This course provides a computational perspective on the analysis of genomic and genome-scale information. The course concentrates on exploration and analysis of information contained in large genomic sequences, but also touches on issues in structural and functional genomics. Topics include genome sequence assembly, local and global sequence alignment, gene and motif finding in sequences, protein structure and folding, and clustering and classification of genes and tissues using gene expression data from microarrays. The goal is for students to understand computational and algorithmic approaches to genomics, as well as to develop practical experience with handling, analyzing, and visualizing information at a genome-scale. The course will expose students to the Perl programming language. Ideally, students coming in to the course should know how to program in some computer language (this need not be Perl), and should have at least some exposure to basic probability, statistics, and molecular or cellular biology. However, there are no formal course prerequisites and significant background will be provided. This course is a valid elective in both biology and computer science.
Meets: Tues/Thurs 10:05-11:20.
Instructor: Hartemink.