A new approach to analyzing gene expression time series data

Ziv Bar-Joseph*, Georg Gerber, David K. Gifford, Tommi S. Jaakkola, Itamar Simon

*Corresponding author for this work

Research output: Contribution to conferencePaperpeer-review

138 Scopus citations

Abstract

We present algorithms for time-series gene expression analysis that permit the principled estimation of unobserved time-points, clustering, and dataset alignment. Each expression profile is modeled as a cubic spine (piecewise polynomial) that is estimated from the observed data and every time point influences the overall smooth expression curve. We constrain the spline coefficients of genes in the same class to have similar expression patterns, while also allowing for gene specific parameters. We show that unobserved time-points can be reconstructed using our method with 10-15% less error when compared to previous best methods. Our clustering algorithm operates directly on the continuous representations of gene expression profiles, and we demonstrate that this is particularly effective when applied to non-uniformly sampled data. Our continuous alignment algorithm also avoids difficulties encountered by discrete approaches. In particular, our method allows for control of the number of degrees of freedom of the warp through the specification of parametrized functions, which helps to avoid overfitting. We demonstrate that our algorithm produces stable low-error alignments on real expression data and further show a specific application to yeast knockout data that produces biologically meaningful results.

Original languageEnglish
Pages39-48
Number of pages10
DOIs
StatePublished - 2002
Externally publishedYes
EventRECOMB 2002: Proceedings of the Sixth Annual International Conference on Computational Biology - Washington, DC, United States
Duration: 18 Apr 200221 Apr 2002

Conference

ConferenceRECOMB 2002: Proceedings of the Sixth Annual International Conference on Computational Biology
Country/TerritoryUnited States
CityWashington, DC
Period18/04/0221/04/02

Fingerprint

Dive into the research topics of 'A new approach to analyzing gene expression time series data'. Together they form a unique fingerprint.

Cite this