TY - JOUR
T1 - Context-specific Bayesian clustering for gene expression data
AU - Barash, Yoseph
AU - Friedman, Nir
PY - 2002
Y1 - 2002
N2 - The recent growth in genomic data and measurements of genome-wide expression patterns allows us to apply computational tools to examine gene regulation by transcription factors. In this work, we present a class of mathematical models that help in understanding the connections between transcription factors and functional classes of genes based on genetic and genomic data. Such a model represents the joint distribution of transcription factor binding sites and of expression levels of a gene in a unified probabilistic model. Learning a combined probability model of binding sites and expression patterns enables us to improve the clustering of the genes based on the discovery of putative binding sites and to detect which binding sites and experiments best characterize a cluster. To learn such models from data, we introduce a new search method that rapidly learns a model according to a Bayesian score. We evaluate our method on synthetic data as well as on real life data and analyze the biological insights it provides. Finally, we demonstrate the applicability of the method to other data analysis problems in gene expression data.
AB - The recent growth in genomic data and measurements of genome-wide expression patterns allows us to apply computational tools to examine gene regulation by transcription factors. In this work, we present a class of mathematical models that help in understanding the connections between transcription factors and functional classes of genes based on genetic and genomic data. Such a model represents the joint distribution of transcription factor binding sites and of expression levels of a gene in a unified probabilistic model. Learning a combined probability model of binding sites and expression patterns enables us to improve the clustering of the genes based on the discovery of putative binding sites and to detect which binding sites and experiments best characterize a cluster. To learn such models from data, we introduce a new search method that rapidly learns a model according to a Bayesian score. We evaluate our method on synthetic data as well as on real life data and analyze the biological insights it provides. Finally, we demonstrate the applicability of the method to other data analysis problems in gene expression data.
KW - Bayesian model selection
KW - Clustering
KW - Gene expression
UR - http://www.scopus.com/inward/record.url?scp=0036100113&partnerID=8YFLogxK
U2 - 10.1089/10665270252935403
DO - 10.1089/10665270252935403
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 12015876
AN - SCOPUS:0036100113
SN - 1066-5277
VL - 9
SP - 169
EP - 191
JO - Journal of Computational Biology
JF - Journal of Computational Biology
IS - 2
ER -