DECOD: Fast and accurate discriminative DNA motif finding

Peter Huggins, Shan Zhong, Idit Shiff, Rachel Beckerman, Oleg Laptenko, Carol Prives, Marcel H. Schulz, Itamar Simon, Ziv Bar-Joseph*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

33 Scopus citations

Abstract

Motivation: Motif discovery is now routinely used in high-throughput studies including large-scale sequencing and proteomics. These datasets present new challenges. The rst is speed. Many motif discovery methods do not scale well to large datasets. Another issue is identifying discriminative rather than generative motifs. Such discriminative motifs are important for identifying co-factors and for explaining changes in behavior between different conditions. Results: To address these issues we developed a method for DECOnvolved Discriminative motif discovery (DECOD). DECOD uses a k-mer count table and so its running time is independent of the size of the input set. By deconvolving the k-mers DECOD considers context information without using the sequences directly. DECOD outperforms previous methods both in speed and in accuracy when using simulated and real biological benchmark data. We performed new binding experiments for p53 mutants and used DECOD to identify p53 co-factors, suggesting new mechanisms for p53 activation.

Original languageEnglish
Article numberbtr412
Pages (from-to)2361-2367
Number of pages7
JournalBioinformatics
Volume27
Issue number17
DOIs
StatePublished - Sep 2011

Bibliographical note

Funding Information:
Funding: National Institute of Health (1RO1 GM085022, in part); National Science Foundation CAREER award (0448453, to Z.B.J., in part). Israeli Cancer Research Foundation; Israeli Cancer Association; Weinkselbaum Family foundation (to I.S., in part).

Fingerprint

Dive into the research topics of 'DECOD: Fast and accurate discriminative DNA motif finding'. Together they form a unique fingerprint.

Cite this