Correlation detection strategies in microbial data sets vary widely in sensitivity and precision

Sophie Weiss, Will Van Treuren, Catherine Lozupone, Karoline Faust, Jonathan Friedman, Ye Deng, Li Charlie Xia, Zhenjiang Zech Xu, Luke Ursell, Eric J. Alm, Amanda Birmingham, Jacob A. Cram, Jed A. Fuhrman, Jeroen Raes, Fengzhu Sun, Jizhong Zhou, Rob Knight*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

458 Scopus citations


Disruption of healthy microbial communities has been linked to numerous diseases, yet microbial interactions are little understood. This is due in part to the large number of bacteria, and the much larger number of interactions (easily in the millions), making experimental investigation very difficult at best and necessitating the nascent field of computational exploration through microbial correlation networks. We benchmark the performance of eight correlation techniques on simulated and real data in response to challenges specific to microbiome studies: fractional sampling of ribosomal RNA sequences, uneven sampling depths, rare microbes and a high proportion of zero counts. Also tested is the ability to distinguish signals from noise, and detect a range of ecological and time-series relationships. Finally, we provide specific recommendations for correlation technique usage. Although some methods perform better than others, there is still considerable need for improvement in current techniques.

Original languageAmerican English
Pages (from-to)1669-1681
Number of pages13
JournalISME Journal
Issue number7
StatePublished - 1 Jul 2016
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2016 International Society for Microbial Ecology.


Dive into the research topics of 'Correlation detection strategies in microbial data sets vary widely in sensitivity and precision'. Together they form a unique fingerprint.

Cite this