TY - CHAP
T1 - Computational methods for high-throughput comparative analyses of natural microbial communities
AU - Preheim, Sarah P.
AU - Perrotta, Allison R.
AU - Friedman, Jonathan
AU - Smilie, Chris
AU - Brito, Ilana
AU - Smith, Mark B.
AU - Alm, Eric
PY - 2013
Y1 - 2013
N2 - One of the most widely employed methods in metagenomics is the amplification and sequencing of the highly conserved ribosomal RNA (rRNA) genes from organisms in complex microbial communities. rRNA surveys, typically using the 16S rRNA gene for prokaryotic identification, provide information about the total diversity and taxonomic affiliation of organisms present in a sample. Greatly enhanced by high-throughput sequencing, these surveys have uncovered the remarkable diversity of uncultured organisms and revealed unappreciated ecological roles ranging from nutrient cycling to human health. This chapter outlines the best practices for comparative analyses of microbial community surveys. We explain how to transform raw data into meaningful units for further analysis and discuss how to calculate sample diversity and community distance metrics. Finally, we outline how to find associations of species with specific metadata and true correlations between species from compositional data. We focus on data generated by next-generation sequencing platforms, using the Illumina platform as a test case, because of its widespread use especially among researchers just entering the field.
AB - One of the most widely employed methods in metagenomics is the amplification and sequencing of the highly conserved ribosomal RNA (rRNA) genes from organisms in complex microbial communities. rRNA surveys, typically using the 16S rRNA gene for prokaryotic identification, provide information about the total diversity and taxonomic affiliation of organisms present in a sample. Greatly enhanced by high-throughput sequencing, these surveys have uncovered the remarkable diversity of uncultured organisms and revealed unappreciated ecological roles ranging from nutrient cycling to human health. This chapter outlines the best practices for comparative analyses of microbial community surveys. We explain how to transform raw data into meaningful units for further analysis and discuss how to calculate sample diversity and community distance metrics. Finally, we outline how to find associations of species with specific metadata and true correlations between species from compositional data. We focus on data generated by next-generation sequencing platforms, using the Illumina platform as a test case, because of its widespread use especially among researchers just entering the field.
KW - 16S ribosomal
KW - RNA survey operational
KW - distance metric correlation analysis
KW - diversity estimates community
KW - taxonomic units
UR - http://www.scopus.com/inward/record.url?scp=84884568058&partnerID=8YFLogxK
U2 - 10.1016/B978-0-12-407863-5.00018-6
DO - 10.1016/B978-0-12-407863-5.00018-6
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.chapter???
C2 - 24060130
AN - SCOPUS:84884568058
SN - 9780124078635
T3 - Methods in Enzymology
SP - 353
EP - 370
BT - Microbial Metagenomics, Metatranscriptomics, and Metaproteomics
PB - Academic Press Inc.
ER -