Accurate age prediction from blood using a small set of DNA methylation sites and a cohort-based machine learning algorithm

Miri Varshavsky, Gil Harari, Benjamin Glaser, Yuval Dor, Ruth Shemer, Tommy Kaplan*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Chronological age prediction from DNA methylation sheds light on human aging, health, and lifespan. Current clocks are mostly based on linear models and rely upon hundreds of sites across the genome. Here, we present GP-age, an epigenetic non-linear cohort-based clock for blood, based upon 11,910 methylomes. Using 30 CpG sites alone, GP-age outperforms state-of-the-art models, with a median accuracy of ∼2 years on held-out blood samples, for both array and sequencing-based data. We show that aging-related changes occur at multiple neighboring CpGs, with implications for using fragment-level analysis of sequencing data in aging research. By training three independent clocks, we show enrichment of donors with consistent deviation between predicted and actual age, suggesting individual rates of biological aging. Overall, we provide a compact yet accurate alternative to array-based clocks for blood, with applications in longitudinal aging research, forensic profiling, and monitoring epigenetic processes in transplantation medicine and cancer.

Original languageEnglish
Article number100567
JournalCell Reports Methods
Volume3
Issue number9
DOIs
StatePublished - 25 Sep 2023

Bibliographical note

Publisher Copyright:
© 2023 The Author(s)

Keywords

  • CP: Genetics
  • CP: Systems biology
  • DNA methylation
  • aging
  • computational biology
  • epigenetics
  • machine learning

Fingerprint

Dive into the research topics of 'Accurate age prediction from blood using a small set of DNA methylation sites and a cohort-based machine learning algorithm'. Together they form a unique fingerprint.

Cite this