A standardized framework for robust fragmentomic feature extraction from cell-free DNA sequencing data

Haichao Wang, Paulius D. Mennea, Yu Kiu Elkie Chan, Zhao Cheng, Maria C. Neofytou, Arif Anwer Surani, Aadhitthya Vijayaraghavan, Emma Jane Ditter, Richard Bowers, Matthew D. Eldridge, Dmitry S. Shcherbo, Christopher G. Smith, Florian Markowetz, Wendy N. Cooper, Tommy Kaplan, Nitzan Rosenfeld*, Hui Zhao*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Fragmentomics features of cell-free DNA represent promising non-invasive biomarkers for cancer diagnosis. A lack of systematic evaluation of biases in feature quantification hinders the adoption of such applications. We compare features derived from whole-genome sequencing of ten healthy donors using nine library kits and ten data-processing routes and validated in 1182 plasma samples from published studies. Our results clarify the variations from library preparation and feature quantification methods. We design the Trim Align Pipeline and cfDNAPro R package as unified interfaces for data pre-processing, feature extraction, and visualization to standardize multi-modal feature engineering and integration for machine learning.

Original languageEnglish
Article number141
JournalGenome Biology
Volume26
Issue number1
DOIs
StatePublished - Dec 2025

Bibliographical note

Publisher Copyright:
© The Author(s) 2025.

Keywords

  • Cancer genomics
  • CfDNA
  • Feature extraction
  • Fragmentomics

Fingerprint

Dive into the research topics of 'A standardized framework for robust fragmentomic feature extraction from cell-free DNA sequencing data'. Together they form a unique fingerprint.

Cite this