Abstract
Fragmentomics features of cell-free DNA represent promising non-invasive biomarkers for cancer diagnosis. A lack of systematic evaluation of biases in feature quantification hinders the adoption of such applications. We compare features derived from whole-genome sequencing of ten healthy donors using nine library kits and ten data-processing routes and validated in 1182 plasma samples from published studies. Our results clarify the variations from library preparation and feature quantification methods. We design the Trim Align Pipeline and cfDNAPro R package as unified interfaces for data pre-processing, feature extraction, and visualization to standardize multi-modal feature engineering and integration for machine learning.
Original language | English |
---|---|
Article number | 141 |
Journal | Genome Biology |
Volume | 26 |
Issue number | 1 |
DOIs | |
State | Published - Dec 2025 |
Bibliographical note
Publisher Copyright:© The Author(s) 2025.
Keywords
- Cancer genomics
- CfDNA
- Feature extraction
- Fragmentomics