TY - JOUR
T1 - Marginalized Frailty-Based Illness-Death Model
T2 - Application to the UK-Biobank Survival Data
AU - Gorfine, Malka
AU - Keret, Nir
AU - Ben Arie, Asaf
AU - Zucker, David
AU - Hsu, Li
N1 - Publisher Copyright:
© 2020 American Statistical Association.
PY - 2021
Y1 - 2021
N2 - The UK Biobank is a large-scale health resource comprising genetic, environmental, and medical information on approximately 500,000 volunteer participants in the United Kingdom, recruited at ages 40–69 during the years 2006–2010. The project monitors the health and well-being of its participants. This work demonstrates how these data can be used to yield the building blocks for an interpretable risk-prediction model, in a semiparametric fashion, based on known genetic and environmental risk factors of various chronic diseases, such as colorectal cancer. An illness-death model is adopted, which inherently is a semi-competing risks model, since death can censor the disease, but not vice versa. Using a shared-frailty approach to account for the dependence between time to disease diagnosis and time to death, we provide a new illness-death model that assumes Cox models for the marginal hazard functions. The recruitment procedure used in this study introduces delayed entry to the data. An additional challenge arising from the recruitment procedure is that information coming from both prevalent and incident cases must be aggregated. Lastly, we do not observe any deaths prior to the minimal recruitment age, 40. In this work, we provide an estimation procedure for our new illness-death model that overcomes all the above challenges. Supplementary materials for this article, including a standardized description of the materials available for reproducing the work, are available as an online supplement.
AB - The UK Biobank is a large-scale health resource comprising genetic, environmental, and medical information on approximately 500,000 volunteer participants in the United Kingdom, recruited at ages 40–69 during the years 2006–2010. The project monitors the health and well-being of its participants. This work demonstrates how these data can be used to yield the building blocks for an interpretable risk-prediction model, in a semiparametric fashion, based on known genetic and environmental risk factors of various chronic diseases, such as colorectal cancer. An illness-death model is adopted, which inherently is a semi-competing risks model, since death can censor the disease, but not vice versa. Using a shared-frailty approach to account for the dependence between time to disease diagnosis and time to death, we provide a new illness-death model that assumes Cox models for the marginal hazard functions. The recruitment procedure used in this study introduces delayed entry to the data. An additional challenge arising from the recruitment procedure is that information coming from both prevalent and incident cases must be aggregated. Lastly, we do not observe any deaths prior to the minimal recruitment age, 40. In this work, we provide an estimation procedure for our new illness-death model that overcomes all the above challenges. Supplementary materials for this article, including a standardized description of the materials available for reproducing the work, are available as an online supplement.
KW - Delayed entry
KW - Dependent left truncation
KW - Frailty model
KW - Random effect
KW - Risk prediction
KW - Semi-competing risks
UR - http://www.scopus.com/inward/record.url?scp=85096626892&partnerID=8YFLogxK
U2 - 10.1080/01621459.2020.1831922
DO - 10.1080/01621459.2020.1831922
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85096626892
SN - 0162-1459
VL - 116
SP - 1155
EP - 1167
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 535
ER -