TY - JOUR
T1 - Estimating Mean Viral Load Trajectory From Intermittent Longitudinal Data and Unknown Time Origins
AU - Woodbridge, Yonatan
AU - Mandel, Micha
AU - Goldberg, Yair
AU - Huppert, Amit
N1 - Publisher Copyright:
© 2025 The Author(s). Statistics in Medicine published by John Wiley & Sons Ltd.
PY - 2025/2/28
Y1 - 2025/2/28
N2 - Viral load (VL) in the respiratory tract is the leading proxy for assessing infectiousness potential. Understanding the dynamics of disease-related VL within the host is of great importance, as it helps to determine different policies and health recommendations. However, normally the VL is measured on individuals only once, in order to confirm infection, and furthermore, the infection date is unknown. It is therefore necessary to develop statistical approaches to estimate the typical VL trajectory. We show here that, under plausible parametric assumptions, two measures of VL on infected individuals can be used to accurately estimate the VL mean function. Specifically, we consider a discrete-time likelihood-based approach to modeling and estimating partial observed longitudinal samples. We study a multivariate normal model for a function of the VL that accounts for possible correlation between measurements within individuals. We derive an expectation-maximization (EM) algorithm which treats the unknown time origins and the missing measurements as latent variables. Our main motivation is the reconstruction of the daily mean VL, given measurements on patients whose VLs were measured multiple times on different days. Such data should and can be obtained at the beginning of a pandemic with the specific goal of estimating the VL dynamics. For demonstration purposes, the method is applied to SARS-Cov-2 cycle-threshold-value data collected in Israel.
AB - Viral load (VL) in the respiratory tract is the leading proxy for assessing infectiousness potential. Understanding the dynamics of disease-related VL within the host is of great importance, as it helps to determine different policies and health recommendations. However, normally the VL is measured on individuals only once, in order to confirm infection, and furthermore, the infection date is unknown. It is therefore necessary to develop statistical approaches to estimate the typical VL trajectory. We show here that, under plausible parametric assumptions, two measures of VL on infected individuals can be used to accurately estimate the VL mean function. Specifically, we consider a discrete-time likelihood-based approach to modeling and estimating partial observed longitudinal samples. We study a multivariate normal model for a function of the VL that accounts for possible correlation between measurements within individuals. We derive an expectation-maximization (EM) algorithm which treats the unknown time origins and the missing measurements as latent variables. Our main motivation is the reconstruction of the daily mean VL, given measurements on patients whose VLs were measured multiple times on different days. Such data should and can be obtained at the beginning of a pandemic with the specific goal of estimating the VL dynamics. For demonstration purposes, the method is applied to SARS-Cov-2 cycle-threshold-value data collected in Israel.
KW - Ct-value
KW - EM algorithm
KW - multivariate normal distribution
KW - SARS-Cov-2
UR - http://www.scopus.com/inward/record.url?scp=85219116344&partnerID=8YFLogxK
U2 - 10.1002/sim.70033
DO - 10.1002/sim.70033
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 39995297
AN - SCOPUS:85219116344
SN - 0277-6715
VL - 44
JO - Statistics in Medicine
JF - Statistics in Medicine
IS - 5
M1 - e70033
ER -