TY - JOUR
T1 - A non-zero variance of Tajima's estimator for two sequences even for infinitely many unlinked loci
AU - King, Léandra
AU - Wakeley, John
AU - Carmi, Shai
N1 - Publisher Copyright:
© 2017 Elsevier Inc.
PY - 2018/7
Y1 - 2018/7
N2 - The population-scaled mutation rate, θ is informative on the effective population size and is thus widely used in population genetics. We show that for two sequences and n unlinked loci, the variance of Tajima's estimator (θˆ), which is the average number of pairwise differences, does not vanish even as n→∞. The non-zero variance of θˆ results from a (weak) correlation between coalescence times even at unlinked loci, which, in turn, is due to the underlying fixed pedigree shared by gene genealogies at all loci. We derive the correlation coefficient under a diploid, discrete-time, Wright–Fisher model, and we also derive a simple, closed-form lower bound. We also obtain empirical estimates of the correlation of coalescence times under demographic models inspired by large-scale human genealogies. While the effect we describe is small (Varθˆ∕θ2≈ONe −1), it is important to recognize this feature of statistical population genetics, which runs counter to commonly held notions about unlinked loci.
AB - The population-scaled mutation rate, θ is informative on the effective population size and is thus widely used in population genetics. We show that for two sequences and n unlinked loci, the variance of Tajima's estimator (θˆ), which is the average number of pairwise differences, does not vanish even as n→∞. The non-zero variance of θˆ results from a (weak) correlation between coalescence times even at unlinked loci, which, in turn, is due to the underlying fixed pedigree shared by gene genealogies at all loci. We derive the correlation coefficient under a diploid, discrete-time, Wright–Fisher model, and we also derive a simple, closed-form lower bound. We also obtain empirical estimates of the correlation of coalescence times under demographic models inspired by large-scale human genealogies. While the effect we describe is small (Varθˆ∕θ2≈ONe −1), it is important to recognize this feature of statistical population genetics, which runs counter to commonly held notions about unlinked loci.
KW - Coalescent theory
KW - Effective population size
KW - Genealogies
KW - Heterozygosity
KW - Pedigrees
KW - Recombination
UR - http://www.scopus.com/inward/record.url?scp=85017233680&partnerID=8YFLogxK
U2 - 10.1016/j.tpb.2017.03.002
DO - 10.1016/j.tpb.2017.03.002
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 28341209
AN - SCOPUS:85017233680
SN - 0040-5809
VL - 122
SP - 22
EP - 29
JO - Theoretical Population Biology
JF - Theoretical Population Biology
ER -