TY - JOUR
T1 - Rapidly registering identity-by-descent across ancestral recombination graphs
AU - Yang, Shuo
AU - Carmi, Shai
AU - Pe'er, Itsik
N1 - Publisher Copyright:
© Mary Ann Liebert, Inc. 2016.
PY - 2016/6/1
Y1 - 2016/6/1
N2 - The genomes of remotely related individuals occasionally contain long segments that are identical by descent (IBD). Sharing of IBD segments has many applications in population and medical genetics, and it is thus desirable to study their properties in simulations. However, no current method provides a direct, efficient means to extract IBD segments from simulated genealogies. Here, we introduce computationally efficient approaches to extract ground-truth IBD segments from a sequence of genealogies, or equivalently, an ancestral recombination graph. Specifically, we use a two-step scheme, where we first identify putative shared segments by comparing the common ancestors of all pairs of individuals at some distance apart. This reduces the search space considerably, and we then proceed by determining the true IBD status of the candidate segments. Under some assumptions and when allowing a limited resolution of segment lengths, our run-time complexity is reduced from O(n3 log n) for the naïve algorithm to O(n log n), where n is the number of individuals in the sample.
AB - The genomes of remotely related individuals occasionally contain long segments that are identical by descent (IBD). Sharing of IBD segments has many applications in population and medical genetics, and it is thus desirable to study their properties in simulations. However, no current method provides a direct, efficient means to extract IBD segments from simulated genealogies. Here, we introduce computationally efficient approaches to extract ground-truth IBD segments from a sequence of genealogies, or equivalently, an ancestral recombination graph. Specifically, we use a two-step scheme, where we first identify putative shared segments by comparing the common ancestors of all pairs of individuals at some distance apart. This reduces the search space considerably, and we then proceed by determining the true IBD status of the candidate segments. Under some assumptions and when allowing a limited resolution of segment lengths, our run-time complexity is reduced from O(n3 log n) for the naïve algorithm to O(n log n), where n is the number of individuals in the sample.
KW - ancestral recombination graphs
KW - identity-by-descent
KW - population genetics
KW - simulation
UR - http://www.scopus.com/inward/record.url?scp=84974652918&partnerID=8YFLogxK
U2 - 10.1089/cmb.2016.0016
DO - 10.1089/cmb.2016.0016
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 27104872
AN - SCOPUS:84974652918
SN - 1066-5277
VL - 23
SP - 495
EP - 507
JO - Journal of Computational Biology
JF - Journal of Computational Biology
IS - 6
ER -