The genomes of remotely related individuals occasionally contain long segments that are Identical By Descent (IBD). Sharing of IBD segments has many applications in population and medical genetics, and it is thus desirable to study their properties in simulations. However, no current method provides a direct, efficient means to extract IBD segments from simulated genealogies. Here, we introduce computationally efficient approaches to extract ground-truth IBD segments from a sequence of genealogies, or equivalently, an ancestral recombination graph. Specifically, we use a two-step scheme, where we first identify putative shared segments by comparing the common ancestors of all pairs of individuals at some distance apart. This reduces the search space considerably, and we then proceed by determining the true IBD status of the candidate segments. Under some assumptions and when allowing a limited resolution of segment lengths, our run-time complexity is reduced from O(n3 log n) for the naïve algorithm to O(n log n), where n is the number of individuals in the sample.
|Original language||American English|
|Title of host publication||Research in Computational Molecular Biology - 19th Annual International Conference, RECOMB 2015, Proceedings|
|Editors||Teresa M. Przytycka|
|Number of pages||14|
|State||Published - 2015|
|Event||19th Annual International Conference on Research in Computational Molecular Biology, RECOMB 2015 - Warsaw, Poland|
Duration: 12 Apr 2015 → 15 Apr 2015
|Name||Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)|
|Conference||19th Annual International Conference on Research in Computational Molecular Biology, RECOMB 2015|
|Period||12/04/15 → 15/04/15|
Bibliographical notePublisher Copyright:
© Springer International Publishing Switzerland 2015.
- Ancestral Recombination Graphs
- Identity by Descent
- Population Genetics