J Genomics 2020; 8:43-48. doi:10.7150/jgen.39147

Research Paper

A detailed characteristics of bias associated with long runs of homozygosity identification based on medium density SNP microarrays

Tomasz Szmatoła1,2✉, Artur Gurgul1,2, Igor Jasielczuk1,2, Weiwei Fu3, Katarzyna Ropka-Molik2

1. University Centre of Veterinary Medicine, University of Agriculture in Kraków, Al. Mickiewicza 24/28, 30-059 Kraków, Poland.
2. National Research Institute of Animal Production, Department of Animal Molecular Biology, Krakowska 1, 32-083 Balice, Poland
3. College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China

This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/). See http://ivyspring.com/terms for full terms and conditions.
Citation:
Szmatoła T, Gurgul A, Jasielczuk I, Fu W, Ropka-Molik K. A detailed characteristics of bias associated with long runs of homozygosity identification based on medium density SNP microarrays. J Genomics 2020; 8:43-48. doi:10.7150/jgen.39147. Available from http://www.jgenomics.com/v08p0043.htm

File import instruction

Abstract

In the present study, runs of homozygosity (ROH) detected with the use of a standard bovine 54k single nucleotide polymorphism (SNP) genotyping assay and two different ROH detection approaches, based on 50 (M1) or 15 (M2) consecutive SNPs, were compared with results of whole genome sequencing. Both microarray-based methods accurately recognised medium-sized ROH, however, it was found that M2 method seemed to better than M1 identify short ROH, but highly overestimated their number, leading to numerous false positive calls. Moreover, long ROH identified with microarray data tended to break into shorter segments in sequencing data because of the presence of regions with high heterozygosity within the ROH sequences. This may indicate, that these long ROH are formed by closely positioned shorter homozygous segments that may be of older origin or may be created by two similar but not identical haplotypes, showing minor internal recombination signs. Such finding also suggests that at least some of the results of previous studies in regard to long ROH may be biased leading to inaccurate estimations of genomes autozygosity via ROH classification into length categories.

Keywords: runs of homozygosity, autozygosity, microarray, next generation sequencing