Identification of genomic variations of Azeri buffaloes using whole genome sequencing

Document Type : Research Paper

Authors

1 Ph.D. Student, Department of Animal Science, Faculty of Agriculture and Natural Resources, University of Tehran, Karaj, Iran

2 Assistant Professor, Department of Animal Science, Faculty of Agriculture and Natural Resources, University of Tehran, Karaj, Iran

3 Professor, Department of Animal Science, Faculty of Agriculture and Natural Resources, University of Tehran, Karaj, Iran

Abstract

Abstract
Objective
Advances in next generation sequencing technologies have created the ability to efficiently and economically sequence the whole genome more than ever and provide the opportunity to discover and introduce multiple polymorphisms throughout the genome of organisms. Azeri buffalo is one of the most important breeds of Iran and it is scattered in the north to the northwest of the country and is completely adapted to the environmental conditions of this geographical region. The main purpose of this study is to introduce the genomic diversity of Iranian buffaloes and categorize them. Also, introducing the effects of these variations on different genomic regions of Azeri buffaloes have potential applications in breeding programs.
Materials and Methods
In this study, whole genome sequencing of 5 heads of Azeri buffaloes native to Iran was done by Illumina sequencing platform. Data quality was measured by FastQC software. BWA-MEM software was used for alignment with the reference genome. Finally, the variants were identified using freebayes and the SnpEff program was used to calculate the effects of the variants by mentioning their type, location and number. The alignment result of high quality reads with the reference genome showed that the alignment percentage for all 5 samples was over 97.5%, which indicates the high quality of short reads.The coverage in the sequenced samples was determined between 4x and 12.8x.
Results
finally, 76,298,858 million variants were identified, including 57,921,822 SNPs, 6,162,328 indels, 10,534,042 MNPs, and 1,680,666 MIXEDs. Small deletions and insertions with a minimum length of 1 bp, a maximum length of 28 bp and an average of 1.39 bp were identified. From the total number of variants, the highest frequency of variants was observed in intergenic regions, 53,789,879 (62.022 percent), intron 24,003,682 (27.677 percent), respectively, and the variants detected in exons had the lowest number.
Conclusions
Identification of genome-level variations such as snps, small insertions and deletions, and multi-nucleotide polymorphisms in different populations are a valuable resource in genetic research and can be useful in locating genomic segments responsible for important economic traits. Also, the large volume of SNPs enables genome-wide association studies in animals. In addition, the genomic variations identified in the present study can be used to develop high-density SNP arrays in Iranian breeds for genetic and breeding applications.

Keywords


جعفری احمدآبادی سید علی اصغر، عسکری­همت حشمت­اله، محمدآبادی محمدرضا (1402) تاثیر شاهدانه بر بیان ژن DLK1 در بافت‌ قلب بره‌های کرمانی. مجله بیوتکنولوژی کشاورزی، 15(1)، 217-234.
شکری سمیرا، خضری امین، محمدآبادی محمدرضا، خیرالدین حمید (1402) بررسی بیان ژن MYH7  در بافت‌های ران، دست و راسته بره‌های پرواری نژاد کرمانی. مجله بیوتکنولوژی کشاورزی، 15(2)، 217-236.
References
Andrews S (2010) FastQC: a quality control tool for high throughput sequence data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc
Bakhtiarizadeh MR, Alamouti AA (2020) RNA-Seq based genetic variant discovery provides new insights into controlling fat deposition in the tail of sheep. Sci Rep 10, 13525.
Barazandeh A, Mohammadabadi MR, Ghaderi-Zefrehei M, Nezamabadipour H (2016a) Predicting CpG Islands and Their Relationship with Genomic Feature in Cattle by Hidden Markov Model Algorithm. Iran J Appl Anim Sci 6 (3), 571-579.
Barazandeh A, Mohammadabadi MR, Ghaderi-Zefrehei M, Nezamabadipour H (2016b) Genome-wide analysis of CpG islands in some livestock genomes and their relationship with genomic features. Czech J Anim Sci 61, 487.
Blas AL, Ming R, Liu Z et al. (2010) Cloning of the papaya chromoplast-specific lycopene β-cyclase, CpCYC-b, controlling fruit flesh color reveals conserved microsynteny and a recombination hot spot. Plnt Physio 152, 2013-2022.
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinform 30, 2114-2120.
Bordbar F, Mohammadabadi M, Jensen J, et al. (2022) Identification of candidate genes regulating carcass depth and hind leg circumference in simmental beef cattle using Illumina Bovine Beadchip and next-generation sequencing. J Anim 12 (9), e1103.
Borghese A (2013) Buffalo livestock and products in Europe. Buffalo Bulletin 32, 50-74.
Collins DW, Jukes TH (1994) Rates of transition and transversion in coding sequences since the human-rodent divergence. Genomics 20, 386-396.
Garrison E, Marth G (2012) Haplotype-based variant detection from short-read sequencing. arXiv preprint arXiv:12073907.
Guajardo V, Solís S, Almada R et al. (2020) Genome-wide SNP identification in Prunus rootstocks germplasm collections using genotyping-by-sequencing: phylogenetic analysis, distribution of SNPs and prediction of their effect on gene function. Sci Rep 10, 1467.
Jafari Ahmadabadi SAA, Askari-Hemmat H, Mohammadabadi M, et al. (2023) The effect of Cannabis seed on DLK1 gene expression in heart tissue of Kermani lambs. Agric Biotechnol J 15 (1), 217-234 (In Persian).
Kumar S, Nagarajan M, Sandhu J et al. (2007) Mitochondrial DNA analyses of Indian water buffalo support a distinct genetic origin of river and swamp buffalo. Anim Genet 38, 227-232.
Li H, Handsaker B, Wysoker A et al. (2009) The sequence alignment/map format and SAMtools. Bioinform 25, 2078-2079.
Masoudzadeh, S.H., Mohammadabadi, M.R., Khezri, A., et al. (2020) Dlk1 gene expression in different Tissues of lamb. Iran J Appl Anim Sci 10, 669-677.
Michelizzi VN, Dodson MV, Pan Z et al. (2010) Water buffalo genome science comes of age. Int J Biol Sci 6, 333.
Mohamadipoor L, Mohammadabadi M, Amiri Z, et al. (2021) Signature selection analysis reveals candidate genes associated with production traits in Iranian sheep breeds. BMC Vet Res 17 (1), 1-9.
Mohammadabadi M, Masoudzadeh SH, Khezri A, et al. (2021) Fennel (Foeniculum vulgare) seed powder increases Delta-Like Non-Canonical Notch Ligand 1 gene expression in testis, liver, and humeral muscle tissues of growing lambs. Heliyon 7 (12), e08542.
Mohammadinejad F, Mohammadabadi M, Roudbari Z, Sadkowski T (2022) Identification of Key Genes and Biological Pathways Associated with Skeletal Muscle Maturation and Hypertrophy in Bos taurus, Ovis aries, and Sus scrofa. J Anim 12 (24), 3471.
Mokhber M, Moradi-Shahrbabak M, Sadeghi M et al. (2018) A genome-wide scan for signatures of selection in Azeri and Khuzestani buffalo breeds. BMC Genom 19, 1-9.
Rafiepour M, Ebrahimie E, Vahidi MF et al. (2021) Whole-genome resequencing reveals adaptation prior to the divergence of buffalo subspecies. Genome Biol Evol 13, evaa231.
Safaei SMH, Dadpasand M, Mohammadabadi M, et al. (2022) An Origanum majorana Leaf Diet Influences Myogenin Gene Expression, Performance, and Carcass Characteristics in Lambs. J Anim 13 (1), e14.
Safari A  Ghavi Hossein-Zadeh N, Shadparvar AA, Abdollahi AR. (2018) A review on breeding and genetic strategies in Iranian buffaloes. Trop Anim Health Prod 50 (4), 707-714.
Shahsavari M, Mohammadabadi M, Khezri A, et al. (2022) Effect of Fennel (Foeniculum Vulgare) Seed Powder Consumption on Insulin-like Growth Factor 1 Gene Expression in the Liver Tissue of Growing Lambs. Gene Expr 21 (2), 21-26.
Shirasawa K, Fukuoka H, Matsunaga H et al. (2013) Genome-wide association studies using single nucleotide polymorphism markers developed by re-sequencing of the genomes of cultivated tomato. DNA Res 20, 593-603.
Shokri S, Khezri A, Mohammadabadi M, Kheyrodin H (2023). The expression of MYH7 gene in femur, humeral muscle and back muscle tissues of fattening lambs of the Kermani breed. Agric Biotechnol J 15 (2), 217-236 (In Persian).
Tanaka K, Solis CD, Masangkay JS et al. (1996) Phylogenetic relationship among all living species of the genus Bubalus based on DNA sequences of the cytochrome b gene. Biochem Genet 34, 443-452.
Williams JL, Iamartino D, Pruitt KD et al. (2017) Genome assembly and transcriptome resource for river buffalo, Bubalus bubalis (2 n= 50). Gigascience 6, gix088.
Yindee M, Vlamings B, Wajjwalku W et al. (2010) Y‐chromosomal variation confirms independent domestications of swamp and river buffalo. Anim Genet 41, 433-435.
Zhang Y, Colli L, Barker J (2020) Asian water buffalo: domestication, history and genetics. Anim genet 51, 177-191.
Zimin AV, Marçais G, Puiu D et al. (2013) The MaSuRCA genome assembler. Bioinform 29, 2669-2677.