Abstract
Hybridization capture (also called enrichment) is a molecular technique used to focus high-throughput sequencing effort to a targeted region (or regions) of the genome, thereby reducing costs. This method has been used with great success in a wide range of research topics, including applied conservation genetics, and has been the subject of many studies aiming to optimize and improve the technique. In such technical optimization studies, it is common to minimally sequence (i.e., shallow sequencing) experimental capture libraries to save costs, and evaluate hybridization capture efficiency at lower sequencing effort than would be necessary for downstream analyses. Here I use simulations to show how this practice can produce misleading results and, ultimately, non-optimal experimental design choices. I argue that hybridization capture optimization studies must investigate their experiments at variable sequencing efforts and evaluate results in the context of the required target-space coverage for downstream analyses.
Similar content being viewed by others
Data availability
All necessary code to replicate the simulated data found in Fig. 1 is available at https://github.com/DrLaurenCWhite/Hybridization_Capture_Simulations
References
Arribas P, Andújar C, Bidartondo MI, Bohmann K, Coissac É, Creer S, deWaard JR, Elbrecht V, Ficetola GF, Goberna M, Kennedy S, Krehenwinkel H, Leese F, Novotny V, Ronquist F, Yu DW, Zinger L, Creedy TJ, Meramveliotakis E, Noguerales V, Overcast I, Morlon H, Vogler AP, Papadopoulou A, Emerson BC (2021) Connecting high-throughput biodiversity inventories: opportunities for a site-based genomic framework for global integration and synthesis. Mol Ecol 30:1120–1135. https://doi.org/10.1111/mec.15797
Ávila-Arcos MC, Sandoval-Velasco M, Schroeder H, Carpenter ML, Malaspinas A-S, Wales N, Peñaloza F, Bustamante CD, Gilbert MTP (2015) Comparative performance of two whole-genome capture methodologies on ancient DNA Illumina libraries. Methods Ecol Evol 6(6):725–734. https://doi.org/10.1111/2041-210X.12353
Bodi K, Perera AG, Adams PS, Bintzler D, Dewar K, Grove DS, Kieleczawa J, Lyons RH, Neubert TA, Noll AC, Singh S, Steen R, Zianni M (2013) Comparison of commercially available target enrichment methods for next-generation sequencing. J Biomol Technol 24(2):73–86. https://doi.org/10.7171/jbt.13-2402-002
Bragg JG, Potter S, Bi K, Moritz C (2016) Exon capture phylogenomics: efficacy across scales of divergence. Mol Ecol Resour 16(5):1059–1068. https://doi.org/10.1111/1755-0998.12449
Carpenter ML, Buenrostro JD, Valdiosera C, Schroeder H, Allentoft ME, Sikora M, Rasmussen M, Gravel S, Guillén S, Nekhrizov G, Leshtakov K, Dimitrova D, Theodossiev N, Pettener D, Luiselli D, Sandoval K, Moreno-Estrada A, Li Y, Wang J et al (2013) Pulling out the 1%: Whole-genome capture for the targeted enrichment of ancient DNA sequencing libraries. The American Journal of Human Genetics 93(5):852–864. https://doi.org/10.1016/j.ajhg.2013.10.002
Christmas MJ, Biffin E, Breed MF, Lowe AJ (2017) Targeted capture to assess neutral genomic variation in the narrow-leaf hopbush across a continental biodiversity refugium. Sci Rep 7(1):41367. https://doi.org/10.1038/srep41367
Cole TL, Ksepka DT, Mitchell KJ, Tennyson AJ, Thomas DB, Pan H, Zhang G, Rawlence NJ, Wood JR, Bover P, Bouzat JL (2019) Mitogenomes uncover extinct penguin taxa and reveal island formation as a key driver of speciation. Mol Biol Evol 36(4):784–797
Cruz-Dávalos DI, Llamas B, Gaunitz C, Fages A, Gamba C, Soubrier J, Librado P, Seguin-Orlando A, Pruvost M, Alfarhan AH, Alquraishi SA, Al-Rasheid KAS, Scheu A, Beneke N, Ludwig A, Cooper A, Willerslev E, Orlando L (2017) Experimental conditions improving in-solution target enrichment for ancient DNA. Mol Ecol Resour 17(3):508–522. https://doi.org/10.1111/1755-0998.12595
Cruz-Dávalos DI, Nieves-Colón MA, Sockell A, Poznik GD, Schroeder H, Stone AC, Bustamante CD, Malaspinas A-S, Ávila-Arcos MC (2018) In-solution Y-chromosome capture-enrichment on ancient DNA libraries. BMC Genomics 19(1):608. https://doi.org/10.1186/s12864-018-4945-x
Enk J, Rouillard J-M, Poinar H (2013) Quantitative PCR as a predictor of aligned ancient DNA read counts following targeted enrichment. Biotechniques 55(6):300–309. https://doi.org/10.2144/000114114
Enk J, Devault AM, Kuch M, Murgha YE, Rouillard J-M, Poinar HN (2014) Ancient whole genome enrichment using baits built from modern DNA. Mol Biol Evol 31(5):1292–1294. https://doi.org/10.1093/molbev/msu074
Exposito-Alonso M, Drost H-G, Burbano HA, Weigel D (2020) The Earth BioGenome project: opportunities and challenges for plant genomics and conservation. Plant J 102:222–229. https://doi.org/10.1111/tpj.14631
Faircloth BC, McCormack JE, Crawford NG, Harvey MG, Brumfield RT, Glenn TC (2012) Ultraconserved elements anchor thousands of genetic markers spanning multiple evolutionary timescales. Syst Biol 61(5):717–726
Fisher S, Barry A, Abreu J, Minie B, Nolan J, Delorey TM, Young G, Fennell TJ, Allen A, Ambrogio L, Berlin AM (2011) A scalable, fully automated process for construction of sequence-ready human exome targeted capture libraries. Genome Biol 12(1):1–15
Fontsere C, Alvarez-Estape M, Lester J, Arandjelovic M, Kuhlwilm M, Dieguez P, Agbor A, Angedakin S, Ayimisin EA, Bessone M, Brazzola G, Deschner T, Eno-Nku M, Granjon A-C, Head J, Kadam P, Kalan AK, Kambi M, Langergraber K et al (2021) Maximizing the acquisition of unique reads in noninvasive capture sequencing experiments. Mol Ecol Resour 21(3):745–761. https://doi.org/10.1111/1755-0998.13300
Gasc C, Peyretaillade E, Peyret P (2016) Sequence capture by hybridization to explore modern and ancient genomic diversity in model and nonmodel organisms. Nucleic Acids Res 44(10):4504–4518
Gnirke A, Melnikov A, Maguire J, Rogov P, LeProust EM, Brockman W, Fennell T, Giannoukos G, Fisher S, Russ C, Gabriel S (2009) Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing. Nat Biotechnol 27:182–189. https://doi.org/10.1038/nbt.1523
Gorden EM, Sturk-Andreaggi K, Marshall C (2021) Capture enrichment and massively parallel sequencing for human identification. Forensic Sci Int Genet 53:102496
Gretzinger J, Molak M, Reiter E, Pfrengle S, Urban C, Neukamm J, Blant M, Conard NJ, Cupillard C, Dimitrijević V, Drucker DG, Hofman-Kamińska E, Kowalczyk R, Krajcarz MT, Krajcarz M, Münzel SC, Peresani M, Romandini M, Rufí I et al (2019) Large-scale mitogenomic analysis of the phylogeography of the Late Pleistocene cave bear. Sci Rep 9(1):10700. https://doi.org/10.1038/s41598-019-47073-z
Hernandez-Rodriguez J, Arandjelovic M, Lester J, de Filippo C, Weihmann A, Meyer M, Angedakin S, Casals F, Navarro A, Vigilant L, Kühl HS, Langergraber K, Boesch C, Hughes D, Marques-Bonet T (2018) The impact of endogenous content, replicates and pooling on genome capture from faecal samples. Mol Ecol Resour 18(2):319–333. https://doi.org/10.1111/1755-0998.12728
Hohenlohe PA, Funk WC, Rajora OP (2021) Population genomics for wildlife conservation and management. Mol Ecol 30:62–82. https://doi.org/10.1111/mec.15720
Jensen MR, Sigsgaard EE, Liu S, Manica A, Bach SS, Hansen MM, Møller PR, Thomsen PF (2021) Genome-scale target capture of mitochondrial and nuclear environmental DNA from water samples. Mol Ecol Resour 21(3):690–702. https://doi.org/10.1111/1755-0998.13293
Keighley X, Bro-Jørgensen MH, Ahlgren H, Szpak P, Ciucani MM, Barreiro FS, Howse L, Gotfredsen AB, Glykou A, Jordan P, Lidén K, Olsen MT (2021) Predicting sample success for large-scale ancient DNA studies on marine mammals. Mol Ecol Resour 21(4):1149–1166. https://doi.org/10.1111/1755-0998.13331
Kollias S, Poortvliet M, Smolina I, Hoarau G (2015) Low cost sequencing of mitogenomes from museum samples using baits capture and Ion Torrent. Conserv Genet Resour 7:345–348. https://doi.org/10.1007/s12686-015-0433-7
Linck EB, Hanna ZR, Sellas A, Dumbacher JP (2017) Evaluating hybridization capture with RAD probes as a tool for museum genomics with historical bird specimens. Ecol Evol 7(13):4755–4767. https://doi.org/10.1002/ece3.3065
Mariac C, Scarcelli N, Pouzadou J, Barnaud A, Billot C, Faye A, Kougbeadjo A, Maillol V, Martin G, Sabot F, Santoni S (2014) Cost-effective enrichment hybridization capture of chloroplast genomes at deep multiplexing levels for population genetics and phylogeography studies. Mol Ecol Resour 14(6):1103–1113
Mohandesan E, Speller CF, Peters J, Uerpmann HP, Uerpmann M, De Cupere B, Hofreiter M, Burger PA (2017) Combined hybridization capture and shotgun sequencing for ancient DNA analysis of extinct wild and domestic dromedary camel. Mol Ecol Resour 17(2):300–313
Oreskovic A, Lutz BR (2021) Ultrasensitive hybridization capture: reliable detection of< 1 copy/mL short cell-free DNA from large-volume urine samples. PLoS ONE 16(2):e0247851
Paijmans JLA, Fickel J, Courtiol A, Hofreiter M, Förster DW (2016) Impact of enrichment conditions on cross-species capture of fresh and degraded DNA. Mol Ecol Resour 16(1):42–55. https://doi.org/10.1111/1755-0998.12420
Parker LD, Campana MG, Quinta JD, Cypher B, Rivera I, Fleischer RC, Ralls K, Wilbert TR, Boarman R, Boarman WI, Maldonado JE (2022) An efficient method for simultaneous species, individual, and sex identification via in-solution single nucleotide polymorphism capture from low-quality scat samples. Mol Ecol Resour 22(4):1345–1361
Patel RP, Lenz D, Kitchener AC, Fickel J, Förster DW, Wilting A (2017) Threatened but understudied: supporting conservation by understanding the genetic structure of the flat-headed cat. Conserv Genet 18:1423–1433. https://doi.org/10.1007/s10592-017-0990-2
Perry GH, Marioni JC, Melsted P, Gilad Y (2010) Genomic-scale capture and sequencing of endogenous DNA from feces. Mol Ecol 19(24):5332–5344. https://doi.org/10.1111/j.1365-294X.2010.04888.x
Portik DM, Smith LL, Bi K (2016) An evaluation of transcriptome-based exon capture for frog phylogenomics across multiple scales of divergence (Class: Amphibia, Order: Anura). Mol Ecol Resour 16(5):1069–1083. https://doi.org/10.1111/1755-0998.12541
Pyron RA, Beamer DA, Holzheuser CR, Lemmon EM, Lemmon AR, Wynn AH, O’Connell KA (2022) Contextualizing enigmatic extinctions using genomic DNA from fluid-preserved museum specimens of Desmognathus salamanders. Conserv Genet 23:375–386. https://doi.org/10.1007/s10592-021-01424-4
R Core Team (2022) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/.
Ramos E, Levinson BT, Chasnoff S, Hughes A, Young AL, Thornton K, Li A, Vallania FL, Province M, Druley TE (2012) Population-based rare variant detection via pooled exome or custom hybridization capture with or without individual indexing. BMC Genomics 13(1):1–15
Schmid S, Genevest R, Gobet E, Suchan T, Sperisen C, Tinner W, Alvarez N (2017) HyRAD-X, a versatile method combining exome capture and RAD sequencing to extract genomic information from ancient DNA. Methods Ecol Evol 8(10):1374–1388. https://doi.org/10.1111/2041-210X.12785
Schulte L, Bernhardt N, Stoof-Leichsenring K, Zimmermann HH, Pestryakova LA, Epp LS, Herzschuh U (2021) Hybridization capture of larch (Larix Mill.) chloroplast genomes from sedimentary ancient DNA reveals past changes of Siberian forest. Mol Ecol Resour 21(3):801–815
Seeber PA, McEwen GK, Löber U, Förster DW, East ML, Melzheimer J, Greenwood AD (2019) Terrestrial mammal surveillance using hybridization capture of environmental DNA from African waterholes. Mol Ecol Resour 19(6):1486–1496. https://doi.org/10.1111/1755-0998.13069
van der Valk T, Durazo FL, Dalén L, Guschanski K (2017) Whole mitochondrial genome capture from faecal samples and museum-preserved specimens. Mol Ecol Resour 17(6):e111–e121. https://doi.org/10.1111/1755-0998.12699
White LC, Fontsere C, Lizano E, Hughes DA, Angedakin S, Arandjelovic M, Granjon A-C, Hans JB, Lester JD, Rabanus-Wallace MT, Rowney C, Städele V, Marques-Bonet T, Langergraber KE, Vigilant L (2019) A roadmap for high-throughput sequencing studies of wild animal populations using noninvasive samples and hybridization capture. Mol Ecol Resour 19(3):609–622. https://doi.org/10.1111/1755-0998.12993
Zhou L, Bawa R, Holliday JA (2014) Exome resequencing reveals signatures of demographic and adaptive processes across the genome and range of black cottonwood (Populus trichocarpa). Mol Ecol 23(10):2486–2499
Ziesemer KA, Ramos-Madrigal J, Mann AE, Brandt BW, Sankaranarayanan K, Ozga AT, Hoogland M, Hofman CA, Salazar-García DC, Frohlich B, Milner GR (2019) The efficacy of whole human genome capture on ancient dental calculus and dentin. Am J Phys Anthropol 168(3):496–509
Acknowledgements
Many thanks to Dr Linda Vigilant (Max Planck Institute for Evolutionary Anthropology), Dr M. Timothy Rabanus-Wallace (University of Melbourne), and Dr Carlo Pacioni (Arthur Rylah Institute) for comments on earlier versions of the manuscript and proof-reading assistance.
Funding
The author declares that no funds, grants, or other support were received during the preparation of this manuscript.
Author information
Authors and Affiliations
Contributions
LCW conceived the study, developed the simulation code and wrote the manuscript.
Corresponding author
Ethics declarations
Competing interests
The author has no relevant financial or non-financial interests to disclose.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
White, L.C. Shallow sequencing can mislead when evaluating hybridization capture methods. Conservation Genet Resour 15, 15–20 (2023). https://doi.org/10.1007/s12686-023-01298-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12686-023-01298-3