Abstract
Olive is an ancient oil-producing tree, widely cultivated in Mediterranean countries, and now spread to other areas of the world, including China. Recently, several molecular databases were constructed in different countries and platforms for olive identification using simple sequence repeats (SSRs) or single-nucleotide polymorphisms (SNPs). However, comparing their results across laboratories was difficult. Herein, hundreds of polymorphic single-copy nuclear sequence markers were developed from the olive genome. Using the advantage of multiplex PCR amplification and high-throughput sequencing, a fingerprint database was constructed for the majority of olives cultivated in China. We used 100 high-quality sequence loci and estimated the genetic diversity and structure among all these varieties. We found that compared with that based on SSRs, the constructed fingerprint database based on these 100 sequences or a few of them, could provide a reliable olive variety identification platform in China, with high discrimination among different varieties using the principle of BLAST algorithm. An example of such identification platform based on this study was displayed on the web for the olive database in China (http://olivedb.cn/jianding). After resolving redundant genotypes, we identified 126 olive varieties with distinct genotypes in China. These varieties could be divided into two clusters, and it was revealed that the grouping of the varieties has a certain relationship with their origin. Herein, it is concluded that these single-copy orthologous nuclear sequences could be used to construct a universal fingerprint database of olives across different laboratories and platforms inexpensively. Based on such a database, variety identification can be performed easily by any laboratory, which would further facilitate olive breeding and variety exchange globally.
Similar content being viewed by others
Data availability
All raw sequence data have been deposited in the Genome Sequence Archive in National Genomics Data Center, China National Center for Bioinformation/Beijing Institute of Genomics, Chinese Academy of Sciences under the BioProject accession number PRJCA013968 and the Biosample number SAM103582.
References
Atienza SG, Rdl R, Domínguez-García MC, Martín A, Kilian A, Belaj A (2013) Use of DART markers as a means of better management of the diversity of olive cultivars. Food Res Int 54:2045–2053. https://doi.org/10.1016/j.foodres.2013.08.015
Baali-Cherif D, Besnard G (2005) High genetic diversity and clonal growth in relict populations of Olea europaea subsp. Laperrinei (Oleaceae) from Hoggar, Algeria. Ann Bot 96:823–830. https://doi.org/10.1093/aob/mci232
Baldoni L, Cultrera NG, Mariotti R, Ricciolini C, Arcioni S, Vendramin GG, Buonamici A, Porceddu A, Sarri V, Ojeda MA (2009) A consensus list of microsatellite markers for olive genotyping. Mol Breed 24:213–231. https://doi.org/10.1007/s11032-009-9285-8
Barranco D, Cimato A, Fiorino P, Rallo L, Touzani A, Castañeda C, Serafín F, Trujillo I (2000) World catalogue of olive varieties. International Olive Oil Council, Madrid, p 360
Beghè D, Molano JFG, Fabbri A, Ganino T (2015) Olive biodiversity in Colombia. A molecular study of local germplasm. Sci Hortic 189:122–131. https://doi.org/10.1016/j.scienta.2015.04.003
Belaj A, De la Rosa R, Lorite IJ, Mariotti R, Cultrera NG, Beuzón CR, González-Plaza J, Muñoz-Mérida A, Trelles O, Baldoni L (2018) Usefulness of a new large set of high throughput EST-SNP markers as a tool for olive germplasm collection management. Front Plant Sci 9:1320. https://doi.org/10.3389/fpls.2018.01320
Belaj A, Ninot A, Gómez-Gálvez FJ, El Riachy M, Gurbuz-Veral M, Torres M, Lazaj A, Klepo T, Paz S, Ugarte J (2022) Utility of EST-SNP markers for improving management and use of olive genetic resources: a case study at the worldwide olive germplasm bank of Cordoba. Plants-Basel 11:921. https://doi.org/10.3390/plants11070921
Besnard G, El Bakkali A (2014) Sequence analysis of single-copy genes in two wild olive subspecies: nucleotide diversity and potential use for testing admixture. Genome 57:145–153. https://doi.org/10.1139/gen-2014-0001
Besnard G, Terral JF, Cornille A (2018) On the origins and domestication of the olive: a review and perspectives. Ann Bot 121:385–403. https://doi.org/10.1093/aob/mcx145
Besnard KB, Navascués M, Fernández-Mazuecos M, El Bakkali A, Arrigo N, Baali-Cherif D, Brunini-Bronzini de Caraffa V, Santoni S, Vargas P (2013) The complex history of the olive tree: from late quaternary diversification of mediterranean lineages to primary domestication in the northern Levant. Proc R Soc B-Biol Sci 280:20122833. https://doi.org/10.1098/rspb.2012.2833
Biton I, Doron-Faigenboim A, Jamwal M, Mani Y, Eshed R, Rosen A, Sherman A, Ophir R, Lavee S, Avidan B, Ben-Ari G (2015) Development of a large set of SNP markers for assessing phylogenetic relationships between the olive cultivars composing the Israeli olive germplasm collection. Mol Breed 35:1–14. https://doi.org/10.1007/s11032-015-0304-7
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: A flexible trimmer for illumina sequence data. Bioinformatics 30:2114–2120. https://doi.org/10.1093/bioinformatics/btu170
Boyer F, Mercier C, Bonin A, Le Bras Y, Taberlet P, Coissac E (2016) Obitools: a unix-inspired software package for DNA metabarcoding. Mol Ecol Resour 16:176–182. https://doi.org/10.1111/1755-0998.12428
Cultrera NGM, Sarri V, Lucentini L, Ceccarelli M, Alagna F, Mariotti R, Mousavi S, Ruiz CG, Baldoni L (2019) High levels of variation within gene sequences of Olea europaea L. Front Plant Sci 9:1932. https://doi.org/10.3389/fpls.2018.01932
Diez CM, Trujillo I, Martinez-Urdiroz N, Barranco D, Rallo L, Marfil P, Gaut BS (2015) Olive domestication and diversification in the Mediterranean Basin. New Phytol 206:436–447. https://doi.org/10.1111/nph.13181
Duarte JM, Wall PK, Edger PP, Landherr LL, Ma H, Pires PK, Leebens-Mack J, Depamphilis CW (2010) Identification of shared single copy nuclear genes in Arabidopsis, Populus, Vitis and Oryza and their phylogenetic utility across various taxonomic levels. BMC Evol Biol 10:1–18. https://doi.org/10.1186/1471-2148-10-61
Earl DA, VonHoldt BM (2012) STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour 4:359–361. https://doi.org/10.1007/s12686-011-9548-7
El Bakkali A, Essalouh L, Tollon C, Rivallan R, Mournet P, Moukhli A, Zaher H, Mekkaoui A, Hadidou A, Sikaoui L, Khadari B (2019) Characterization of worldwide olive germplasm banks of Marrakech (Morocco) and Cordoba (Spain): towards management and use of olive germplasm in breeding programs. PLoS One 14:e0223716. https://doi.org/10.1371/journal.pone.0223716
Emms DM, Kelly S (2019) Orthofinder: phylogenetic orthology inference for comparative genomics. Genome Biol 20:1–14. https://doi.org/10.1186/s13059-019-1832-y
Falush D, Stephens M, Pritchard JK (2003) Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics 164:1567–1587. https://doi.org/10.1093/genetics/164.4.1567
Fedosov DY, Korzhenkov AA, Petrova KO, Sapsay AO, Sharko FS, Toshchakov SV, Kolosova AA, Bakhmutova ED, Patrushev MV (2021) SNP-based analysis reveals authenticity and genetic similarity of Russian indigenous V vinifera grape cultivars. Plants-Basel 10:2696. https://doi.org/10.3390/plants10122696
Francis RM (2017) Pophelper: an R package and web app to analyse and visualize population structure. Mol Ecol Resour 17:27–32. https://doi.org/10.1111/1755-0998.12509
Galatali S, Ghafoor NA, Kaya E (2021) Characterization of olive (Olea europaea L.) genetic resources via PCR-based molecular marker systems. Eur J Biol Biotech 2:26–33. https://doi.org/10.24018/ejbio.2021.2.1.146
Ganino T, Bartolini G, Fabbri A (2006) The classification of olive germplasm. J Hortic Sci Biotech 81:319–334. https://doi.org/10.1080/14620316.2006.11512069
IOOC (2022) International olive oil council. Accessed October 2022. https://www.internationaloliveoil.org/olive-world/olive-tree/
Jiménez-Ruiz J, Ramírez-Tejero JA, Fernández-Pozo N, Leyva-Pérez MO, Yan H, Rosa R, Belaj A, Montes E, Rodríguez-Ariza MO, Navarro F, Barroso JB, Beuzón CR, Valpuesta V, Bombarely A, Luque F (2020) Transposon activation is a major driver in the genome evolution of cultivated olive trees (Olea europaea L.). Plant Genome 13:e20010. https://doi.org/10.1002/tpg2.20010
Johnson MG, Pokorny L, Dodsworth S, Botigue LR, Cowan RS, Devault A, Eiserhardt WL, Epitawalage N, Forest F, Kim JT (2019) A universal probe set for targeted sequencing of 353 nuclear genes from any flowering plant designed using k-medoids clustering. Syst Biol 68:594–606. https://doi.org/10.1093/sysbio/syy086
Julca I, Marcet-Houben M, Cruz F, Gomez-Garrido J, Gaut BS, Diez CM, Gut IG, Alioto TS, Vargas P, Gabaldon T (2020) Genomic evidence for recurrent genetic admixture during the domestication of mediterranean olive trees (Olea europaea L.). BMC Biol 18:148. https://doi.org/10.1186/s12915-020-00881-6
Julca I, Marcet-Houben M, Vargas P, Gabaldón T (2018) Phylogenomics of the olive tree (Olea europaea) reveals the relative contribution of ancient allo-and autopolyploidization events. BMC Biol 16:15. https://doi.org/10.1186/s12915-018-0482-y
Kalogiouri NP, Alygizakis NA, Aalizadeh R, Thomaidis NS (2016) Olive oil authenticity studies by target and nontarget LC–QTOF-MS combined with advanced chemometric techniques. Anal Bioanal Chem 408:7955–7970. https://doi.org/10.1007/s00216-016-9891-3
Kalyaanamoorthy S, Minh BQ, Wong TK, Von Haeseler A, Jermiin LS (2017) Modelfinder: fast model selection for accurate phylogenetic estimates. Nat Methods 14:587–589. https://doi.org/10.1038/nmeth.4285
Katoh K, Standley DM (2013) Mafft multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30:772–780. https://doi.org/10.1093/molbev/mst010
Kaya HB, Cetin O, Kaya H, Sahin M, Sefer F, Kahraman A, Tanyolac B (2013) SNP discovery by Illumina-based transcriptome sequencing of the olive and the genetic characterization of Turkish olive genotypes revealed by AFLP, SSR and SNP markers. PLoS One 8:e73674. https://doi.org/10.1371/journal.pone.0073674
Korir NK, Han J, Shangguan L, Wang C, Kayesh E, Zhang Y, Fang J (2013) Plant variety and cultivar identification: advances and prospects. Crit Rev Biotechnol 33:111–125. https://doi.org/10.3109/07388551.2012.675314
Lalitha S (2000) Primer premier 5. Biotech Software & Internet Report 1:270–272. https://doi.org/10.1089/152791600459894
Lazović B, Klepo T, Adakalić M, Šatović Z, Arbeiter AB, Hladnik M, Strikić F, Liber Z, Bandelj D (2018) Intra-varietal variability and genetic relationships among the homonymic East Adriatic olive (Olea europaea L.) varieties. Sci Hortic 236:175–185. https://doi.org/10.1016/j.scienta.2018.02.053
León L, Martín LM, Rallo L (2004) Phenotypic correlations among agronomic traits in olive progenies. J Am Soc Hortic Sci 129:271–276. https://doi.org/10.21273/JASHS.129.2.0271
Li H (2013) Aligning sequence reads, clone sequences and assembly contigs with bwa-mem. arXiv preprint arXiv:1303–3997. https://doi.org/10.48550/arXiv.1303.3997
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R (2009) The sequence alignment/map format and samtools. Bioinformatics 25:2078–2079. https://doi.org/10.1093/bioinformatics/btp352
Li JZ (2018) Introduction and industry development of olive in China. China Forestry Press, Beijing (in Chinese)
Liang C, Wan T, Xu S, Li B, Li X, Feng Y, Cai Y (2018) Molecular identification and genetic analysis of cherry cultivars using capillary electrophoresis with fluorescence-labeled SSR markers. Biotech 8:1–10. https://doi.org/10.1007/s13205-017-1036-7
Librado P, Rozas J (2009) Dnasp v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics 25:1451–1452. https://doi.org/10.1093/bioinformatics/btp187
Liu BB, Ma ZY, Ren C, Hodel RG, Sun M, Liu XQ, Liu GN, Hong DY, Zimmer EA, Wen J (2021) Capturing single-copy nuclear genes, organellar genomes, and nuclear ribosomal DNA from deep genome skimming data for plant phylogenetics: a case study in Vitaceae. J Syst Evol 59:1124–1138. https://doi.org/10.1111/jse.12806
Lopes MS, Mendonça D, Sefc KM, FoS G, da Câmara MA (2004) Genetic evidence of intra-cultivar variability within Iberian olive cultivars. HortSci 39:1562–1565. https://doi.org/10.21273/hortsci.39.7.1562
Mammadov J, Aggarwal R, Buyyarapu R, Kumpatla S (2012) SNP markers and their impact on plant breeding. Int J Plant Genomics 2012:728398. https://doi.org/10.1155/2012/728398
Marcelino G, Hiane PA, Freitas KDC, Santana LF, Pott A, Donadon JR, Guimarães RCA (2019) Effects of olive oil and its minor components on cardiovascular diseases, inflammation, and gut microbiota. Nutrients 11:1826. https://doi.org/10.3390/nu11081826
Mousavi S, Mariotti R, Regni L, Nasini L, Bufacchi M, Pandolfi S, Baldoni L, Proietti P (2017) The first molecular identification of an olive collection applying standard simple sequence repeats and novel expressed sequence tag markers. Front Plant Sci 8:1283. https://doi.org/10.3389/fpls.2017.01283
Nei M (1987) Molecular evolutionary genetics. Columbia university press, New York
Nguyen LT, Schmidt HA, Von Haeseler A, Minh BQ (2015) IQ-tree: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol 32:268–274. https://doi.org/10.1093/molbev/msu300
Olofsson J, Cantera I, Van de Paer C, Hong-Wa C, Zedane L, Dunning L, Alberti A, Christin PA, Besnard G (2019) Phylogenomics using low-depth whole genome sequencing: a case study with the olive tribe (Oleeae). Mol Ecol Resour 19:877–892. https://doi.org/10.1111/1755-0998.13016
Potapov V, Ong JL (2017) Examining sources of error in PCR by single-molecule sequencing. PLoS One 12:e0169774. https://doi.org/10.1371/journal.pone.0169774
Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype data. Genetics 155:945–959. https://doi.org/10.1093/genetics/155.2.945
Sarri V, Baldoni L, Porceddu A, Cultrera N, Contento A, Frediani M, Belaj A, Trujillo I, Cionini P (2006) Microsatellite markers are powerful tools for discriminating among olive cultivars and assigning them to geographically defined populations. Genome 49:1606–1615. https://doi.org/10.1139/g06-126
Schuelke M (2000) An economic method for the fluorescent labeling of PCR fragments. Nat Biotechnol 18:233–234. https://doi.org/10.1038/72708
Seppey M, Manni M, Zdobnov EM (2019) Busco: assessing genome assembly and annotation completeness. Methods Mol Biol 1962:227–245. https://doi.org/10.1007/978-1-4939-9173-0_14
Sion S, Savoia MA, Gadaleta S, Piarulli L, Mascio I, Fanelli V, Montemurro C, Miazzi MM (2021) How to choose a good marker to analyze the olive germplasm (Olea europaea L.) and derived products. Genes 12:1474. https://doi.org/10.3390/genes12101474
Sollars ESA, Harper AL, Kelly LJ, Sambles CM, Ramirez-Gonzalez RH, Swarbreck D, Kaithakottil G, Cooper ED, Uauy C, Havlickova L, Worswick G, Studholme DJ, Zohren J, Salmon DL, Clavijo BJ, Li Y, He Z, Fellgett A, McKinney LV et al (2017) Genome sequence and genetic diversity of European ash trees. Nature 541:212–216. https://doi.org/10.1038/nature20786
Su C, Sun J, Zhu W, Peng L (2018) History, distribution, and potential of the olive industry in China: a review. Sustainability 10:1426. https://doi.org/10.3390/su10051426
Suo Z, Li W, Jin X, Zhang H (2016) A new nuclear DNA marker revealing both microsatellite variations and single nucleotide polymorphic loci: a case study on classification of cultivars in Lagerstroemia indica L. J Microb Biochem Technol 8:266–271. https://doi.org/10.4172/1948-5948.1000296
Tripoli E, Giammanco M, Tabacchi G, Di Majo D, Giammanco S, La Guardia M (2005) The phenolic compounds of olive oil: structure, biological activity and beneficial effects on human health. Nutr Res Rev 18:98–112. https://doi.org/10.1079/NRR200495
Trujillo I, Ojeda MA, Urdiroz NM, Potter D, Barranco D, Rallo L, Diez CM (2014) Identification of the worldwide olive germplasm bank of Cordoba (Spain) using SSR and morphological markers. Tree Genet Genomes 10:141–155. https://doi.org/10.1007/s11295-013-0671-3
Wang J, Zhang D, Farooqi TJA, Ma L, Deng Y, Jia Z (2019) The olive (Olea europaea L.) industry in China: its status, opportunities and challenges. Agrofor Syst 93:395–417. https://doi.org/10.1007/s10457-017-0129-y
Winfield M, Burridge A, Ordidge M, Harper H, Wilkinson P, Thorogood D, Copas L, Edwards K, Barker G (2020) Development of a minimal Kasp marker panel for distinguishing genotypes in apple collections. PLoS One 15:e0242940. https://doi.org/10.1371/journal.pone.0242940
Xu WY (2001) Germplasm resources and utilization of olive in China. Changchun Press (in Chinese), Changchun
Xu Y, Huang L, Li JH, Deng Y, Zhang JG, Zeng YF (2022) Identification and genetic diversity analysis of olive cultivars in Longnan based on phenotype and SSR markers. Forest Research 35:33–43. https://doi.org/10.13275/j.cnki.lykxyj.2022.004.004 (in Chinese)
Yadav S, Carvalho J, Trujillo I, Prado M (2021) Microsatellite markers in olives (Olea europaea L.): utility in the cataloging of germplasm, food authenticity and traceability studies. Foods 10:1907. https://doi.org/10.3390/foods10081907
Yılmaz-Düzyaman H, de la Rosa R, León L (2022) Seedling selection in olive breeding progenies. Plants 11:1195. https://doi.org/10.3390/plants11091195
Zhang D, Gao F, Jakovlić I, Zou H, Zhang J, Li WX, Wang GT (2020) Phylosuite: an integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies. Mol Ecol Resour 20:348–355. https://doi.org/10.1111/1755-0998.13096
Zhang N, Zeng L, Shan H, Ma H (2012) Highly conserved low-copy nuclear genes as effective markers for phylogenetic analyses in angiosperms. New Phytol 195:923–937. https://doi.org/10.1111/j.1469-8137.2012.04212.x
Acknowledgements
We thank two anonymous reviewers for their helpful comments and suggestions, Na li and Haiyun Zhao from the National Improved Olive Variety Breeding Base in Dawan’gou, Longnan city, Gansu province for their help on olive sampling.
Funding
This work was supported by the National Key R&D Program of China (No. 2019YFD1000602).
Author information
Authors and Affiliations
Contributions
The study was designed by YZ and JZ. Material preparation, data collection, and analysis were performed by LH, JL, YZ, YD, and GS. The manuscript was written by LH and YZ, and all authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
ESM 1
Online Resource method:Sequence typing method using OBItools3 software. Online Resource 1 Sampling information of 168 olive accessions. Online Resource 2: The information on primer pairs for 462 loci. Online Resource 3 Tag combinations in primer pair. Online Resource 4 The information on primer pairs for 282 loci. Online Resource 5 The identification results of the variety repetitive pair. Online Resource 6 The identification results of olive accessions. Online Resource 7 The structure results from K = 3 to K = 10. Online Resource 8 The flowchart summarizing the improved protocol (DOCX 18 kb)
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Huang, L., Zeng, Y., Li, J. et al. One hundred single-copy nuclear sequence markers for olive variety identification: a case of fingerprinting database construction in China. Mol Breeding 43, 86 (2023). https://doi.org/10.1007/s11032-023-01434-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11032-023-01434-9