Introduction

Major histocompatibility complex (MHC) plays a central role in adaptive immunity of jawed vertebrates shaping immune defenses against a wide range of pathogens. The MHC consists of a large region of the genome containing several loci, including classical class I and class II genes coding for cell surface glycoproteins (or MHC molecules) involved in the processing and presentation of antigens to T lymphocytes. These antigens are small peptides derived by proteolysis from self- and pathogen-expressed proteins in the host. The first class molecules are present on the surface of all nuclear cells and bind to peptides derived from intracellular parasites (e.g., virus), whereas class II are present only on specific immune cells (macrophages, B cells, and dendritic cells) and are involved in the recognition of extra cellular pathogens (e.g., bacteria, protozoa). There are different isotypes of MHC class II molecules, including the DR isotype, found in most mammalian species. These molecules are heterodimers of two non-covalently associated α and β subunits encoded by adjacent genes MHC DRA and MHC DRB. While the DRA gene is usually single copy, the presence of multiple copies of the DRB gene appears to be the rule in most mammalian species. For example, nine loci (including five pseudogenes) have been mapped in human MHC (Horton et al. 2004) and three (one pseudogene) in cattle (Burke et al. 1991). However, the functional relevance of these loci is variable due to their varying levels of expression. For instance, DRB1 is the main DRB gene in human and is the most highly expressed (Emery et al. 1993), whereas only the DRB3 gene is considered to be functionally relevant in cattle (Burke et al. 1991). Structurally, the β-chain, encoded by the DRB gene, consists of the extracellular β1- and β2-domains, the transmembrane domain and the cytoplasmic tail. The β1-domain, which is encoded by the DRB exon 2, is part of the structure of the peptide binding groove where foreign peptides are recognized and processed. Residues within this domain that directly bind to antigens define the peptide binding regions or PBR (Brown et al. 1993) and thus determine the antigen binding repertoire. These regions are often found to be under positive diversifying selection resulting in a high number of alleles that allow the host immune system to react to a wide and variable range of foreign pathogens found in the host environment. As a result, the DRB genes usually exhibit high levels of polymorphisms mainly clustered in the second exon coding for the β1 domain (Babik 2010). For instance, a total of 384 DRB3 alleles have been detected in cattle as shown in the IPD-MHC (Immuno Polymorphism Database, https://www.ebi.ac.uk/ipd/mhc/; Maccari et al. 2017) database, and several alleles have been associated with diseases in different breeds (Behl et al. 2012).

The Arabian camel (Camelus dromedarius) or dromedary is the one-humped camel belonging to the genus Camelus that also includes the domesticated Bactrian (Camelus bactrianus) and wild (Camelus ferus) two-humped camels. The dromedary is a common livestock in several arid and semi-arid regions of the world with significant socio-economic importance allowing production of meat and milk in these areas where the climate precludes many other animals. The species is notoriously known for its resilience and adaptation to harsh environments characterized by extreme temperatures, drought, low-quality food, and scarcity of the resources. Notwithstanding this impressive adaptation to harsh conditions, the dromedary is indeed susceptible to various infectious and parasitic pathogens (Khalafalla 2017) although it showed also resistance to some infectious diseases such as viral FMD (foot and mouth disease, Wernery and Kinne 2012) which affects ruminant species causing a significant economic impact. The dromedary could also be infested by a number of ectoparasites such as ticks and fleas (Sazmand et al. 2019) and has been identified as potential reservoir of the zoonotic Middle East respiratory syndrome (MERS) virus (Dawson et al. 2019).

The humoral immune system of the dromedary camel and other camelids is characterized by the presence of antibodies (IgG2 and IgG3) devoid of lights chains (Hamers-Casterman et al. 1993) in addition to the conventional antibodies (IgG1) found in other species. As regard adaptive immunity, the MHC region displays an overall organization similar to the structure of the mammalian MHC with genes ordered into class I-class III-class II and has been mapped to the long arm of chromosome 20 (Avila et al. 2014; Plasil et al. 2016). Currently, studies regarding the MHC II genes and their diversity in dromedary are very scarce despite the importance of the species as a livestock animal and its capacity of adaptation and resilience to harsh environment challenges. For instance, these genes have not yet been characterized at the mRNA level and the available data were derived solely from genomic DNA with no knowledge of the expression status or the number of loci being analyzed. However, this knowledge is primordial for interpreting the significance and the functional relevance of the detected MHC diversity. In dromedary — and in other Camelus species — the MHC II genes appeared to display higher mean nucleotide diversity than other immune and rest of genome genes as estimated from genome-wide analysis in nine dromedaries (Lado et al. 2020). On the other hand, an unexpectedly low level of polymorphism has been previously reported in MHC class II DR and DQ genes in dromedary as well as in Bactrian and wild camels (Plasil et al. 2016). As far as the DRB gene is concerned, only one non synonymous mutation has been observed from a total of four variable sites detected in the analyzed exon 2 spanning 270 nucleotides (54 animals were analyzed).

This work sets out with the aim to study the main MHC DRB gene in dromedary camel at both mRNA and DNA levels. Specifically, the objectives of the study were to (i) identify and characterize the full length MHC DRB cDNA in dromedary using Sanger sequencing and determine gene structure, (ii) assess the extent of the exon 2 polymorphism and genotype the alleles in the Tunisian dromedary population, and (iii) investigate genetic diversity (segregating sites, nucleotide diversity) and the evolutionary process that shape this diversity. The results showed the presence of at least two transcribed DRB genes in the dromedary genome differing in the size of exon 2 (270 and 282 bp), transcript abundance, and the level of polymorphism. Furthermore, occurrence of alternative splicing resulting from exon 2 skipping was observed in one locus. The findings highlight the importance of including cDNA analysis in the investigation of the MHC diversity.

Materials and methods

Sample collection and DNA extraction

A total of 132 blood samples (about 5 ml) were collected in EDTA tubes from unrelated animals in different herds representing all dromedary camel ecotypes found in Tunisia. The panel includes two trios (sire, dam, and offspring) allowing gene segregation analysis. Samples used for RNA extraction were obtained from six healthy adult animals (three males and three females) belonging to the experimental herd of the Arid Lands Institute of Medenine (Tunisia). An additional blood sample for allele validation at the mRNA level was taken from private herd, preserved in RNA later, and stored at –20 °C. Genomic DNA was extracted from blood samples according to the standard phenol–chloroform protocol (Sambrook et al. 1989) and DNA quantity and quality were evaluated by absorbance using UV5Nano spectrophotometer (Mettler Toledo) and by agarose gel electrophoresis. All sampling, animal handling, and study procedures were in compliance with local regulations.

RNA extraction and cDNA synthesis

Peripheral blood mononuclear cells (PBMCs) were isolated from fresh EDTA blood samples using Histopaque 1077 (Sigma), washed twice with PBS pH 7.2 (Gibco, Life Technologies), and homogenized in TRIzol reagent (Sigma) using rotor–stator homogenizer (Kinematica). For RNA later-preserved sample, blood cells were collected from 2.4 ml after thawing and centrifugation for 1 min and processed as for PBMCs using TRIzol. Total RNA was isolated according to the manufacturer protocol and purified with the PureLink RNA Mini Kit (Ambion, Life Technologies). RNA quality and concentration were assessed on an UV5Nano spectrophotometer (Mettler Toledo) at 260/280 nm wavelength. All RNA samples were treated with DNase I (Thermo Scientific) prior to cDNA synthesis in order to remove genomic DNA. Reverse transcription was performed using the MMLV reverse transcriptase (Invitrogen) following manufacturer protocol in a final volume of 20 µl containing 1 µg of RNA and 2.5 µM of the oligo d(T)15 primer (Promega). Single strand cDNA was diluted ten-fold and stored at − 20 °C.

In silico analysis and primer development

Human DRB1 and bovine DRB3 mRNA sequences (GenBank accession numbers NM_001243965 and NM_001012680, respectively) were used as queries to search dromedary camel ESTs (Al-Swailem et al. 2010) using standalone BLAST (version 2.9.0 + , Altschul et al. 1990). These ESTs were generated from several tissues including blood from nine middle-eastern dromedary camels. The retrieved most similar hits (n = 5) were blasted against the NCBI non-redundant nucleotide (nr) database using online BLAST in order to confirm they were DRB-like in other mammalian species. Besides, they were further searched in the three available dromedary camel genome assemblies to infer gene structure (exon–intron boundaries following canonical AG/GT rule), and in the dromedary RNAseq available in the NCBI database as transcriptome shotgun assembly (TSA, bioproject PRJNA82161). The top hit ESTs were aligned to DRBs from other species, including the human DRB1, the bovine DRB3, and the porcine DRB1 (NM_001113695) mRNAs using Clustal omega available in the EBI web server (https://www.ebi.ac.uk/Tools/msa/clustalo/) to compare gene structure and features. Primers used for amplification of the full-coding region of the DRB1 gene were designed based on these detected ESTs. The remaining primers (DRB2 cDNAs and DRB1 exon 2) were constructed over genomic DNA after aligning transcript (exons) to a genomic sequences using BLAST and Splign (www.ncbi.nlm.nih.gov/sutils/splign) tools. All primers (Table 1) were designed using Primer3 version 4.1 (Untergasser et al. 2012), checked with Primer-Blast (NCBI) and SMS (sequence manipulation suite) version 2 (Stothard 2000).

Table 1 Primers used for the analysis of the dromedary camel DRB1 and DRB2 genes and cDNAs. Underlined nucleotides in the Cd.DRB1.E2F primer correspond to intron 1 sequences and the remaining bases from exon 2. The two first nucleotides in the 5′-end of the Cd.DRB2.F primer were added in order to increase melting temperature (GA in gDNA)

Amplification of the DRB cDNA and gene

The full coding region of the DRB1 cDNA was amplified using primers Cd.DRB1.F-R (Table 1) constructed on exons 1 and 6, respectively. The PCR reactions were performed in a final volume of 25 µl containing 1X PCR buffer, 1.5 mM MgCl2, 0.2 mM dNTPS, 0.4 µM of each primer, 0.5 U of Go Taq DNA Polymerase (Promega), and about 20 ng of cDNA. Thermal profile consisted of an initial denaturation step at 94 °C for 2 min, 35 cycles of denaturation (94 °C–30 s), annealing at 62 °C for 45 s, extension (72 °C–1 min), and a final extension step at 72 °C for 10 min. The same PCR conditions were used for the DRB2 cDNA except for MgCl2 (2 mM) and annealing temperature (63 °C).

Two overlapping fragments spanning whole DRB1 gene were amplified with primers Cd.DRB1.F-I2R (exon1-intron2, ~ 5.3 Kb) and Cd.DRB1.E2F-R (exon2-exon6, ~ 4.5 Kb) using LongAmp Taq DNA Polymerase (New England Biolabs). Amplifications were performed in a total volume of 25 μl and contained 1X LongAmp buffer, 300 µM dNTPS, 1.5 mM MgCl2, 0.4 µM of each primer, 2.5 units of LongAmp Taq polymerase, and about 50 ng of DNA. Thermal cycling started with one cycle at 94 °C for 2 min, followed by 35 cycles of denaturation (94 °C–30 s), annealing (62 °C for 30 s), and extension (65 °C–5 min), with a final extension step at 65 °C for 10 min.

Amplification of the DRB1 exon 2

The entire sequence of the DRB1 exon 2 (282 bp) was amplified using primers Cd.DRB1.I1F—I2R (Table 1) designed in introns 1 and 2, respectively, resulting in a fragment of 382 nucleotides. All samples were genotyped for exon 2 using this set of primers, including samples used for cDNA analysis in order to compare gDNA and cDNA sequences. PCR reactions were carried out as above (for DRB1 cDNA) in either 10 or 25 μl using Go Taq polymerase (Promega) and 10–30 ng of genomic DNA. Cycling conditions were as follows: initial denaturation at 94 °C for 2 min, then 35 cycles of 94 °C for 30 s, 66 °C for 30 s, 72 °C for 30 s, and a final extension at 72 °C for 10 min. Exon 2 (partial sequences) was also amplified using the forward primer Cd.DRB1.E2F used for gene amplification. This primer was designed at the intron1-exon2 boundary allowing the amplification of a 364 bp-fragment including 271 bp from exon 2 (11 nucleotides missed) when used with Cd.DRB1.I2R reverse primer. A subset of a randomly selected samples (n = 79) with known genotype and representing all established alleles were amplified using this set of primers (at annealing temperature of 65 °C) and directly sequenced in order to further validate the genotyping procedure.

Sequencing of MHC DRB genes

All PCR reactions included a negative control (no DNA) and products were visualized by electrophoresis in 1.5% agarose gel in 1 × TAE buffer containing ethidium bromide (0.5 µg/ml). Prior to sequencing, the PCR products were either gel-extracted using Purelink gel extract kit (Invitrogen) or purified with exonuclease I and shrimp alkaline phosphatase (Thermo Fischer). Purified fragments were directly sequenced in both directions using the BigDye Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems) on an ABI PRISM 3500 Genetic Analyzer (Applied Biosystems). All samples were sequenced at least twice from two independent PCRs (ranging from 2 to 5 ×). In order to phase alleles from heterozygote samples, DNA fragments (DRB1 cDNA) were cloned using the TOPO TA cloning kit (Invitrogen) and transformed into competent JM109 cells. Positive clones were confirmed for the presence of correct insert by colony PCR and by enzymatic digestion with EcoR I endonuclease (Thermo Fischer). Plasmid DNA from five clones was purified with EZ-10 Spin Column Plasmid DNA minipreps kit (Bio Basic, Canada) and sequenced using M13 forward and reverse primers.

Sequence analysis, allele detection, and nomenclature

DNA sequences were analyzed with Sequencing Analysis software v6.0 (Thermo Fischer) and all chromatograms were manually checked. The obtained sequences were aligned using Clustal omega and compared to the NCBI non-redundant nucleotide (nr) database and to the dromedary camel genomic and transcriptome sequences using BLAST. Coding sequences were translated with SMS v2. For DRB1 genotyping, the obtained homozygote exon 2 sequences were compared to the alleles initially established from full coding cDNAs, and heterozygote samples were resolved using Haplofinder script (Miltiadou et al. 2003). Individuals showing alleles with new mutations were re-sampled — when possible — in order to extract RNA and obtain the full coding cDNA sequences. Alleles were defined on the basis of the standard criterion in MHC genotyping (Babik 2010): sequences are considered to be valid alleles only when they are detected at least twice in two different PCRs from the same or different individual. Alleles were designated following the proposed nomenclature for the MHC in different species (Klein et al. 1990) with serial numbers according to the observed frequencies.

Data analysis

Various indices of nucleotide and amino acid polymorphisms of the DRB1 exon 2 were calculated using DnaSP v6.12 (Rozas et al. 2017) and MEGAX (Kumar et al. 2018). These include nucleotide diversity (π), haplotype diversity (Hd), segregating and parsimony informative sites, and overall mean nucleotide and amino acid genetic distances based on Tamura 3-parameter (T92) evolutionary distances and Poisson corrected amino acid distances, respectively. Genepop v4.7 (Rousset 2008) was used to calculate allele and genotype frequencies and to test for Hardy–Weinberg (HW) equilibrium.

The occurrence of recombination events was examined with GARD method (Kosakovsky Pond et al. 2006) available in the datamonkey server (http://datamonkey.org/, Weaver et al. 2018) and using methods implemented in RDP5 (Recombination Detection Program, Martin et al. 2021) software. These methods include RDP, Geneconv, Maxchi, Chimaera, Bootscan, SiScan, and 3Seq, all used with default settings and at alpha value of 0.05 for significance.

Footprint of positive selection acting on individual sites of the DRB1 exon 2 was evaluated by analyzing the ratio of non-synonymous (dN) to synonymous (dS) substitutions (dN/dS or ω) using maximum likelihood (ML) and Bayesian inference approaches. In the ML approach, sites subject to episodic positive or diversifying selection were detected using the MEME (mixed effects model of evolution, Murrell et al. 2012) method in the HyPhy package implemented in the datamonkey server. Bayesian inference analysis was carried out within MrBayes version 3.2.7 (Ronquist et al. 2012) using the Ny98 model with default priors and two independent runs of four Markov chain Monte Carlo (MCMC) chains for 150,000 generations with a burn-in of 25%. Convergence was assessed following default settings using the average standard deviation of split frequencies between runs (< 0.01), ESS (estimated sample size, > 100), and PSRF (potential scale reduction factor, ~ 1) indices. Codons identified by these two approaches are considered to be under positive selection, whereas codons are considered only potentially positively selected when detected by MEME method or when ω (dN/dS) > 3 and the probability of being positively selected is ≥ 0.9 using Bayesian inference. Since the dromedary DRB1 exon 2 contains an insertion of 12 nucleotides compared to human DRB1, it is difficult to infer residues of the peptide binding region (PBR) and thus to partition the exon 2 into PBR and non PBR codons. Therefore, only overall dN/dS ratio on the entire exon 2 was calculated using DNaSP v6.12 following the modified Nei–Gojobori method with Jukes–Cantor correction.

Analyses of the phylogenetic relationships among the DRB1 exon 2 alleles were performed using ML method in MEGAX and the JC + I model (Jukes and Cantor 1969) which is considered the best fitting model on the basis of the Bayesian information criterion (BIC). The tree robustness was assessed with 2000 bootstrap replications, and outgroup included exons 2 from Bos taurus DRB3 (NM_001012680) and Sus scrofa DRB1 (NM_001113695) genes.

Results

Evidence of two transcribed DRB loci in the Arabian camel genome

In order to identify functional DRB genes in dromedary camel, we firstly focused on transcriptome data through BLAST analysis of the available EST database. We retrieved several contigs including a transcript of 1.3 Kb (CL83contig1) showing 85 and 83% similarity to bovine DRB3 and human DRB1, respectively. Searching for this contig in the dromedary camel genomes revealed the presence of six coding exons exhibiting complete identity to the sequences from the ASM164081v1 genome assembly. Furthermore, several highly similar transcripts were also detected in the dromedary TSA databases.

In addition to these identical matches (to CL83contig1) found in the genomic sequences, further hits covering nearly the entire sequence but with gaps and less identity were observed in the three dromedary camel genomes assemblies available in the NCBI databases. These hits, corresponding to six putative exons, were detected in the minus strand of the dromedary reference genome (assembly accession: GCF_000803125.2) and sequence discrepancies and gaps were mainly located in the 5′ region of the transcript. Identical sequences to these putative exons were detected in raw RNAseq SRA (Sequence Read Archive) though none were found in ESTs and TSA. Close sequence analysis showed that this locus has been annotated in the reference genome assembly (LOC105101410) as DRB-like albeit the locus product was reported as low quality protein due to the discrepancy between genome and mRNA sequences.

Comparative analysis revealed that these two putative DRB loci displayed similarity to the DRB genes from mammalian species as shown by BLAST search against the non-redundant (nr) and the reference sequence RNA (refseq rna) NCBI databases. These observations clearly indicate that the Arabian camel genome contains at least two transcribed DRB loci, designated MhcCadr-DRB1 (initially detected in ESTs) and MhcCadr-DRB2 (observed in the minus strand) in accordance to the proposed nomenclature for the MHC in different species (Klein et al. 1990). These genes are located approximately 155 Kb apart in the chromosome 20 of the dromedary reference genome and are transcribed in opposite directions (Fig. 1A).

Fig. 1
figure 1

Dromedary camel MHC DRB1 and DRB2 genes. A Relative position of the DRB1 and DRB2 genes in chromosome 20 of the reference genome. Genes are represented as arrows indicating the direction of transcription. B Amplification of the DRB2 cDNA (left) showing isoforms X1 and X2, and the DRB1 cDNA (upper right); lower right, amplification of the whole DRB1 gene (~ 9.5 Kb) in two overlapping fragments. C Schematic representation of the dromedary camel MHC DRB1 gene. Introns are shown as lines and exons as rectangles with coding region in purple and UTR in white (sizes are not to scale). ATG indicates the start codon and TGA depicts the stop codon. Arrows represented approximate position of primers used for gene amplification

Low abundance and alternative splicing of the DRB2 mRNA

To determine whether the DRB2 locus is transcribed in our panel, a primer pair flanking the coding region (Table 1) was designed. PCR amplification of cDNA derived from seven dromedaries produced a fragment of expected size (~ 1.1 Kb) in addition to a low-size fragment of about 0.8 Kb (Fig. 1B). Sequence analysis from both fragments showed that they correspond to two DRB2 transcript variants (named X1 and X2) resulting from an RNA alternative splicing event. Alignment of the X1 isoform (1032 bp) to the dromedary genomes revealed that the DRB2 gene is spanned over six exons with an exon 2 size of 270 nucleotides, similar to the DRB genes from other species such as human, bovine, and porcine. The open reading frame contains 801 bp coding for 266 amino acids. The locus is mapped to 20,873,286–20,863,270 positions (~ 10 Kb) within the minus strand of the dromedary chromosome 20 (GenBank accession: CM016646.2). Compared to X1, the X2 isoform (764 bp) lacks 270 nucleotides as a result of exon 2 skipping during RNA splicing (Fig. 1B) with no evidence of frameshift or premature codon stop, and translating to a predicted protein of 176 amino acids. The DRB2 exon 2 sequences are of low complexity, showing a high GC content (about 67.4%). A total of nine SNPs were detected, all in the X2 isoform from two individuals, including five non-synonymous (three in exon 1 and two in exon 3), two synonymous in exons 1 and 3, and two in the 3′-untranslated region. The sequences of the DRB2 X1 and X2 isoforms have been submitted to the GenBank under accession numbers OQ106969 and OQ106970, respectively.

The dromedary DRB2 mRNA (X1) is virtually identical to the genomic sequences from domestic Bactrian (Camelus bactrianus) and wild (Camelus ferus) camels, with only four SNPs observed in exon 1 and one SNP in the 3′-UTR (exon 6). The DRB2 locus was found in the plus strand of chromosome 20 (and DRB1 in minus strand) in the genomes of both species and the same sequence orientation has been observed in the previous assembly of the dromedary reference genome (Camdro2, GCA_000803125.2). This sequence orientation is likely the correct one, and the sequences were probably reversed during the assembly of the current version of the dromedary reference genome Camdro3. The inter genic region between the two DRB loci spanned about 155 and 74 Kb in the dromedary and wild camel genomes, respectively. Compared to other species, the DRB2 mRNA (X1) displayed a nucleotide similarity ranging from ~ 86% (bovine, caprine) to 81.4% (porcine) with intermediate levels for other common species (human, horse, dog, cat, sheep, chimpanzee). Contrary to DRB1, the DRB2 transcript was not detected neither in the ESTs (derived from blood among other tissues) nor in the TSA databases, though identical sequences were found in the raw RNAseq reads generated from blood of three dromedaries (bioproject accession: PRJNA701734). Although the expression level was not quantitatively assessed, this finding suggests that the DRB2 is rather a low abundant transcript, which is also indicated by the observed faint cDNA amplification (Fig. 1B) despite extensive attempts of PCR optimization.

From the seven individuals sequenced, four were homozygous for the X2 variant and thus lacking exon 2, three were heterozygous X1/X2, and none homozygous for the full X1 variant. The functionality of the DRβ proteins in mammals is primarily determined by the β1 domain, which is a structural part of the peptide-binding cleft. Residues within this domain are encoded by exon 2, including the amino acids in the PBR that interact directly with the foreign antigens. Consequently, although the DRB2 gene is transcribed as shown by mRNA analysis, it is unlikely that it would be of functional relevance due to the exon 2 skipping during transcription. These observations, together with the noticed low abundance of the transcripts, clearly indicate that the DRB2 is not the main DRB gene in the dromedary camel and thus, it is of minor importance as regard the adaptive immunity in this species.

Characterization of the DRB1 cDNA and gene

As previously mentioned, the DRB1 locus corresponds to the transcript (CL83contig1) detected from the EST database displaying complete identity to the dromedary genomic sequences and high similarity to the bovine DRB3 and human DRB1 mRNAs. A couple of primers (Table 1) was designed based on this contig to PCR amplify the full coding region using cDNA derived from four animals. Subsequently, three animals were included in order to validate alleles detected during exon 2 diversity screening. Contrary to the DRB2, a sharp, high intensity expected fragment (980 bp) without extra bands was observed (Fig. 1B). Alignment of the obtained sequences to dromedary genomes revealed that the DRB1 gene consists of six exons (Fig. 1C) with an open reading frame of 813 bp encoding for 270 amino acids. All exons were coding and the intron sizes varied from 309 (intron 5) to 4801 (intron 1) nucleotides. The gene size is about 9.5 Kb. The dromedary DRβ1 protein contains an additional four residues (GEQG) at positions 74–77 when compared to DRβ from other mammalian species (266 aa) as a result of an insertion of 12 nucleotides in exon 2 (282 bp) and this feature seems to be specific to the Camelidae family. The four amino acids include an acidic residue (glutamic acid) along with one polar (glutamine) and to two non-polar (glycine) residues. The insertion is located upstream of the major part of the PBR sites (16 from a total of 24 codons) making it difficult to infer dromedary PBR by comparison to human DRβ1. The DRB1 had 88% nucleotide similarity and 79.6% amino acid similarity with the DRB2 (isoform X1). Alignment of the dromedary DRB1 and DRB2 nucleotide and protein sequences are shown in Fig. S1.

Several transcripts showing complete identity to the obtained DRB1 cDNA sequences were found in the dromedary transcriptome data (ESTs, TSA, and SRA) indicating a wide expression and transcript abundance and corroborating the observations from cDNA amplification (Fig. 1B). Two cDNA sequences (911 bp) obtained from two different animals were identical to CL83contig1 and to the genomic sequences from one contig detected in the ASM164081v1 genome assembly containing the whole DRB1 gene (LSZX01099695, positions: 3231–12,569). On the contrary, we found several discrepancies when these sequences were compared to the current assembly of the dromedary reference genome (GCF_000803125.2) likely due sequence or assembly errors of this region of the genome. Specifically, although almost identical sequences of the 5′-region of the gene (from exon 1 to exon 3 with two missed nucleotides in exon 2) were observed, the sequences completely diverge from exon 3 (starting at position 54) onwards. Based on these partial sequences, a DRB-like locus was predicted (LOC105101413) in the NCBI reference genome annotation (the locus product was annotated as low quality protein). In this regard, similar sequence errors and equivocal annotation have been previously reported in the MHC region of the dromedary reference genome (Lado et al. 2021). The DRB1 gene has been correctly mapped to the previous assembly version Camdro2 (GCA_000803125.2) though with minor sequence errors (281 bp of exon 2).

Given these observed discrepancies and in order to assess the size and the structure of the DRB1 gene, two overlapping fragments covering the whole gene were amplified by PCR using genomic DNA and their ends were directly sequenced. The size of amplified fragments is ~ 9.5 Kb as expected (Fig. 1B) and the obtained sequences (exons one, two, and from the end of intron three to exon six as well as partial introns 1 and 2) were identical to the sequences of the contig LSZX01099695. This finding thus confirms that this contig indeed contains the DRB1 gene. The retrieved genomic sequences have been submitted to the GenBank under accession numbers OQ106978-80. The DRB1 gene is highly conserved in the genus Camelus; the sequences of the whole gene in the dromedary (LSZX01099695:3231–12,569) and in the wild camel (VSZR01000013:c21690641-21,681,287) were practically identical even in introns. Only an insertion of 16 nucleotides in the repetitive sequences at the start of the intron 2 and one SNP in intron 3 were observed in the Camelus ferus gene compared to the dromedary camel. Relatively high similarities were also observed between the dromedary DRB1 and the DRBs from other mammalian species at both nucleotide (83–87%) and protein (75–80%) levels.

Polymorphism screening and genetic diversity of the DRB1 exon 2

A total of 132 animals were successfully genotyped by direct sequencing of the entire exon 2 (282 bp) amplified using primers Cd-DRB1.I1F-I2R (Fig. S2). The panel includes two families (sire, dam, and offspring) as well as the seven animals genotyped at the mRNA level. Patterns of chromatograms obtained after direct sequencing are consistent with the presence of a single DRB1 locus since no more than two peaks were observed in all obtained sequences. Complete identity was also observed between the cDNA and the genomic DNA sequences derived from the same animals. Thirty three (33) variable nucleotide sites in exon 2 were detected in the analyzed panel translating in 18 amino acid changes, corresponding to 19% of the 95 encoded residues (Table 2). Only one site showed three possible amino acid residues at position 104 in the protein, all remaining nucleotide and protein sites were bi-allelic. Overall, an intermediate level of nucleotide diversity (π) was observed (~ 0.049) in the analyzed population (Table 2).

Table 2 Measures of the DRB1 exon 2 genetic diversity. Sn, segregating nucleotide sites; Si, parsimony informative sites; k, average number of nucleotide differences; π, average nucleotide diversity; Hd, haplotype diversity; Saa, segregating amino acid sites; dnt and daa, overall mean nucleotide and amino acid genetic distances (± standard errors); and n, number of animals

A total of seven exon 2 alleles (haplotypes) resulting from the combination of these polymorphisms were identified in the Tunisian dromedary population and designated Cadr-DRB1*01 through Cadr-DRB1*07. Alignment of the nucleotide and predicted amino acid sequences of these alleles are shown in Figs. 2 and 3, respectively. Allele-specific sites were found in three alleles (Cadr-DRB1*01, 04, and 06). All alleles differed by at least two amino acids (Cadr-DRB1*02-Cadr-DRB1*03, Cadr-DRB1*05-Cadr-DRB1*07) with a maximum of 15 differences observed between Cadr-DRB1*01 and Cadr-DRB1*02. The average number of nucleotide differences (k) was 13.24 and the mean nucleotide and amino acid distances were 5 and 8%, respectively (Table 2). A low mean ratio of transitions to transversions (Ti/Tv) over all sequence pairs was observed (0.616), close to the theoretical expected value of 0.5, and ranging from 0 (Cadr-DRB1*02-Cadr-DRB1*05) to 2.5 (Cadr-DRB1*02-Cadr-DRB1*03). Analysis of trio genotypes showed Mendelian inheritance with sharing of identical alleles within each trio without evidence of recombination. The first family was homozygous for the second allele while the second trio was heterozygous displaying Cadr-DRB1*01–05, Cadr-DRB1*01–02, and Cadr-DRB1*01–02 genotypes for sire, dam, and offspring, respectively.

Fig. 2
figure 2

Alignment of the nucleotide sequences of the dromedary camel MHC DRB1 exon 2 alleles (282 bp). Allele-specific sites are highlighted in blue and the insertion of the 12 nucleotides at positions 120–131 (corresponding to the positions 220–231 in the coding region) is highlighted by a grey rectangle

Fig. 3
figure 3

Alignment of the deduced amino acid sequences of the dromedary camel MHC DRB1 exon 2 alleles. The first (P) and last (V) residues are also encoded by exons 1 and 3, respectively. Allele-specific sites are highlighted in blue, and the insertion of the four residues at positions 74–77 is indicated in bold and highlighted by a grey rectangle. Numbering of amino acid residues is relative to the DRβ1 protein (270 aa) above the alignment and to the residues encoded by exon 2 below the alignment. Detected sites under positive selection are indicated by + symbol highlighted in green (detected by MEME method and Bayesian inference) and grey (Bayesian inference)

Of the 132 animals genotyped, only 38 were heterozygous showing ten different allele combinations (from 21 possible), and this deficit of heterozygotes (Ho = 0.29, He = 0.68) resulted in a significant deviation from HW equilibrium. The most frequent allele was Cadr-DRB1*01 found in 48 homozygous and 30 heterozygous animals, followed by Cadr-DRB1*02 present in 24 homozygous and 27 heterozygous individuals. Twenty animals (from 38) were heterozygous Cadr-DRB1*01/Cadr-DRB1*02. Apart from the Cadr-DRB1*07, all other alleles were observed in heterozygote combinations with either Cadr-DRB1*01 or Cadr-DRB1*02. These two highly divergent alleles, differing by 29 nucleotides and 15 amino acids, had a cumulative frequency of 76% (48 and 28 respectively). Frequency of other alleles varied from 9.1% (Cadr-DRB1*03) to 2.7% (Cadr-DRB1*06) with alleles Cadr-DRB1*04 and Cadr-DRB1*05 having an intermediate frequency (5.7%). The less abundant allele in our panel was Cadr-DRB1*07, found in only one homozygous animal. However, this allele was detected in TSA (GenBank accession: GAEB01042196) as well as in raw RNAseq SRA (SRX10094694). Three other alleles detected here were found in the EST database (Cadr-DRB1*01, Cadr-DRB1*03, and Cadr-DRB1*05), two in RNAseq SRA (Cadr-DRB1*01, Cadr-DRB1*03), and one in TSA (Cadr-DRB1*02). Thus, only the Cadr-DRB1*04 allele present in this panel with 5.7% frequency and the Cadr-DRB1*06 (2.7%) were not found in the available dromedary transcriptome resources.

Both genomic sequences (LSZX01099695) from the ASM164081v1 assembly and the CL83Contig1 transcript initially detected in the ESTs displayed Cadr-DRB1*01 genotype. This allele was also observed in the Camelus ferus reference genome (GCF_009834535.1). In the Bactrian camel reference genome (assembly accession: GCF_000767855.1), the DRB1 gene was split into two contigs with partial sequences of exon 2 (191 nucleotides) found at the end of the contig (JARL01022239). These sequences (which contain 23 of 33 variable sites) are identical to the Cadr-DRB1*01 allele. In this species, four transcripts were detected in the TSA (bioproject PRJNA62509), none of them was identical to the alleles detected here. The seven DRB1 exon 2 alleles have been deposited in GenBank under accession numbers OQ106971-77.

Five alleles (Cadr-DRB1*01–05) were characterized at the mRNA level by sequencing the full coding regions from seven animals. No samples were available for mRNA analysis of the two remaining alleles. Three animals, two homozygous and one heterozygous, displayed the Cadr-DRB1*01 allele. Outside the β1 domain encoded by exon 2, four variations were detected in exon 3, including three non-synonymous (p.V137A, p.N183S, and p.V197A; Cadr-DRB1*01 is the reference). No evidence for mutations was found neither in the remaining exons nor in the untranslated regions. The substitution of valine residue by alanine at position 197 resulted in a new allele since it was observed only in one animal harboring the reference allele Cadr-DRB1*01. Therefore, the reference allele showing valine at this position was renamed Cadr-DRB1*01:01 following MHC nomenclature, whereas alanine at this position in the same allele will corresponds to Cadr-DRB1*01:02. The other four alleles (Cadr-DRB1*02–05) showed A-S-V amino acids at positions 137–183-197, respectively. The synonymous mutation (c.537 T > C) occurred in the alleles Cadr-DRB1*02–05 and was not observed in Cadr-DRB1*01:02. The alignment of the full coding nucleotide and the deduced amino acid sequences of the six alleles are shown in Fig. S3. The sequences of these alleles have been deposited in GenBank under accession numbers OQ106963-68.

As outlined in methods, a subset of animals (n = 79) displaying all detected genotypes were also genotyped using different forward primer (Cd.DRB1.E2F) for partial exon 2 amplification (271 bp) (Fig. S2) and the results showed complete concordance of genotypes. This finding validated the genotyping procedure and ruled out the likelihood of null alleles. On the other hand, this method could also be used for DRB1 genotyping in dromedary camel since the 11 missed nucleotides at the start of exon 2 which are included in the primer sequences turned out to be monomorphic in all genotyped animals.

Recombination and positive selection analysis

No significant evidence of recombination was detected in the DRB1 exon 2 when using methods implemented in RDP5 software. Although two potential recombination events at sites 61–170 and 100–215 were found, the involved parental sequences were not clearly identified and they turned out to be non-significant for the seven used methods. GARD analysis revealed one recombination event with inferred breakpoints at positions 26 and 189, the same position 26 but in conjunction with position 211 was also detected by DnaSP6 which used the method of Hudson (1987). Even considering the occurrence of this recombination event as highly plausible, it is unlikely that it would have pronounced effects on codon-based analysis of positive selection given the low level of recombination (only one event) and the limited number of the analyzed sequences (Anisimova et al. 2003).

Overall, the mean dN/dS ratio estimated across all codon sites of the second exon was less than 1 (ω = 0.89) indicating a lack of positive selection at the DRB1 locus. However, when considering individual codons for dN/dS ratio estimation, signals of episodic diversifying selection were detected at three positions (9, 71, and 86) by MEME method (Fig. 3). The same codons were also detected by Bayesian inference, all showing a high estimate of ω value (3.2–3.6) albeit with varying posterior probabilities (0.76, 0.9, and 0.87 for the three codons, respectively). An additional codon at position 78 was detected by the same procedure as being under positive selection (ω = 3.84) with strong evidence (posterior probability of 0.98). Three other codons at positions 33, 57, and 61 were detected as having an omega ratio > to 3 but with reduced posterior probabilities (0.72–0.77). Codon 9 falls within the peptide binding region (PBR) on the basis of the homology to the human DRβ1 molecule (Brown et al. 1993), whereas PBR inference could not be ascertained for codons 71, 78, and 86 since they were located downstream of the four amino acids insertion observed in the dromedary DRβ1 protein.

Phylogenetic analysis of the DRB1 exon 2 alleles

The ML phylogenetic tree based on the entire DRB1 exon 2 alleles with bovine and porcine alleles as outgroups is depicted in Fig. 4. The tree divided exon 2 sequences into two separate lineages represented by the two highly divergent Cadr-DRB1*01 (with Cadr-DRB1*03, 06 and 07) and Cadr-DRB1*02 (with Cadr-DRB1*04 and 05) alleles. Within each lineage, two alleles appeared to be closely related and were clustered together with high bootstrap support, Cadr-DRB1*01-Cadr-DRB1*06 (90%) and Cadr-DRB1*04- Cadr-DRB1*05 (87%). The remaining alleles formed each distinct sub-lineage but with weak bootstrap support (less than 70%).

Fig. 4
figure 4

Maximum likelihood phylogenetic tree of the seven dromedary MHC DRB1 exon 2 alleles. The numbers on the nodes represent the bootstrap percentages that support each node after 2000 replicates

Discussion

To date, few is known as regard the MHC II genes in the dromedary camel, e.g., number of expressed loci, gene structure and polymorphism, despite the importance of the species from the socio-economic point of view (as livestock animal thriving in several arid and semi-arid regions) and from the biological and evolutionary perspectives (genetic basis of adaptation to harsh environments, biological peculiarities). This study thus provided first insights on the expressed MHC class II DRB loci and their diversity in the Arabian camel Camelus dromedarius. Focusing essentially on the expressed loci rather than solely genomic DNA was motivated by the fact that several DRB genes (and pseudogenes) have been observed in several species with varying levels of expression and functionality. Furthermore, alleles detected using genomic DNA are not always observed at the cDNA level (O'Connor and Westerdahl 2021) and thus not mirroring the real functional diversity.

In this regard, and as in several other species where multiple DRB loci were found, the dromedary camel genome contains at least two transcribed loci, designated MhcCadr-DRB1 and MhcCadr-DRB2. These two loci share overall gene structure with six exons but exhibited different exon 2 sizes (282 vs. 270 bp) and transcript abundance patterns. Cadr-DRB1 is thought to be the main DRB locus in the dromedary camel on the basis of transcript abundance and genetic diversity, whereas Cadr-DRB2 locus seems of secondary relevance due to exon 2 skipping during transcription and to the observed low transcript abundance and polymorphism. The presence of multiple copies of the DRB locus with varying levels of expression and polymorphism has been observed in several mammalian species. For example, the main DRB gene in human, HLA-DRB1, is highly polymorphic and is expressed at much higher level than its paralogs DRB3, DRB4, and DRB5 (Emery et al. 1993). Similarly, only BoLA-DRB3 is functional in cattle; the second gene BoLA-DRB2 is monomorphic and showing very low levels of expression (Burke et al. 1991). Multiple copies of MHC loci usually arise by gene duplication events following the “birth-and-death” model of evolution (Nei et al. 1997). In this context, and in addition to the two DRB genes characterized here, a DRB2-like locus containing a 270 bp exon 2 has been predicted (LOC105101437) in the NCBI annotation of the dromedary camel reference genome. The extent of expression and genetic diversity of this locus as well as its relevance to the dromedary immunity remains to be investigated. Even though no evidence of additional copies of the DRB1 gene was found in the sequenced dromedary genomes, the presence of duplicated copies could not be ruled out because DRB copy number variation between individuals within the same species appears to be not uncommon in mammalian species, including in human (Arvidsson et al. 1995). Thus, and in the light of the limited number of the sequenced dromedary genomes currently available, this possibility should be considered as it remains fairly plausible.

The dromedary DRB1 gene contains an insertion of twelve nucleotides in exon 2 resulting in the introduction of four amino acids in the antigen presenting domain. This insertion extends the dromedary exon 2 to 282 bp (and the protein to 270 amino acids) and appears to be a specific feature of Camelidae family since it was not observed in other mammalian species DRBs (Fig. 5). This feature adds another peculiarity to the camelids’ immune system characterized by the presence of the well-known HCAbs (heavy-chain antibodies) devoid of light chains (Hamers-Casterman et al. 1993). It is expected that the four amino acid extensions of the β1 domain in dromedary camel DRβ1 will have an impact on the composition of the PBR sites and the repertoire of bound peptides. Further investigations are needed to evaluate the extent of this impact and to study the characteristics of the dromedary PBR and peptide binding repertoires. Similar insertions of twelve nucleotides in the exon 2 (282 bp) of the dromedary DQB gene have been reported (Plasil et al. 2016) and this appears to be a common characteristic of the dromedary MHC DR and DQ genes.

Fig. 5
figure 5

Alignment of the amino acids encoded by the MHC DRB exon 2 in the dromedary camel and in other species. The four amino acids insertion in the dromedary DRβ1 is highlighted by the light grey box and indicated in bold. PBR codons in human DRβ1 chain are indicated by the + symbol. BoLA (Bos taurus, NP_001012698), Eqca (Equus caballus, NP_001136287), Susc (Sus scrofa, NP_001302687), Feca (Felis catus, NP_001121544), Calu (Canis lupus familiarus, NP_001014768), Ovar (Ovis aries, NP_001116874), Cahi (Capra hircus, NP_001301146), Mamu (Macaca mulatta, NP_001305273), Patr (Pan troglodytes, NP_001266704), Rano (Rattus norvegicus, NP_001008884), and HLA (Homo sapiens, NP_001346123)

A straightforward protocol for the dromedary DRB1 exon 2 amplification was developed using primers in the flanking introns (and thus avoiding polymorphic exon sequences) allowing analysis of the whole exon 2. The reliability of this method was further validated with different forward primer designed at the intron–exon boundary (Fig. S2). High primer annealing temperatures (65–66 °C) ensured specific amplification and facilitated subsequent sequencing steps. Full concordance between genotypes (from homozygous and heterozygous animals) obtained using these two methods was observed.

Screening for the DRB1 exon 2 polymorphism revealed a contrasting pattern of the DRB1 diversity in the dromedary camel. Although a substantial sequence variation at the DRB1 exon 2 has been detected with 33 segregating nucleotides (11.7%) and 18 variable amino acids sites (19%), this variation has been “condensed” in a relatively reduced number of haplotypes (alleles), seven in total (and eight when considering exon 3). The reasons behind this observed pattern are not clear. While the number of alleles was relatively small, significant differences between these alleles were observed, with a difference of 2–15 amino acids in the protein sequences. Six from the seven detected exon 2 alleles were found at frequencies more than 1% and five at more than 5%. Low polymorphism levels have been previously reported in the dromedary and Bactrian camel MHC class II genes (Plasil et al. 2016); however, as far as the DRB is concerned, this low polymorphism was not surprisingly. Rather, it corroborates the results found in this work since the analyzed fragment was from a DRB2-like locus (exon 2 of 270 bp) characterized by low levels of polymorphism and transcript abundance as shown here for the DRB2 gene. The DRB1 alleles appear to be widespread among the dromedary populations since they were observed in animals from diverse geographic and — presumed — different ancestry origins: the ESTs and RNAseq SRA were derived from Middle-Eastern dromedaries, the TSA was derived from Indian animals, and the ASM164081v1 genome (from an Algerian dromedary, breed Targui) and the data from this study (Tunisian dromedary camels) were derived from North-African individuals.

The number of the DRB alleles varies considerably among mammalian species from monomorphism or few alleles (Mikko et al. 1999) to several hundred in human or cattle species as shown in the IPD-MHC database. Nevertheless, this high variability of allelic richness should be regarded cautiously considering the number of analyzed samples, population or breed coverage, and gene duplication. In this context, the dromedary camel allelic diversity could be considered within the range of DRB polymorphism of other mammalian species, albeit in the lower end. On the other hand, it is anticipated that the DRB1 allele repertoire will certainly be extended by genotyping more diverse dromedary populations. In this regard, evidence of three additional alleles was found (an allele from one animal from our panel, and two alleles detected in the raw RNAseq SRA) although they remain to be validated. These alleles did not arise from new SNPs but were rather different haplotypes from the variants detected here.

Contrasting with the relatively limited allelic richness, an overall intermediate level of nucleotide diversity (π = 0.049) was observed at the DRB1 exon 2. Comparatively, this level is slightly less than that observed in cattle for example, an intensively and widely studied species, where π ranged from 0.068 to 0.09 (Salim et al. 2021). In dromedary, much lower nucleotide diversity has been reported for both SNPs and indels (6.2 10−4) and non-synonymous (1.72 10−4) variants when considering all MHC genes (Lado et al. 2020) estimated from whole genome sequences. Obviously, these are overall estimates of nucleotide diversity trends that could not be compared to the π value shown here as it was obtained from only DRB1 exon 2. Patterns of nucleotide diversity are not uniform among the MHC region due to the varying levels of selection pressure, and generally showing extreme but interrupted levels (Gaudieri et al. 2000).

The observed heterozygote deficiency (Ho = 0.29, He = 0.68) in the Tunisian dromedary despite the diversity of seven alleles raises the possibility of the presence of missed alleles during PCR amplification, i.e., null alleles. However, this scenario is highly unlikely since fully matched genotypes — in both homozygous and heterozygous animals — were observed using two genotyping protocols. Furthermore, total agreement of the cDNA and gDNA sequences and no Mendelian mismatches in trio analysis were observed. In addition, primer sequences used for amplification of either DRB1 cDNA or exon 2 were fully conserved in both domesticated Bactrian and wild camels. On the other hand, no noticeable genetic structure has been detected in the analyzed population using neutral microsatellites markers (Nouairia et al. 2015) that would explain this heterozygote deficit by the Wahlund effect. Rather, these results could be likely related to the breeding systems practiced (low-input systems with low reproduction rate) or to the population size fluctuations (pronounced reduction during the second half of last century followed by steady recovery in last decades).

Otherwise, this heterozygote deficiency is indicative of a lack of ongoing balancing selection in the population on the basis of the heterozygote advantage assumption (or overdominant selection). The analysis of non-synonymous and synonymous substitutions revealed no evidence for positive selection acting on the DRB1 exon 2 in the past when considering the entire sequences (PBR and non-PBR, dN/dS < 1). The distinction between PBR and non PBR codons was not possible due to the insertion of four amino acids in the dromedary protein compared to the human DRβ1. However, when codons were taken individually, four sites were inferred as to be under positive selection, including one codon (at position 9) congruent with predicted antigen-binding sites of the human HLA-DRβ1 (Fig. 5), the three remaining codons could not be inferred since they are located downstream of the insertion. These findings suggest a weak positive selection pressure acting on the dromedary DRB1 gene that might be related to a reduced likelihood of exposure to pathogens in the harsh and arid environments where dromedary camel usually thrives (at least in Tunisia). In this context, it would be of interest to compare the genetic diversity of the DRB1 in dromedary populations from regions with different environments and likelihood of pathogen load, for example, from the dry and hyper-arid areas (e.g., Saharan and desert zones) and from the hot and relatively most wetter regions (e.g., sub-Saharan and Sahel). Another possible explanation of the relaxed selection pressure may be related to the breeding systems currently practiced in this species. Compared to other livestock species, the dromedary camel is usually reared under an “extensive” management systems with low-input and low density herds that may contribute to a limited pathogen transmission between animals. Nevertheless, these observations and hypothesis have also to be considered in the light of the evolution and demographic history of the species that might contribute to the observed genetic diversity and selection patterns. Particularly, signals of pronounced and long-term population bottlenecks during Pleistocene and domestication process have been detected in dromedary camel (Fitak et al. 2020) resulting in an overall reduced genome-wide diversity and a relatively lack of sub-structure in the dromedary populations.

The common allele Cadr-DRB1*01 was the most frequent and distinct one compared to other alleles, differing by 8–15 amino acids, and showing a significant excess of homozygotes in the population. It is therefore possible that this genotype excess was related to pathogen-driven selection where homozygote genotype might confers more protection or survival against specific pathogens (Bernatchez and Landry 2003). However, this remains only speculative as many other confounding factors may be involved and has to be validated by empirical functional studies. On the other hand, the fact that this allele was present in Camelus ferus — and probably in Camelus bactrianus — suggests trans-species polymorphism (TSP) across the genus Camelus. TSP occurs when identical or similar alleles are detected in related species and this phenomenon has been observed in several animal orders, including primates and rodents (Klein et al. 2007). Allele sharing between related species could result from convergent selection, from introgression, or as unaltered remnant from the speciation process. Although no data is currently available regarding the DRB1 gene in the two other Camelus species, it is likely that the Cadr-DRB1*01 allele may predate the split of dromedary and two-humped wild camel from the common ancestor 4.4 million years ago (Wu et al. 2014) and is still persisting in the two species. Such persistence over a long evolutionary period of the DRB1 polymorphisms implies that the two species have been — and are still — submitted to a relatively similar parasitic and pathogen challenges (at least those recognized by DRB1). Even though they are found in distinct geographic regions, both species share similar environment and occupy harsh and arid desert habitat characterized by scarcity of resources and extreme temperatures. In this context, analysis of the DRB1 genetic diversity in the wild Camelus ferus and domesticated Camelus bactrianus camels would elucidate the extent of allele sharing and relationship within the Camelus genus as well as the involved underlying genetic basis (TSP, convergent selection, etc.). This would contribute to better understand the evolutionary history of these species. For this purpose, the genotyping procedures developed in this work could be directly applied in both species since the primers used for the DRB1 exon 2 screening as well as for cDNA analysis are perfectly conserved in this genus. Particularly, such studies are of high relevance for the wild camel species since it is considered critically endangered on the IUCN (International Union for Conservation of Nature) red list with a census of few hundred individuals.

In summary, the present study lays the groundwork for future investigations into the DRB1 diversity in the genus Camelus by providing the first comprehensive analysis of the expressed MHC class II DRB genes in the dromedary camel and by developing optimized protocols for genetic diversity screening. This makes it possible to consider in the future targeted functional studies in these species, for example, by analyzing the association of the DRB1 genotypes with parasite infestation (i.e., ticks) and infectious diseases (i.e., MERS caused by MERS coronavirus). In this regard, some human HLA-DRB1 alleles have been associated with the severe acute respiratory syndrome (COVID-19) caused by the SARS-CoV-2 virus (Augusto and Hollenbach 2022). Further investigations are needed to identify other DRB-like genes, analyze their genetic diversity and their expression levels (along with the DRB1 and DRB2 loci) using quantitative PCR, and evaluate their relevance to the dromedary camel adaptive immunity.