Development and characterization of twenty-eight polymorphic microsatellite for rice field eel (Monopterus albus) using RAD tag sequencing
Received: January 01, 2018
Accepted: February 14, 2018
Published: February 22, 2018
Genet.Mol.Res. 17(1): gmr16039880
Monopterus albus is a special economic aquatic animal in China. Most of cultured M. albus are wild species captured from different areas, and this makes the wild resources of M. albus are under great threat. Microsatellite markers can help us understand the genetic resources of M. albus in different areas. In this study, using RAD-seq (restriction site associated DNA sequencing) protocol, a total of 9,897 microsatellites were identified in the genome with average frequency of 195 microsatellites per megabase of genomic sequences. Among these SSRs, the dinucleotide repeat motif was the most abundant type representing 61.77 % of the total microsatellite loci, followed by pentanucleotides (13.91 %), trinucleotides (12.08 %), tetranucleotides (9.30 %) and hexanucleotides (2.94 %). A total of 100 SSR primers were designed for PCR ampliï¬cation. The polymorphism analysis showed that 28 primer pairs could successfully amplify the polymorphic fragments and the number of alleles per polymorphic locus ranged from 2 to 13, with an average of 5.70 alleles/locus. The values of observed and expected heterozygosity’s ranged from 0.0333 to 0.8000 and 0.0333 to 0.8887, respectively. This microsatellite locus will be useful to understand the structure, genetic diversity, and genetic difference of this species.
The swamp eel (Monopterus albus), belongs to Order Synbranchiformes, Family Synbranchidae, and is widely distributed in the muddy ponds, swamps, canals, and rice fields (Chew SF, et al. 2015). Because of its great growth performance and rich nutrient content, swamp eel has become a commercially important farmed species in China (Chen D, et al. 2016). Due to the difficulties in artificial breeding, all cultured M. albus come from wild species, and this makes the wild resources of M. albus are under great threat. Proper conservation management of this species is of interest to ensure the sustainable exploitation. Knowledge of the population genetic structure and genetic resources condition is vital for conservation endeavors of this species. Microsatellites are very suitable markers for such studies because of their high frequency of distribution (Tóth G, 2000) co-dominance, reproducibility, and high level of polymorphism (Wang S, 2009). Recently, some microsatellites have been isolated for M. albus (Li WT, 2007). However, additional loci will facilitate high-resolution analyses such as pedigree reconstruction and assignment test.
The traditional development of microsatellite markers, based on probe hybridization, is time-consuming and expensive (An HS, 2012; Ekblom R, 2011). Current next generation sequencing (NGS) technologies can generate a large number of sequences and provide a great convenient for the acquisition of genetic markers . Especially the “Restriction-site Associated DNA” (RAD) method, using the Illumina platform, it has been proved an efficient and cost-effective method for SSRs discovery (Castoe TA, et al. 2012)
In this study, the de novo RAD-seq of M. albus were performed to discovery microsatellite markers. Furthermore, we developed 28 polymorphic microsatellite markers which could provide an effective tool for population genetic structure and conservation genetics researches of this species.
Material and Methods
Sample of M. albus were collected from wild population from Jiangxi Province, China (n=30). Genomic DNA was extracted from muscle tissue, using the traditional phenol chloroform method. Extracted DNA were dissolved in ddH2O and assessed by loading on a 1.5% agarose gel, stored at -20°C.
RAD-seq sequencing and contigs assembly
The RAD library was constructed at Meiji Inc (China) according to the protocol described by Etter et al. Briefly, genomic DNA was digested with PstI, then the P1 adapter (an adapter contains a matching sticky-end, Illumina sequencing priming sites, and the “barcoding” a nucleotide 4 or 5 bp long for sample identification) was ligated to the products of the restriction reaction. The reactions were then pooled, randomly sheared to a mean size of 500 bp, the DNA in the rage 300-800 bp was selected following electrophoresis. After end repair and the addition of 3’-adenine overhangs, DNA was then ligated to a second adapter (P2). PCR amplification was performed using P1-and P2-adapter primes, enriching for RAD tags that contain both adapters, and preparing them to be hybridized to HiSeq 2000 sequencer. Illumina raw data were processed using Stacks process_radtags. pl script (Catchen J, 2013), and then the high quality first and second reads were retained for de novo assembly using Velvet (Namiki T, 2012).
SSR identification and primer design
All assembly contigs were used to identify SSR motifs by MISA (Beier S, 2017). The search criteria were that the minimum repetitions of di-, tri-, tetra-, penta-, and hexa-nucleotides were 6, 5, 4, 3 and 3, respectively. Mononucleotide repeats were excluded. Using primer premier 5.0 software (Premier Biosoft International), 100 pairs of primers were then designed flanking the repeat regions of interest and the target amplicon size was in the range of 100-250 bp. All the primer pairs were subject to the temperature gradient PCR system for annealing temperature optimization. Only the primers which could successfully amplify the target fragments were used for the next polymorphism analysis.
Microsatellite polymorphism verification and data analysis
30 wild individuals were used to test the polymorphic. Each amplification was performed in a 15-μL final volume containing 1.5 μL of 10 × PCR buffer, 1.2 μL of dNTP (10 mM), 0.3 U of Taq DNA polymerase, 0.2 μM of each primer, 50 ng of genomic DNA. The PCR parameters were 94°C for 5 min (initial denaturation), followed by 30 cycles with 94°C for 30 s, annealing temperature 30 s, 72°C for 60 s and an additional extension at 72°C for 10 min. Polymorphic amplification products were separated on 12% non-denaturing polyacrylamide gel and stained with ethidium bromide. After establishing the AB matrix, the software Genepop v1.32 (Rousset F, 2008). was applied to estimate microsatellite alleles, observed heterozygosities, HWE and expected heterozygosities. Polymorphic information content (PIC) was obtained by PIC_CALC software. Micro-Checker was used to test for the presence of null alleles (Oosterhout CV, 2004).
Results and Discussion
RAD-seq sequencing and contigs assembly
Compared with the traditional enrichment library, RAD-seq technology is more convenient and less time consuming in SSRs identification (Miller MR, 2007). In this study, the sequencing procedure generated 18.53 million raw reads, and after filtering using process_radtags. pl script, 18.23 million clean paired reads were retained (Table 1). These clean reads were assembled into 249,278 contigs (105-852 bp) using Velvet. The average contigs length was 204 bp, and most contigs was 101-300 bp (96.7%) (Figure 1). Raw data have been made available through the Sequence Read Archive (SRA) repository at NCBI (SRR5980345).
|Illumina sequencing information||M. albus|
|Raw paired Reads (million)||18.53|
|Clear paired Read (million)||18.23|
|Number of contigs||249,278|
|Contigs length range (bp, min-max)||105-852|
|Average contigs length (bp)||204|
|Total contigs length of SSR analyzed (bp, million)||50.87|
|Number of contigs containing SSR||9,321|
|Number of SSR||9,897|
Table 1: Summary statistics of the RAD-seq sequencing via Illumina.
Characterization of SSRs
A total of 9,897 microsatellites were identified with average frequency of 195 microsatellites per megabase of genomic sequences (Table 1), which was little higher than that documented in swamp eel whole genome (179/Mb) (Li Z, 2017), but low than that in Clarias batrachus (247/Mb) (Srivastava S, 2016), This difference may be due to the sizes of the databases and bioinformatics software tools.
Among these SSRs, the most abundant repeat motif were dinucleotides (61.77%), followed by pentanucleotides (13.91%), trinucleotides (12.08%), tetranucleotides (9.30%) and hexanucleotides (2.94%) (Figure 2A). Similar results were found in many other creatures, such as Buhalus Bubalis (Qi WH, 2015), Panulirus guttatus (Truelove N, 2016) Clarias batrachus (Srivastava S, 2016) and Scophthalmus maximus (Pardo BG, 2007) in which the dinucleotides were the most abundant repeat motif.
Similar to other fishes (Zhang J, 2014; Kim KS, 2016) in dinucleotides repeat motif, AC/GT motifs (76.00%) were most abundant and GC/CG (0.15%) repeats were extremely rare in M. albus (Figure 2B). Among the trinucleotides, AAT/ATT (24.08%) was most abundant motifs, followed by AGG/CCT (18.39%), AAC/GTT (13.29%) and AGC/GCT (13.29%), whereas CGG/CCG (0.42%) motifs were the least (Figure 2C). Three most frequent repeats in tetranucleotide were AAAT/ATTT (21.52%), AAAC/GTTT (14.46%) and ATGG/CCAT (13.91%) (Figure 2D). Among pentanucleotide repeats, the top three types of repeat motifs are AAATC/GATTT (15.25%), AAAAC/GTTTT (12.27%) and AAAAT/ATTTT (8.93%) (Figure 2E). In hexanucleotides, GATTCT/AGAATC (10.65%), AAAAAT/ATTTTT (9.62%) and GGGTTA/TAACCC (4.46%) were the most abundant repeats (Figure 2F). The abundance of A-rich repeat motifs in genome SSRs indicated its significant evolution role in fish species (Li Z, 2017).
Development of polymorphic microsatellite markers
A total of 100 SSR primers were designed for PCR amplification and the results showed that 28 primer pairs could successfully amplify the polymorphic fragments. The polymorphic percentage was 28%, which was lower than Ichthyoelephas longirostris (60%) (Landinez-Garcia RM, 2016) Lateolabrax japonicus (42%) (Li BB, 2016) and Geophagus brasiliensishigh (36%) (Ferreira DG, 2013), but higher than Clarias batrachus (17%) , hybrid giant tiger grouper (20%) (Huang YS, 2015) and Girella elevate (22%) (McWilliam RA, 2016).
Across 30 individuals surveyed, the number of alleles at these informative loci varied from 2 (SSRSZ78, SSRSZ92, SSRSZ98) to 13 (SSRSZ97), with an average of 5.70 alleles/locus (Table 2).
|Locus||Accession no.||Primer sequences (5'-3')||Repeat motif||Ta (℃)||Allele size (bp)||Na||Ho||He||PIC||P|
Notes: Ta, annealing temperature; Na, observed number of alleles; Ne, Effective number of alleles; Ho, observed heterozygosity; He, expected heterozygosity; PIC, Polymorphic information content; P, P-value for deviation from Hardy-Weinberg equilibrium (p<0.05).
Table 2: Characteristics of 28 microsatellite loci in Monopterus albus
The expected heterozygosity ranged from 0.0333 to 0.8887, while the observed heterozygosity varied from 0.0333 to 0.8000. The average value of PIC was 0.4987, indicating the high level of polymorphism of these loci. After Bonferroni correction in Genepop v1.32, ten loci showed significant deviation from HWE. The microsatellite markers characterized here would contribute to the investigation of the genetic diversity and population structure in this species.
This work was supported by grants from the National Natural Science Foundation of China (No. 31160530 and No. 31360634), the National Natural Science Foundation of Jiangxi Province (20114BAB214003), the Science and Technology Support Program of Jiangxi Province (20122BBF60074) and the Project of Education Department in Jiangxi Province (GJJ13289).
About the Authors
- Chew SF, Gan J (2005). Nitrogen metabolism and excretion in the swamp eel, Monopterus albus, during 6 or 40 days of estivation in mud. Physiological and Biochemical Zoology. 78, 620-629. https://doi.org/10.1086/430233
- Chen D, Chu W, He Y, Liang XF, et al. (2016) Characteristics and phylogenetic studies of complete mitochondrial DNA based on the rice field eel (Monopterus albus) from four different areas. Mitochondrial DNA. 27: 2419-2420. https://doi.org/10.3109/19401736.2015.1030623
- Tóth G, Gáspári Z, Jurka J (2000) Microsatellites in Different Eukaryotic Genomes: Survey and Analysis. Genome Research. 10: 967-981. https://doi.org/10.1016/j.gene.2011.08.016
- Wang S, Zhang L, Matz M (2009). Microsatellite Characterization and Marker Development from Public EST and WGS Databases in the Reef-Building Coral Acropora millepora (Cnidaria, Anthozoa, Scleractinia). Journal of Heredity. 100 329-337. https://doi.org/10.1093/jhered/esn100
- Li WT, Liao XL, Yu XM, Cheng L, et al. (2007). Isolation and characterization of polymorphic microsatellites in a sex-reversal fish, rice field eel (Monopterus albus). Molecular Ecology Notes. 7: 705-707. https://doi.org/10.1111/j.1471-8286.2007.01683.x
- An HS, Lee JW (2012). Development of microsatellite markers for the Korean Mussel, Mytilus coruscus (Mytilidae) using next-generation sequencing. International Journal of Molecular Sciences. 13: 10583-10593. https://doi.org/10.3390/ijms130810583
- Iniguez-Luy FL, Voort AV, Osborn TC (2008). Development of a set of public SSR markers derived from genomic sequence of a rapid cycling Brassica oleracea L. genotype. Theoretical and Applied Genetics. 117: 977-985. https://doi.org/10.1007/s00122-008-0837-9.
- Ekblom R, Galindo J (2011). Applications of next generation sequencing in molecular ecology of non-model organisms. Heredity. 107: 1-15. https://doi.org/10.1038/hdy.2010.152
- Castoe TA, Poole AW, de Koning AP, Jones KL, et al. (2012). Rapid microsatellite identification from Illumina paired-end genomic sequencing in two birds and a snake. PLoS One. 7:e30953. https://doi.org/10.1371/journal.pone.0030953
- Etter PD, Bassham S, Hohenlohe PA, Johnson EA, et al. (2011) SNP discovery and genotyping for evolutionary genetics using RAD sequencing. Methods in molecular biology. 772: 157-178. https://doi.org/10.1007/978-1-61779-228-1_9
- Catchen J, Hohenlohe PA, Bassham S, Amores A, et al. (2013). Stacks: an analysis tool set for population genomics. Molecular Ecology. 22, 3124-3140. https://doi.org/10.1111/mec.12354
- Namiki T, Hachiya T, Tanaka H, Sakakibara Y (2012). MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads. Nucleic Acids Research. 40: e155. https://doi.org/10.1093/nar/gks678.
- Beier S, Thiel T, Munch T, Scholz U (2017). MISA-web: a web server for microsatellite prediction. Bioinformatics. 33: 2583-2585. https://doi.org/10.1093/bioinformatics/btx198
- Rousset F (2008). genepop'007: a complete re-implementation of the genepop software for Windows and Linux. Molecular Ecology Resources. 8: 103-106. https://doi.org/10.1111/j.1471-8286.2007.01931.x
- Oosterhout CV, Hutchinson WF, Wills DPM, Shipley P (2004). micro-checker software for identifying and correcting genotyping errors in microsatellite data. Molecular Ecology Notes. 4: 535-538. https://doi.org/10.1111/j.1471-8286.2004.00684.x
- Miller MR, Dunham JP, Amores A, Cresko WA, et al. (2007). Rapid and cost-effective polymorphism identification and genotyping using restriction site associated DNA (RAD) markers. Genome Research. 17: 240-248. https://doi.org/10.1101/gr.5681207
- Li Z, Chen F, Huang C, Zheng W, et al. (2017). Genome-wide mapping and characterization of microsatellites in the swamp eel genome. Scientific Reports. 7:3157. https://doi.org/10.1038/s41598-017-03330-7.
- Srivastava S, Kushwaha B, Prakash J, Kumar R, et al. (2016). Development and characterization of genic SSR markers from low depth genome sequence of Clarias batrachus (magur). Journal of Genetics. 95: 603-609. https://doi.org/10.1007/s12041-016-0672-8
- Qi WH, Jiang XM, Du LM, Xiao GS, et al. (2015). Genome-Wide Survey and Analysis of Microsatellite Sequences in Bovid Species. PLoS One. 10: e0133667. https://doi.org/10.1371/journal.pone.0133667
- Truelove N, Behringer DC, Butler Iv MJ, Preziosi RF (2016). Isolation and characterization of eight polymorphic microsatellites for the spotted spiny lobster, Panulirus guttatus. Peer J. 4: e1467. https://doi.org/10.7717/peerj.1467
- Pardo BG, Fernandez C, Hermida M, Vazquez-Lopez A, (2007). Development and characterization of 248 novel microsatellite markers in turbot (Scophthalmus maximus). Genome. 50, 329-332. https://doi.org/10.1139/g06-154
- Zhang J, Ma W, Song X, Lin Q, Gui JF, et al. (2014). Characterization and development of EST-SSR markers derived from transcriptome of yellow catfish. Molecules. 19: 16402-16415. https://doi.org/10.3390/molecules191016402.
- Kim KS, Noh CH, Moon SJ, Han SH (2016). Development of novel tetra- and trinucleotide microsatellite markers for giant grouper Epinephelus lanceolatus using 454 pyrosequencing. Molecular Biology Reports. 43: 541-548. https://doi.org/10.1007/s11033-016-3980-4
- Landinez-Garcia RM, Marquez EJ (2016). Development and characterization of 24 polymorphic microsatellite loci for the freshwater fish Ichthyoelephas longirostris (Characiformes: Prochilodontidae). Peer J. 4: e2419. https://doi.org/10.7717/peerj.2419
- Li BB, Li ZB, Shangguan JB, Ning YF (2016). Screening and characterization of novel microsatellite loci in Lateolabrax japonicus. Genetics and Molecular Research.15 https://doi.org/10.4238/gmr15048378
- Ferreira DG, Galindo BA, Alves AN, Almeida FS (2013). Development and characterization of 14 microsatellite loci in the Neotropical fish Geophagus brasiliensis (Perciformes, Cichlidae). Journal Fish Biology. 83:1430-1438. https://doi.org/10.1111/jfb.12227
- Huang YS, Li ZB, Ning YF, Shangguan JB, et al. (2015). Isolation and characterization of microsatellite loci in hybrid giant tiger grouper. Genetics and Molecular Research. 14: 14706-14710.
- McWilliam RA, Adams KR, Minchinton TE, Ayre DJ (2016). Characterization of 11 Polymorphic Microsatellite Markers for Black Drummer (Girella elevata) Developed Using 454 Next-Generation Sequencing. Journal of Heredity. 107: 670-673. https://doi.org/10.1093/jhered/esw066
- Share This