Pinus kesiya Royle ex Gordon var. langbianensis (A. Chev.) Gaussen (Pinaceae) is an important forest tree species in Yunnan Province, China. It has been recorded at altitudes from 600–1800 m in the southern, semihumid climate zone of Yunnan (Editorial Committee of Flora of China, 1978; Wu, 1986) and accounts for 11% of the forest area and 1.0 × 108 m3 of the forest volume (Jiang et al., 2007). The wood is extensively used in building, furniture, and the fiber industry. Pinus kesiya var. langbianensis is also highly valued for its high resin content, with an annual output of 179,100,000 kg (Editorial Committee of Flora of China, 1978; Wu, 1986; Dong et al., 2009). Output of gum turpentine from P. kesiya var. langbianensis accounted for more than 90% of the total output in Yunnan (Yin et al., 2005). However, germplasm resources of P. kesiya var. langbianensis have decreased in recent years as a result of overexploitation (Zhao et al., 2016).
Information on genetic diversity and spatial structure in P. kesiya var. langbianensis is important for its future conservation and can be used to help guide local forest management (Sanchez et al., 2014). No specific conservation strategy is available for this species, in part due to the limited understanding of genetic diversity and structure of the natural populations. As a primary forest tree species in southern Yunnan Province, resource conservation of P. kesiya var. langbianensis will benefit the entire ecological system in the region (Li et al., 2015). Therefore, in this study we developed novel microsatellite markers for P. kesiya var. langbianensis by applying next-generation sequencing to investigate the genetic diversity and population structure of this species at the molecular level.
METHODS AND RESULTS
Needle samples of 60 individuals from four P. kesiya var. langbianensis populations located in Yunnan Province, China, and 59 individuals from four related species (P. massoniana D. Don, P. densata Mast, P. tabuliformis Carrière, P. yunnanensis Franch.) were collected (Appendix 1). All needle samples were dried and preserved in silica gel. Total genomic DNA was isolated from dried needle samples using the cetyltrimethylammonium bromide (CTAB) method (Doyle and Doyle, 1990). Paired-end libraries were constructed on four individuals sampled from Puer City Institute of Forestry Sciences and sequenced by a customer sequencing service (Beijing Honor Tech Co. Ltd., Beijing, China) using the Illumina HiSeq 2500 Sequencing System (Illumina, San Diego, California, USA). Clean reads were assembled using Trinity version 2.2.0 ( https://github.com/trinityrnaseq/trinityrnaseq/wiki; Grabherr et al., 2011; Haas et al., 2013). Data quality control was carried out with the software FastQC (Andrews, 2010). The Q30 percentage exceeded 90% and the GC contents were 45.60– 46.56%, which suggests that the sequencing was highly reliable. Data filtering was carried out according to the following criteria: (1) removed reads with adapters; (2) removed reads with unknown bases >10%; and (3) removed lowquality reads (defined as reads having >50% bases with quality value ≤5). A total of 104,392 unigenes were obtained with an N50 length of 1349 bp. The data have been deposited in the Short Read Archive (SRA) database of the National Center for Biotechnology Information (NCBI; accession no. SRP093696). Genomic microsatellite loci for P. kesiya var. langbianensis were detected using the software MicroSAtellite Identification Tool (MISA; Thiel et al., 2003). Two thousand three hundred forty-nine simple sequence repeat (SSR) loci were designed with Primer Premier 5.0 (PREMIER Biosoft International, Palo Alto, California, USA). Among them, 192 SSR loci with dinucleotide or trinucleotide SSR motifs were randomly chosen to screen using four individuals from four populations. One hundred fifty-nine out of 192 SSR loci were amplified successfully, and 79 out of 159 SSR loci were polymorphic. Eighteen polymorphic SSR loci (Table 1) were then randomly selected for characterization using 60 individuals from four populations (Appendix 1). The SSR amplifications were multiplexed in a 10-µL reaction containing 30 ng of genomic DNA, 0.15 µM of each primer, 5 µL Mix (0.05 units/µL Taq DNA Polymerase, 0.4 mM dNTPs, 4.0 mM MgCl2; Beijing Ruibio Biotech Co. Ltd., Beijing, China), and 1× PCR Buffer. The amplification protocol was: 95°C for 5 min; followed by 30 cycles at 95°C for 30 s, the annealing temperature for each primer (Table 1) for 30 s, and 72°C for 30 s; and a final extension step at 72°C for 7 min.
Table 1.
Characteristics of 18 polymorphic microsatellite loci in Pinus kesiya var. langbianensis.
Amplification products were resolved using capillary electrophoresis on an ABI 3730xl DNA Analyzer (Applied Biosystems, Waltham, Massachusetts, USA). The electropherograms were analyzed using GeneMarker version 2.2.0 with GeneScan 500 LIZ as a size standard (Applied Biosystems). CONVERT version 1.3.1 (Glaubitz, 2004) was used to convert input files for analysis in subsequent software. The genetic diversity parameters of polymorphic loci, including the number of alleles, observed heterozygosity, and expected heterozygosity were calculated by GenAlEx 6.4 (Peakall and Smouse, 2006), and Hardy–Weinberg equilibrium (HWE) for each locus and the presence of linkage disequilibria for all pairwise loci in each population were tested with POPGENE 1.32 (Yeh et al., 1997). Among the 60 genotyped individuals, the number of alleles per locus varied from one to 11 (Table 2). The observed heterozygosity and expected heterozygosity ranged from 0.000 to 0.800 and from 0.000 to 0.840, respectively. There were no significant departures from HWE over all loci for any populations, but some populations deviated from HWE at up to 15 loci (P < 0.05) (Table 2). A significant linkage disequilibrium (P < 0.01) was detected in three pairwise SSR loci in two (Populations 1 and 2) out of four populations (Population 1: Pkvl007 and Pkvl010, Pkvl008 and Pkvl010; Population 2: Pkvl006 and Pkvl014). Furthermore, 16 out of 18 SSR loci were successfully amplified in 59 individuals of four related species (14–15 individuals for each species; Appendix 1). Most of these loci were polymorphic (Table 3).
CONCLUSIONS
The set of 18 novel SSR markers reported in this study will be helpful for population genetic analysis in P. kesiya var. langbianensis, which will offer valuable information for the formulation of the rational utilization and conservation strategies of this species in the future. Furthermore, the successful cross-species amplification of these SSR markers in Pinus (Pinaceae) suggests their potential to be used in studies of genetic variation for related pine species.
Table 2.
Genetic properties of 18 polymorphic microsatellite markers in four Pinus kesiya var. langbianensis populations.a
Table 3.
Cross-amplification results showing the number of alleles detected in 18 loci from Pinus kesiya var. langbianensis in four related species.a
ACKNOWLEDGMENTS
This work was supported by the National Natural Science Foundation of China (grant no. 31360189, 31260191, 31500536) and the Foundation of Southwest Forestry University (grant no. 01102-111436).