The Australasian tree genus Eucalyptus L'Hér. (Myrtaceae) comprises more than 700 species (Brooker, 2000) and is economically significant to the forestry industry. Vast genomic resources for model species have allowed molecular markers to be developed in related nonmodel taxa of conservation significance. Eucalyptus gomphocephala DC. (subg. Symphyomyrtus, common name tuart) is endemic to the Swan Coastal Plain of southwestern Australia, and is a culturally iconic species in the region. Severe pathogen-mediated tree decline (Cai et al., 2010), together with extensive habitat fragmentation in urbanized areas, has required rapid conservation actions for the species. To inform seed sourcing strategies for ecological restoration and forest management in the context of climate change, measures of adaptive population genetic diversity and structure are critical, but no microsatellite markers (simple sequence repeats [SSRs]) have yet been tested or optimized for E. gomphocephala. Specifically, SSRs within gene coding regions such as expressed sequence tags (ESTs) will be valuable for investigating variation in functional and potentially adaptive regions of the genome.
Here, we design novel PCR primer pairs for SSRs within Eucalyptus ESTs, targeting those that are homologous to annotated genes and therefore those with putative ecological relevance. For the first time, we optimize a subset for utility in population genetic analysis of E. gomphocephala to inform future conservation programs. To demonstrate the markers' broader utility in the genus, we report cross-species amplification in three additional ecologically important species (E. camaldulensis Dehnh., E. marginata Sm., E. victrix L. A. S. Johnson & K. D. Hill). This is the first set of gene-homologous EST-SSRs to our knowledge that have been reported and tested in natural populations of nonmodel eucalypt species of ecological and conservation significance. These EST-SSR primers add value to a recently expanding set of EST-SSRs being made available for Eucalyptus, increasing the variety of genes that can be investigated in novel population and conservation genetic studies of nonmodel species.
METHODS AND RESULTS
A total of 36,001 Eucalyptus EST sequences were downloaded from GenBank in 2007 and screened for microsatellite repeats with Tandem Repeats Finder (Benson, 1999) using default parameters, and organized using Tandem Repeats Database version 2.30 (Gelfand et al., 2007). A total of 1098 repeats were detected in 1073 (3%) of 36,001 ESTs. Sequences were clustered using the sequence assembly program CAP3 (Huang and Madan, 1999) with default parameters, into a set of 128 nonredundant contigs and 401 singletons (Appendix S1 (APPS1300004_Appendix S1.doc) and S2 (APPS1300004_Appendix S2.doc)). A total of 154 of 529 (29%) ESTs were homologous to various annotated genes, following BLASTN searches of the National Center for Biotechnology Information's nucleotide (nr/nt) collection, using an E-value threshold of 10-10. PCR primers were designed for all 154 sequences using Primer3 (Rozen and Skaletsky, 2000). Annotated gene homologs were specifically targeted so that loci could inform hypotheses relating to adaptive genetic variation in future studies of E. gomphocephala. All primer pairs and putative gene functions are provided in Appendix S3 (APPS1300004_Appendix S3.doc).
TABLE 1.
Characteristics of 11 new EST-SSR loci developed and optimized for Eucalyptus gomphocephala.
We prioritized a random subset of 44 EST-SSRs for further PCR optimization in E. gomphocephala. Primer pairs were synthesized by GeneWorks Pty Ltd (Hindmarsh, Australia) and initially tested for amplification products on a screening panel of seven E. gomphocephala individuals. DNA was extracted from freeze-dried leaf material using a NucleoSpin 8/96 Plant II Core Kit, with buffer set PL2/3 (Macherey-Nagel GmbH & Co., Düren, Germany), as per the manufacturer's instructions. PCRs were carried out in a total volume of 10 µL, containing 10 ng genomic DNA template, 1×PCR Polymerization Buffer containing dNTPs, 0.2 µM of each primer, 0.44 units Taq DNA polymerase, and 2 mM of MgCl2 (reagents from Fisher Biotech, Perth, Western Australia, Australia). Thermocycling was carried out as follows: denaturation at 94°C for 5 min; followed by 30 cycles of 94°C for 1 min, annealing at 55°C for 30 s, and extension at 72°C for 30 s; followed by a final extension at 72°C for 15 min. Annealing temperatures (Ta) were optimized for each locus and products were visualized on 2% agarose gels stained with SYBR Safe (Invitrogen, Carlsbad, California, USA).
A set of 17 (39%) primer pairs reliably amplified a product of expected size in E. gomphocephala. These loci were screened for allele size polymorphism and genotyped in multiplex PCR of between two and four loci per reaction, using a commercial kit (PCR Multiplex Kit using Q Solution; QIAGEN, Hilden, Germany). Forward primers were 5′ end-labeled with WellRED fluorescent dyes (D2, D3, D4; Sigma-Aldrich, St. Louis, Missouri, USA). Reactions were carried out in a total volume of 12.5 µL, containing 5–30 ng DNA template, 1× QIAGEN Multiplex PCR Master Mix, 0.5× Q-solution, and 0.2 µM of each primer (with some exceptions, Table 1). PCR was conducted with an initial activation step at 95°C for 15 min; followed by 30 cycles of 94°C for 30 s, annealing at Ta (Table 1) for 90 s, and extension at 72°C for 60 s; followed by final extension at 72°C for 10 min. Products were genotyped using a CEQ 8800 Genetic Analysis System and analyzed using CEQ Fragment Analysis Software (Beckman Coulter, Brea, California, USA). Two loci (EGM09 and EGM24) exhibited amplification failures or dubious peaks in multiplex and were analyzed in singleplex for further reactions. Following genotyping tests, 11 ESTSSRs were polymorphic and produced reliable electrophoretic profiles in E. gomphocephala (Tables 1 and 2). For these loci, we further investigated the most likely gene annotations by conducting a BLASTX search against proteins in the UniRef100 database (European Molecular Biology Laboratory—European Bioinformatics Institute [EMBL-EBI], http://www.ebi.ac.uk/Tools/blastall/) (Table 1).
To screen genetic diversity of the 11 EST-SSRs, we genotyped 60 E. gomphocephala individuals from two natural populations: ‘Yalgorup’ (voucher: R. Davis 1390, PERTH 04473167) and ‘Ludlow’ (voucher: C. A. Gardner, PERTH 01350501) (Table 2). Genetic diversity parameters were calculated with GenAlEx version 6.4.1 (Peakall and Smouse, 2006). Deviation from Hardy—Weinberg equilibrium (HWE) and linkage disequilibrium among loci were calculated with GENEPOP version 4.0.10 (Raymond and Rousset, 1995; Rousset, 2008), using the Bonferroni correction for multiple testing. The total number of alleles per locus ranged from two to 13 (mean = 6) (Table 1). Observed and expected heterozygosities ranged from 0.172 to 0.867 and 0.238 to 0.861, respectively (Table 2). Significant deviation from HWE was detected for EGM09 at Ludlow. We did not detect evidence of linkage disequilibrium between any pair of loci in more than one population. The potential presence of null alleles and their frequency (r) in each population were estimated using MICRO-CHECKER version 2.2.3 (van Oosterhout et al., 2004), and were predicted for EGM09 and EGM24 in both populations (r range = 0.17–0.29) and for EGM35 at Yalgorup (r = 0.09). The potential for stuttering was predicted for EGM09 at Ludlow.
TABLE 2.
Population genetic properties of the 11 newly developed ESTSSRs in two natural populations of Eucalyptus gomphocephala. a
All 17 primer pairs that initially amplified E. gomphocephala DNA were screened for cross-amplification in three additional species, belonging to diverse sections within two subgenera: E. camaldulensis (sect. Exsertaria) and E. victrix (sect. Adnataria) (both subg. Symphyomyrtus), and E. marginata (sect. Longistylus) (subg. Eucalyptus) according to the methods described above. Samples were collected from natural populations (Appendix 1). Sixteen out of 17 (94%) loci cross-amplified within subg. Symphyomyrtus, and 14 out of 17 (82%) cross-amplified across the subgenus boundary to subg. Eucalyptus, demonstrating high transferability rates within the genus.
CONCLUSIONS
This new set of 11 polymorphic EST-SSR loci will enable the characterization of population genetic diversity and structure throughout the species' range of E. gomphocephala in conjunction with putatively neutral, nuclear genomic SSRs (gSSRs). Given their gene-linked nature, the EST-SSRs can be used to test for signatures of selection in natural populations. The high transferability of the loci demonstrates their suitability for application to other species of ecological, conservation, or economic importance in the genus. The broader set of 133 EST-SSRs represents a resource that could be exploited to optimize additional gene-linked EST-SSRs for Eucalyptus and further expand the types of genes available for investigation. Data obtained using these markers will be particularly valuable to inform seed sourcing and conservation strategies in natural E. gomphocephala populations.
LITERATURE CITED
Appendices
APPENDIX 1.
Cross-species amplification of 17 EST-SSR loci from Eucalyptus gomphocephala to three additional eucalypt species from two subgenera.a,b,c
[1] The authors thank E. Veneklaas, M. Wallace, P. Nevill, J. Anthony, M. Foley, and A. Bradbury for field, laboratory, and conceptual assistance and for advice. We also thank E. O'Brien and P. Nevill for providing E. marginata, E. victrix, and E. camaldulensis DNA. This project was funded by an Australian Research Council Linkage Grant (LP0669757) with additional support from Alcoa of Australia and BHP Billiton Worsley Alumina Pty Ltd.