Open Access
Translator Disclaimer
1 October 2014 The Complete Mitochondrial Genomes of the Fenton's Wood White, Leptidea morsei, and the Lemon Emigrant, Catopsilia pomona
Juan-Juan Hao, Jia-Sheng Hao, Xiao-Yan Sun, Lan-Lan Zhang, Qun Yang
Author Affiliations +

The complete mitochondrial genomes of Leptidea morsei Fenton (Lepidoptera: Pieridae: Dismorphiinae) and Catopsilia pomona (F.) (Lepidoptera: Pieridae: Coliadinae) were determined to be 15,122 and 15,142 bp in length, respectively, with that of L. morsei being the smallest among all known butterflies. Both mitogenomes contained 37 genes and an A T-rich region, with the gene order identical to those of other butterflies, except for the presence of a tRNA-like insertion, tRNALeu (UUR), in C. pomona. The nucleotide compositions of both genomes were higher in A and T (80.2% for L. morsei and 81.3% for C. pomona) than C and G; the A T bias had a significant effect on the codon usage and the amino acid composition. The protein-coding genes utilized the standard mitochondrial start codon ATN, except the COI gene using CGA as the initiation codon, as reported in other butterflies. The intergenic spacer sequence between the tRNASer (UCN) and ND1 genes contained the ATACTAA motif. The A T-rich region harbored a poly-T stretch and a conserved ATAGA motif located at the end of the region. In addition, there was a triplicated 23 bp repeat and a microsatellite-like (TA)9(AT)3 element in the A T-rich region of the L. morsei mitogenome, while in C. pomona, there was a duplicated 24 bp repeat element and a microsatellite-like (TA)9 element. The phylogenetic trees of the main butterfly lineages (Hesperiidae, Papilionidae, Pieridae, Nymphalidae, Lycaenidae, and Riodinidae) were reconstructed with maximum likelihood and Bayesian inference methods based on the 13 concatenated nucleotide sequences of protein-coding genes, and both trees showed that the Pieridae family is sister to Lycaenidae. Although this result contradicts the traditional morphologically based views, it agrees with other recent studies based on mitochondrial genomic data.


The animal mitochondrial genomes (mitogenomes) are usually circular molecules of 14–19 kb in size, containing 37 genes (including 13 protein-coding genes, two rRNA genes, and 22 tRNA genes) and a non-coding A+Trich region that regulates the transcription and replication of the mitogenome (Clayton 1992). Due to its simple and compact structure, fast evolutionary rate, and maternal inheritage, it has been used frequently in the studies of population genetics, molecular evolution, phylogenetics, phylogeography, and evolutionary biology (Simon et al. 1994). In recent years, as the DNA sequencing technology has been progressing rapidly, more and more complete animal mitogenome sequences have been determined. To date, more than 240 complete or near-complete mitochondrial DNA sequences have been identified from insects, including 67 from lepidopterans. Of these, the available sequences are mainly from six superfamilies (Bombycoidea, Geometroidea, Papilionoidea, Noctuoidea, Tortricoidea, and Pyraloidea). In total, 33 of these lepidopteran mitogenomes are from butterflies (Table 1).

Pieridae is one of the largest families of Papilionoidea, containing 76 genera and approximately 1,100 species worldwide, mostly distributed in tropical Africa and Asia (Courtney 1986, Watt et al. 1996, Brunton 1998, Stavenga et al. 2004, Kemp et al. 2005). Their adults are generally of medium size and typically white, orange, and yellow in color (Chapman 1895). Taxonomically, they are currently divided into four subfamilies (Dismorphiinae, Pierinae, Coliadinae, and Pseudopontiinae). In addition, phylogenetically, they may stand as a key group to clarify the intra-familiar butterfly relationships. For example, they were traditionally considered to be the sister to the Papilionidae (Ehrlich 1958, Scott 1985). However, more and more evidence indicated that they were sister to the grouping of (Nymphalidae (Riodinidae, Lycaenidae)) (Kristensen 1976, de Jong et al. 1996, Weller et al. 1996, Ackery et al. 1999, Wahlberg et al. 2005) or sister to (Riodinidae + Lycaenidae) (Kim, M. J. et al. 2010, Chai et al. 2012). To our dissatisfaction, up to the present, only four mitogenomes of pierid species (Artogeia melete [Menetries], Pieris rapae [L.], Delias hyparete [L.], and Aporia crataegi [L.]) are available, and thus more pierid mitogenome data are needed to enrich the taxon sampling for use in phylogenetic studies.

The Fenton's wood white, Leptidea morsei Fenton, and the lemon emigrant, Catopsilia pomona (F.), are the two representative species of the family Pieridae. Leptidea morsei is distributed mainly throughout Europe, Siberia, Ussuri, Korea, northern China, and Japan. It is found occasionally in damp, grassy vegetation at the sunny edges of woods, as well as in grassy woodland. Its larvae feed on peas, and adults are seen twice per year from April to May and June to July. Catopsilia pomona is ubiquitously distributed from areas of southeast Asia (Sikkim, Malaysia, Philippines) to Australia. It is found often in secondary forests, along river courses, and even in the hot arid deserts throughout the year. Its colors are usually variable, chiefly lemon-yellow with an apical black margin (Rienks 1985).

In this study, we determined and analyzed the complete mitogenome sequences of these two pierid species and compared these sequences with those of other butterfly species available to clarify the phylogenetic relationships among the main butterfly lineages. The new sequence data will provide valuable information for the studies of lepidopteran comparative genomics, molecular evolution, and other relevant areas.

Materials and Methods

Sample collection

Adult individuals of L. morsei and C. pomona were collected from Shanxi and Hainan Provinces, China, in August 2008 and July 2009, respectively. After sample collection, the fresh materials were placed into 100% ethanol immediately for DNA fixation and stored at -20°C until used for genomic DNA extraction.

DNA extraction and amplification by PCR

Total genomic DNA of L. morsei and C. pomona was extracted from the thoracic muscle of an adult individual by using the glass bead method after Hao et al. (2005). Insect universal primers were used for the amplification of the COI, CytB, 16S rRNA, and 12S rRNA genes (Simon et al. 1994). Primers for the ND2, ND4, COIII, and ND5 amplification were designed via the alignment of the respective sequences from all the butterflies available by using Clustal X1.8 and Primer Premier 5.0 softwares (Thompson et al. 1997, Singh et al. 1998). Seven long fragments (COICOIII, COIIIND5, ND5CytB, CytB16S, 16S12S, 12SND2, ND2COI) were amplified via long PCR using Takara LA Taq™ (Takara Co., The long PCR conditions were as follows: an initial denaturation at 95°C for 5 min, followed by 30 cycles of denaturation at 95°C for 50 sec, annealing at 50-55°C (depending on primer pairs) for 50 sec, and elongation at 68°C for 150 sec during the first 15 cycles and then an additional 5 sec per cycle during the last 15 cycles, and a final extension at 68°C for 10 min. All PCR fragments were sequenced directly in both strands after purification with the QIA quick PCR Purification Kit (QIAGEN,, except for the 12SND2 fragment of L. morsei, which was sequenced after cloning. All of the long PCR fragments were sequenced by using the primer walking strategy.

Sequence analysis

The raw sequences from the overlapping fragments were proofread and assembled in BioEdit version 7.0 (Hall 1999). Proteincoding genes, rRNA genes, and A+T-rich regions were determined via the alignment of the sequences by using Clustal X1.8 software (Thompson et al. 1997). The nucleotide sequences of the protein-coding genes were translated based on the invertebrate mtDNA genetic code. The tRNAs were identified by tRNAscan-SE software version 1.21 (Lowe and Eddy 1997). The putative tRNAs that could not be found by tRNAscan-SE were confirmed by sequence comparisons between the Pieridae and other butterfly tRNAs. Nucleotide composition and codon usage were calculated by using MEGA5.1 software (Kumar et al. 2004), and the tandem repeats in the A+T-rich regions were predicted by the Tandem Repeats Finder available online ( (Benson et al. 1999). Sequence data were deposited in the GenBank database under the accession numbers JX274648 for L. morsei and JX274649 for C. pomona.

Phylogenetic analysis

For the phylogenetic analysis, 13 concatenated nucleotide sequences of protein-coding genes of 33 available butterfly mitogenome sequences (two newly sequenced in this study and 31 extracted from GenBank, Table 1) were aligned by using Clustal X1.8 (Thompson 1997). The phylogenetic trees were then reconstructed with the maximum likelihood and Bayesian inference methods using the moth species Adoxophyes honmai Yasuda (Lepidoptera: Tortricidae) (GenBank accession number DQ073916) as the outgroup.

In the maximum likelihood and Bayesian inference analyses, the third position of all the codons was excluded, and the best fitting substitution model GTR + I + G (Lanave et al. 1984) was selected via a comparison of Akaike Information Criterion scores (Akaike 1974), calculated by using the Modeltest software version 3.7 (Posada and Crandal 1998). The maximum likelihood analyses were conducted in PAUP version 4.0b8 (Swofford 2002) under the following conditions: tree searching by TBR (tree bisection and reconnection) branch swapping (10 random-addition sequences); specifying the number of substitution rate categories as four; and using a BIONJ distance-based tree as the starting tree. The confidence values of each node of the maximum likelihood tree were evaluated via the bootstrapping test with 1,000 iterations. Bayesian analyses were performed by using the program MrBayes 3.1 (Huelsenbeck and Ronquist 2001). Two independent runs of four incrementally heated MCMC chains (one cold chain and three hot chains) were simultaneously run for one million generations in all datasets. Each set was sampled every 100 generations with a burn-in of 25%, and when the average standard deviation of split frequencies was less than 0.01, stationarity was considered to be reached. The confidence values of the Bayesian inference tree were presented as the Bayesian posterior probabilities in percentages.

Results and Discussion

General features

The complete mtDNA sequences of L. morsei and C. pomona were 15,122 and 15,142 bp in length, respectively, with that of L. morsei being the shortest among all known sequences of butterfly species (Table 2). Each genome was composed of the typical 13 proteincoding genes, 22 tRNA genes, two rRNA genes, and one major non-coding A+T-rich region. The gene order was identical to that of other sequences of butterflies but different from that found in the ancestral insects with respect to the location of tRNAMet. That is to say, the tRNAMet was located between the control region and tRNAIle, giving the derived control region (CR)-Met (M)-Ile (I)-Glu (Q) arrangement instead of that of the insect ground plan CR-I-Q-M (Fig. 1). The total sizes of protein-coding genes, rRNAs, and tRNAs were all well within the corresponding ranges of those found in other butterfly species (Table 2). The size proportions of coding genes to the whole genome of these two and four other pierid species were slightly higher than those of butterflies of other families (Table 3). By contrast, their non-coding sequences, including intergenic spacers and the A+T-rich region, were slightly shorter than those of other family taxa. The A+T-rich region of C. pomona was the shortest, whereas that of Papilio maraho Shiraki & Sonan (Lepidoptera: Papilionidae) was the longest (Table 4). The majority strand coding nine proteincoding genes and 14 tRNAs were 7,804 and 7,836 bp, respectively, whereas the minority strand coding four protein-coding genes, eight tRNAs, and two rRNA genes were 6,901 and 6,934 bp, respectively, for the two pierid species.

The nucleotide compositions of the two entire mitogenome sequences were biased significantly toward A and T (Table 5). These A+T contents were generally consistent with those of other butterfly mitogenomes, which ranged from 79.1% in Eumenis autonoe Esper (Lepidoptera: Nymphalidae) (Kim, M. J. et al. 2010) to 82.7% in Coreana raphaelis Oberthür (Lepidoptera: Lycaenidae) (Kim, I. et al. 2006). The base composition bias of an individual strand can be described by A+T skewness, caculated by (A%—T%)/(A%+T%), and G+C skewness, calculated by (G%—C%)/(G%+C%). The A+T and G+C skewness values in majority strands were calculated to be -0.122 and -0.121, respectively, for L. morsei , and -0.119 and -0.087, respecitively, for C. pomona.

Protein-coding genes

All the protein-coding genes in L. morsei and C. pomona started with a typical ATN codon, with the only exception represented by the CGA start codon of the COI gene. For L. morsei, three genes (ND2, ND5, ND6) started with ATT, one (ATP8) with ATC, two (ND3, ND4) with ATA, and six (ATP6, ND1, COII, COIII, ND4L, CytB) with ATG. In comparison with L. morsei, ATP8, ND3, and ND6 in C. pomona possessed different start codons, namely ATT, ATT, and ATC, respectively. These start codons were well-conserved in the sequenced butterfly mitogenomes. For instance, the ND2 gene usually used ATT as the start codon, whereas the COII and ATP6 genes frequently used ATG as the start codon (Table 6).

Nine protein-coding genes were terminated with the standard stop codon TAA, whereas the COI, COII, ND4, and ND5 genes used T as a truncated stop codon. The only difference in stop codons between the two pierid species was found in the ND3 gene, that is, L. morsei used TAA instead of TAG, which appeared in C. pomona. Furthermore, incomplete stop codons were detected frequently in the COI, COII, and ND5 genes in most insects, including all sequenced butterfly species (Table 6). Incomplete stop codons would produce functional stop codons after polycistronic transcript cleavage and polyadenylation (Ojala et al. 1981).

Previous studies reported that most lepidopterans used the codon CGA as the start codon for COI (Fig. 2). However, exceptions have been reported; for example, TTG was proposed as the start codon in Acraea issoria Hübner (Lepidoptera: Nymphalidae) (Hu et al. 2010), Caligula boisduvalii Eversmann (Lepidoptera: Saturniidae) (Kim, I. et al. 2006), and Fabriciana nerippe Felder (Lepidoptera: Nymphalidae) (Kim, M. J. et al. 2011a); ATT in A. crataegi (Park et al. 2012) and Ctenoptilum vasava Moore (Lepidoptera: Hesperiidae) (Hao et al. 2012); TTAG in Bombyx mori L. (Lepidoptera: Bombycidae) (Yukuhiro et al. 2002) and C. raphaelis (Hong, G. Y. et al. 2009); and ATTTAG in Ostrinia nubilalis Hübner and Ostrinia furnacalis Guenée (Lepidoptera: Crambidae) (Coates et al. 2005). In this study, a typical ATN initiator for COI in L. morsei and C. pomona was not detected at their starting sites. The putative ATT start codon is commonly located upstream of the COI gene and frequently followed immediately by the TAG or TAA stop codon; thus, the CGA, not ATT, probably acted as the start codon here as in most butterflies.

Excluding stop codons, the A+T contents of protein-coding genes in L. morsei and C. pomona were 79.16% and 80.02% (Table 5), respectively, and these values were similar to those detected in other butterflies, which ranged from 76.8% in E. autonoe (Kim, M. J. et al. 2010) to 81.5% in C. raphaelis (Kim, I. et al. 2006) (Table 2). When the first, second, and third codon positions were considered separately, the highest A+T contents were in the third positions for L. morsei and C. pomona. In addition, the highest T contents were detected in the second positions, and the lowest G contents in the third positions (Table 5).

Exclusive of the stop codon, 3,713 and 3,724 amino acids were encoded by the mitogenomes of L. morsei and C. pomona, respectively (Table 2). The amino acid numbers were well within the size range of 3,586 in Sasakia charonda kuriyamaensis Shirozu (Lepidoptera: Nymphalidae) (Hakozaki et al., unpublished, GenBank accession number NC_014223.1) to 3,740 in Abisara fylloides Moore (Lepidoptera: Riodinidae) (Shi et al. unpublished, College of Life Sciences, Anhui Normal University, China) detected in other butterflies. Among the amino acids, UUU (Phe), UUA (Leu), AUU (Ile), AUA (Met), and AAU (Asn) were the most frequently used codons in L. morsei and C. pomona (Table 7), and similar codons were found in other butterfly species, such as C. vasava (Hao et al. 2012), A. crataegi (Park et al. 2012), and Sericinus montela Gray (Lepidoptera: Papilionidae) (Ji et al. 2012).

Transfer RNAs and ribosomal RNAs

There were 22 tRNA genes (two each for serine and leucine, and one for each of the other amino acids) identified within the two pierid mitogenomes. An additional tRNA-like sequence (tRNALeu [UUR]) located within the 16S rRNA gene was detected in the mitogenome of C. pomona. The 22 tRNA genes ranging in size from 60 to 71 bp were interspersed throughout the two whole mitogenomes. The total sizes of the L. morsei and C. pomona tRNAs were 1,416 and 1,446 bp, respectively, with 80.68 and 81.05% A+T contents, respectively. All tRNAs could be folded into the typical clover leaf secondary structure, whereas tRNASer (AGN) in both mitochondrial genomes lacked the dihydrouridine (DHU) loop. This feature has been shown in the majority of metazoan mitogenomes, including all those sequenced from butterflies (Kim, I. et al. 2006; Hong, G. Y. et al. 2009; Hu et al. 2010; Kim, M. I. et al. 2009; Xia et al. 2011; Wang et al. 2011; Kim, M. J. et al. 2010, 2011a, 2011b; Chen et al. 2012; Shi et al. 2012; Tian et al. 2012). The anticodon for tRNASer (AGN) in mitogenomes of butterfly species was either TCT, GCT, or ACT, whereas only GCT was detected in all other sequenced mitogenomes of pierid species (P. rapae, A. crataegi, D. hyparete, and A. melete) (Hong, G. Y. et al. 2009, Mao et al. 2010, Park et al. 2012, Shi et al. 2012). The tRNA-like structure (tRNALeu [UUR]) was detected in the 16S rRNA gene of C. pomona, and a similar observation had been made in C. vasava (Hao et al. 2012). Interestingly, the 81 bp insertion of the tRNA-like sequence was made up completely of A and T, without G and C nucleotides. The L. morsei and C. pomona anticodon sequences of each tRNA isotype were identical to those of all other sequenced butterfly mitogenomes. As in other insects, unmatched base pairs were also detected in the stems of tRNAs. For L. morsei, there were 32 unmatched base pairs, consisting of 25 G-U, one A-A, and six U-U mismatches, whereas in C. pomona, 21 G-U, two A-A, and six U-U mismatches were identified.

Both of the two pierid mitogenomes harbored a large and a small ribosomal RNA subunit (16S rRNA and 12S rRNA), located between tRNALeu (CUN) and tRNAVal, and between tRNAVal and the A+T-rich region, respectively. The length of the 16S rRNA and 12S rRNA genes in L. morsei were 1,337 and 764 bp, respectively, with A+T contents of 84.29 and 83.25%, respectively; the 16S rRNA and 12S rRNA in C. pomona were 1,332 and 779 bp in length, respectively, with A+T contents of 85.21 and 85.11%, respectively (Table 2).

Intergenic spacers and overlapping sequences

The mitogenomes of L. morsei and C. pomona harbored 11 and 15 intergenic spacers, ranging from 1 to 39 bp (94 bp in total) and 1 to 24 bp (87 bp in total), respectively (Table 3). Among these, only three intergenic spacers were longer than 10 bp in both pierid species (Table 3). The longest intergenic spacers, located between the tRNAGln and ND2 genes in L. morsei and C. pomona, were 39 and 24 bp in length, respectively, with A+T contents of 90.35 and 91.67% respectively. This spacer is present in all of the butterfly mitogenomes sequenced to date, whereas absent in all nonlepidopteran insects. Another long spacer harboring the 7 bp ATACTAA motif, located between the tRNASer (UCN) and ND1 genes, has been observed commonly in most insect groups including all other butterflies.

In addition, there were 33 overlapping nucleotides scattered over 13 locations in L. morsei, and 26 nucleotide overlaps scattered over eight locations in C. pomona (Table 2). Among these overlaps in the two pierid species, the longest one was 8 bp in length and located between tRNATrp and tRNACys with the 7 bp motif AGCCTTA; the second longest one was 7 bp in length and located between ATP8 and ATP6 with the 7 bp motif ATGATAA. Both of these motifs have been observed in the mitogenomes of many butterfly species, including all the other pierids sequenced.

The A+T-rich region

The A+T-rich regions of L. morsei and C. pomona were 356 and 313 bp in length, respectively, with A+T contents of 89.60 and 97.13%, respectively. Among the A+T-rich regions of all the butterfly mitogenomes sequenced, that of C. pomona was the shortest in length and the highest in A+T content (Table 2). The A+T-rich regions of L. morsei and C. pomona contained the motif ATAGA, followed by a 19 and 18 bp poly-T stretch, respectively (Fig. 3). Besides, the regions also included microsatellite-like elements, such as (TA)9(AT)3 in L. morsei and (TA)9 in C. pomona, which were preceded by the ATTA motif characteristic of lepidopteran mitogenomes. Additionally, a triplicated 23 bp and a duplicated 24 bp repeat element of unknown function were found in L. morsei and C. pomona, respectively (Fig. 3), and similar repeat elements were detected in other butterflies, such as A. melete (Hong, G. Y. et al. 2009), E. autonoe (Kim, M. J. et al. 2010), and Agehana maraho (Shiraki and Sonan) (Lepidoptera: Papilionidae) (syn. Papilio maraho) (Wu et al. 2010).

Phylogenetic analysis

Several competing hypotheses exist on the phylogenetic family relationships in butterflies. Ehrlich (1967), Ehrlich and Ehrlich (1967), and Scott (1985) demonstrated the close relationship between the Nymphalidae and Lycaenidae and that between the Pieridae and Papilionidae via numerical taxonomic methods and morphological characters. Son and Kim (2011) and Wahlberg et al. (2005) obtained results consistent with the traditional view of the sister relationship between the Pieridae and the Nymphalidae + Lycaenidae group, with Papilionidae as the basal lineage, in agreement with the earlier hypothesis of Kristensen (1976) and supported by several recent studies (Akaike 1974, Kristensen 1976, de Jong et al. 1996, Ackery et al. 1999, Wahlberg et al. 2005, Liao et al. 2010, Kim, M. I. et al. 2010, Kim, M. J. et al. 2011b, Son et al. 2011). However, Chai et al. (2012) and Zhang et al. (2012), on the basis of mitochondrial genomic data, proposed a close relationship between the Pieridae and Lycaenidae, with the Nymphalidae being the sister group, in agreement with the hypothesis of Robbins (1988). Therefore, controversy exists regarding the relationships among the Nymphalidae, Pieridae, and Lycaenidae.

In this study, we conducted phylogenetic analyses via Bayesian inference and maximum likelihood methods, using concatenated nucleotide datasets of 13 protein-coding genes (6,582 aligned sites, 910 gaps, and 3,746 excluded positions), resulting in similar tree topologies of the butterfly families Papilionidae, Pieridae, Lycaenidae, and Nymphalidae (Fig. 4 A and B). The presented trees showed two major clusters (Fig. 4 A and B). The first one had the Papilionidae as the basal lineage, and the other one included the rest of the butterfly families. Maximum likelihood and Bayesian inference trees suggest a close relationship between Pieridae and Lycaenidae (Fig. 4 A and B), in agreement with the prevailing phylogeny of butterfly families (Hesperiidae (Papilionidae (Nymphalidae (Pieridae, Lycaenidae)))). Although the close relationship of the Pieridae and Lycaenidae proposed herein is contradictory to the traditional view (Pieridae (Nymphalidae, Lycaenidae)), the result is consistent with those of recent studies (Kim, M. J. et al. 2010, Chai et al. 2012, Hao et al. 2012, Zhang et al. 2012). However, uncertainty does exist regarding the sister relationship of the Pieridae and Lycaenidae as shown in the maximum likelihood tree (Fig. 4A). We also note that this relationship has been derived mainly from the protein-coding genes of the mitochondrial genome of butterflies.

In conclusion, although it may still be immature to suggest that the phylogeny of butterfly families is resolved, we do suggest that the sister relationship of the Pieridae and Lycaenidae is supported at the mitogenomic level in this study. We believe that the problem of butterfly phylogeny should be resolved step by step as gene sequence data and morphologic characters are being accumulated, and hopefully more sophisticated analytic tools will become available.

Figure 1.

Map of the circular mitochondrial genome of L. morsei. Genes encoded in the H-strand (clockwise orientation) are colored in red or blue. Genes encoded in the L-strand (anticlockwise orientation) are colored in orange or green. Abbreviations for the genes: COI-III for cytochrome oxidase subunits, CYTB for cytochrome b, and ND1-6 for NADH dehydrogenase components. tRNAs are denoted as one-letter symbols according to the IUPAC-IUB single-letter amino acid codes. High quality figures are available online.


Figure 2.

Alignment of the initiation codons of the COI genes of lepidopterans, including those of L. morsei and C. pomona. The first four or five codons for COI and their amino acids are shown on the right-hand side of the figure. Underlined nucleotides indicate the adjacent partial sequence of tRNATyr. Arrows indicate the transcriptional direction. Boxed nucleotides indicate the currently proposed translation initiators for the COI gene of lepidopteran insects. The start codon for L. morsei and C. pomona is designated as CGA. High quality figures are available online.


Figure 3.

Structures of the A+T-rich regions in the mitogenomes of L. morsei (A) and C. pomona (B). High quality figures are available online.


Figure 4.

Phylogenetic trees of the butterflies in this study based on the nucleotide sequences of 13 protein-coding genes. (A) Maximum likelihood tree. (B) Bayesian inference tree. Numbers at each node indicate bootstrap percentage of maximum likelihood analysis and posterior probability of Bayesian inference analysis. High quality figures are available online.


Table 1.

Mitogenomes of the 33 butterfly species used in this study.


Table 2.

Characteristics of mitogenomes of the 33 butterfly species available.


Table 3.

Summarized characteristics of the mitogenomes of L. morsei (Lm) and C. pomona (Cp).


Table 4.

Size proportion of coding genes, intergenic spacers, and the A+T-rich region to the whole genome of the butterflies in this study.


Table 5.

Nucleotide compositions in L. morsei (Lm) and C. pomona (Cp).


Table 6.

The 13 protein-coding gene initiation and termination codons in the mitogenomes of the 33 butterfly species in this study.


Table 7.

The codon usage in the mitogenomes of L. morsei (Lm) and C. pomona (Cp).



This work was supported by the National Science Foundation of China (Grant No.41172004), the Chinese Academy of Sciences (Grant No. KZCX22YW2JC104), and a grant from the State Key Laboratory of Palaeobiology and Stratigraphy, Nanjing Institute of Geology and Palaeontology, Chinese Academy of Sciences (Grant No. 104143).



P. R. Ackery , R. de Jong , and R. I. Vane-Wright . 1999. The butterflies: Hedyloidea, Hesperioidea, and Papilionoidea, pp. 263–300. In N. P. Kirstensen (ed.), Lepidoptera, moths and butterflies, vol. 1, Evolution, systematics, and biogeography (Handbook of zoology series). De Gruyter, Berlin, Germany. Google Scholar


H. Akaike 1974. A new look at the statistical model identification. IEEE Control Systems Society 19: 716–723. Google Scholar


G. Benson 1999. Tandem Repeats Finder: a program to analyze DNA sequences. Nucleic Acids Research 27: 573–580. Google Scholar


C.F.A. Brunton 1998. The evolution of ultraviolet patterns in European Colias butterflies (Lepidoptera, Pieridae): a phylogeny using mitochondrial DNA. Heredity 80: 611–616. Google Scholar


H. N. Chai , Y. Z. Du , and B. P. Zhai . 2012. Characterization of the complete mitochondrial genomes of Cnaphalocrocis medinalis and Chilo suppressalis (Lepidoptera: Pyralidae). International Journal of Biological Sciences 8: 561–579. Google Scholar


T. A. Chapman 1895. Notes on butterfly pupae, with some remarks on the phylogenesis of the Rhopalocera. Entomologist's Record and Journal of Variation 6: 101–107, 125–131, 147–152. Google Scholar


M. Chen , L. L. Tian , Q. H. Shi , T. W. Cao , and J. S. Hao . 2012. Complete mitogenome of the lesser purple emperor Apatura ilia (Lepidoptera: Nymphalidae: Apaturinae) and comparison with other nymphalid butterflies. Zoological Research 33: 191–201. Google Scholar


D. A. Clayton 1992. Transcription and replication of animal mitochondrial DNA. International Review of Cytology 141: 217–232. Google Scholar


B. S. Coates , D. V. Sumerford , R. L. Hellmich , and L. C. Lewis . 2005. Partial mitochondrial genome sequences of Ostrinia nubilalis and Ostrinia furnicalis. International Journal of Biological Sciences 1: 13–18. Google Scholar


S. P. Courtney 1986. The ecology of pierid butterflies: dynamics and interactions. Advances in Ecological Research 15: 51–131. Google Scholar


R. de Jong , R. I. Vane-Wright , and P. R. Ackery . 1996. The higher classification of butterflies (Lepidoptera): problems and prospects. Insect Systematics and Evolution 27: 65–101. Google Scholar


P. R. Ehrlich 1958. The comparative morphology, phylogeny and higher classification of the butterflies (Lepidoptera: Papilionoidea). University of Kansas Science Bulletin 39: 305–364. Google Scholar


P. R. Ehrlich , and A. H. Ehrlich . 1967. The phenetic relationships of the butterflies. I. Adult taxonomy and the non-specificity hypothesis. Systematic Zoology 16: 301–317. Google Scholar


T. A. Hall 1999. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symposium Series 41: 95–98. Google Scholar


J. S. Hao , C. X. Li , X. Y. Sun , and Q. Yang . 2005. The phenetic relationships of the butterflies based on mitochondrial 16S rRNA sequences. Chinese Science Bulletin 12: 1205–1211. Google Scholar


J. S. Hao , Q. Q. Sun , H. B. Zhao , X. Y. Sun , Y. H. Gai , and Q. Yang . 2012. The complete mitochondrial genome of Ctenoptilum vasava (Lepidoptera: Hesperiidae: Pyrginae) and its phylogenetic implication. Comparative and Functional Genomics 2012: 328049. doi: 10.1155/2012/328049Google Scholar


G. Y. Hong , S. T. Jiang , M. Yu , Y. Yang , F. Li , F. S. Xue , and Z. J. Wei . 2009. The complete nucleotide sequence of the mitochondrial genome of the cabbage butterfly, Artogeia melete (Lepidoptera: Pieridae). Acta Biochimica et Biophysica Sinica 41: 446–455. Google Scholar


M. Y. Hong , E. M. Lee , Y. H. Jo , H. C. Park , S. R. Kim , J. S. Hwang , B. R. Jin , P. D. Kang , K. G. Kim , Y. S. Han , and I. Kim . 2008. Complete nucleotide sequence and organization of the mitogenome of the silk moth Caligula boisduvalii (Lepidoptera: Saturniidae) and comparison with other lepidopteran insects. Gene 413: 49–57. Google Scholar


J. Hu , J. S. Hao , D. X. Zhang , D. Y. Huang , S. Cameron , and C. D. Zhu . 2010. The complete mitochondrial genome of the yellow coaster, Acraea issoria (Lepidoptera: Nymphalidae: Heliconiinae: Acraeini): sequence, gene organization and a unique tRNA translocation event. Molecular Biology Reports 37: 3431–3438. Google Scholar


J. P. Huelsenbeck , and F. Ronquist . 2001. MrBayes: Bayesian inference of phylogeny. Bioinformatics 17: 754–755. Google Scholar


L. W. Ji , J. S. Hao , Y. Wang , D. Y. Huang , J. L. Zhao , and C. D. Zhu . 2012. The mitochondrial genome of the dragon swallowtail, Sericinus montela Gray (Lepidoptera: Papilionidae), and its phylogenetic implication. Acta Entomologica Sinica 55: 91–100. Google Scholar


D. J. Kemp , R. L. Rutowski , and M. Mendoza . 2005. Colour pattern evolution in butterflies: a phylogenetic analysis of structural ultraviolet and melanic markings in North American sulphurs. Evolutionary Ecology Research 7: 133–141. Google Scholar


I. Kim , E. M. Lee , K. Y. Seol , E. Y. Yun , Y. B. Lee , J. S. Hwang , and B. R. Jin . 2006. The mitochondrial genome of the Korean hairstreak, Coreana raphaelis (Lepidoptera: Lycaenidae). Insect Molecular Biology 15: 217–225. Google Scholar


M. I. Kim , J. Y. Baek , M. J. Kim , H. C. Jeong , K. G. Kim , C. H. Bae , Y. S. Han , B. R. Jin , and I. Kim . 2009. Complete nucleotide sequence and organization of the mitogenome of the red spotted Apollo butterfly, Parnassius bremeri (Lepidoptera: Papilionidae) and comparison with other lepidopteran insects. Molecules and Cells 28: 347–363. Google Scholar


M. I. Kim , X. L. Wan , M. J. Kim , H. C. Jeong , N. H. Ahn , K. G. Kim , Y. S. Han , and I. Kim . 2010. Phylogenetic relationships of true butterflies (Lepidoptera: Papilionoidea) inferred from COI, 16S rRNA and EF-1α sequences. Molecules and Cells 30: 409–425. Google Scholar


M. J. Kim , X. L. Wan , K. G. Kim , J. S. Hwang , and I. Kim . 2010. Complete nucleotide sequence and organization of the mitogenome of endangered Eumenis autonoe (Lepidoptera: Nymphalidae). African Journal of Biotechnology 9: 735–754. Google Scholar


M. J. Kim , H. C. Jeong , S. R. Kim , and I. Kim . 2011a. Complete mitochondrial genome of the nerippe fritillary butterfly, Argynnis nerippe (Lepidoptera: Nymphalidae). Mitochondrial DNA 22: 86–88. Google Scholar


M. J. Kim , A. R. Kang , H. C. Jeong , K. G. Kim , and I. Kim . 2011b. Reconstructing intraordinal relationships in Lepidoptera using mitochondrial genome data with the description of two newly sequenced lycaenids, Spindasis takanonis and Protantigius superans (Lepidoptera: Lycaenidae). Molecular Phylogenetics and Evolution 61: 436–445. Google Scholar


N. P. Kristensen 1976. Remarks on the family level phylogeny of butterflies (Insecta, Lepidoptera, Rhopalocera). Zeitschrift für Zoologische Systematik und Evolutionsforschung 14: 25–33. Google Scholar


S. Kumar , K. Tamura , and M. Nei . 2004. MEGA3: integrated software for molecular evolutionary genetics analysis and sequence alignment. Briefings in Bioinformatics 5: 150–163. Google Scholar


C. Lanave , G. Preparata , C. Saccone , and G. Serio . 1984. A new method for calculating evolutionary substitution rates. Journal of Molecular Evolution 20: 86–93. Google Scholar


F. Liao , L. Wang , S. Wu , Y. P. Li , L. Zhao , G. M. Huang , C. J. Niu , Y. Q. Liu , and M. G. Li . 2010. The complete mitochondrial genome of the fall webworm, Hyphantria cunea (Lepidoptera: Arctiidae). International Journal of Biological Sciences 6: 172–186. Google Scholar


T. M. Lowe , and S. R. Eddy . 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Research 25: 955–964. Google Scholar


Z. H. Mao , J. S. Hao , G. P. Zhu , J. Hu , M. M. Si , and C. D. Zhu . 2010. Sequencing and analysis of the complete mitochondrial genome of Pieris rapae Linnaeus (Lepidoptera: Pieridae). Acta Entomologica Sinica 5: 1295–1304. Google Scholar


D. Ojala , J. Montoya , and G. Attardi . 1981. tRNA punctuation model of RNA processing in human mitochondria. Nature 290: 470–474. Google Scholar


J. S. Park , Y. Cho , M. J. Kim , S. H. Nam , and I Kim . 2012. Description of complete mitochondrial genome of the black-veined white, Aporia crataegi (Lepidoptera: Papilionoidea), and comparison to papilionoid species. Journal of Asia-Pacific Entomology 15: 331–341. Google Scholar


D. Posada , and K. A. Crandal . 1998. Modeltest: testing the model of DNA substitution. Bioinformatics 14: 817–818. Google Scholar


X. M. Qin , Q. X. Guan , D. L. Zeng , F. Qin , and H. M. Li . 2012. Complete mitochondrial genome of Kallima inachus (Lepidoptera: Nymphalidae: Nymphalinae): comparison of K. inachus and Argynnis hyperbius. Mitochondrial DNA 23(4):318–320. Google Scholar


J. H. Rienks 1985. Phenotypic response to photoperiod and temperature in a tropical pierid butterfly. Australian Journal of Zoology 33: 837–847. Google Scholar


R. K. Robbins 1988. Comparative morphology of the butterfly foreleg coxa and trochanter (Lepidoptera) and its systematic implications. Proceedings of the Entomological Society of Washington 90: 133–154. Google Scholar


J. A. Scott 1985. The phylogeny of butterflies (Papilionoidea and Hesperioidea). Journal of Research on the Lepidoptera 23: 241–281. Google Scholar


Q. H. Shi , J. Xia , X. Y. Sun , J. S. Hao , and Q. Yang . 2012. Complete mitogenome of the Painted Jezebel, Delis hyparete Linnaeus (Lepidoptera: Pieridae) and its comparison with other butterfly species. Zoological Research 33: 111–120. Google Scholar


C. Simon , F. Frati , A. Beckenbach , B. Cresp , H. Liu , and P. Flook . 1994. Evolution, weighting, and phylogenetic utility of mitochondrial gene sequences and a compilation of conserved polymerase chain reaction primers. Annals of the Entomological Society of America 87: 651–701. Google Scholar


V. K. Singh , A. K. Mangalam , S. Dwivedi , and S. Naik . 1998. Primer Premier: program for design of degenerate primers from a protein sequence. Bio Techniques 24: 318–319. Google Scholar


Y. Son , and Y. Kim . 2011. The complete mitochondrial genome of Grapholita molesta (Lepidoptera: Tortricidae). Annals of the Entomological Society of America 104: 788–799. Google Scholar


D. G. Stavenga , S. Stowe , K. Siebke , J. Zeil , and K. Arikawa . 2004. Butterfly wing colours: scale beads make white pierid wings brighter. Proceedings of the Royal Society of London B 271: 1577–1584. Google Scholar


D. L. Swofford 2002. PAUP: Phylogenetic analysis using parsimony (and other methods) version 4.10. Sinauer Associates, Sunderland, MA. Google Scholar


J. D. Thompson , T. J. Gibson , F. Plewniak , F. Jeanmougin , and D. G. Higgins . 1997. The Clustal X Windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Research 25: 4876–4882. Google Scholar


L. L. Tian , X. Y. Sun , M. Chen , Y. Gai , J. S. Hao , and Q. Yang . 2012. Complete mitochondrial genome of the five-dot sergeant Parathyma sulpitia (Nymphalidae: Limenitidinae) and its phylogenetic implications. Zoological Research 33: 133–143. Google Scholar


N. Wahlberg , M. F. Braby , A. V. Z. Brower , R. de Jong , M. M. Lee , S. Nylin , N. Pierce , F. A. Sperling , R. Vila , A. D. Warren . 2005. Synergistic effects of combining morphological and molecular data in resolving the phylogeny of butterflies and skippers. Proceedings of the Royal Society B: Biological Sciences 272: 1577–1586. Google Scholar


X. C. Wang , X. Y. Sun , Q. Q. Sun , D. X. Zhang , J. Hu , Q. Yang , and J. S. Hao . 2011. The complete mitochondrial genome of the laced fritillary Argyreus hyperbius (Lepidoptera: Nymphalidae). Zoological Research 32: 465–475. Google Scholar


W. B. Watt , K. Donohue , and P. A. Carter . 1996. Adaptation at specific loci. VI. Divergence vs. parallelism of polymorphic allozymes in molecular function and fitness component effects among Colias species (Lepidoptera, Pieridae). Molecular Biology and Evolution 13: 699–709. Google Scholar


S. J. Weller , D. P. Pashley , and J. A. Martin . 1996. Reassessment of butterfly family relationships using independent genes and morphology. Annals of the Entomological Society of America 89: 184–192. Google Scholar


L. W. Wu , D. C. Lees , S. H. Yen , C. C. Lu , and Y. F. Hsu . 2010. The complete mitochondrial genome of the near-threatened swallowtail, Agehana maraho (Lepidoptera: Papilionidae): evaluating sequence variability and suitable markers for conservation genetic studies. Entomological News 121: 267–280. Google Scholar


J. Xia , J. Hu , G. P. Zhu , C. D. Zhu , and J. S. Hao . 2011. Complete mitochondrial DNA sequence of Calinaga davidis. Acta Entomologica Sinica 54: 555–565. Google Scholar


K. Yukuhiro , H. Sezutsu , M. Itoh , K. Shimizu , and Y. Banno . 2002. Significant levels of sequence divergence and gene rearrangements have occurred between the mitochondrial genomes of the wild mulberry silkmoth, Bombyx mandarina , and its close relative, the domesticated silkmoth, Bombyx mori. Molecular Biology and Evolution 19: 1385–1389. Google Scholar


M. Zhang , X. P. Nie , T. W. Cao , J. P. Wang , T. Li , X. N. Zhang , Y. P. Guo , E. B. Ma , and Y. Zhong . 2012. The complete mitochondrial genome of the butterfly Apatura metis (Lepidoptera: Nymphalidae). Molecular Biology Reports 39: 6529–6536. Google Scholar
This is an open access paper. We use the Creative Commons Attribution 3.0 license that permits unrestricted use, provided that the paper is properly attributed.
Juan-Juan Hao, Jia-Sheng Hao, Xiao-Yan Sun, Lan-Lan Zhang, and Qun Yang "The Complete Mitochondrial Genomes of the Fenton's Wood White, Leptidea morsei, and the Lemon Emigrant, Catopsilia pomona," Journal of Insect Science 14(130), 1-22, (1 October 2014).
Received: 8 October 2012; Accepted: 5 June 2013; Published: 1 October 2014

mitochondrial genome
phylogenetic analysis
Get copyright permission
Back to Top