Open Access
27 October 2022 Instar Determination for the Tomato Leafminer Tuta absoluta (Lepidoptera: Gelechiidae) Using the Density-Based OPTICS Clustering Algorithm
Wenqian Wang, Guanli Xiao, Baoyun Yang, Jvhui Ye, Xu Zhang, Yaqiang Zheng, Bin Chen
Author Affiliations +
Abstract

The tomato leafminer Tuta absoluta (Meyrick) is one of the most harmful pests of solanaceous crops. Its larval morphological characteristics are similar, making the distinguishing between different larval instars difficult. Accurate identification of T. absoluta instars is necessary either for population outbreak forecasting, or developing successful control programs. Although a clustering algorithm can be used to determine the number of larval instars, little is known regarding the use of density-based ordering points to identify the clustering structure (OPTICS) and determine the number of larvae. In this study, larval instars of 240 T. absoluta individuals were determined by the density-based OPTICS clustering method, based on mandible width, and head capsule width and length. To verify the feasibility of the OPTICS clustering method, we compared it with the density-based spatial clustering of applications with noise (DBSCAN) clustering algorithm, Gaussian mixture models, and k-means. Additionally, the instars determined by the clustering methods were verified using the Brooks–Dyar rule, Crosby rule, and linear regression model. The instars determined by the OPTICS clustering method were equal to those determined by the other types of clustering algorithms, and the instar results were consistent with the Brooks–Dyar rule, Crosby rule, frequency analysis, and logarithmic regression model. These results indicated that the OPTICS clustering method is robust for determining insect larva instar phase. Moreover, it was found that three morphological indices of T. absoluta can be used for determining instars of this pest in the field, which may provide important information for the management of T. absoluta populations.

Determination of the instar distribution of a pest population can provide important information for management because spray applications at a particular stage of pest larval development may increase the control efficacy. Thus, accurate instar determination in pests is important for forecasting the outbreak of pest insects, life table analyses, key factor analyses, and other aspects of biology (Esperk et al. 2007). The toxicity of some insecticides, as well as the efficiency larval parasitoids, and the level of expression of certain genes with instar-wise variation are instar-specific (Ramasubramanian et al. 2020). The most direct method for determining the instar number is identifying morphological features by observing the molting larvae (Puri 1925, Kiguchi and Agui 1981). However, this method is suitable for insects with significant differences in the instar morphological characteristics and short growth cycles that can be easily distinguished. For larval is that involve similar morphological characteristics, long growth periods, and spend most of their time inside leaves, stems, and fruits of plants. Thus, the instars cannot be easily distinguished.

Therefore, some rules, models, and clustering algorithms have been developed to solve this problem. Traditional instar division is based on Dyar's rule, where a certain external morphological index of the insect body is measured (Dyar and Rhinebeck 1890). The larval development and growth rule was analyzed by the frequency distribution method to determine the age of the larvae, and the growth rule of Brooks (1886), Floater (1996), and Crosby (1973) was used to assist the inference. However, these methods are based on the fact that biological characteristics tend to have a normal distribution, the frequency density curve is bell-shaped, and the instar number is then preliminarily assumed according to the data distribution state (Mcclellan et al. 1994, Jones et al. 2005, Wang et al. 2009). If the overlap area of two normal distributions is too large, then insect larval instars cannot be determined.

To account for these problems, some cluster analyses, such as the Gaussian mixture models (GMM, Wu et al. 2013), linear discriminant method (LDA, Hunt and Chapman 2001), k-means clustering algorithm (Yang et al. 2018), univariate kernel density estimation (Chen and Seybold 2013), adaptive kernel smoothing method (Cen et al. 2015), adaptive bivariate kernel smoothing method (Cen et al. 2018), density-based spatial clustering of applications with noise (DBSCAN) (Zheng et al. 2019), and artificial neural networks (Carmo et al. 2020), have been proposed. DBSCAN is a typical density clustering algorithm. Compared with k-means clustering and Gaussian mixture models, which are generally only applicable to convex sample sets, DBSCAN can be applied to both convex and nonconvex sample sets (Hahsler et al. 2019). However, its inability to find clusters of varying densities is a notable drawback of DBSCAN, resulting from the fact that a combination of a specific neighborhood size with a single density threshold minPts is used to determine if a point resides in a dense neighborhood.

Ordering points to identify the clustering structure (OPTICS) is proposed based on the DBSCAN clustering algorithm to reduce the instability of the clustering results caused by the parameter settings (Ankerst et al. 1999). OPTICS borrows the core density-reachable concept from DBSCAN. However, while DBSCAN may be thought of as a clustering algorithm, searching for natural groups in data, OPTICS is an augmented ordering algorithm from which either flat or hierarchical clustering results can be derived. OPTICS introduces two additional concepts called core-distance and reachability-distance. The algorithm starts with a point and expands its neighborhood in a manner similar to that of DBSCAN, but it explores new points in the order of lowest to highest core-distance. The order in which the points are explored in addition to each point's core- and reachability-distance is the final result of the algorithm (Hahsler et al. 2019). However, to the best of our knowledge, no study using this algorithm has focused on determining the number of insect instars in the field of entomology.

The tomato leafminer Tuta absoluta (Meyrick) (Lepidoptera: Gelechiidae) is an important tomato pest native to Peru, South America (Desneux et al. 2010). This insect has been found in more than 80 countries and regions in South America, Europe, Africa, Central America, and Asia (Desneux et al. 2011, Biondi et al. 2017, Campos et al. 2017, Blazhevski et al. 2018, Mansour et al. 2018, Han et al. 2019, Zhang et al. 2020). Tuta absoluta is considered a typical invasive species due to its capacity to develop very quickly under suitable agroecological conditions, rapidly spread in new areas, and cause economically related damage (Tropea et al. 2012, Martins et al. 2016). If no prevention and control measures are taken, then this pest can cause losses of up to 80–100% in the yield of tomato crops in recently invaded areas, and thus, it has become a major threat to global tomato production (Desneux et al. 2010). Therefore, accurate and effective prevention and control of tomato leafminer is extremely urgent.

Although previous studies found that there are four larval instars in the life cycle of T. absoluta (Desneux et al. 2010), because its leaf-mining and stem-, or fruit-boring behavior, and its larval morphological characteristics are similar, it is difficult to accurately determine the specific larval instar. However, because it was proven that the third instar larvae of T. absoluta is the most susceptible to some strains of Beauveria bassiana (Balsamo) (Hypocreales: Cordycipitaceae) and Bacillus thuringiensis (Berliner) (Bacillales: Bacillaceae), while susceptibility was lower in the second instar larvae stage (Tsoulnara and Port 2016), the determination of the specific larval instar is very important for the control this pest. Furthermore, the predatory insect Dicyphus errans (Wolff) (Hemiptera: Miridae) is more willing to prey on first instar larvae of T. absoluta (Ingegno et al. 2013). Therefore, rapid and accurate determination of the specific larval instar of the tomato leafminer is necessary for predicting its occurrence and for comprehensive prevention and control. Unless its larval instars can be accurately determined, there will be great difficulty in preventing and controlling this pest.

In this study, we used the OPTICS clustering algorithm (density-based clustering) as a new method to assess the larval instar grouping of T. absoluta larvae. Larval instars were measured using head capsule width and length and mandible width of the newly hatched to mature T. absoluta larvae. The results of the OPTICS clustering algorithm were compared with DBSCAN, another density-based clustering algorithm, as well as Gaussian mixture models (centroid-based clustering) and k-means clustering (distribution-based clustering). Through this work, we intend to provide a theoretical basis for studying the biological characteristics of T. absoluta, predicting pest outbreaks, and improving the existing integrated pest management strategy.

Materials and Methods

Insect Colony

The initial population of T. absoluta was collected in the greenhouse of Yunnan Agricultural University (102°44′56″E, 25°07′54″N). The experimental population was subcultured for three generations at room temperature on the Shouhe tomato.

Measurements of T. absoluta Morphological Characteristics

The adults of the above populations were placed in a nylon cage with potted tomato plants (1 × 1 m, 200 mesh) and raised at room temperature. After 24 hr of oviposition, the adults were removed, and the eggs were incubated. During the entire study period (from the newly hatched larvae to mature larva pupation), 20 larvae were randomly collected daily at 9:00 a.m. The end of larval instar was identified when the mature larvae dropped from infested leaves. Larvae were rinsed from the leaves and collected into a Petri dish using a paintbrush and a 2.5-ml liquid injector and then preserved in 70% alcohol (Jones et al. 2005, Yang et al. 2018, Zheng et al. 2019). The head capsule width and length and mandible width of the larvae were measured to the nearest 0.001 cm using a Leica stereo-microscope M205FA (Leica). These three characteristics were used together in the analysis to determine the instar stage.

OPTICS Clustering

Because OPTICS clustering was developed from the DBSCAN algorithm, we will first briefly introduce the DBSCAN algorithm. The DBSCAN algorithm is one of the most cited density-based clustering algorithms (Microsoft Academic Search 2017), and it is probably the most commonly used density-based clustering algorithm in today's scientific community (Ester et al. 1996). The central idea behind DBSCAN and its extensions and revisions is the notion that points are assigned to the same cluster if they are density reachable from each other. The DBSCAN algorithm identifies all such clusters by systematically finding all core points and expanding each to all density-reachable points. It is not always easy to determine the appropriate values for the two parameters and minPts. The parameters depend on the dataset and influence each other. Detailed information regarding these clusters has been described in previous studies (Hahsler et al. 2019, Zheng et al. 2019).

The inability to find clusters of varying density is a notable drawback of DBSCAN resulting from the fact that a combination of a specific neighborhood size with a single density threshold minPts is used to determine if a point resides in a dense neighborhood. OPTICS requires the same and minPts parameters as DBSCAN. However, the parameter is theoretically unnecessary and is only used for the practical purpose of reducing the runtime complexity of the algorithm. To describe OPTICS, we introduced two additional concepts called core-distance and reachability-distance.

Definition 1:

-neighborhood. The ∊-neighborhood, N (p), of a data point p is the set of points within a specified radius ∊ around p.

e01_940.gif

where d is some distance measure and ∊ ∈ R+. Note that together with p ∈ D this definition implies that point p is always part of its own ∊-neighborhood, i.e., p ∊ always holds. The size of the neighborhood |N(p)| showed as a simple unnormalized kernel density estimate around p using a uniform kernel with a bandwidth of ∊.

Definition 2:

Core-distance. The core-distance of a point p ∈D with respect to minPts and ∊ is defined as:

e02_940.gif

where minPts-dist(p) is the distance from p to its minPts-1 nearest neighbor, i.e., the minimal radius a neighborhood of size minPts centred at and including p would have.

Definition 3:

Reachability-distance. The reachability-distance of a point p ∈ D to a point q ∈ D parameterized by ∊ and minPts is defined as:

e03_940.gif

The reachability-distance of a core point p with respect to object q is the smallest neighborhood radius such that p would be directly density-reachable from q. Note that the parameters, although they have the same name, work differently than those in DBSCAN. OPTICS is typically set to a very large value compared to that of DBSCAN. Therefore, OPTICS will consider additional nearest neighbors in the core-distance calculation, and minPts affects the smoothness of the reachability distribution, where larger values will lead to a smoother reachability distribution. This scenario needs to be considered when choosing appropriate parameters. It is worth noting that the parameter is strictly used for computational reasons, and it restricts the number of points considered in the neighborhood search. It can safely be set to the maximum k-nearest neighbor (k-NN) distance, where k = minPts, and will achieve the same result as if were set to. Detailed information regarding these clusters has been described in previous studies (Hahsler et al. 2019).

The instar results for the OPTICS algorithm were assessed by the Brooks–Dyar and Crosby rules, frequency analysis, and logarithmic regression model (Dyar and Rhinebeck 1890, Crosby 1973). Detailed information regarding these procedures has been described in previous studies (Wu et al. 2013, Yang et al. 2018, Zheng et al. 2019).

Comparative Tests

To verify that the OPTICS clustering method is appropriate, DBSCAN clustering, k-means clustering, and Gaussian mixture models were also used to determine instars for T. absoluta. Detailed information regarding DBSCAN clustering is described above. Gaussian mixture model-based clustering was conducted based on the maximum-BIC model selected using the Bayesian information criterion (BIC). The BIC for parameterized Gaussian mixture models was fitted using the EM algorithm initialized by model-based hierarchical clustering. Detailed information on Gaussian mixture models has been described in previous studies (Wu et al. 2013, Scrucca et al. 2016, Zheng et al. 2019). Detailed information on k-means clustering has also been described in previous studies (MacQueen 1967, Wagstaff et al. 2001, Yang et al. 2018). The instar results for the above three clustering methods were also assessed by the Brooks–Dyar and Crosby rules, frequency analysis, and logarithmic regression model (Dyar and Rhinebeck 1890, Crosby 1973).

Statistical Analysis Software

The statistical analyses were performed in R software, version 4.1.0 (R Development Core Team, 2017). OPTICS and DBSCAN clustering were performed using the ‘dbscan’ package (Hahsler et al. 2019, Zheng et al. 2019). Gaussian mixture models were performed using the ‘mclust’ package (Scrucca et al. 2016), and k-means clustering was performed using the ‘Amap’ package (Lucas 2019). Linear regression equation fitting was performed using the ‘basic Trendline’ package (Mei et al. 2018). Data visualization was performed using the ‘ggpubr’, ‘ggplot2’, and ‘scatterplot3d’ packages (Ligges et al. 2002, Kassambara 2017).

Results

Determining T. absoluta Larval Instars by the OPTICS Clustering Algorithm

The OPTICS clustering algorithm was performed at a default MinPts of 5 points, and the results are shown as a reachability plot and convex hull plot (Fig. 1a and b). The OPTICS clustering algorithm was used on 240 larval individuals to correctly categorize them into one of four T. absoluta larval instar stages using head capsule width and length and mandible width. A total of 69, 42, 35, and 94 larvae at the first, second, third, and fourth instar phases, respectively, were recorded.

The averages of each clustering group, as determined by the OPTICS clustering algorithm, were verified using the Brooks–Dyar rule, Crosby rule, size-frequency distribution, and linear regression model of the Brooks–Dyar equation (Table 1 and Fig. 1). The Brooks–Dyar index of head capsule length and width and mandible width calculated using the OPTICS clustering algorithm ranged from 1.569 to 1.720, 1.497 to 1.601, and 1.439 to 1.632, respectively, indicating a constant rate. The Crosby index of major morphological characteristics determined by the OPTICS clustering algorithm was less than 10%, showing that the cluster determined by the OPTICS clustering algorithm was sustainable.

Fig. 1.

Determining the T. absoluta larval instar stage using the ordering points to identify the clustering structure (OPTICS) clustering algorithm. (a–c) The results for T. absoluta larval instars, as determined by the OPTICS clustering algorithm map using the original data; (d–f) OPTICS clustering algorithm map of size-frequency distributions. The different T. absoluta larval instars determined using the OPTICS clustering algorithm are annotated with different colors.

img-z4-1_940.jpg

The T. absoluta larval instars determined by the OPTICS clustering algorithm were mapped to the size-frequency distribution (Fig. 1d–f). The clusters of all morphological characteristics fit the size-frequency distribution well, and there was no overlap in the histogram of the size-frequency distribution. We also performed principal component analysis (PCA) (Fig. 1b), and the results showed that the four clusters produced by the OPTICS clustering algorithm were completely mapped.

The log-transformed size of the morphological characteristics in each clustering group, as determined by the OPTICS clustering algorithm, was plotted using the linear regression model of the Brooks–Dyar equation (Fig. 2). The Brooks–Dyar equation for mandible width was y=0.440 x-0.543 (R2 = 0.981, df = 238, RSS (residual sum of squares) = 1.392, p < 0.0001), and the growth rate constant eb was 1.553. The Brooks–Dyar equation for head capsule width was y=0.452 x+0.199 (R2 = 0.992, df = 238, RSS=0.657, p < 0.0001), and the growth rate constant eb was 1.571. The Brooks–Dyar equation for head capsule length was y=0.486 x-0.265 (R2 = 0.981, df = 238, RSS=1.773, p < 0.0001), and the growth rate constant eb was 1.626. Thus, the size of each morphological characteristic exponentially increased with the T. absoluta larval instars. The results also showed that the clusters created by the OPTICS clustering algorithm were consistent with those created by the Brooks–Dyar rule, further supporting its reliability for determining the number of T. absoluta larval instars.

Table 1.

Statistical analyses of the T. absoluta larval instars using four types of clustering algorithms

img-Anq_940.gif

Comparative Tests

To verify that the OPTICS clustering algorithm is appropriate and scientific, DBSCAN clustering, k-means clustering, and Gaussian mixture model creation were performed. The same instar clusters determined by the OPTICS clustering algorithm were determined by the DBSCAN clustering, k-means clustering, and Gaussian mixture models. For DBSCAN clustering, the same instar clusters were obtained with a neighborhood radius eps of 0.04 and a minimum number of points (MinPts) of 4 (Fig. 3). According to the largest BIC value (Fig. 4), the unconstrained model with four clusters was selected, and the same instar clusters were determined by the Gaussian mixture models. For k-means clustering, the same instar clusters were obtained at k = 4.

The clusters of all morphological characteristics, as determined by DBSCAN clustering, k-means clustering, and the Gaussian mixture models, were the same as those determined by the OPTICS clustering algorithm and fit the Brooks–Dyar rule and Crosby rule well (Table 1). Thus, OPTICS clustering can be used for determining T. absoluta larval instars.

Discussion

In this study, the density-based OPTICS clustering algorithm was used to determine the number of T. absoluta larval instars based on head capsule width and length and mandible width, and the results were the same as those obtained through the DBSCAN clustering algorithm (density-based clustering), Gaussian mixture models (centroid-based clustering), and k-means clustering (distribution-based clustering) and fit the Brooks–Dyar and Crosby rules, and size-frequency distribution well. Our results showed that T. absoluta larval instars were reliably and effectively determined by the OPTICS clustering algorithm and three other types of clustering algorithms. The results indicated that this density-based clustering method is a promising tool for the identification of T. absoluta larval instars. It will not only be the necessary raw material for forecasting but also for developing successful control programs for T. absoluta.

Gaussian mixture models, as distribution-based clustering has been used in determing the instars of Blaptica dubia (Serville) (Blattodea: Blaberidae) (Wu et al. 2013), k-means clustering, as centroid-based clustering has been used in determing the Simulium quinquestriatum (Shiraki) (Diptera: Simuliidae) (Yang et al. 2018). However, these algorithms are greatly influenced by subjective factors of users. For Gaussian mixture models, the clustering was conducted based on the model selected using the BIC, which is rather complicated. For k-means clustering, the k parameter signifies the user-specified number of clusters to find (Hahsler et al. 2019). Density-based clustering approaches have become increasingly popular due to their ability to capture clusters of arbitrary shapes, including nonconvex shapes. Although, density-based clustering of density-based spatial clustering of applications with noise (DBSCAN) clustering has been successfully used to determine the instars of Phthorimaea operculella (Zeller) (Lepidoptera: Gelechiidae) (Zheng et al. 2019). while this density-based clustering, two initial parameters , and minPts need to be manually set and entered by the user, and the clustering results are very sensitive to the values of these two parameters. Slightly different parameter settings will produce different clustering results and has some difficulties in distinguishing separated clusters if they are located too close to each other, even though they have different densities (Kanagala and Krishnaiah 2016). To overcome this problem, OPTICS algorithm was developed. Compared with the DBSCAN algorithm, the OPTICS algorithm addresses the defects of DBSCAN. OPTICS borrows the core density-reachable concept from DBSCAN. Although the OPTICS options also require the same two parameters as DBSCAN, the parameter is not necessary in theory and is only used to reduce the runtime complexity of the algorithm (Hahsler et al. 2019). In this study, our results show that the OPTICS clustering algorithm can be successfully determined T. absoluta instar stages. It should be noted that the same instar stages of T. absoluta clusters were determined by the OPTICS algorithm and the other algorithms, and these stages may be easily distinguishable for T. absoluta instar and more useful for dealing with complex data for insect instars. However, the advantages of the OPTICS clustering algorithm compared with those of the other three clustering methods are that it reduces the sensitivity to initial parameters, the clustering results are stable and unique, suggesting that the OPTICS clustering algorithm is a promising alternative approach for the identification of insect larval instars.

Fig. 2.

Linear regression relationship between the measurements of the morphological characteristics with 95% confidence intervals and instar number. (a) Mandible width, (b) head capsule width, and (c) head capsule length.

img-z6-1_940.jpg

Fig. 3.

k-nearest neighbor (k-NN) distance plot (a knee—the optimal neighborhood radius eps—is visible near a 4-NN distance of 0.04).

img-z6-3_940.jpg

Fig. 4.

Bayesian information criterion (BIC) plot of the measurement dataset of the T. absoluta larval morphological characteristics. Note: ‘EII' is spherical and has an equal volume; ‘VII' is spherical and has an unequal volume; ‘EEI' is diagonal and has an equal volume and shape; ‘VEI' is diagonal and has a varying volume and equal shape; ‘EVI' is diagonal and has a varying volume and varying shape; ‘VVI' is diagonal and has a varying volume and shape; ‘EEE' is ellipsoidal and has an equal volume, shape, and orientation; ‘EVE’ is ellipsoidal and has an equal volume and orientation; ‘VEE’ is ellipsoidal and has an equal shape and orientation; ‘VVE’ is ellipsoidal and has an equal orientation; ‘EEV’ is ellipsoidal and has equal volume and equal shape; ‘VEV’ is ellipsoidal and has an equal shape; ‘EVV’ is ellipsoidal and has an equal volume; and ‘VVV’ is ellipsoidal and has a varying volume, shape, and orientation. Where the maximum is taken over all of the models and numbers of components considered.

img-z6-6_940.jpg

No overlapping size-frequency distributions for the morphological characteristics, head capsule width and length or mandible width were found, indicating that these three morphological characteristics can be distinguished in T. absoluta larval instars. Among the three parameters, the division of head capsule width was the clearest, indicating that head capsule width can be used as the most optimal index for age identification, and this phenomenon also exists at the instar division of many other insects (Flaherty et al. 2012, Chen and Seybold 2013, Thakur 2016, Zheng et al. 2019). In summary, the size of three morphological characteristics for different instar stages is specified by the OPTICS clustering algorithm. This data can be used for rapid and accurate instar division and increase the efficiency of integrated management strategies for this pest in the field.

The instar number of insect larvae exhibits intraspecific differences. Photoperiod, temperature, humidity, food quality and quantity, and feeding density are the most common factors affecting the instar number of insect larvae. In the case of insufficient food or unfavorable environmental conditions, the instar number of insect larvae will change (Esperk et al. 2007, Shintani and Ishikawa 2010, Goguen and Moreau 2013), and the instar number usually tends to increase under adverse rather than favorable conditions (Esperk et al. 2007). Therefore, to increase the accuracy of instar divisions and thus increase the effectiveness of pest control, it is necessary for future studies to combine field data to observe the differences between field- and laboratory-reared instars under different photo-period population densities at different temperatures and under different nutritional conditions. Nevertheless, the use of the OPTICS clustering algorithm to determine T. absoluta larval instar stages is herein shown to be reliable and effective, which indicates that this method can also be used to determine the instar stages of other insects. In this study, the OPTICs algorithm was used to determine the size range of the different instars of T. absoluta, which provided basic information for the classification of larval instars. However, in the field, the insect is small, and its morphological indicators are still difficult to determine in filed for farmers even technicians in plant protection station, but the instar of larva can be determined using a microscope and manually measure the characteristics under the fewer samples in field. Moreover, these methods will be more practical in field when the development of simple measurement tools (such as designing a software to judge the age by taking photos) and the development of the algorithm into a simple program (such as determining the age by inputting measured data), it will be a further study.

Acknowledgments

We thank the Science and Technology Key Special Project of Yunnan Province (2019ZG00902) and Technology Major Project of the China National Tobacco Corporation (110202101049[LS-09]), and Guizhou Provincial Science and Technology Projects (Qian Ke He [2021] general 203).

Author Contributions

W.W., B.C., and Y.Z. conceived the research. W.W., G.X., B.Y., X.Z., and B.C. conducted the experiments. W.W., B.C., B.Y., and J.Y. analyzed the data and conducted the statistical analyses. W.W. wrote the manuscript. W.W., G.X., B.C., and Y.Z. revised the manuscript. All authors have read and agreed to the published version of the manuscript.

References Cited

1.

Ankerst, M., M. M. Breunig, H. P. Kriegel, and J. Sander. 1999. OPTICS: ordering points to identify the clustering structure. Proc. 1999 ACM SIGMOD Int Conf Manag Data. 49–60. Google Scholar

2.

Biondi, A., R. Guedes, F. H. Wan, and N. Desneux. 2017. Ecology, worldwide spread, and management of the invasive South American tomato pinworm, Tuta absoluta: past, present, and future. Annu. Rev. Entomol. 63: 239–258. Google Scholar

3.

Blazhevski, S., A. P. Kalaitzaki, and A. E. Tsagkarakis. 2018. Impact of nitrogen and potassium fertilization regimes on the biology of the tomato leaf miner Tuta absoluta. Entomol. Gen. 37: 157–174. Google Scholar

4.

Brooks, W. K. 1886. Report on the Stomatopoda collected by H. M. S. Challenger during the Years 1873–1876, vol. 5. Neill and Co., London. p. 105. Google Scholar

5.

Campos, M., A. Biondi, A. Adiga, R. N. C. Guedes, and N. Desneux. 2017. From the Western Palaearctic region to beyond: Tuta absoluta 10 years after invading Europe. J. Pest Sci. 90: 787–796. Google Scholar

6.

Carmo, D. G., E. S. Farias, T. L. Costa, E. A. Queiroz, M. Nascimento, and M. C. Picanço. 2020. Instar determination of Blaptostethus pallescens (Hemiptera: Anthocoridae) using artificial neural networks. Ann. Entomol. Soc. 113: 50–54. Google Scholar

7.

Cen, G. J., X. Y. Gao, X. Z. Long, X. R. Zeng, J. M. Liu, D. W. Wei, and Y. H. Yu. 2018. An adaptive bivariate kernel smoothing method for determining instars of Austrosimulium tillyardianum (Diptera: Simuliidae) larvae. Ecol. Entomol. 43: 412–421. Google Scholar

8.

Cen, G. J., Y. H. Yu, X. R. Zeng, X. Z. Long, D. W. We, X. Y. Gao, and T. Zeng. 2015. An adaptive kernel smoothing method for classifying, Austrosimulium tillyardianumr, (Diptera: Simuliidae) larval instars. J. Insect Sci. 15: 159–166. Google Scholar

9.

Chen, Y., and S. J. Seybold. 2013. Application of a frequency distribution method for determining instars of the beet armyworm (Lepidoptera: Noctuidae) from widths of cast head capsules. J. Econ. Entomol. 106: 800–806. Google Scholar

10.

Crosby, T. K. 1973. Dyar's rule predated by Brooks' rule. N. Z. Entomol. 5: 175–176. Google Scholar

11.

Desneux, N., M. G. Luna, T. Guillemaud, and A. Urbaneja. 2011. The invasive South American tomato pinworm, Tuta absoluta, continues to spread in Afro-Eurasia and beyond: the new threat to tomato world production. J. Pest Sci. 84: 403–408. Google Scholar

12.

Desneux, N., E. Wajnberg, K. A. G. Wyckhuys, G. Burgio, S. Arpaia, C. A. Narva´ez-Vasquez, J. Gonza´lez-Cabrera, D. C. Ruescas, E. Tabone, J. Frandon , et al. 2010. Biological invasion of European tomato crops by Tuta absoluta: ecology, geographic expansion and prospects for biological control. J. Pest Sci. 83: 197–215. Google Scholar

13.

Dyar, H. G., and N. Y. Rhinebeck. 1890. The number of molts of lepidopterous larvae. Psyche. 5: 420–422. Google Scholar

14.

Esperk, T., T. Tammaru, and S. Nylin. 2007. Intraspecific variability in number of larval instars in insects. J. Environ. Entomol. 100: 627–645. Google Scholar

15.

Ester, M., H. P. Kriegel, J. Sanders, and X. Xu. 1996. A density-based algorithm for discovering clusters in large spatial databases with noise , pp. 226–231. In KDD96 Proceedings. AAAI Press. Google Scholar

16.

Flaherty, L., J. Regniere, and J. Sweeney. 2012. Number of instars and sexual dimorphism of Tetropium fuscum (Coleoptera: Cerambycidae) larvae determined by maximum likelihood. Can. Entomol. 144: 720–726. Google Scholar

17.

Floater, G. J. 1996. The Brooks-Dyar rule and morphometrics of the processionary caterpillar Ochrogaster lunifer Herrich-Schaffer (Lepidoptera: Thaumetopoeidae). Aust. J. Entomol. 35: 271–278. Google Scholar

18.

Goguen, J., and G. Moreau. 2013. Factors affecting head capsule development in field populations of Altica sylvia (Coleoptera: Chrysomelidae). Can. Entomol. 145: 40–47. Google Scholar

19.

Hahsler, M., M. Piekenbrock, and D. Doran. 2019. DBSCAN: Fast Density-Based Clustering with R. J. Stat. Softw. 99: 1–30. Google Scholar

20.

Han, P., Y. Bayram, L. Shaltiel-Harpaz, F. Sohrabi, A. Saji, U. E. Esenali, A. Jalilov, A. Ali, P. R. Shashank, K. Ismoilov , et al. 2019. Tuta absoluta continues to disperse in Asia: damage, ongoing management and future challenges. J. Pest Sci. 92: 1317–1327. Google Scholar

21.

Hunt, G., and E. R. Chapman. 2001. Evaluating hypotheses of instar-grouping in arthropods: a maximum likelihood approach. Paleobiology. 27: 466–484. Google Scholar

22.

Ingegno, B. L., C. Ferracini, D. Gallinotti, A. Alma, and L. Tavella. 2013. Evaluation of the effectiveness of Dicyphus errans (Wolff) as predator of Tuta absoluta (Meyrick). Biol. Control 67: 246–252. Google Scholar

23.

Jones, V. P., C. C. Eastburn, T. D. Wilburn, and J. F. Brunner. 2005. Instar-specific phenology of Pandemis pyrusana and Choristoneura rosaceana (Lepidoptera: Tortricidae) in Washington apple orchards. J. Econ. Entomol. 98: 875–883. Google Scholar

24.

Kanagala, H. K., and V. V. J. R. Krishnaiah. 2016. A comparative study of K-Means, DBSCAN and OPTICS. In 2016 International Conference on Computer Communication and Informatics (ICCCI). IEEE. Google Scholar

25.

Kassambara, A. 2017. ggpubr: ‘ggplot2’ based publication ready plots (version 0.2). Retrieved from  https://cran.r-project.org/web/packages/ggpubr/index.htmlGoogle Scholar

26.

Kiguchi, K., and N. Agui. 1981. Ecdysteroid levels and developmental events during larval moulting in the silkworm, Bombyx mori. J. Insect Physiol. 27: 805–812. Google Scholar

27.

Ligges U., M. Maechler, and S. Schnackenberg. 2002. Scatterplot3d-An R package for visualizing multivariate data. Retrieved from  https://cran.rproject.org/web/packages/scatterplot3d/scatterplot3d.pdfGoogle Scholar

28.

Lucas, A. 2019. Amap package. Retrieved from  https://cran.r-project.org/web/packages/amap/amap.pdfGoogle Scholar

29.

MacQueen, J. B. 1967. Some methods for classification and analysis of multivariate observations, pp. 281–297. In Proceedings of the fifth symposium on math, statistics, and probability. Berkeley, CA: University of California Press. Google Scholar

30.

Mansour, R., T. Brévault, A. Chailleux, A. Cherif, K. Grissa-Lebdi, K. Haddi, S. A. Mohamed, R. S. Nofemela, A. Oke, S. Sylla , et al. 2018. Occurrence, biology, natural enemies and management Tuta absoluta in Africa. Entomol. Gen. 38: 83–111. Google Scholar

31.

Martins, J. C., M. C. Picanco, L. Bacci, R. N. C. Guedes, Jr. P. A. Santana, D. O. Ferreira, and M. Chediak. 2016. Life table determination of thermal requirements of the tomato borer Tuta absoluta. J. Pest Sci. 89: 897–908. Google Scholar

32.

Mcclellan, Q. C., and J. A. Logan. 1994. Instar determination for the gypsy moth (Lepidoptera: Lymantriidae) based on the frequency distribution of head capsule widths. Environ. Entomol. 23: 248–253. Google Scholar

33.

Mei, W., G. Yu, and J. Lai. 2018. Trendline and confidence interval of basic regression models to plot. Retrieved from  https://cran.r-project.org/web/packages/basicTrendline/index.htmlGoogle Scholar

34.

Microsoft Academic Search. 2017. Top publications in data mining.  https://academic.microsoft.com/#/search?iq=@data%20mining@&q=data%20mining&filters=&from=56&sort=3. Accessed on October 02, 2017. Google Scholar

35.

Puri, I. M. 1925. On the life-history and structure of the early stages of simuliidae (Diptera, Nematocera). Part I. Parasitology. 17: 335295–335369. Google Scholar

36.

R Development Core Team. 2017. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from  https://cran.r-project.org/web/packages/basicTrendline/index.htmlGoogle Scholar

37.

Ramasubramanian, T., R. T. Sonai, and S. E. Madhu. 2020. Instar determination for sugarcane internode borer Chilo sacchariphagus indicus (Kapur) (Lepidoptera: Crambidae). J. Asia-Pac. Entomol. 24: 461–469. Google Scholar

38.

Scrucca, L., F. Michael, T. B. Murphy, and A. E. Raftery. 2016. mclust 5: Clustering, classification and density estimation using Gaussian finite mixture models. R J. 8: 289–317. Google Scholar

39.

Shintani, Y., and Y. Ishikawa. 2010. Photoperiodic control of larval diapause in the yellow-spotted longicorn beetle, Psacothea hilaris: analysis by photoperiod manipulation. Entomol. Exp. Et Appl. 86: 41–48. Google Scholar

40.

Thakur, B. 2016. The study of head capsule width of different larval instars of Indian gypsy moth Lymantria obfuscata Walker in Himachal Pradesh (India). J. Entomol. Zool. Stud. 4: 42–46. Google Scholar

41.

Tropea, G. G., G. Siscaro, A. Biondi, and L. Zappala. 2012. Tuta absoluta, a South American pest of tomato now in the EPPO region: biology, distribution and damage. EPPO Bull. 42: 205–210. Google Scholar

42.

Tsoulnara, D., and G. Port. 2016. Efficacy of a Beauveria bassiana strain, Bacillus thuringiensis and their combination against the tomato leafminer Tuta absoluta. Entomol. Hell. 25: 23–30. Google Scholar

43.

Wagstaff, K., C. Cardie, S. Rogers, and S. Schrödl. 2001. Constrained K-means clustering with background knowledge. In Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001). Williams College. Morgan Kaufmann Publishers Inc., Williamstown, MA. Google Scholar

44.

Wang, J. W., Y. Q. Luo, and S. X. Zong. 2009. Characterization of larval instars of artemisia weevil, Adosomus sp. For. Sci. Pract.. 11: 20–23. Google Scholar

45.

Wu, H., A. G. Appel, and X. P. Hu. 2013. Instar determination of Blaptica dubia (Blattodea: Blaberidae) using Gaussian mixture models. Ann. Entomol. Soc. Am. 106: 323–328. Google Scholar

46.

Yang, Y. M., R. Jia, H. Xun, J. Yang, Q. Chen, X. G. Zeng, and M. Yang. 2018. Determining the number of instars in Simulium quinquestriatum (Diptera: Simuliidae) using k-Means clustering via the Canberra distance. J. Med. Entomol. 55: 808–816. Google Scholar

47.

Zhang, G. F., D. Y. Ma, Y. S. Wang, Y. H. Gao, W. X. Liu, R. Zhang, W. J. Fu, X. Q. Xian, J. Wang, M. Kuang , et al. 2020. First report of the South American tomato leafminer, Tuta absoluta (Meyrick), in China. J. Integr. Agric. 19: 1912–1917. Google Scholar

48.

Zheng, Y. Q., W. Q. Wang, B. Chen, L. M. Zhang, S. Phangthavong, Z. T. Su, L. Y. Zhang, and G. L. Xiao. 2019. Determining the number of instars in potato tuber moth Phthorimaea operculella (Zeller) using density-based DBSCAN clustering. J. Appl. Entomol. 143: 1080–1088. Google Scholar
© The Author(s) 2022. Published by Oxford University Press on behalf of Entomological Society of America.
Wenqian Wang, Guanli Xiao, Baoyun Yang, Jvhui Ye, Xu Zhang, Yaqiang Zheng, and Bin Chen "Instar Determination for the Tomato Leafminer Tuta absoluta (Lepidoptera: Gelechiidae) Using the Density-Based OPTICS Clustering Algorithm," Environmental Entomology 51(5), 940-947, (27 October 2022). https://doi.org/10.1093/ee/nvac065
Received: 11 March 2022; Accepted: 27 July 2022; Published: 27 October 2022
KEYWORDS
larval instar
OPTICS clustering algorithm
Tomato leafminer
Tuta absoluta
Back to Top