Click on “Download PDF” for the PDF version or on the title for the HTML version.


If you are not an ASABE member or if your employer has not arranged for access to the full-text, Click here for options.

An automated online learning framework for insect pest image classification model enhancement

Published by the American Society of Agricultural and Biological Engineers, St. Joseph, Michigan www.asabe.org

Citation:  2023 ASABE Annual International Meeting  2300292.(doi:10.13031/aim.202300292)
Authors:   Ching-Kuang Chen, Ta-Te Lin
Keywords:   Online Learning, Deep Learning, Gaussian Mixture Model, Insect Pest Classification

Abstract. To effectively manage insect pests during crop growth, it is important to have accurate data on the types and numbers of insect pests present in the field. The prime way to ascertain this information is to use the image-based deep-learning recognition model for insect detection and recognition. Making a deep learning model generalizable, typically requires a large amount of manually labelled data, which can be both time-consuming and labor-intensive. Thus, this research proposed an automated online learning framework for insect pests image classification for two main purposes: (1) To automate the collection of training dataset without the need for manual labelling in order to reduce the time and effort required for data preparation; (2) To maintain or improve the model performance over time by continuously tuning the pre-existing model by incorporating new data. The AIoT imaging devices developed by our previous research were used to automatically and periodically collect images of sticky paper traps. The new training samples were selected by using the Gaussian Mixture Model (GMM) trained on the features of the insect images, extracted by the CNN model, and setting the threshold by calculating the percentile value of the log-probability of the training dataset. After the new samples were collected, the fine-tuning method was used to update the previous reference model to continuously maintain or improve the model performance without human effort. In this study, the dataset was collected by our system for five years and the performance of supervised learning and the proposed automated online learning method were compared. The results showed that the proposed method gradually improved the F1-score from 0.898 to 0.956, which was comparable to the F1-score of 0.958 achieved by supervised learning. The GMM sample cleaning process ensures that the collected samples have certain feature similarities with the correct samples. Therefore, even if noisy or incorrect samples are collected, they will not cause the model to crash during the retraining process. This result indicates that the proposed method effectively enhances model performance while reducing human labor, making it a promising and resource-efficient approach for improving model performance.

(Download PDF)    (Export to EndNotes)