Department of Data Science and Information Technology, Faculty of Science,Udon Thani Rajabhat University, Udonthani, Thailand.
Department of Data Science and Information Technology, Faculty of Science,Udon Thani Rajabhat University, Udonthani, Thailand.
Master's student in Data Science and Information Technology
Department of Data Science and Information Technology, Faculty of Science,Udon Thani Rajabhat University, Udonthani, Thailand.
Assistant Professor in the Department of Data Science and Information Technology at Udon Thani Rajabhat University.
Department of Data Science and Information Technology, Faculty of Science,Udon Thani Rajabhat University, Udonthani, Thailand.
Lecturer in the Department of Data Science and Information Technology at Udon Thani Rajabhat University.
This study aimed to develop and evaluate the performance of deep learning models for classifying the quality of Golden Nam Dok Mai mangoes under real-world environmental conditions. A dataset of 1,200 mango images, categorized into three quality levels—Grade A, B, and C—was utilized. The experiments were conducted using 5-fold cross-validation across five distinct neural network architectures: custom CNN, ResNet50, VGG16, MobileNetV2, and EfficientNetB0, while comparing performance under conditions with and without image augmentation. The experimental results revealed that the MobileNetV2 model achieved the most distinguished performance, with both accuracy and F1-score reaching a perfect 100% after the fine-tuning process. This was followed by ResNet50, which achieved an accuracy of 99.44%, while VGG16 and EfficientNetB0 also demonstrated robust classification capabilities. Although statistical analysis using one-way ANOVA indicated no significant difference among the models (p > 0.05), the average performance of MobileNetV2 remained the highest. Beyond classification precision, this study emphasized computational costs and latency for industrial applicability. To ensure measurement stability, each model underwent 100 warm-up iterations. The custom CNN exhibited the lowest latency at 5.48 ms per image, followed by VGG16 and ResNet50 with latencies of 12.73 ms and 36.54 ms, respectively. The MobileNetV2 recorded a latency of 42.64 ms, confirming their technical readiness for real-time processing in high-speed conveyor systems. Furthermore, the perfect results obtained from MobileNetV2 were validated for stability, suggesting that the model effectively captured discriminative physical visual features under the defined experimental conditions, without evident overfitting. These findings highlight the significant potential of transfer learning combined with fine-tuning in enhancing fruit grading standards and increasing the competitiveness of the Thai agricultural sector. Finally, this research serves as a fundamental framework for future advancements, including the integration of multimodal learning and the implementation of Edge AI for commercial use and smart farming solutions.

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.