Hybrid machine learning for imbalanced lettuce disease classification
International Journal of Artificial Intelligence
Abstract
This study investigates a hybrid machine learning framework combining EfficientNet-B3 feature extraction with classical classifiers for lettuce disease classification under conditions of extreme class imbalance. The system utilizes EfficientNet-B3 to extract high-dimensional feature embeddings from 2,337 images, which are subsequently classified using support vector machine (SVM), random forest (RF), and k-nearest neighbors (KNN). Although the proposed SVM-based model achieves a high overall accuracy of 94.01%, experimental results reveal a substantial performance discrepancy compared to the macro F1-score of 37.94%. This critical gap indicates that while the model successfully identifies the majority classes, it fails to recognize rare disease categories with limited samples. Theoretical analysis suggests that while SVM handles high-dimensional feature spaces more effectively than RF and KNN, the deep features extracted are biased toward majority class characteristics. These findings highlight the severe limitations of accuracy-centric evaluation in agricultural diagnostics and demonstrate that deep feature extraction alone is insufficient to guarantee robust detection for minority pathologies. The study concludes that relying on aggregate accuracy can mask diagnostic failures, emphasizing the urgent need for per-class performance analysis and data-level mitigation strategies in future research.
Discover Our Library
Embark on a journey through our expansive collection of articles and let curiosity lead your path to innovation.





