Efficient text detection and recognition in natural scene images using novel blended ensemble deep learning
International Journal of Artificial Intelligence
Abstract
Text detection and recognition in natural scene images is a critical task in computer vision, with applications ranging from document analysis to autonomous navigation. This work presents a robust and efficient pipeline that integrates YOLOv8 for text detection and EasyOCR for recognition, enhanced by an adaptive preprocessing mechanism between the two stages. The YOLOv8 model is trained on a custom dataset with polygonal annotations converted into YOLO format ensures precise bounding box formations around the text regions. An adaptive preprocessing module dynamically optimizes the detected regions adjusting resolution, noise reduction, and orientation before passing them to EasyOCR, significantly improving robustness. The lightweight yet powerful EasyOCR engine then recognizes text across diverse fonts, styles, and orientations. Evaluated on the benchmark Total-Text dataset, the proposed method demonstrates superior performance in detection accuracy, recognition precision, and computational efficiency. Additionally, this work provides a detailed analysis of training metrics, to validate the model’s robustness. The proposed system is scalable and can be integrated into real-time applications such as license plate recognition, document digitization, and assistive technologies for the visually impaired.
Discover Our Library
Embark on a journey through our expansive collection of articles and let curiosity lead your path to innovation.





