Efficient text detection and recognition in natural scene images using novel blended ensemble deep learning

Rajeswari Reddy Patil

Aradhana Dammergidda

International Journal of Artificial Intelligence

Efficient text detection and recognition in natural scene images using novel blended ensemble deep learning

Abstract

Text detection and recognition in natural scene images is a critical task in computer vision, with applications ranging from document analysis to autonomous navigation. This work presents a robust and efficient pipeline that integrates YOLOv8 for text detection and EasyOCR for recognition, enhanced by an adaptive preprocessing mechanism between the two stages. The YOLOv8 model is trained on a custom dataset with polygonal annotations converted into YOLO format ensures precise bounding box formations around the text regions. An adaptive preprocessing module dynamically optimizes the detected regions adjusting resolution, noise reduction, and orientation before passing them to EasyOCR, significantly improving robustness. The lightweight yet powerful EasyOCR engine then recognizes text across diverse fonts, styles, and orientations. Evaluated on the benchmark Total-Text dataset, the proposed method demonstrates superior performance in detection accuracy, recognition precision, and computational efficiency. Additionally, this work provides a detailed analysis of training metrics, to validate the model’s robustness. The proposed system is scalable and can be integrated into real-time applications such as license plate recognition, document digitization, and assistive technologies for the visually impaired.

Cite

Full View

DOI

10.11591/ijai.v15.i2.pp1664-1679

ISSN Information

2089-4872

Pages

1664-1679

More Information

Volume 15

Issue 2

Publish at 2026-04-01

Discover Our Library

Embark on a journey through our expansive collection of articles and let curiosity lead your path to innovation.

Explore Now