Umapada Pal
A Comprehensive Review on Text Detection and Recognition in Scene Images
Pal, Umapada; Halder, Arnab; Shivakumara, Palaiahnakote; Blumenstein, Michael
Authors
Arnab Halder
Dr Shivakumara Palaiahnakote S.Palaiahnakote@salford.ac.uk
Lecturer
Michael Blumenstein
Abstract
Detecting and recognizing text in natural scene images and videos is vital for several real-world applications, such as in the analysis of Crime scene CCTV footage, sports videos, and autonomous driving, to name a few. Therefore, one can expect several challenges, namely arbitrarily oriented and shaped text detection and identification in movies and natural environments. Many methods have been developed in the past to address these challenges, including advanced deep-learning models and transformers. Due to several methods available in the literature, it is not so easy to understand the open challenges, applications, directions, scope, limitations, and weaknesses of the methods. Therefore, there is a need to write a survey/review to highlight and discuss the strengths and weaknesses of the developed methods. This survey/review presents different categories of work and discusses their importance, limitations, new challenges, applications, and, finally, directions such that readers can choose appropriate methods and directions to carry out research work in the field of text detection/recognition in the natural scene and videos.
Citation
Pal, U., Halder, A., Shivakumara, P., & Blumenstein, M. (2024). A Comprehensive Review on Text Detection and Recognition in Scene Images. #Journal not on list, https://doi.org/10.47852/AIA42022755
Journal Article Type | Article |
---|---|
Acceptance Date | Sep 25, 2024 |
Publication Date | Oct 28, 2024 |
Deposit Date | Nov 22, 2024 |
Publicly Available Date | Nov 26, 2024 |
Journal | Artificial Intelligence and Applications |
Peer Reviewed | Peer Reviewed |
Series ISSN | 2811-0854 |
DOI | https://doi.org/10.47852/AIA42022755 |
Files
Published Version
(2.8 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
You might also like
An Adaptive Xception Model for Classification of Brain Tumors
(2024)
Journal Article
Altered Handwritten Text Detection in Document Images Using Deep Learning
(2024)
Journal Article
NDOrder: Exploring a Novel Decoding Order for Scene Text Recognition
(2024)
Journal Article
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search