Sangheeta Roy
Rough-fuzzy based scene categorization for text detection and recognition in video
Roy, Sangheeta; Shivakumara, Palaiahnakote; Jain, Namita; Khare, Vijeta; Dutta, Anjan; Pal, Umapada; Lu, Tong
Authors
Dr Shivakumara Palaiahnakote S.Palaiahnakote@salford.ac.uk
Lecturer
Namita Jain
Vijeta Khare
Anjan Dutta
Umapada Pal
Tong Lu
Abstract
Scene image or video understanding is a challenging task especially when number of video types increases drastically with high variations in background and foreground. This paper proposes a new method for categorizing scene videos into different classes, namely, Animation, Outlet, Sports, e-Learning, Medical, Weather, Defense, Economics, Animal Planet and Technology, for the performance improvement of text detection and recognition, which is an effective approach for scene image or video understanding. For this purpose, at first, we present a new combination of rough and fuzzy concept to study irregular shapes of edge components in input scene videos, which helps to classify edge components into several groups. Next, the proposed method explores gradient direction information of each pixel in each edge component group to extract stroke based features by dividing each group into several intra and inter planes. We further extract correlation and covariance features to encode semantic features located inside planes or between planes. Features of intra and inter planes of groups are then concatenated to get a feature matrix. Finally, the feature matrix is verified with temporal frames and fed to a neural network for categorization. Experimental results show that the proposed method outperforms the existing state-of-the-art methods, at the same time, the performances of text detection and recognition methods are also improved significantly due to categorization.
Citation
Roy, S., Shivakumara, P., Jain, N., Khare, V., Dutta, A., Pal, U., & Lu, T. (2018). Rough-fuzzy based scene categorization for text detection and recognition in video. Pattern recognition, 80, 64-82. https://doi.org/10.1016/j.patcog.2018.02.014
Journal Article Type | Article |
---|---|
Acceptance Date | Feb 11, 2018 |
Online Publication Date | Mar 12, 2018 |
Publication Date | 2018-08 |
Deposit Date | Feb 2, 2024 |
Journal | Pattern Recognition |
Print ISSN | 0031-3203 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 80 |
Pages | 64-82 |
DOI | https://doi.org/10.1016/j.patcog.2018.02.014 |
You might also like
Classification of aesthetic natural scene images using statistical and semantic features
(2023)
Journal Article
An Episodic Learning Network for Text Detection on Human Bodies in Sports Images
(2022)
Journal Article
A Conformable Moments-Based Deep Learning System for Forged Handwriting Detection
(2022)
Journal Article
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search