Sangheeta Roy
Rough-fuzzy based scene categorization for text detection and recognition in video
Roy, Sangheeta; Shivakumara, Palaiahnakote; Jain, Namita; Khare, Vijeta; Dutta, Anjan; Pal, Umapada; Lu, Tong
Authors
Dr Shivakumara Palaiahnakote S.Palaiahnakote@salford.ac.uk
Lecturer in Computer Vision
Namita Jain
Vijeta Khare
Anjan Dutta
Umapada Pal
Tong Lu
Abstract
Scene image or video understanding is a challenging task especially when number of video types increases drastically with high variations in background and foreground. This paper proposes a new method for categorizing scene videos into different classes, namely, Animation, Outlet, Sports, e-Learning, Medical, Weather, Defense, Economics, Animal Planet and Technology, for the performance improvement of text detection and recognition, which is an effective approach for scene image or video understanding. For this purpose, at first, we present a new combination of rough and fuzzy concept to study irregular shapes of edge components in input scene videos, which helps to classify edge components into several groups. Next, the proposed method explores gradient direction information of each pixel in each edge component group to extract stroke based features by dividing each group into several intra and inter planes. We further extract correlation and covariance features to encode semantic features located inside planes or between planes. Features of intra and inter planes of groups are then concatenated to get a feature matrix. Finally, the feature matrix is verified with temporal frames and fed to a neural network for categorization. Experimental results show that the proposed method outperforms the existing state-of-the-art methods, at the same time, the performances of text detection and recognition methods are also improved significantly due to categorization.
Citation
Roy, S., Shivakumara, P., Jain, N., Khare, V., Dutta, A., Pal, U., & Lu, T. (2018). Rough-fuzzy based scene categorization for text detection and recognition in video. Pattern recognition, 80, 64-82. https://doi.org/10.1016/j.patcog.2018.02.014
Journal Article Type | Article |
---|---|
Acceptance Date | Feb 11, 2018 |
Online Publication Date | Mar 12, 2018 |
Publication Date | 2018-08 |
Deposit Date | Feb 2, 2024 |
Journal | Pattern Recognition |
Print ISSN | 0031-3203 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 80 |
Pages | 64-82 |
DOI | https://doi.org/10.1016/j.patcog.2018.02.014 |
You might also like
A Newly Adopted YOLOv9 Model for Detecting Mould Regions Inside of Buildings
(2024)
Journal Article
Spatial-Frequency Based EEG Features for Classification of Human Emotions
(2024)
Journal Article