Kazuki Minemura
Multi-oriented text detection for intra-frame in H.264/AVC video
Minemura, Kazuki; Palaiahnakote, Shivakumara; Wong, KokSheik
Authors
Dr Shivakumara Palaiahnakote S.Palaiahnakote@salford.ac.uk
Lecturer in Computer Vision
KokSheik Wong
Abstract
Text detection in compressed video has received much attention in recent years due to the effectiveness of DCT coefficients and motion vectors in realizing several applications. In this paper, a new text detection, which utilizes AC coefficients in the H.264/AVC compressed video, is proposed. The proposed median deviation of coefficients from a specific subband is first computed, then the k-means clustering and morphological operations are applied to classify the text candidates. The majority orientation is considered to eliminate false positive candidate groups that have different orientations. Local block energy information is extracted to obtain the final text candidates. Experimental results show that the proposed method outperforms the existing methods either in computational time or accuracy in detecting horizontal text. Furthermore, for non-horizontal text, the proposed method is superior to all the conventional methods considered.
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | 2014 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) |
Start Date | Dec 1, 2014 |
End Date | Dec 4, 2014 |
Online Publication Date | Jan 29, 2015 |
Publication Date | Jan 29, 2015 |
Deposit Date | Nov 15, 2024 |
Publisher | Institute of Electrical and Electronics Engineers |
Book Title | 2014 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) |
ISBN | 9781479961207 |
DOI | https://doi.org/10.1109/ISPACS.2014.7024478 |
You might also like
A Newly Adopted YOLOv9 Model for Detecting Mould Regions Inside of Buildings
(2024)
Journal Article
Spatial-Frequency Based EEG Features for Classification of Human Emotions
(2024)
Journal Article