Skip to main content

Research Repository

Advanced Search

Multi-oriented text detection for intra-frame in H.264/AVC video

Minemura, Kazuki; Palaiahnakote, Shivakumara; Wong, KokSheik

Authors

Kazuki Minemura

KokSheik Wong



Abstract

Text detection in compressed video has received much attention in recent years due to the effectiveness of DCT coefficients and motion vectors in realizing several applications. In this paper, a new text detection, which utilizes AC coefficients in the H.264/AVC compressed video, is proposed. The proposed median deviation of coefficients from a specific subband is first computed, then the k-means clustering and morphological operations are applied to classify the text candidates. The majority orientation is considered to eliminate false positive candidate groups that have different orientations. Local block energy information is extracted to obtain the final text candidates. Experimental results show that the proposed method outperforms the existing methods either in computational time or accuracy in detecting horizontal text. Furthermore, for non-horizontal text, the proposed method is superior to all the conventional methods considered.

Presentation Conference Type Conference Paper (published)
Conference Name 2014 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)
Start Date Dec 1, 2014
End Date Dec 4, 2014
Online Publication Date Jan 29, 2015
Publication Date Jan 29, 2015
Deposit Date Nov 15, 2024
Publisher Institute of Electrical and Electronics Engineers
Book Title 2014 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)
ISBN 9781479961207
DOI https://doi.org/10.1109/ISPACS.2014.7024478