Yao Xiao
A text-context-aware CNN network for multi-oriented and multi-language scene text detection
Xiao, Yao; Xue, Minglong; Lu, Tong; Wu, Yirui; Palaiahnakote, Shivakumara
Authors
Abstract
The existing deep learning based state-of-theart scene text detection methods treat scene texts a type of general objects, or segment text regions directly. The latter category achieves remarkable detection results on arbitrary orientation and large aspect ratios of scene texts based on instance segmentation algorithms. However, due to the lack of context information with consideration of scene text unique characteristics, directly applying instance segmentation to text detection task is prone to result in low accuracy, especially producing false positive detection results. To ease this problem, we propose a novel text-context-aware scene text detection CNN structure, which appropriately encodes channel and spatial attention information to construct context-aware and discriminative feature map for multi-oriented and multi-language text detection tasks. With high representation ability of text context-aware feature map, the proposed instance segmentation based method can not only robustly detect multi-oriented and multi-language text from natural scene images, but also produce better text detection results by greatly reducing false positives. Experiments on ICDAR2015 and ICDAR2017-MLT datasets show that the proposed method has achieved superior performances in precision, recall and F-measure than most of the existing studies.
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | 2019 International Conference on Document Analysis and Recognition (ICDAR) |
Start Date | Sep 20, 2019 |
End Date | Sep 25, 2019 |
Online Publication Date | Feb 3, 2020 |
Publication Date | Feb 3, 2020 |
Deposit Date | Nov 15, 2024 |
Publisher | Institute of Electrical and Electronics Engineers |
Series ISSN | 2379-2140 |
Book Title | 2019 International Conference on Document Analysis and Recognition (ICDAR) |
ISBN | 9781728128610 |
DOI | https://doi.org/10.1109/ICDAR.2019.00116 |
You might also like
Altered Handwritten Text Detection in Document Images Using Deep Learning
(2024)
Journal Article
A novel autoencoder for structural anomalies detection in river tunnel operation
(2023)
Journal Article
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search