G Retsinas
Historical typewritten document recognition using minimal user interaction
Retsinas, G; Gatos, B; Antonacopoulos, Apostolos; Louloudis, G; Stamatopoulos, N
Authors
B Gatos
Prof Apostolos Antonacopoulos A.Antonacopoulos@salford.ac.uk
Professor
G Louloudis
N Stamatopoulos
Abstract
Recognition of low-quality historical typewritten documents can still be considered as a challenging and difficult task due to several issues i.e. the existence of faint and degraded characters, stains, tears, punch holes etc. In this paper, we exploit the unique characteristics of historical typewritten documents in order to propose an efficient recognition methodology that requires minimum user interaction. It is based on a pre-processing stage in order to enhance the quality and extract connected components, on a semi-supervised clustering for detecting the most representative character samples and on a segmentation-free recognition stage based on a template matching and cross-correlation technique. Experimental results prove that even with minimum user interaction, the proposed method can lead to promising accuracy results.
Citation
Retsinas, G., Gatos, B., Antonacopoulos, A., Louloudis, G., & Stamatopoulos, N. (2015). Historical typewritten document recognition using minimal user interaction. In Proceedings of the 3rd Workshop on Historical Document Imaging and Processing (HIP2015) (31-38). ACM Digital Library. https://doi.org/10.1145/2809544.2809559
Publication Date | Jan 1, 2015 |
---|---|
Deposit Date | Mar 22, 2016 |
Pages | 31-38 |
Book Title | Proceedings of the 3rd Workshop on Historical Document Imaging and Processing (HIP2015) |
ISBN | 9781450336024 |
DOI | https://doi.org/10.1145/2809544.2809559 |
Additional Information | Funders : European Commission |
You might also like
Text line segmentation from struck-out handwritten document images
(2022)
Journal Article
A new deep wavefront based model for text localization in 3D video
(2021)
Journal Article
A survey of OCR evaluation tools and metrics
(2021)
Conference Proceeding
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search