V Kluzner
Word-Based adaptive OCR for historical books
Kluzner, V; Tzadok, A; Shimony, Y; Walach, E; Antonacopoulos, A
Authors
Abstract
The aim of this work is to propose a new approach to the recognition of historical texts by providing an adaptive mechanism that automatically tunes itself to a specific book. The system is based on clustering together all the similar words in a book/text and simultaneously handling entire class. The paper describes the architecture of such a system and new algorithms that have been developed for robust word image comparison (including registration, optical flow based distortion compensation, and adaptive binarization). Results for a large dataset are presented as well. Over 23% recognition improvement is demonstrated.
Citation
Kluzner, V., Tzadok, A., Shimony, Y., Walach, E., & Antonacopoulos, A. (2009). Word-Based adaptive OCR for historical books. In 2009 10th International Conference on Document Analysis and Recognition. https://doi.org/10.1109/ICDAR.2009.133
Conference Name | Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on |
---|---|
Conference Location | Barcelona |
Start Date | Jul 26, 2009 |
End Date | Jul 29, 2009 |
Publication Date | Jan 1, 2009 |
Deposit Date | Jun 19, 2014 |
Book Title | 2009 10th International Conference on Document Analysis and Recognition |
DOI | https://doi.org/10.1109/ICDAR.2009.133 |
Keywords | historical book, image recognition, optical character recognition, word-based adaptive OCR. |
Publisher URL | http://dx.doi.org/10.1109/ICDAR.2009.133 |
Related Public URLs | http://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=5277471 |
Additional Information | Event Type : Conference |
You might also like
A survey of OCR evaluation tools and metrics
(2021)
Conference Proceeding
VISE : an interface for Visual Search and Exploration of museum collections
(2019)
Journal Article
Efficient and effective OCR engine training
(2019)
Journal Article
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search