Skip to main content

Research Repository

Advanced Search

All Outputs (2)

Word-Based adaptive OCR for historical books (2009)
Conference Proceeding
Kluzner, V., Tzadok, A., Shimony, Y., Walach, E., & Antonacopoulos, A. (2009). Word-Based adaptive OCR for historical books. In 2009 10th International Conference on Document Analysis and Recognition. https://doi.org/10.1109/ICDAR.2009.133

The aim of this work is to propose a new approach to the recognition of historical texts by providing an adaptive mechanism that automatically tunes itself to a specific book. The system is based on clustering together all the similar words in a book... Read More about Word-Based adaptive OCR for historical books.

A new framework for recognition of heavily degraded characters in historical typewritten documents based on semi-supervised clustering (2009)
Conference Proceeding
Pletschacher, S., Hu, J., & Antonacopoulos, A. (2009). A new framework for recognition of heavily degraded characters in historical typewritten documents based on semi-supervised clustering. In 2009 10th International Conference on Document Analysis and Recognition. https://doi.org/10.1109/ICDAR.2009.267

This paper presents a new semi-supervised clustering framework to the recognition of heavily degraded characters in historical typewritten documents, where off-theshelf OCR typically fails. The constraints are generated using typographical (colle... Read More about A new framework for recognition of heavily degraded characters in historical typewritten documents based on semi-supervised clustering.