Skip to main content

Research Repository

Advanced Search

All Outputs (22)

Aletheia - An advanced document layout and text ground-truthing system for production environments (2011)
Conference Proceeding
Clausner, C., Pletschacher, S., & Antonacopoulos, A. (2011). Aletheia - An advanced document layout and text ground-truthing system for production environments. In 2011 International Conference on Document Analysis and Recognition ICDAR 2011. https://doi.org/10.1109/ICDAR.2011.19

Large-scale digitisation has led to a number of new possibilities with regard to adaptive and learning based methods in the field of Document Image Analysis and OCR. For ground truth production of large corpora, however, there is still a gap in terms... Read More about Aletheia - An advanced document layout and text ground-truthing system for production environments.

The ENP image and ground truth dataset of historical newspapers
Book Chapter
Clausner, C., Papadopoulos, C., Pletschacher, S., & Antonacopoulos, A. The ENP image and ground truth dataset of historical newspapers. In 2015 13th International Conference on Document Analysis and Recognition (ICDAR) (931-935). IEEE-CPS. https://doi.org/10.1109/ICDAR.2015.7333898

This paper presents a research dataset of historical newspapers comprising over 500 page images, uniquely representative of European cultural heritage from the digitization projects of 12 national and major European libraries, created within the scop... Read More about The ENP image and ground truth dataset of historical newspapers.