Skip to main content

Research Repository

Advanced Search

All Outputs (48)

Visual representation of text in web documents and its interpretation (2005)
Book Chapter
Karatzas, D., & Antonacopoulos, A. (2005). Visual representation of text in web documents and its interpretation. In G. Malcolm (Ed.), Multidisciplinary approaches to visual representations and interpretations (181-196). Elsevier. https://doi.org/10.1016/S1571-0831%2804%2980041-8

This chapter examines the uses of text and its representation on Web documents in terms of the challenges in its interpretation. Particular attention is paid to the significant problem of non-uniform representation of text. This non-uniformity is mai... Read More about Visual representation of text in web documents and its interpretation.

A robust braille recognition system (2004)
Book Chapter
Antonacopoulos, A., & Bridson, D. (2004). A robust braille recognition system. In A. Dengel, & S. Marinai (Eds.), Document analysis systems VI (533-545). Springer Berlin / Heidelberg. https://doi.org/10.1007/b100557

Braille is the most effective means of written communication between visually-impaired and sighted people. This paper describes a new system that recognizes Braille characters in scanned Braille document pages. Unlike most other approaches, an ine... Read More about A robust braille recognition system.

Page segmentation using the description of the background (1998)
Journal Article
Antonacopoulos, A. (1998). Page segmentation using the description of the background. Computer Vision and Image Understanding, 70(3), 350-369. https://doi.org/10.1006/cviu.1998.0691

There is an ever increasing number of publications which do not have the “traditional” layout where printed regions are rectangu- lar. Text paragraphs and areas of graphic type may be of any shape, individually rotated and in any arrangement. Previou... Read More about Page segmentation using the description of the background.

The ENP image and ground truth dataset of historical newspapers
Book Chapter
Clausner, C., Papadopoulos, C., Pletschacher, S., & Antonacopoulos, A. The ENP image and ground truth dataset of historical newspapers. In 2015 13th International Conference on Document Analysis and Recognition (ICDAR) (931-935). IEEE-CPS. https://doi.org/10.1109/ICDAR.2015.7333898

This paper presents a research dataset of historical newspapers comprising over 500 page images, uniquely representative of European cultural heritage from the digitization projects of 12 national and major European libraries, created within the scop... Read More about The ENP image and ground truth dataset of historical newspapers.

The lifecycle of a digital historical document: structure and content
Conference Proceeding
Antonacopoulos, A., Wiszniewski, B., Krawczyk, H., & Karatzas, D. The lifecycle of a digital historical document: structure and content.

This paper describes the lifecycle of a digital historical document, from template-based structure definition through to content extraction from the scanned pages and its final reconstitution as an electronic document (combining content and semantic... Read More about The lifecycle of a digital historical document: structure and content.