Skip to main content

Research Repository

Advanced Search

Prof Apostolos Antonacopoulos' Outputs (51)

Correction of arbitrary geometric artefacts in historical documents (2010)
Thesis
Rahnemoonfar, M. Correction of arbitrary geometric artefacts in historical documents. (Thesis). Salford : University of Salford

The research presented in this thesis addresses the problem of correction of arbitrary
geometric artefacts in historical documents. Geometric distortions in historical
documents may be introduced at any time during the... Read More about Correction of arbitrary geometric artefacts in historical documents.

A new framework for recognition of heavily degraded characters in historical typewritten documents based on semi-supervised clustering (2009)
Presentation / Conference Contribution

This paper presents a new semi-supervised clustering
framework to the recognition of heavily degraded characters
in historical typewritten documents, where off-theshelf
OCR typically fails. The constraints are generated
using typographical (colle... Read More about A new framework for recognition of heavily degraded characters in historical typewritten documents based on semi-supervised clustering.

Visual representation of text in web documents and its interpretation (2005)
Book Chapter
Karatzas, D., & Antonacopoulos, A. (2005). Visual representation of text in web documents and its interpretation. In G. Malcolm (Ed.), Multidisciplinary approaches to visual representations and interpretations (181-196). Elsevier. https://doi.org/10.1016/S1571-0831%2804%2980041-8

This chapter examines the uses of text and its representation on Web documents in terms of the challenges in its interpretation. Particular attention is paid to the significant problem of non-uniform representation of text. This non-uniformity is mai... Read More about Visual representation of text in web documents and its interpretation.

A robust braille recognition system (2004)
Book Chapter
Antonacopoulos, A., & Bridson, D. (2004). A robust braille recognition system. In A. Dengel, & S. Marinai (Eds.), Document analysis systems VI (533-545). Springer Berlin / Heidelberg. https://doi.org/10.1007/b100557

Braille is the most effective means of written communication between
visually-impaired and sighted people. This paper describes a new system
that recognizes Braille characters in scanned Braille document pages. Unlike
most other approaches, an ine... Read More about A robust braille recognition system.

Page segmentation using the description of the background (1998)
Journal Article
Antonacopoulos, A. (1998). Page segmentation using the description of the background. Computer Vision and Image Understanding, 70(3), 350-369. https://doi.org/10.1006/cviu.1998.0691

There is an ever increasing number of publications which do not have the “traditional” layout where printed regions are rectangu- lar. Text paragraphs and areas of graphic type may be of any shape, individually rotated and in any arrangement. Previou... Read More about Page segmentation using the description of the background.

The ENP image and ground truth dataset of historical newspapers
Book Chapter
Clausner, C., Papadopoulos, C., Pletschacher, S., & Antonacopoulos, A. The ENP image and ground truth dataset of historical newspapers. In 2015 13th International Conference on Document Analysis and Recognition (ICDAR) (931-935). IEEE-CPS. https://doi.org/10.1109/ICDAR.2015.7333898

This paper presents a research dataset of historical newspapers comprising over 500 page images, uniquely representative of European cultural heritage from the digitization projects of 12 national and major European libraries, created within the scop... Read More about The ENP image and ground truth dataset of historical newspapers.