Prof Apostolos Antonacopoulos A.Antonacopoulos@salford.ac.uk
Professor
Prof Apostolos Antonacopoulos A.Antonacopoulos@salford.ac.uk
Professor
Dr Christian Clausner C.Clausner@salford.ac.uk
Senior Research Fellow
C Papadopoulos
Mr Stefan Pletschacher S.Pletschacher@salford.ac.uk
Lecturer
This paper presents an objective comparative evaluation of page segmentation and region classification methods for documents with complex layouts. It describes the competition (modus operandi, dataset and evaluation methodology) held in the context of ICDAR2015, presenting the results of the evaluation of eight methods – four submitted, two state-of-the art systems (one commercial and one open-source) and their two immediately previous versions. Three scenarios are reported in this paper, one evaluating the ability of methods to accurately segment regions and two evaluating both segmentation and region classification (one with emphasis on text and the other focusing only on text). The results indicate that an innovative approach has a clear advantage but there is still a considerable need to develop robust methods that deal with layout challenges, especially with the non-text content.
Antonacopoulos, A., Clausner, C., Papadopoulos, C., & Pletschacher, S. (2015). ICDAR2015 competition on recognition of documents with complex layouts - RDCL2015. In 2015 13th International Conference on Document Analysis and Recognition (ICDAR) (1151-1155). IEEE. https://doi.org/10.1109/ICDAR.2015.7333941
Publication Date | Jan 1, 2015 |
---|---|
Deposit Date | Mar 22, 2016 |
Pages | 1151-1155 |
Book Title | 2015 13th International Conference on Document Analysis and Recognition (ICDAR) |
ISBN | 9781479918058 |
DOI | https://doi.org/10.1109/ICDAR.2015.7333941 |
Publisher URL | http://dx.doi.org/10.1109/ICDAR.2015.7333941 |
Additional Information | Funders : European Commission |
Efficient and effective OCR engine training
(2019)
Journal Article
Highlights of the novel dewaterability estimation test (DET) device
(2019)
Journal Article
The ENP image and ground truth dataset of historical newspapers
(-0001)
Book Chapter
A survey of OCR evaluation tools and metrics
(2021)
Presentation / Conference Contribution
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
Apache License Version 2.0 (http://www.apache.org/licenses/)
Apache License Version 2.0 (http://www.apache.org/licenses/)
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search