Mr Christian Clausner C.Clausner@salford.ac.uk
Senior Research Fellow
ICDAR2017 Competition on Recognition of Documents with Complex Layouts – RDCL2017
Clausner, C; Antonacopoulos, A; Pletschacher, S
Authors
Prof Apostolos Antonacopoulos A.Antonacopoulos@salford.ac.uk
Professor
Mr Stefan Pletschacher S.Pletschacher@salford.ac.uk
Lecturer
Abstract
This paper presents an objective comparative evaluation of page segmentation and region classification methods for documents with complex layouts. It describes the competition (modus operandi, dataset and evaluation methodology) held in the context of ICDAR2017, presenting the results of the evaluation of seven methods – five submitted, two state-of-the-art systems (commercial and open-source). Three scenarios are reported in this paper, one evaluating the ability of methods to accurately segment regions and two evaluating both segmentation and region classification (one focusing only on text regions). For the first time, nested region content (table cells, chart labels etc.) are evaluated in addition to the top-level page content. Text recognition was a bonus challenge and was not taken up by all participants. The results indicate that an innovative approach has a clear advantage but there is still a considerable need to develop robust methods that deal with layout challenges, especially with the non-textual content.
Citation
Clausner, C., Antonacopoulos, A., & Pletschacher, S. (2017). ICDAR2017 Competition on Recognition of Documents with Complex Layouts – RDCL2017. . https://doi.org/10.1109/ICDAR.2017.229
Conference Name | 14th International Conference on Document Analysis and Recognition (ICDAR2017) |
---|---|
Conference Location | Kyoto, Japan |
Start Date | Nov 13, 2017 |
End Date | Nov 15, 2017 |
Publication Date | Nov 15, 2017 |
Deposit Date | Nov 20, 2017 |
Publicly Available Date | Dec 7, 2017 |
ISBN | 9781538635865 |
DOI | https://doi.org/10.1109/ICDAR.2017.229 |
Related Public URLs | http://u-pat.org/ICDAR2017/index.php |
Files
ICDAR2017 Competition on Recognition of Documents with Complex Layouts.pdf
(1.1 Mb)
PDF
You might also like
Text line segmentation from struck-out handwritten document images
(2022)
Journal Article
A survey of OCR evaluation tools and metrics
(2021)
Conference Proceeding
VISE : an interface for Visual Search and Exploration of museum collections
(2019)
Journal Article
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search