Skip to main content

Research Repository

Advanced Search

ICDAR2017 Competition on Recognition of Documents with Complex Layouts – RDCL2017

Clausner, C; Antonacopoulos, A; Pletschacher, S

ICDAR2017 Competition on Recognition of Documents with Complex Layouts – RDCL2017 Thumbnail


Authors



Abstract

This paper presents an objective comparative evaluation of page segmentation and region classification methods for documents with complex layouts. It describes the competition (modus operandi, dataset and evaluation methodology) held in the context of ICDAR2017, presenting the results of the evaluation of seven methods – five submitted, two state-of-the-art systems (commercial and open-source). Three scenarios are reported in this paper, one evaluating the ability of methods to accurately segment regions and two evaluating both segmentation and region classification (one focusing only on text regions). For the first time, nested region content (table cells, chart labels etc.) are evaluated in addition to the top-level page content. Text recognition was a bonus challenge and was not taken up by all participants. The results indicate that an innovative approach has a clear advantage but there is still a considerable need to develop robust methods that deal with layout challenges, especially with the non-textual content.

Citation

Clausner, C., Antonacopoulos, A., & Pletschacher, S. (2017). ICDAR2017 Competition on Recognition of Documents with Complex Layouts – RDCL2017. . https://doi.org/10.1109/ICDAR.2017.229

Conference Name 14th International Conference on Document Analysis and Recognition (ICDAR2017)
Conference Location Kyoto, Japan
Start Date Nov 13, 2017
End Date Nov 15, 2017
Publication Date Nov 15, 2017
Deposit Date Nov 20, 2017
Publicly Available Date Dec 7, 2017
ISBN 9781538635865
DOI https://doi.org/10.1109/ICDAR.2017.229
Related Public URLs http://u-pat.org/ICDAR2017/index.php

Files

ICDAR2017 Competition on Recognition of Documents with Complex Layouts.pdf (1.1 Mb)
PDF





You might also like



Downloadable Citations