M Rahnemoonfar
Restoration of arbitrarily warped historical document images using flow lines
Rahnemoonfar, M; Antonacopoulos, A
Abstract
Historical documents frequently suffer from
arbitrary geometric distortions (warping and folds) due to
storage conditions, use and to, some extent, the printing
process of the time. In addition, page curl can be
prominent due to the scanning technique used. Such
distortions adversely affect OCR and print-on-demand
quality. Previous approaches to
geometric restoration
either focus only on the correction of page curl or require
supplementary informatio
n obtained by additional
scanning hardware ` not practical for existing scans. This
paper presents a new approach to detect and restore
arbitrary warping and folds, in addition to page curl.
Warped text lines and the smooth deformation between
them are precisely modelled as primary and secondary
flow lines that are then restored to their original linear
shape. Preliminary, but representative, experimental
results, in comparison to a leading page curl removal
method and an industry-standard commercial system,
demonstrate the effectiveness of the proposed method
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | Document Analysis and Recognition (ICDAR), 2011 International Conference |
Start Date | Sep 18, 2011 |
End Date | Sep 21, 2011 |
Publication Date | Jan 1, 2011 |
Deposit Date | Sep 10, 2014 |
Book Title | 2011 International Conference on Document Analysis and Recognition |
DOI | https://doi.org/10.1109/ICDAR.2011.184 |
Publisher URL | http://dx.doi.org/10.1109/ICDAR.2011.184 |
Additional Information | Event Type : Conference Funders : Funder not known |
You might also like
A new deep CNN for 3D text localization in the wild through shadow removal
(2023)
Journal Article
NAME – A Rich XML Format for Named Entity and Relation Tagging
(2023)
Presentation / Conference Contribution
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search