Lokesh Nandanwar
A New Method for Detecting Altered Text in Document Images
Nandanwar, Lokesh; Shivakumara, Palaiahnakote; Pal, Umapada; Lu, Tong; Lopresti, Daniel; Seraogi, Bhagesh; B. Chaudhuri, Bidyut
Authors
Dr Shivakumara Palaiahnakote S.Palaiahnakote@salford.ac.uk
Lecturer
Umapada Pal
Tong Lu
Daniel Lopresti
Bhagesh Seraogi
Bidyut B. Chaudhuri
Abstract
As more and more office documents are captured, stored, and shared in digital format, and as image editing software becomes increasingly more powerful, there is a growing concern about document authenticity. For example, texts in property documents can be altered to make an illegal deal, or the date on an airline ticket can be altered to gain entry to airport terminals by breaching security. To prevent such illicit activities, this paper presents a new method for detecting altered text in a document. The proposed method explores the relationship between positive and negative coefficients of a DCT to extract the effect of distortions caused by tampering operations. Here we divide DCT coefficients into positive and negative classes, then reconstructs images from the inverse DCT of the respective positive and negative coefficients. Next, we perform Laplacian filtering over reconstructed images for widening the gap between the values of text and other pixels. Then filtered images of positive and negative coefficients are fused by an average operation. For a fused image, we generate Canny and Sobel edge images in order to investigate the effect of distortion through quality measures, namely, MSE, PSNR and SSIM used as features. In addition, for the fused image, the proposed method extracts features based on histograms over the residual images. The features are then passed on to a deep Convolutional Neural Network for classification. The proposed method is tested on our own dataset as well as two standard datasets, namely IMEI and the ICPR 2018 Fraud Contest dataset. The results show that the proposed method is effective and outperforms existing methods.
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | Pattern Recognition and Artificial Intelligence International Conference, ICPRAI 2020 |
Start Date | Oct 19, 2020 |
End Date | Oct 23, 2020 |
Online Publication Date | Oct 9, 2020 |
Publication Date | Oct 9, 2020 |
Deposit Date | Nov 15, 2024 |
Publisher | Springer |
Pages | 93-108 |
Series Title | Lecture Notes in Computer Science |
Series ISSN | 1611-3349 |
Book Title | Pattern Recognition and Artificial Intelligence |
ISBN | 978-3-030-59829-7 |
DOI | https://doi.org/10.1007/978-3-030-59830-3_8 |
You might also like
Altered Handwritten Text Detection in Document Images Using Deep Learning
(2024)
Journal Article
A novel autoencoder for structural anomalies detection in river tunnel operation
(2023)
Journal Article
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search