Aritro Pal Choudhury
A New Symmetry based Transformer for Text Spotting in Person and Vehicle Re-Identification Images
Pal Choudhury, Aritro; Palaiahnakote, Shivakumara; Pal, Umapada
Abstract
Text spotting in person and vehicle re-identification images is complex due to the presence of multiple views of the same person and vehicle. Most existing models focus on text spotting in natural scene images, our work focuses on spotting in person and vehicle re-identification images. The rationale behind this work is that the person and the vehicles share symmetry properties and the bib number in the torso and license plate number in the vehicle are text. The method divides the input image into patches, and it explores vision transformation for encoding the patches into linear patches. The linearly embedded patches are fed to the feature similarity index step, which involves phase congruency and gradient magnitude to detect symmetric patches. The transformer is proposed to encode and capture textual information from the symmetry patches for text detection and recognition. The decoder receives the attention features from the encoder and fetches a multi-task head with the information about the detected and recognized text. The experiments on person and vehicle image benchmark, viz. (Person) Re-ID, RBNR, UFPR-ALPR and RodoSol datasets show significant improvement in performance when compared to other text spotting models. The effectiveness of the proposed model is validated by testing on the benchmark datasets, namely, ICDAR 2015, Total-Text and CTW1500 of natural scene images. Furthermore, cross-data validation shows the proposed method is independent of domains.
Citation
Pal Choudhury, A., Palaiahnakote, S., & Pal, U. (2024). A New Symmetry based Transformer for Text Spotting in Person and Vehicle Re-Identification Images. International Journal of Pattern Recognition and Artificial Intelligence, https://doi.org/10.1142/S0218001424550115
Journal Article Type | Article |
---|---|
Acceptance Date | Jul 20, 2024 |
Publication Date | Sep 13, 2024 |
Deposit Date | Jul 20, 2024 |
Publicly Available Date | Sep 14, 2025 |
Journal | International Journal of Pattern Recognition and Artificial Intelligence |
Print ISSN | 0218-0014 |
Publisher | World Scientific Publishing |
Peer Reviewed | Peer Reviewed |
DOI | https://doi.org/10.1142/S0218001424550115 |
Keywords | Scene text detection; Scene text recognition; Transformer; Feature similarity index; Text spotting |
Files
This file is under embargo until Sep 14, 2025 due to copyright reasons.
Contact S.Palaiahnakote@salford.ac.uk to request a copy for personal use.
You might also like
An Adaptive Xception Model for Classification of Brain Tumors
(2024)
Journal Article
Altered Handwritten Text Detection in Document Images Using Deep Learning
(2024)
Journal Article
NDOrder: Exploring a Novel Decoding Order for Scene Text Recognition
(2024)
Journal Article
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search