Dr Shivakumara Palaiahnakote S.Palaiahnakote@salford.ac.uk
Lecturer
A New Contrastive Learning-Based Vision Transformer for Sentiment Analysis Using Scene Text Images
Palaiahnakote, Shivakumara; Kapri, Dhruv; Saleem, Muhammad Hammad; Pal, Umapada
Authors
Dhruv Kapri
Dr Muhammad Hammad Saleem M.H.Saleem@salford.ac.uk
Lecturer in Computer Science (AI)
Umapada Pal
Abstract
Sentiment analysis using scene text images is complex and challenging because it has an arbitrary background, and the method should rely on only visual features. Unlike most existing methods that use either text or images or both, this study uses only scene text images for sentiment analysis. The intuition to use only scene text images is that sometimes users express their feelings and emotions or convey their messages by writing text in different shapes with diverse background designs. It is noted that the existing methods ignore such vital cues for sentiment analysis. This work explores a vision transformer to extract visual features that represent contextual information about the appearance of the text image. Further, to strengthen the visual features, the proposed work introduces contrastive learning which maximizes the gap between inter-classes and minimizes the gap between intra-classes of positive, negative, and neutral. To demonstrate the effectiveness of the proposed method, it is tested on our own constructed dataset and benchmark dataset. A comparative study of our method with the existing method shows the proposed method is superior in the classification of positive, negative, and neutral scene text images.
Citation
Palaiahnakote, S., Kapri, D., Saleem, M. H., & Pal, U. (2024). A New Contrastive Learning-Based Vision Transformer for Sentiment Analysis Using Scene Text Images. International Journal of Pattern Recognition and Artificial Intelligence, https://doi.org/10.1142/s0218001424520293
Journal Article Type | Article |
---|---|
Acceptance Date | Oct 3, 2024 |
Online Publication Date | Dec 23, 2024 |
Publication Date | Oct 17, 2024 |
Deposit Date | Nov 15, 2024 |
Publicly Available Date | Oct 18, 2025 |
Journal | International Journal of Pattern Recognition and Artificial Intelligence |
Print ISSN | 0218-0014 |
Electronic ISSN | 1793-6381 |
Publisher | World Scientific Publishing |
Peer Reviewed | Peer Reviewed |
DOI | https://doi.org/10.1142/s0218001424520293 |
Files
This file is under embargo until Oct 18, 2025 due to copyright reasons.
Contact S.Palaiahnakote@salford.ac.uk to request a copy for personal use.
You might also like
An Adaptive Xception Model for Classification of Brain Tumors
(2024)
Journal Article
Altered Handwritten Text Detection in Document Images Using Deep Learning
(2024)
Journal Article
NDOrder: Exploring a Novel Decoding Order for Scene Text Recognition
(2024)
Journal Article
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search