Dr Shivakumara Palaiahnakote S.Palaiahnakote@salford.ac.uk
Lecturer in Computer Vision
Dr Shivakumara Palaiahnakote S.Palaiahnakote@salford.ac.uk
Lecturer in Computer Vision
Dhruv Kapri
Dr Muhammad Hammad Saleem M.H.Saleem@salford.ac.uk
Lecturer in Computer Science (AI)
Umapada Pal
Sentiment analysis using scene text images is complex and challenging because it has an arbitrary background, and the method should rely on only visual features. Unlike most existing methods that use either text or images or both, this study uses only scene text images for sentiment analysis. The intuition to use only scene text images is that sometimes users express their feelings and emotions or convey their messages by writing text in different shapes with diverse background designs. It is noted that the existing methods ignore such vital cues for sentiment analysis. This work explores a vision transformer to extract visual features that represent contextual information about the appearance of the text image. Further, to strengthen the visual features, the proposed work introduces contrastive learning which maximizes the gap between inter-classes and minimizes the gap between intra-classes of positive, negative, and neutral. To demonstrate the effectiveness of the proposed method, it is tested on our own constructed dataset and benchmark dataset. A comparative study of our method with the existing method shows the proposed method is superior in the classification of positive, negative, and neutral scene text images.
Palaiahnakote, S., Kapri, D., Saleem, M. H., & Pal, U. (2024). A New Contrastive Learning-Based Vision Transformer for Sentiment Analysis Using Scene Text Images. International Journal of Pattern Recognition and Artificial Intelligence, https://doi.org/10.1142/s0218001424520293
Journal Article Type | Article |
---|---|
Acceptance Date | Oct 3, 2024 |
Online Publication Date | Dec 23, 2024 |
Publication Date | Oct 17, 2024 |
Deposit Date | Nov 15, 2024 |
Publicly Available Date | Oct 18, 2025 |
Journal | International Journal of Pattern Recognition and Artificial Intelligence |
Print ISSN | 0218-0014 |
Electronic ISSN | 1793-6381 |
Publisher | World Scientific Publishing |
Peer Reviewed | Peer Reviewed |
DOI | https://doi.org/10.1142/s0218001424520293 |
This file is under embargo until Oct 18, 2025 due to copyright reasons.
Contact S.Palaiahnakote@salford.ac.uk to request a copy for personal use.
A Newly Adopted YOLOv9 Model for Detecting Mould Regions Inside of Buildings
(2024)
Journal Article
Spatial-Frequency Based EEG Features for Classification of Human Emotions
(2024)
Journal Article
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
Apache License Version 2.0 (http://www.apache.org/licenses/)
Apache License Version 2.0 (http://www.apache.org/licenses/)
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search