Skip to main content

Research Repository

Advanced Search

All Outputs (21)

Soft set-based MSER end-to-end system for occluded scene text detection, recognition and prediction (2024)
Journal Article
Das, A., Palaiahnakote, S., Banerjee, A., Antonacopoulos, A., & Pal, U. (2024). Soft set-based MSER end-to-end system for occluded scene text detection, recognition and prediction. Knowledge-Based Systems, https://doi.org/10.1016/j.knosys.2024.112593

The presence of unpredictable occlusions on natural scene text is a significant challenge, exacerbating the difficulties already posed on text detection and recognition by the variability of such images. Addressing the need for a robust, consistently... Read More about Soft set-based MSER end-to-end system for occluded scene text detection, recognition and prediction.

Spatial-Frequency Based EEG Features for Classification of Human Emotions (2024)
Journal Article
S. Gornale, S., Palaiahnakote, S., Unki, A., & Vadera, S. (2024). Spatial-Frequency Based EEG Features for Classification of Human Emotions. International Journal of Pattern Recognition and Artificial Intelligence, https://doi.org/10.1142/S0218001424570143

Human emotion classification without bias and unfairness is challenging because most existing
image-based methods are directly or indirectly affected by subjectivity. Therefore, we propose
an EEG (Electroencephalogram) based model for an accurate e... Read More about Spatial-Frequency Based EEG Features for Classification of Human Emotions.

A Novel Infogain and Multi-Axial Wavelet-based Transformer for Personality Traits Question Answering (2024)
Journal Article
Biswas, K., Palaiahnakote, S., Bhattacharya, S., Pal, U., & Sarkar, R. (2024). A Novel Infogain and Multi-Axial Wavelet-based Transformer for Personality Traits Question Answering. International Journal of Pattern Recognition and Artificial Intelligence, https://doi.org/10.1142/S0218001424510236

Visual Question Answering (VQA) is one of the attractive topics in the field of multimedia, affective,
and empathic computing to garner user interest. Unlike existing models which aim at addressing chal-
lenges of VQA for the scene images, this wor... Read More about A Novel Infogain and Multi-Axial Wavelet-based Transformer for Personality Traits Question Answering.

A New Adopted YOLOv9 Model for Detecting Mould Regions Inside of Buildings (2024)
Journal Article
Mansouri, T., Shadab Mashuk, M., Palaiahnakote, S., Chacko, A., Sykes, L., & Alameer, A. (2024). A New Adopted YOLOv9 Model for Detecting Mould Regions Inside of Buildings. International Journal of Pattern Recognition and Artificial Intelligence, https://doi.org/10.1142/S0218001424500253

Molds on wall and ceiling surfaces in damp indoor environments especially in houses with poor insulation and ventilation are common in the UK. Since it releases toxic chemicals as it grows, it is a serious health hazard for occupants who live in such... Read More about A New Adopted YOLOv9 Model for Detecting Mould Regions Inside of Buildings.

Domain‐independent adaptive histogram‐based features for pomegranate fruit and leaf diseases classification (2024)
Journal Article
Prajwala, M., Palaiahnakote, S., Prajwal Kumar, P., Maheshwarappa Gopinath, S., Basavanna, M., & P. Lopresti, D. (2024). Domain‐independent adaptive histogram‐based features for pomegranate fruit and leaf diseases classification. CAAI Transactions on Intelligence Technology, https://doi.org/10.1049/cit2.12390

Disease identification for fruits and leaves in the field of agriculture is important for estimating production, crop yield and earnings for farmers. In the specific case of pomegranates, this is challenging because of the wide range of possible dise... Read More about Domain‐independent adaptive histogram‐based features for pomegranate fruit and leaf diseases classification.

A Comprehensive Review on Text Detection and Recognition in Scene Images (2024)
Journal Article
Pal, U., Halder, A., Shivakumara, P., & Blumenstein, M. (2024). A Comprehensive Review on Text Detection and Recognition in Scene Images. #Journal not on list, https://doi.org/10.47852/AIA42022755

Detecting and recognizing text in natural scene images and videos is vital for several real-world applications, such as in the analysis of Crime scene CCTV footage, sports videos, and autonomous driving, to name a few. Therefore, one can expect sever... Read More about A Comprehensive Review on Text Detection and Recognition in Scene Images.

A New Contrastive Learning based Vision Transformer for Sentiment Analysis using Scene Text Images (2024)
Journal Article
Palaiahnakote, S., Kapri, D., Saleem, M. H., & Pal, U. (2024). A New Contrastive Learning based Vision Transformer for Sentiment Analysis using Scene Text Images. International Journal of Pattern Recognition and Artificial Intelligence, https://doi.org/10.1142/S0218001424520293

Sentiment analysis using scene text images is complex and challenging because it has an arbitrary
background, and the method should rely on only visual features. Unlike most existing methods that use
either text or images or both, this study uses o... Read More about A New Contrastive Learning based Vision Transformer for Sentiment Analysis using Scene Text Images.

Oil Palm Tree Detection in UAV Imagery Using an Enhanced RetinaNet (2024)
Journal Article
Lee, S. S., Lim, L. G., Palaiahnakote, S., Cheong, J. X., Sow, S., Lock, M., …Malaysia. (in press). Oil Palm Tree Detection in UAV Imagery Using an Enhanced RetinaNet. Computers and Electronics in Agriculture,

21 Accurate inventory management of oil palm trees is crucial for optimizing yield and monitoring 22 the health and growth of plantations. However, detecting and counting oil palm trees, particularly 23 young trees that blend into complex environment... Read More about Oil Palm Tree Detection in UAV Imagery Using an Enhanced RetinaNet.

A novel domain independent scene text localizer (2024)
Journal Article
Roy, A., Palaiahnakote, S., Pal, U., & Liu, C.-L. (2024). A novel domain independent scene text localizer. Pattern recognition, 158, Article 111015. https://doi.org/10.1016/j.patcog.2024.111015

Text localization across multiple domains is crucial for applications like autonomous driving and tracking marathon runners. This work introduces DIPCYT, a novel model that utilizes Domain Independent Partial Convolution and a Yolov5-based Transforme... Read More about A novel domain independent scene text localizer.

A New Symmetry-Based Transformer for Text Spotting in Person and Vehicle Re-Identification Images (2024)
Journal Article
Choudhury, A. P., Palaiahnakote, S., & Pal, U. (2024). A New Symmetry-Based Transformer for Text Spotting in Person and Vehicle Re-Identification Images. International Journal of Pattern Recognition and Artificial Intelligence, https://doi.org/10.1142/s0218001424550115

Text spotting in person and vehicle re-identification images is complex due to the presence of multiple views of the same person and vehicle. Most existing models focus on text spotting in natural scene images, our work focuses on spotting in person... Read More about A New Symmetry-Based Transformer for Text Spotting in Person and Vehicle Re-Identification Images.

A New Unsupervised Approach for Text Localization in Shaky and Non-shaky Scene Video (2024)
Conference Proceeding
Halder, A., Palaiahnakote, S., Pal, U., Blumenstein, M., & Liu, C.-L. (2024). A New Unsupervised Approach for Text Localization in Shaky and Non-shaky Scene Video. . https://doi.org/10.1007/978-3-031-70549-6_10

Text Detection in shaky and non-shaky videos is challenging due to poor video quality and the presence of static and dynamic obstacles. Video captured by a shaky camera due to wind is considered shaky video, while video captured by a fixed camera is... Read More about A New Unsupervised Approach for Text Localization in Shaky and Non-shaky Scene Video.

Sffl: Self-Aware Fairness Federated Learning Framework for Heterogeneous Data Distributions (2024)
Working Paper
Zhang, J., Li, Y., Wu, D., Zhao, Y., & Palaiahnakote, S. Sffl: Self-Aware Fairness Federated Learning Framework for Heterogeneous Data Distributions

Recent years have witnessed increasing development towards federated learning. However, federated learning has been proven to show biased predictions against certain demographic groups, such as sex or race, especially under heterogeneous data distrib... Read More about Sffl: Self-Aware Fairness Federated Learning Framework for Heterogeneous Data Distributions.

An Adaptive Xception Model for Classification of Brain Tumors (2024)
Journal Article
Thakur, A., Mahesh, T. R., Khan, S. B., Palaiahnakote, S., Kumar V, V., Vinoth Kumar, V., …Mashat, A. (in press). An Adaptive Xception Model for Classification of Brain Tumors. International Journal of Pattern Recognition and Artificial Intelligence, https://doi.org/10.1142/s0218001424560056

Classification of different brain tumors is challenging due to unpredictable variations in intra-inter-classes. Unlike existing methods which are not effective for images of complex backgrounds, the proposed work aims at accurate classification of di... Read More about An Adaptive Xception Model for Classification of Brain Tumors.

A New Approach for Classification of Spices to Make Special Herbal Tea Using Caralluma Fimbriata (2024)
Journal Article
Kumar, P. P., Palaiahnakote, S., & Patil, R. (2024). A New Approach for Classification of Spices to Make Special Herbal Tea Using Caralluma Fimbriata. International Journal of Pattern Recognition and Artificial Intelligence, https://doi.org/10.1142/s0218001424570039

Classification of multiple types of spice images is automatically challenging due to conflict between the texture patterns of spice images. This work aims to develop an automatic system for classifying different types of spice images so that the syst... Read More about A New Approach for Classification of Spices to Make Special Herbal Tea Using Caralluma Fimbriata.

TANet: Text region attention learning for vehicle re-identification (2024)
Journal Article
Hu, W., Zhan, H., Shivakumara, P., Pal, U., & Lu, Y. (2024). TANet: Text region attention learning for vehicle re-identification. Engineering Applications of Artificial Intelligence, 133, https://doi.org/10.1016/j.engappai.2024.108448

In recent years, the challenge of distinguishing vehicles of the same model has prompted a shift towards leveraging both global appearances and local features, such as lighting and rearview mirrors, for vehicle re-identification (ReID). Despite advan... Read More about TANet: Text region attention learning for vehicle re-identification.

Altered Handwritten Text Detection in Document Images Using Deep Learning (2024)
Journal Article
Patil, G., Palaiahnakote, S., Gornale, S. S., & Lopresti, D. P. (2024). Altered Handwritten Text Detection in Document Images Using Deep Learning. International Journal of Pattern Recognition and Artificial Intelligence, https://doi.org/10.1142/s0218001424520062

Handwritten documents possess immense significance in domains such as law, history, and administration. However, they are vulnerable to forgery, which can undermine their credibility and reliability. This paper aims to establish a dependable techniqu... Read More about Altered Handwritten Text Detection in Document Images Using Deep Learning.

TTS: Hilbert Transform-based Generative Adversarial Network for Tattoo and Scene Text Spotting (2024)
Journal Article
Banerjee, A., Palaiahnakote, S., Antonacopoulos, A., Pal, U., Lu, T., & Canet, J. L. (2024). TTS: Hilbert Transform-based Generative Adversarial Network for Tattoo and Scene Text Spotting. IEEE Transactions on Multimedia, 1-15. https://doi.org/10.1109/tmm.2024.3378458

Text spotting in natural scenes is of increasing interest and significance due to its critical role in several applications, such as visual question answering, named entity recognition and event rumor detection on social media. One of the newly emerg... Read More about TTS: Hilbert Transform-based Generative Adversarial Network for Tattoo and Scene Text Spotting.

NDOrder: Exploring a Novel Decoding Order for Scene Text Recognition (2024)
Journal Article
Zhong, D., Zhan, H., Lyu, S., Liu, C., Yin, B., Palaiahankote, S., …Lu, Y. (2024). NDOrder: Exploring a Novel Decoding Order for Scene Text Recognition. Expert systems with applications, 249, https://doi.org/10.1016/j.eswa.2024.123771

Text recognition in scene images is still considered as a challenging task for the computer vision and pattern recognition community. For text images affected by multiple adverse factors, such as occlusion (due to obstacles) and poor quality (due to... Read More about NDOrder: Exploring a Novel Decoding Order for Scene Text Recognition.

A Robust Script Independent Handwriting System for Gender Identification (2024)
Journal Article
Palaiahnakote, S., Kaljahi, M. A., Kanchan, S., Pal, U., Lopresti, D., & Lu, T. (2024). A Robust Script Independent Handwriting System for Gender Identification. Expert systems with applications, 249, https://doi.org/10.1016/j.eswa.2024.123576

Gender identification at the word level in a multi-script environment is challenging due to variations posed by free-style handwriting of individuals and geographical differences in writing styles. This paper presents a new approach, Multi-Orientatio... Read More about A Robust Script Independent Handwriting System for Gender Identification.

A New Approach for Speech Emotion Recognition using Single Layered Convolutional Neural Network (2024)
Journal Article
Vinoth Kumar, V., Palaiahnakote, S., Khan, S. B., & Almusharraf, A. (2024). A New Approach for Speech Emotion Recognition using Single Layered Convolutional Neural Network. Malaysian journal of computer science, 37(1), 89–106. https://doi.org/10.22452/mjcs.vol37no1.5

Creating a computational device to identify human emotions via voice analysis represents a notable achievement in the sector of human-computer interaction, especially within the healthcare domain. We propose a new lightweight model for addressing cha... Read More about A New Approach for Speech Emotion Recognition using Single Layered Convolutional Neural Network.