Skip to main content

Research Repository

Advanced Search

All Outputs (85)

A New Symmetry-Based Transformer for Text Spotting in Person and Vehicle Re-Identification Images (2024)
Journal Article
Choudhury, A. P., Palaiahnakote, S., & Pal, U. (2024). A New Symmetry-Based Transformer for Text Spotting in Person and Vehicle Re-Identification Images. International Journal of Pattern Recognition and Artificial Intelligence, https://doi.org/10.1142/s0218001424550115

Text spotting in person and vehicle re-identification images is complex due to the presence of multiple views of the same person and vehicle. Most existing models focus on text spotting in natural scene images, our work focuses on spotting in person... Read More about A New Symmetry-Based Transformer for Text Spotting in Person and Vehicle Re-Identification Images.

Sffl: Self-Aware Fairness Federated Learning Framework for Heterogeneous Data Distributions (2024)
Working Paper
Zhang, J., Li, Y., Wu, D., Zhao, Y., & Palaiahnakote, S. Sffl: Self-Aware Fairness Federated Learning Framework for Heterogeneous Data Distributions

Recent years have witnessed increasing development towards federated learning. However, federated learning has been proven to show biased predictions against certain demographic groups, such as sex or race, especially under heterogeneous data distrib... Read More about Sffl: Self-Aware Fairness Federated Learning Framework for Heterogeneous Data Distributions.

A New Approach for Classification of Spices to Make Special Herbal Tea Using Caralluma Fimbriata (2024)
Journal Article
Kumar, P. P., Palaiahnakote, S., & Patil, R. (2024). A New Approach for Classification of Spices to Make Special Herbal Tea Using Caralluma Fimbriata. International Journal of Pattern Recognition and Artificial Intelligence, https://doi.org/10.1142/s0218001424570039

Classification of multiple types of spice images is automatically challenging due to conflict between the texture patterns of spice images. This work aims to develop an automatic system for classifying different types of spice images so that the syst... Read More about A New Approach for Classification of Spices to Make Special Herbal Tea Using Caralluma Fimbriata.

TANet: Text region attention learning for vehicle re-identification (2024)
Journal Article
Hu, W., Zhan, H., Shivakumara, P., Pal, U., & Lu, Y. (2024). TANet: Text region attention learning for vehicle re-identification. Engineering Applications of Artificial Intelligence, 133, https://doi.org/10.1016/j.engappai.2024.108448

In recent years, the challenge of distinguishing vehicles of the same model has prompted a shift towards leveraging both global appearances and local features, such as lighting and rearview mirrors, for vehicle re-identification (ReID). Despite advan... Read More about TANet: Text region attention learning for vehicle re-identification.

Altered Handwritten Text Detection in Document Images Using Deep Learning (2024)
Journal Article
Patil, G., Palaiahnakote, S., Gornale, S. S., & Lopresti, D. P. (2024). Altered Handwritten Text Detection in Document Images Using Deep Learning. International Journal of Pattern Recognition and Artificial Intelligence, https://doi.org/10.1142/s0218001424520062

Handwritten documents possess immense significance in domains such as law, history, and administration. However, they are vulnerable to forgery, which can undermine their credibility and reliability. This paper aims to establish a dependable techniqu... Read More about Altered Handwritten Text Detection in Document Images Using Deep Learning.

TTS: Hilbert Transform-based Generative Adversarial Network for Tattoo and Scene Text Spotting (2024)
Journal Article
Banerjee, A., Palaiahnakote, S., Antonacopoulos, A., Pal, U., Lu, T., & Canet, J. L. (2024). TTS: Hilbert Transform-based Generative Adversarial Network for Tattoo and Scene Text Spotting. IEEE Transactions on Multimedia, 1-15. https://doi.org/10.1109/tmm.2024.3378458

Text spotting in natural scenes is of increasing interest and significance due to its critical role in several applications, such as visual question answering, named entity recognition and event rumor detection on social media. One of the newly emerg... Read More about TTS: Hilbert Transform-based Generative Adversarial Network for Tattoo and Scene Text Spotting.

A Robust Script Independent Handwriting System for Gender Identification (2024)
Journal Article
Palaiahnakote, S., Kaljahi, M. A., Kanchan, S., Pal, U., Lopresti, D., & Lu, T. (2024). A Robust Script Independent Handwriting System for Gender Identification. Expert systems with applications, 249, https://doi.org/10.1016/j.eswa.2024.123576

Gender identification at the word level in a multi-script environment is challenging due to variations posed by free-style handwriting of individuals and geographical differences in writing styles. This paper presents a new approach, Multi-Orientatio... Read More about A Robust Script Independent Handwriting System for Gender Identification.

A New Approach for Speech Emotion Recognition using Single Layered Convolutional Neural Network (2024)
Journal Article
Vinoth Kumar, V., Palaiahnakote, S., Khan, S. B., & Almusharraf, A. (2024). A New Approach for Speech Emotion Recognition using Single Layered Convolutional Neural Network. Malaysian journal of computer science, 37(1), 89–106. https://doi.org/10.22452/mjcs.vol37no1.5

Creating a computational device to identify human emotions via voice analysis represents a notable achievement in the sector of human-computer interaction, especially within the healthcare domain. We propose a new lightweight model for addressing cha... Read More about A New Approach for Speech Emotion Recognition using Single Layered Convolutional Neural Network.

A novel autoencoder for structural anomalies detection in river tunnel operation (2023)
Journal Article
TAN, X.-Y., Palaiahnakote, S., Chen, W., Cheng, K., & Du, B. (2024). A novel autoencoder for structural anomalies detection in river tunnel operation. Expert systems with applications, 244, https://doi.org/10.1016/j.eswa.2023.122906

Anomaly diagnosis is essential to prevent disasters and ensure long-term stable operation of tunnels. However, the diversity and scarcity of abnormal data make it difficult to identify outliers, especially to diagnose structural anomalies from poor-q... Read More about A novel autoencoder for structural anomalies detection in river tunnel operation.

A Locally Weighted Linear Regression Based Approach for Arbitrary Moving Shaky and Non-Shaky Video Classification (2023)
Journal Article
Halder, A., Shivakumara, P., Pal, U., Blumenstein, M., & Ghosal, P. (2023). A Locally Weighted Linear Regression Based Approach for Arbitrary Moving Shaky and Non-Shaky Video Classification. International Journal of Pattern Recognition and Artificial Intelligence, 38(1), https://doi.org/10.1142/S0218001423510199

Classification and identification of objects are complex and challenging in pattern recognition and artificial intelligence if a shaky and nonshaky camera captures the videos at different distances during the day and nighttime. This work presents a... Read More about A Locally Weighted Linear Regression Based Approach for Arbitrary Moving Shaky and Non-Shaky Video Classification.

A New Lightweight Script Independent Scene Text Style Transfer Network (2023)
Journal Article
Shivakumara, P., Roy, A., Nandanwar, L., Pal, U., Lu, Y., & Liu, C.-L. (2023). A New Lightweight Script Independent Scene Text Style Transfer Network. International Journal of Pattern Recognition and Artificial Intelligence, 37(13), https://doi.org/10.1142/S0218001423530038

Scene text style transfer without a language barrier is an open challenge for the video and scene text recognition community because this plays a vital role in poster, web design, augmenting character images, and editing characters to improve scene... Read More about A New Lightweight Script Independent Scene Text Style Transfer Network.

A New Few-Shot Learning-Based Model for Prohibited Objects Detection in Cluttered Baggage X-Ray Images Through Edge Detection and Reverse Validation (2023)
Journal Article
Liu, K., Lyu, S., Shivakumara, P., Blumenstein, M., & Lu, Y. (2023). A New Few-Shot Learning-Based Model for Prohibited Objects Detection in Cluttered Baggage X-Ray Images Through Edge Detection and Reverse Validation. IEEE Signal Processing Letters, 30, https://doi.org/10.1109/LSP.2023.3326088

Detecting prohibited items via X-ray screening at airports and sensitive venues is essential for preventing smuggling and breaches of security. The difficulty in prohibited items inspection lies in accurately detecting prohibited items in complex X-r... Read More about A New Few-Shot Learning-Based Model for Prohibited Objects Detection in Cluttered Baggage X-Ray Images Through Edge Detection and Reverse Validation.

Editorial: Intelligent computing in farmland water conservancy for smart agriculture (2023)
Journal Article
(2023). Editorial: Intelligent computing in farmland water conservancy for smart agriculture. Frontiers in Plant Science, https://doi.org/10.3389/fpls.2023.1236010

In the past few decades, the rapid development of agriculture has put forward high requirements for efficient management of water resources, so as to rationally utilize natural resources and increase their sustainability. It is noted that there is a... Read More about Editorial: Intelligent computing in farmland water conservancy for smart agriculture.