Skip to main content

Research Repository

Advanced Search

All Outputs (75)

A Robust Script Independent Handwriting System for Gender Identification (2024)
Journal Article
Palaiahnakote, S., Kaljahi, M. A., Kanchan, S., Pal, U., Lopresti, D., & Lu, T. (2024). A Robust Script Independent Handwriting System for Gender Identification. Expert systems with applications, 249, https://doi.org/10.1016/j.eswa.2024.123576

Gender identification at the word level in a multi-script environment is challenging due to variations posed by free-style handwriting of individuals and geographical differences in writing styles. This paper presents a new approach, Multi-Orientatio... Read More about A Robust Script Independent Handwriting System for Gender Identification.

A New Approach for Speech Emotion Recognition using Single Layered Convolutional Neural Network (2024)
Journal Article
Vinoth Kumar, V., Palaiahnakote, S., Khan, S. B., & Almusharraf, A. (2024). A New Approach for Speech Emotion Recognition using Single Layered Convolutional Neural Network. Malaysian journal of computer science, 37(1), 89–106. https://doi.org/10.22452/mjcs.vol37no1.5

Creating a computational device to identify human emotions via voice analysis represents a notable achievement in the sector of human-computer interaction, especially within the healthcare domain. We propose a new lightweight model for addressing cha... Read More about A New Approach for Speech Emotion Recognition using Single Layered Convolutional Neural Network.

HLB Disease Detection in Omani Lime Trees Using Hyperspectral Imaging Based Techniques (2024)
Journal Article
Menezes, J., Dharmalingam, R., & Shivakumara, P. (2024). HLB Disease Detection in Omani Lime Trees Using Hyperspectral Imaging Based Techniques. https://doi.org/10.1007/978-3-031-53085-2_7

In the recent years omani acid lime cultivation and production has been affected by Citrus greening or Huanglongbing (HLB) disease. HLB disease is one of the most destructive diseases for citrus with no remedies or countermeasures to stop the disease... Read More about HLB Disease Detection in Omani Lime Trees Using Hyperspectral Imaging Based Techniques.

A novel autoencoder for structural anomalies detection in river tunnel operation (2023)
Journal Article
TAN, X.-Y., Palaiahnakote, S., Chen, W., Cheng, K., & Du, B. (2024). A novel autoencoder for structural anomalies detection in river tunnel operation. Expert systems with applications, 244, https://doi.org/10.1016/j.eswa.2023.122906

Anomaly diagnosis is essential to prevent disasters and ensure long-term stable operation of tunnels. However, the diversity and scarcity of abnormal data make it difficult to identify outliers, especially to diagnose structural anomalies from poor-q... Read More about A novel autoencoder for structural anomalies detection in river tunnel operation.

A Locally Weighted Linear Regression Based Approach for Arbitrary Moving Shaky and Non-Shaky Video Classification (2023)
Journal Article
Halder, A., Shivakumara, P., Pal, U., Blumenstein, M., & Ghosal, P. (2023). A Locally Weighted Linear Regression Based Approach for Arbitrary Moving Shaky and Non-Shaky Video Classification. International Journal of Pattern Recognition and Artificial Intelligence, 38(1), https://doi.org/10.1142/S0218001423510199

Classification and identification of objects are complex and challenging in pattern recognition and artificial intelligence if a shaky and nonshaky camera captures the videos at different distances during the day and nighttime. This work presents a... Read More about A Locally Weighted Linear Regression Based Approach for Arbitrary Moving Shaky and Non-Shaky Video Classification.

A New Lightweight Script Independent Scene Text Style Transfer Network (2023)
Journal Article
Shivakumara, P., Roy, A., Nandanwar, L., Pal, U., Lu, Y., & Liu, C.-L. (2023). A New Lightweight Script Independent Scene Text Style Transfer Network. International Journal of Pattern Recognition and Artificial Intelligence, 37(13), https://doi.org/10.1142/S0218001423530038

Scene text style transfer without a language barrier is an open challenge for the video and scene text recognition community because this plays a vital role in poster, web design, augmenting character images, and editing characters to improve scene... Read More about A New Lightweight Script Independent Scene Text Style Transfer Network.

An Attention based Fusion of ResNet50 and InceptionV3 Model for Water Meter Digit Recognition (2023)
Journal Article
Alkhaled, L., Roy, A., & Palaiahnakote, S. (2023). An Attention based Fusion of ResNet50 and InceptionV3 Model for Water Meter Digit Recognition. #Journal not on list, https://doi.org/10.47852/bonviewAIA32021197

Digital water meter digit recognition from images of water meter readings is a challenging research problem. One key reason is that this might be a lack of publicly available datasets to develop such methods. Another reason is the digits suffer from... Read More about An Attention based Fusion of ResNet50 and InceptionV3 Model for Water Meter Digit Recognition.

A New Few-Shot Learning-Based Model for Prohibited Objects Detection in Cluttered Baggage X-Ray Images Through Edge Detection and Reverse Validation (2023)
Journal Article
Liu, K., Lyu, S., Shivakumara, P., Blumenstein, M., & Lu, Y. (2023). A New Few-Shot Learning-Based Model for Prohibited Objects Detection in Cluttered Baggage X-Ray Images Through Edge Detection and Reverse Validation. IEEE Signal Processing Letters, 30, https://doi.org/10.1109/LSP.2023.3326088

Detecting prohibited items via X-ray screening at airports and sensitive venues is essential for preventing smuggling and breaches of security. The difficulty in prohibited items inspection lies in accurately detecting prohibited items in complex X-r... Read More about A New Few-Shot Learning-Based Model for Prohibited Objects Detection in Cluttered Baggage X-Ray Images Through Edge Detection and Reverse Validation.

A Robust SLIC Based Approach for Segmentation using Canny Edge Detector (2023)
Journal Article
Pal, S., Roy, A., Shivakumara, P., & Pal, U. (2023). A Robust SLIC Based Approach for Segmentation using Canny Edge Detector. #Journal not on list, https://doi.org/10.47852/bonviewAIA32021196

An accurate image segmentation in noisy environment is complex and challenging. Unlike existing state-of-the-art methods that use superpixels for successful segmentation, we propose a new approach for noise-robust SLIC (Simple Linear Iterative Cluste... Read More about A Robust SLIC Based Approach for Segmentation using Canny Edge Detector.

Editorial: Intelligent computing in farmland water conservancy for smart agriculture (2023)
Journal Article
(2023). Editorial: Intelligent computing in farmland water conservancy for smart agriculture. Frontiers in Plant Science, https://doi.org/10.3389/fpls.2023.1236010

In the past few decades, the rapid development of agriculture has put forward high requirements for efficient management of water resources, so as to rationally utilize natural resources and increase their sustainability. It is noted that there is a... Read More about Editorial: Intelligent computing in farmland water conservancy for smart agriculture.

A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images (2023)
Journal Article
Shivakumara, P., Banerjee, A., Pal, U., Nandanwar, L., Lu, T., & Liu, C.-L. (2023). A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images. IEEE transactions on image processing : a publication of the IEEE Signal Processing Society, 32, 3552 - 3566. https://doi.org/10.1109/TIP.2023.3287038

Due to the adverse effect of quality caused by different social media and arbitrary languages in natural scenes, detecting text from social media images and transferring its style is challenging. This paper presents a novel end-to-end model for text... Read More about A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images.

Classification of aesthetic natural scene images using statistical and semantic features (2023)
Journal Article
Biswas, K., Shivakumara, P., Pal, U., Lu, T., Blumenstein, M., & Lladós, J. (2023). Classification of aesthetic natural scene images using statistical and semantic features. Multimedia Tools and Applications, 82, 13507–13532. https://doi.org/10.1007/s11042-022-13924-7

Aesthetic image analysis is essential for improving the performance of multimedia image retrieval systems, especially from a repository of social media and multimedia content stored on mobile devices. This paper presents a novel method for classifyin... Read More about Classification of aesthetic natural scene images using statistical and semantic features.

Spatio-Temporal FFT Based Approach for Arbitrarily Moving Object Classification in Videos of Protected and Sensitive Scenes (2023)
Journal Article
Asadzadehkaljahi, M., Halder, A., Palaiahnakote, S., & Pal, U. (2023). Spatio-Temporal FFT Based Approach for Arbitrarily Moving Object Classification in Videos of Protected and Sensitive Scenes. #Journal not on list, https://doi.org/10.47852/bonviewAIA3202553

Arbitrary moving object detection including vehicles and human beings in the real environment, such as protected and sensitive areas, is challenging due to the arbitrary deformation and directions caused by shaky camera and wind. This work aims at ad... Read More about Spatio-Temporal FFT Based Approach for Arbitrarily Moving Object Classification in Videos of Protected and Sensitive Scenes.

Spatiotemporal Edges for Arbitrarily Moving Video Classification in Protected and Sensitive Scenes (2023)
Journal Article
Asadzadehkaljahi, M., Halder, A., Pal, U., & Shivakumara, P. (2023). Spatiotemporal Edges for Arbitrarily Moving Video Classification in Protected and Sensitive Scenes. #Journal not on list, https://doi.org/10.47852/bonviewAIA320526

Classification of arbitrary moving objects including vehicles and human beings in a real environment (such as protected and sensitive areas) is challenging due to arbitrary deformation and directions caused by shaky camera and wind. This work aims at... Read More about Spatiotemporal Edges for Arbitrarily Moving Video Classification in Protected and Sensitive Scenes.

Spatiotemporal Edges for Arbitrarily Moving Video Classification in Protected and Sensitive Scenes (2023)
Journal Article
Asadzadehkaljahi, M., Halder, A., Pal, U., & Shivakumara, P. (2023). Spatiotemporal Edges for Arbitrarily Moving Video Classification in Protected and Sensitive Scenes. #Journal not on list, https://doi.org/10.47852/bonviewAIA3202526

Classification of arbitrary moving objects including vehicles and human beings in a real environment (such as protected and sensitive areas) is challenging due to arbitrary deformation and directions caused by shaky camera and wind. This work aims at... Read More about Spatiotemporal Edges for Arbitrarily Moving Video Classification in Protected and Sensitive Scenes.

Tldsmi: Genetic Algorithm Based Network for Text Localization in Distorted Social Media Images (2023)
Working Paper
Palaiahnakote, S., Pavan Kumar, C., Aggarwal, P., Sharma, S., Chandana, P., Basavanna, M., & Pal, U. Tldsmi: Genetic Algorithm Based Network for Text Localization in Distorted Social Media Images

This paper presents a novel model for understanding social image content through text localization. For text localization, we explore Maximally Stable Extremal Regions (MSER) for detecting components, that works by clustering pixels having similar pr... Read More about Tldsmi: Genetic Algorithm Based Network for Text Localization in Distorted Social Media Images.

EAU-Net: A New Edge-Attention Based U-Net for Nationality Identification (2022)
Conference Proceeding
Pal Choudhury, A., Shivakumara, P., Pal, U., & Liu, C.-L. (2022). EAU-Net: A New Edge-Attention Based U-Net for Nationality Identification. In Frontiers in Handwriting Recognition 18th International Conference, ICFHR 2022, Hyderabad, India, December 4–7, 2022, Proceedings (137-152). https://doi.org/10.1007/978-3-031-21648-0_10

Identifying crime or individuals is one of the key tasks toward smart and safe city development when different nationals are involved. In this regard, identifying Nationality/Ethnicity through handwriting has received special attention. But due to fr... Read More about EAU-Net: A New Edge-Attention Based U-Net for Nationality Identification.

License Plate Number Detection in Drone Images (2022)
Journal Article
Mokayed, H., Palaiahnakote, S., Alkhaled, L., & N. AL-Masri, A. (2022). License Plate Number Detection in Drone Images. #Journal not on list, https://doi.org/10.47852/bonviewAIA2202421

For an intelligent transportation system, identifying license plate numbers in drone photos is difficult, and it is used in practical applications like parking management, traffic management, automatically organizing parking spots, etc. The primary g... Read More about License Plate Number Detection in Drone Images.

An Augmented Reality-Based Approach for Designing Interactive Food Menu of Restaurant Using Android (2022)
Journal Article
Nur Amin, S., Shivakumara, P., Xue Jun, T., Yang Chong, K., Leong Lon Zan, D., & Rahavendra, R. (2022). An Augmented Reality-Based Approach for Designing Interactive Food Menu of Restaurant Using Android. #Journal not on list, https://doi.org/10.47852/bonviewAIA2202354

The food industry is becoming competitive on a daily basis and introducing newer cuisines to the menu in an attempt to rise up the ladder. But they still are not being able to improve their performances because customers often only have the waiters t... Read More about An Augmented Reality-Based Approach for Designing Interactive Food Menu of Restaurant Using Android.