Dr Shivakumara Palaiahnakote

Classification of aesthetic natural scene images using statistical and semantic features (2023)
Journal Article
Biswas, K., Shivakumara, P., Pal, U., Lu, T., Blumenstein, M., & Lladós, J. (2023). Classification of aesthetic natural scene images using statistical and semantic features. Multimedia Tools and Applications, 82, 13507–13532. https://doi.org/10.1007/s11042-022-13924-7

Aesthetic image analysis is essential for improving the performance of multimedia image retrieval systems, especially from a repository of social media and multimedia content stored on mobile devices. This paper presents a novel method for classifyin... Read More about Classification of aesthetic natural scene images using statistical and semantic features.

Spatio-Temporal FFT Based Approach for Arbitrarily Moving Object Classification in Videos of Protected and Sensitive Scenes (2023)
Journal Article

Arbitrary moving object detection including vehicles and human beings in the real environment, such as protected and sensitive areas, is challenging due to the arbitrary deformation and directions caused by shaky camera and wind. This work aims at ad... Read More about Spatio-Temporal FFT Based Approach for Arbitrarily Moving Object Classification in Videos of Protected and Sensitive Scenes.

Spatiotemporal Edges for Arbitrarily Moving Video Classification in Protected and Sensitive Scenes (2023)
Journal Article

Classification of arbitrary moving objects including vehicles and human beings in a real environment (such as protected and sensitive areas) is challenging due to arbitrary deformation and directions caused by shaky camera and wind. This work aims at... Read More about Spatiotemporal Edges for Arbitrarily Moving Video Classification in Protected and Sensitive Scenes.

Tldsmi: Genetic Algorithm Based Network for Text Localization in Distorted Social Media Images (2023)
Working Paper
Palaiahnakote, S., Pavan Kumar, C., Aggarwal, P., Sharma, S., Chandana, P., Basavanna, M., & Pal, U. Tldsmi: Genetic Algorithm Based Network for Text Localization in Distorted Social Media Images

This paper presents a novel model for understanding social image content through text localization. For text localization, we explore Maximally Stable Extremal Regions (MSER) for detecting components, that works by clustering pixels having similar pr... Read More about Tldsmi: Genetic Algorithm Based Network for Text Localization in Distorted Social Media Images.

Inaugural Editorial (2023)
Journal Article

It gives me great pleasure to write an editorial note for the first edition of the new journal of Artificial Intelligence and Applications (AIA). Before I begin, I would like to thank our Managing Editor, Mrs. Yu Zhang of Bon View Publishing, Singapo... Read More about Inaugural Editorial.

EAU-Net: A New Edge-Attention Based U-Net for Nationality Identification (2022)
Presentation / Conference Contribution

Identifying crime or individuals is one of the key tasks toward smart and safe city development when different nationals are involved. In this regard, identifying Nationality/Ethnicity through handwriting has received special attention. But due to fr... Read More about EAU-Net: A New Edge-Attention Based U-Net for Nationality Identification.

License Plate Number Detection in Drone Images (2022)
Journal Article

For an intelligent transportation system, identifying license plate numbers in drone photos is difficult, and it is used in practical applications like parking management, traffic management, automatically organizing parking spots, etc. The primary g... Read More about License Plate Number Detection in Drone Images.

An Augmented Reality-Based Approach for Designing Interactive Food Menu of Restaurant Using Android (2022)
Journal Article

The food industry is becoming competitive on a daily basis and introducing newer cuisines to the menu in an attempt to rise up the ladder. But they still are not being able to improve their performances because customers often only have the waiters t... Read More about An Augmented Reality-Based Approach for Designing Interactive Food Menu of Restaurant Using Android.

A Conformable Moments-Based Deep Learning System for Forged Handwriting Detection (2022)
Journal Article

Detecting forged handwriting is important in a wide variety of machine learning applications, and it is challenging when the input images are degraded with noise and blur. This article presents a new model based on conformable moments (CMs) and deep... Read More about A Conformable Moments-Based Deep Learning System for Forged Handwriting Detection.

Fake News Detection Techniques on Social Media: A Survey (2022)
Journal Article
Ali, I., Nizam Bin Ayub, M., Shivakumara, P., & Fazmidar Binti Mohd Noor, N. (2022). Fake News Detection Techniques on Social Media: A Survey. Wireless Communications and Mobile Computing, https://doi.org/10.1155/2022/6072084

Social media platforms like Twitter have become common tools for disseminating and consuming news because of the ease with which users can get access to and consume it. This paper focuses on the identification of false news and the use of cutting... Read More about Fake News Detection Techniques on Social Media: A Survey.

New Deep Spatio-Structural Features of Handwritten Text Lines for Document Age Classification (2022)
Journal Article
Shivakumara, P., Das, A., S. Raghunandan, K., Pal, U., & Blumenstein, M. (2022). New Deep Spatio-Structural Features of Handwritten Text Lines for Document Age Classification. International Journal of Pattern Recognition and Artificial Intelligence, 36(9), https://doi.org/10.1142/S0218001422520139

Document age estimation using handwritten text line images is useful for several pattern recognition and artificial intelligence applications such as forged signature verification, writer identification, gender identification, personality traits iden... Read More about New Deep Spatio-Structural Features of Handwritten Text Lines for Document Age Classification.

Local Resultant Gradient Vector Difference and Inpainting for 3D Text Detection in the Wild (2022)
Journal Article
Zhong, D., Shivakumara, P., Nandanwar, L., Pal, U., Blumenstein, M., & Lu, Y. (2022). Local Resultant Gradient Vector Difference and Inpainting for 3D Text Detection in the Wild. International Journal of Pattern Recognition and Artificial Intelligence, 36(8), Article 2253005. https://doi.org/10.1142/S0218001422530056

Three-dimensional (3D) text appearing in natural scene images is common due to 3D cameras and the capture of text from different angles, which presents new problems for text detection. This is because of the presence of depth information, shadows, an... Read More about Local Resultant Gradient Vector Difference and Inpainting for 3D Text Detection in the Wild.

A new ontology-based multimodal classification system for social media images of personality traits (2022)
Journal Article
Biswas, K., Shivakumara, P., Pal, U., & Lu, T. (2023). A new ontology-based multimodal classification system for social media images of personality traits. Signal, Image and Video Processing, 17, 543-551. https://doi.org/10.1007/s11760-022-02259-3

Number of users of social media is increasing exponentially. People are getting addicted to social media, and because of such addiction, it sometimes causes psychological and mental effects on the users. Understanding user interaction with social med... Read More about A new ontology-based multimodal classification system for social media images of personality traits.

An Episodic Learning Network for Text Detection on Human Bodies in Sports Images (2022)
Journal Article
Nath Chowdhury, P., Shivakumara, P., Raghavendra, R., Nag, S., Pal, U., Lu, T., & Lopresti, D. (2022). An Episodic Learning Network for Text Detection on Human Bodies in Sports Images. IEEE Transactions on Circuits and Systems for Video Technology, 32, 2279 - 2289. https://doi.org/10.1109/TCSVT.2021.3092713

Due to the proliferation of sports-related multimedia content on the WWW, effective visual search and retrieval present interesting research challenges. These are caused by poor image quality, a wide range of possible camera points of view, pose vari... Read More about An Episodic Learning Network for Text Detection on Human Bodies in Sports Images.

Multi‐gradient‐direction based deep learning model for arecanut disease identification (2022)
Journal Article
B. Mallikarjuna, S., Shivakumara, P., Khare, V., Basavanna, M., Pal, U., & Poornima, B. (2022). Multi‐gradient‐direction based deep learning model for arecanut disease identification. CAAI Transactions on Intelligence Technology, 7(2), 156–166. https://doi.org/10.1049/cit2.12088

Arecanut disease identification is a challenging problem in the field of image processing. In this work, we present a new combination of multi-gradient-direction and deep convolutional neural networks for arecanut disease identification, namely,... Read More about Multi‐gradient‐direction based deep learning model for arecanut disease identification.

A deep action-oriented video image classification system for text detection and recognition (2021)
Journal Article
Chaudhuri, A., Shivakumara, P., Nath Chowdhury, P., Pal, U., Lu, T., Lopresti, D., & Hemantha Kumar, G. (2021). A deep action-oriented video image classification system for text detection and recognition. SN Applied Sciences, 3, Article 838. https://doi.org/10.1007/s42452-021-04821-z

For the video images with complex actions, achieving accurate text detection and recognition results is very challenging. This paper presents a hybrid model for classification of action-oriented video images which reduces the complexity of the proble... Read More about A deep action-oriented video image classification system for text detection and recognition.

Mining text from natural scene and video images: A survey (2021)
Journal Article
Shivakumara, P., Alaei, A., & Pal, U. (2021). Mining text from natural scene and video images: A survey. Data Mining and Knowledge Discovery, 11(6), https://doi.org/10.1002/widm.1428

In computer terminology, mining is considered as extracting meaningful information or knowledge from a large amount of data/information using computers. The meaningful information can be extracted from normal text, and images obtained from different... Read More about Mining text from natural scene and video images: A survey.

Deformable scene text detection using harmonic features and modified pixel aggregation network (2021)
Journal Article
Jain, T., Palaiahnakote, S., Pal, U., & Liu, C.-L. (2021). Deformable scene text detection using harmonic features and modified pixel aggregation network. Pattern Recognition Letters, 152, 135-142. https://doi.org/10.1016/j.patrec.2021.10.006

Although text detection methods have addressed several challenges in the past, there is a dearth of effective methods for text detection in deformable images, such as images containing text embedded on cloth, banners, rubber, sports jerseys, uniforms... Read More about Deformable scene text detection using harmonic features and modified pixel aggregation network.

Anomaly Detection in Natural Scene Images Based on Enhanced Fine-Grained Saliency and Fuzzy Logic (2021)
Journal Article
Mokayed, H., Shivakumara, P., Saini, R., Liwicki, M., Chee Hin, L., & Pal, U. (2021). Anomaly Detection in Natural Scene Images Based on Enhanced Fine-Grained Saliency and Fuzzy Logic. IEEE Access, 9, https://doi.org/10.1109/ACCESS.2021.3103279

This paper proposes a simple yet effective method for anomaly detection in natural scene images improving natural scene text detection and recognition. In the last decade, there has been significant progress towards text detection and recognition in... Read More about Anomaly Detection in Natural Scene Images Based on Enhanced Fine-Grained Saliency and Fuzzy Logic.

Dr Shivakumara Palaiahnakote's Outputs (84)