Skip to main content

Research Repository

Advanced Search

All Outputs (75)

A Conformable Moments-Based Deep Learning System for Forged Handwriting Detection (2022)
Journal Article
Nandanwar, L., Shivakumara, P., A. Jalab, H., W. Ibrahim, R., Raghavendra, R., Pal, U., …Blumenstein, M. (2022). A Conformable Moments-Based Deep Learning System for Forged Handwriting Detection. IEEE transactions on neural networks and learning systems, 1-14. https://doi.org/10.1109/TNNLS.2022.3204390

Detecting forged handwriting is important in a wide variety of machine learning applications, and it is challenging when the input images are degraded with noise and blur. This article presents a new model based on conformable moments (CMs) and deep... Read More about A Conformable Moments-Based Deep Learning System for Forged Handwriting Detection.

Fake News Detection Techniques on Social Media: A Survey (2022)
Journal Article
Ali, I., Nizam Bin Ayub, M., Shivakumara, P., & Fazmidar Binti Mohd Noor, N. (2022). Fake News Detection Techniques on Social Media: A Survey. Wireless Communications and Mobile Computing, https://doi.org/10.1155/2022/6072084



Social media platforms like Twitter have become common tools for disseminating and consuming news because of the ease with which users can get access to and consume it. This paper focuses on the identification of false news and the use of cutting... Read More about Fake News Detection Techniques on Social Media: A Survey.

New Deep Spatio-Structural Features of Handwritten Text Lines for Document Age Classification (2022)
Journal Article
Shivakumara, P., Das, A., S. Raghunandan, K., Pal, U., & Blumenstein, M. (2022). New Deep Spatio-Structural Features of Handwritten Text Lines for Document Age Classification. International Journal of Pattern Recognition and Artificial Intelligence, 36(9), https://doi.org/10.1142/S0218001422520139

Document age estimation using handwritten text line images is useful for several pattern recognition and artificial intelligence applications such as forged signature verification, writer identification, gender identification, personality traits iden... Read More about New Deep Spatio-Structural Features of Handwritten Text Lines for Document Age Classification.

Local Resultant Gradient Vector Difference and Inpainting for 3D Text Detection in the Wild (2022)
Journal Article
Zhong, D., Shivakumara, P., Nandanwar, L., Pal, U., Blumenstein, M., & Lu, Y. (2022). Local Resultant Gradient Vector Difference and Inpainting for 3D Text Detection in the Wild. International Journal of Pattern Recognition and Artificial Intelligence, 36(8), Article 2253005. https://doi.org/10.1142/S0218001422530056

Three-dimensional (3D) text appearing in natural scene images is common due to 3D cameras and the capture of text from different angles, which presents new problems for text detection. This is because of the presence of depth information, shadows, an... Read More about Local Resultant Gradient Vector Difference and Inpainting for 3D Text Detection in the Wild.

A new ontology-based multimodal classification system for social media images of personality traits (2022)
Journal Article
Biswas, K., Shivakumara, P., Pal, U., & Lu, T. (2023). A new ontology-based multimodal classification system for social media images of personality traits. Signal, Image and Video Processing, 17, 543-551. https://doi.org/10.1007/s11760-022-02259-3

Number of users of social media is increasing exponentially. People are getting addicted to social media, and because of such addiction, it sometimes causes psychological and mental effects on the users. Understanding user interaction with social med... Read More about A new ontology-based multimodal classification system for social media images of personality traits.

An Episodic Learning Network for Text Detection on Human Bodies in Sports Images (2022)
Journal Article
Nath Chowdhury, P., Shivakumara, P., Raghavendra, R., Nag, S., Pal, U., Lu, T., & Lopresti, D. (2022). An Episodic Learning Network for Text Detection on Human Bodies in Sports Images. IEEE Transactions on Circuits and Systems for Video Technology, 32, 2279 - 2289. https://doi.org/10.1109/TCSVT.2021.3092713

Due to the proliferation of sports-related multimedia content on the WWW, effective visual search and retrieval present interesting research challenges. These are caused by poor image quality, a wide range of possible camera points of view, pose vari... Read More about An Episodic Learning Network for Text Detection on Human Bodies in Sports Images.

Multi‐gradient‐direction based deep learning model for arecanut disease identification (2022)
Journal Article
B. Mallikarjuna, S., Shivakumara, P., Khare, V., Basavanna, M., Pal, U., & Poornima, B. (2022). Multi‐gradient‐direction based deep learning model for arecanut disease identification. CAAI Transactions on Intelligence Technology, 7(2), 156–166. https://doi.org/10.1049/cit2.12088



Arecanut disease identification is a challenging problem in the field of image processing. In this work, we present a new combination of multi-gradient-direction and deep convolutional neural networks for arecanut disease identification, namely,... Read More about Multi‐gradient‐direction based deep learning model for arecanut disease identification.

A deep action-oriented video image classification system for text detection and recognition (2021)
Journal Article
Chaudhuri, A., Shivakumara, P., Nath Chowdhury, P., Pal, U., Lu, T., Lopresti, D., & Hemantha Kumar, G. (2021). A deep action-oriented video image classification system for text detection and recognition. SN Applied Sciences, 3, Article 838. https://doi.org/10.1007/s42452-021-04821-z

For the video images with complex actions, achieving accurate text detection and recognition results is very challenging. This paper presents a hybrid model for classification of action-oriented video images which reduces the complexity of the proble... Read More about A deep action-oriented video image classification system for text detection and recognition.

Mining text from natural scene and video images: A survey (2021)
Journal Article
Shivakumara, P., Alaei, A., & Pal, U. (2021). Mining text from natural scene and video images: A survey. Data Mining and Knowledge Discovery, 11(6), https://doi.org/10.1002/widm.1428

In computer terminology, mining is considered as extracting meaningful information or knowledge from a large amount of data/information using computers. The meaningful information can be extracted from normal text, and images obtained from different... Read More about Mining text from natural scene and video images: A survey.

Deformable scene text detection using harmonic features and modified pixel aggregation network (2021)
Journal Article
Jain, T., Palaiahnakote, S., Pal, U., & Liu, C.-L. (2021). Deformable scene text detection using harmonic features and modified pixel aggregation network. Pattern Recognition Letters, 152, 135-142. https://doi.org/10.1016/j.patrec.2021.10.006

Although text detection methods have addressed several challenges in the past, there is a dearth of effective methods for text detection in deformable images, such as images containing text embedded on cloth, banners, rubber, sports jerseys, uniforms... Read More about Deformable scene text detection using harmonic features and modified pixel aggregation network.

Anomaly Detection in Natural Scene Images Based on Enhanced Fine-Grained Saliency and Fuzzy Logic (2021)
Journal Article
Mokayed, H., Shivakumara, P., Saini, R., Liwicki, M., Chee Hin, L., & Pal, U. (2021). Anomaly Detection in Natural Scene Images Based on Enhanced Fine-Grained Saliency and Fuzzy Logic. IEEE Access, 9, https://doi.org/10.1109/ACCESS.2021.3103279

This paper proposes a simple yet effective method for anomaly detection in natural scene images improving natural scene text detection and recognition. In the last decade, there has been significant progress towards text detection and recognition in... Read More about Anomaly Detection in Natural Scene Images Based on Enhanced Fine-Grained Saliency and Fuzzy Logic.

ARNet: Active-Reference Network for Few-Shot Image Semantic Segmentation (2021)
Conference Proceeding
Shi, G., Wu, Y., Palaiahnakote, S., Pal, U., & Lu, T. (2021). ARNet: Active-Reference Network for Few-Shot Image Semantic Segmentation. In 2021 IEEE International Conference on Multimedia and Expo (ICME). https://doi.org/10.1109/ICME51207.2021.9428425

To make predictions on unseen classes, few-shot segmentation becomes a research focus recently. However, most methods build on pixel-level annotation requiring quantity of manual work. Moreover, inherent information on same-category objects to guide... Read More about ARNet: Active-Reference Network for Few-Shot Image Semantic Segmentation.

A new context-based feature for classification of emotions in photographs (2021)
Journal Article
Krishnani, D., Shivakumara, P., Lu, T., Pal, U., Lopresti, D., & Hemantha Kumar, G. (2021). A new context-based feature for classification of emotions in photographs. Multimedia Tools and Applications, 80, 15589–15618. https://doi.org/10.1007/s11042-020-10404-8

A high volume of images is shared on the public Internet each day. Many of these are photographs of people with facial expressions and actions displaying various emotions. In this work, we examine the problem of classifying broad categories of emotio... Read More about A new context-based feature for classification of emotions in photographs.

A survey on video content rating: taxonomy, challenges and open issues (2021)
Journal Article
Khaksar Pour, A., Chaw Seng, W., Palaiahnakote, S., Tahaei, H., & Badrul Anuar, N. (2021). A survey on video content rating: taxonomy, challenges and open issues. Multimedia Tools and Applications, 80, 24121-24145. https://doi.org/10.1007/s11042-021-10838-8

Rating a video based on its content is one of the most important solutions to classify videos for audience age groups. In this regard, Film content rating and TV programmes rating are the only two most common rating systems which have been accomplish... Read More about A survey on video content rating: taxonomy, challenges and open issues.

DCT-phase statistics for forged IMEI numbers and air ticket detection (2021)
Journal Article
Nandanwar, L., Shivakumara, P., Kanchan, S., Basavaraja, V., Guru, D., Pal, U., …Blumenstein, M. (2021). DCT-phase statistics for forged IMEI numbers and air ticket detection. Expert systems with applications, 164, https://doi.org/10.1016/j.eswa.2020.114014

New tools have been developing with the intention of having more flexibility and greater user-friendliness for editing the images and documents in digital technologies, but, unfortunately, they are also being used for manipulating and tampering infor... Read More about DCT-phase statistics for forged IMEI numbers and air ticket detection.

A New Method for Detecting Altered Text in Document Images (2020)
Conference Proceeding
Nandanwar, L., Shivakumara, P., Pal, U., Lu, T., Lopresti, D., Seraogi, B., & B. Chaudhuri, B. (2020). A New Method for Detecting Altered Text in Document Images. In Pattern Recognition and Artificial Intelligence (93-108). https://doi.org/10.1007/978-3-030-59830-3_8

As more and more office documents are captured, stored, and shared in digital format, and as image editing software becomes increasingly more powerful, there is a growing concern about document authenticity. For example, texts in property documents c... Read More about A New Method for Detecting Altered Text in Document Images.

A New DCT-FFT Fusion Based Method for Caption and Scene Text Classification in Action Video Images (2020)
Conference Proceeding
Nandanwar, L., Shivakumara, P., Manna, S., Pal, U., Lu, T., & Blumenstein, M. (2020). A New DCT-FFT Fusion Based Method for Caption and Scene Text Classification in Action Video Images. In Pattern Recognition and Artificial Intelligence. https://doi.org/10.1007/978-3-030-59830-3_7

Achieving better recognition rate for text in video action images is challenging due to multi-type texts with unpredictable backgrounds. We propose a new method for the classification of captions (which is edited text) and scene texts (which is part... Read More about A New DCT-FFT Fusion Based Method for Caption and Scene Text Classification in Action Video Images.

A new unified method for detecting text from marathon runners and sports players in video (PR-D-19-01078R2) (2020)
Journal Article
Nag, S., Shivakumara, P., Pal, U., Lu, T., & Blumenstein, M. (2020). A new unified method for detecting text from marathon runners and sports players in video (PR-D-19-01078R2). Pattern recognition, 107, https://doi.org/10.1016/j.patcog.2020.107476

Detecting text located on the torsos of marathon runners and sports players in video is a challenging issue due to poor quality and adverse effects caused by flexible/colorful clothing, and different structures of human bodies or actions. This paper... Read More about A new unified method for detecting text from marathon runners and sports players in video (PR-D-19-01078R2).

Saliency-based bit plane detection for network applications (2020)
Journal Article
Asadzadeh Kaljahi, M., Shivakumara, P., Hakak, S., Yamani Idna Idris, M., Hossein Anisi, M., & Rajan, D. (2020). Saliency-based bit plane detection for network applications. Multimedia Tools and Applications, 79, 18495–18513. https://doi.org/10.1007/s11042-020-08741-9

Transmitting image data without losing significant information is challenging for any network application especially when large color images are transmitted through TCP communication protocol. This is due to network limitations such as buffer overflo... Read More about Saliency-based bit plane detection for network applications.

A text-context-aware CNN network for multi-oriented and multi-language scene text detection (2020)
Conference Proceeding
Xiao, Y., Xue, M., Lu, T., Wu, Y., & Palaiahnakote, S. (2020). A text-context-aware CNN network for multi-oriented and multi-language scene text detection. In 2019 International Conference on Document Analysis and Recognition (ICDAR). https://doi.org/10.1109/ICDAR.2019.00116

The existing deep learning based state-of-theart scene text detection methods treat scene texts a type of general objects, or segment text regions directly. The latter category achieves remarkable detection results on arbitrary orientation and large... Read More about A text-context-aware CNN network for multi-oriented and multi-language scene text detection.