Skip to main content

Research Repository

Advanced Search

Outputs (26)

A novel autoencoder for structural anomalies detection in river tunnel operation (2023)
Journal Article
TAN, X.-Y., Palaiahnakote, S., Chen, W., Cheng, K., & Du, B. (2024). A novel autoencoder for structural anomalies detection in river tunnel operation. Expert systems with applications, 244, https://doi.org/10.1016/j.eswa.2023.122906

Anomaly diagnosis is essential to prevent disasters and ensure long-term stable operation of tunnels. However, the diversity and scarcity of abnormal data make it difficult to identify outliers, especially to diagnose structural anomalies from poor-q... Read More about A novel autoencoder for structural anomalies detection in river tunnel operation.

A New Few-Shot Learning-Based Model for Prohibited Objects Detection in Cluttered Baggage X-Ray Images Through Edge Detection and Reverse Validation (2023)
Journal Article
Liu, K., Lyu, S., Shivakumara, P., Blumenstein, M., & Lu, Y. (2023). A New Few-Shot Learning-Based Model for Prohibited Objects Detection in Cluttered Baggage X-Ray Images Through Edge Detection and Reverse Validation. IEEE Signal Processing Letters, 30, https://doi.org/10.1109/LSP.2023.3326088

Detecting prohibited items via X-ray screening at airports and sensitive venues is essential for preventing smuggling and breaches of security. The difficulty in prohibited items inspection lies in accurately detecting prohibited items in complex X-r... Read More about A New Few-Shot Learning-Based Model for Prohibited Objects Detection in Cluttered Baggage X-Ray Images Through Edge Detection and Reverse Validation.

A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images (2023)
Journal Article
Shivakumara, P., Banerjee, A., Pal, U., Nandanwar, L., Lu, T., & Liu, C.-L. (2023). A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images. IEEE transactions on image processing : a publication of the IEEE Signal Processing Society, 32, 3552 - 3566. https://doi.org/10.1109/TIP.2023.3287038

Due to the adverse effect of quality caused by different social media and arbitrary languages in natural scenes, detecting text from social media images and transferring its style is challenging. This paper presents a novel end-to-end model for text... Read More about A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images.

Classification of aesthetic natural scene images using statistical and semantic features (2023)
Journal Article
Biswas, K., Shivakumara, P., Pal, U., Lu, T., Blumenstein, M., & Lladós, J. (2023). Classification of aesthetic natural scene images using statistical and semantic features. Multimedia Tools and Applications, 82, 13507–13532. https://doi.org/10.1007/s11042-022-13924-7

Aesthetic image analysis is essential for improving the performance of multimedia image retrieval systems, especially from a repository of social media and multimedia content stored on mobile devices. This paper presents a novel method for classifyin... Read More about Classification of aesthetic natural scene images using statistical and semantic features.

A Conformable Moments-Based Deep Learning System for Forged Handwriting Detection (2022)
Journal Article
Nandanwar, L., Shivakumara, P., A. Jalab, H., W. Ibrahim, R., Raghavendra, R., Pal, U., …Blumenstein, M. (2022). A Conformable Moments-Based Deep Learning System for Forged Handwriting Detection. IEEE transactions on neural networks and learning systems, 1-14. https://doi.org/10.1109/TNNLS.2022.3204390

Detecting forged handwriting is important in a wide variety of machine learning applications, and it is challenging when the input images are degraded with noise and blur. This article presents a new model based on conformable moments (CMs) and deep... Read More about A Conformable Moments-Based Deep Learning System for Forged Handwriting Detection.

An Episodic Learning Network for Text Detection on Human Bodies in Sports Images (2022)
Journal Article
Nath Chowdhury, P., Shivakumara, P., Raghavendra, R., Nag, S., Pal, U., Lu, T., & Lopresti, D. (2022). An Episodic Learning Network for Text Detection on Human Bodies in Sports Images. IEEE Transactions on Circuits and Systems for Video Technology, 32, 2279 - 2289. https://doi.org/10.1109/TCSVT.2021.3092713

Due to the proliferation of sports-related multimedia content on the WWW, effective visual search and retrieval present interesting research challenges. These are caused by poor image quality, a wide range of possible camera points of view, pose vari... Read More about An Episodic Learning Network for Text Detection on Human Bodies in Sports Images.

Mining text from natural scene and video images: A survey (2021)
Journal Article
Shivakumara, P., Alaei, A., & Pal, U. (2021). Mining text from natural scene and video images: A survey. Data Mining and Knowledge Discovery, 11(6), https://doi.org/10.1002/widm.1428

In computer terminology, mining is considered as extracting meaningful information or knowledge from a large amount of data/information using computers. The meaningful information can be extracted from normal text, and images obtained from different... Read More about Mining text from natural scene and video images: A survey.

A deep action-oriented video image classification system for text detection and recognition (2021)
Journal Article
Chaudhuri, A., Shivakumara, P., Nath Chowdhury, P., Pal, U., Lu, T., Lopresti, D., & Hemantha Kumar, G. (2021). A deep action-oriented video image classification system for text detection and recognition. SN Applied Sciences, 3, Article 838. https://doi.org/10.1007/s42452-021-04821-z

For the video images with complex actions, achieving accurate text detection and recognition results is very challenging. This paper presents a hybrid model for classification of action-oriented video images which reduces the complexity of the proble... Read More about A deep action-oriented video image classification system for text detection and recognition.

Anomaly Detection in Natural Scene Images Based on Enhanced Fine-Grained Saliency and Fuzzy Logic (2021)
Journal Article
Mokayed, H., Shivakumara, P., Saini, R., Liwicki, M., Chee Hin, L., & Pal, U. (2021). Anomaly Detection in Natural Scene Images Based on Enhanced Fine-Grained Saliency and Fuzzy Logic. IEEE Access, 9, https://doi.org/10.1109/ACCESS.2021.3103279

This paper proposes a simple yet effective method for anomaly detection in natural scene images improving natural scene text detection and recognition. In the last decade, there has been significant progress towards text detection and recognition in... Read More about Anomaly Detection in Natural Scene Images Based on Enhanced Fine-Grained Saliency and Fuzzy Logic.

A new context-based feature for classification of emotions in photographs (2021)
Journal Article
Krishnani, D., Shivakumara, P., Lu, T., Pal, U., Lopresti, D., & Hemantha Kumar, G. (2021). A new context-based feature for classification of emotions in photographs. Multimedia Tools and Applications, 80, 15589–15618. https://doi.org/10.1007/s11042-020-10404-8

A high volume of images is shared on the public Internet each day. Many of these are photographs of people with facial expressions and actions displaying various emotions. In this work, we examine the problem of classifying broad categories of emotio... Read More about A new context-based feature for classification of emotions in photographs.