Skip to main content

Research Repository

Advanced Search

All Outputs (8)

A deep action-oriented video image classification system for text detection and recognition (2021)
Journal Article
Chaudhuri, A., Shivakumara, P., Nath Chowdhury, P., Pal, U., Lu, T., Lopresti, D., & Hemantha Kumar, G. (2021). A deep action-oriented video image classification system for text detection and recognition. SN Applied Sciences, 3, Article 838. https://doi.org/10.1007/s42452-021-04821-z

For the video images with complex actions, achieving accurate text detection and recognition results is very challenging. This paper presents a hybrid model for classification of action-oriented video images which reduces the complexity of the proble... Read More about A deep action-oriented video image classification system for text detection and recognition.

Mining text from natural scene and video images: A survey (2021)
Journal Article
Shivakumara, P., Alaei, A., & Pal, U. (2021). Mining text from natural scene and video images: A survey. Data Mining and Knowledge Discovery, 11(6), https://doi.org/10.1002/widm.1428

In computer terminology, mining is considered as extracting meaningful information or knowledge from a large amount of data/information using computers. The meaningful information can be extracted from normal text, and images obtained from different... Read More about Mining text from natural scene and video images: A survey.

Deformable scene text detection using harmonic features and modified pixel aggregation network (2021)
Journal Article
Jain, T., Palaiahnakote, S., Pal, U., & Liu, C.-L. (2021). Deformable scene text detection using harmonic features and modified pixel aggregation network. Pattern Recognition Letters, 152, 135-142. https://doi.org/10.1016/j.patrec.2021.10.006

Although text detection methods have addressed several challenges in the past, there is a dearth of effective methods for text detection in deformable images, such as images containing text embedded on cloth, banners, rubber, sports jerseys, uniforms... Read More about Deformable scene text detection using harmonic features and modified pixel aggregation network.

Anomaly Detection in Natural Scene Images Based on Enhanced Fine-Grained Saliency and Fuzzy Logic (2021)
Journal Article
Mokayed, H., Shivakumara, P., Saini, R., Liwicki, M., Chee Hin, L., & Pal, U. (2021). Anomaly Detection in Natural Scene Images Based on Enhanced Fine-Grained Saliency and Fuzzy Logic. IEEE Access, 9, https://doi.org/10.1109/ACCESS.2021.3103279

This paper proposes a simple yet effective method for anomaly detection in natural scene images improving natural scene text detection and recognition. In the last decade, there has been significant progress towards text detection and recognition in... Read More about Anomaly Detection in Natural Scene Images Based on Enhanced Fine-Grained Saliency and Fuzzy Logic.

ARNet: Active-Reference Network for Few-Shot Image Semantic Segmentation (2021)
Conference Proceeding
Shi, G., Wu, Y., Palaiahnakote, S., Pal, U., & Lu, T. (2021). ARNet: Active-Reference Network for Few-Shot Image Semantic Segmentation. In 2021 IEEE International Conference on Multimedia and Expo (ICME). https://doi.org/10.1109/ICME51207.2021.9428425

To make predictions on unseen classes, few-shot segmentation becomes a research focus recently. However, most methods build on pixel-level annotation requiring quantity of manual work. Moreover, inherent information on same-category objects to guide... Read More about ARNet: Active-Reference Network for Few-Shot Image Semantic Segmentation.

A new context-based feature for classification of emotions in photographs (2021)
Journal Article
Krishnani, D., Shivakumara, P., Lu, T., Pal, U., Lopresti, D., & Hemantha Kumar, G. (2021). A new context-based feature for classification of emotions in photographs. Multimedia Tools and Applications, 80, 15589–15618. https://doi.org/10.1007/s11042-020-10404-8

A high volume of images is shared on the public Internet each day. Many of these are photographs of people with facial expressions and actions displaying various emotions. In this work, we examine the problem of classifying broad categories of emotio... Read More about A new context-based feature for classification of emotions in photographs.

A survey on video content rating: taxonomy, challenges and open issues (2021)
Journal Article
Khaksar Pour, A., Chaw Seng, W., Palaiahnakote, S., Tahaei, H., & Badrul Anuar, N. (2021). A survey on video content rating: taxonomy, challenges and open issues. Multimedia Tools and Applications, 80, 24121-24145. https://doi.org/10.1007/s11042-021-10838-8

Rating a video based on its content is one of the most important solutions to classify videos for audience age groups. In this regard, Film content rating and TV programmes rating are the only two most common rating systems which have been accomplish... Read More about A survey on video content rating: taxonomy, challenges and open issues.

DCT-phase statistics for forged IMEI numbers and air ticket detection (2021)
Journal Article
Nandanwar, L., Shivakumara, P., Kanchan, S., Basavaraja, V., Guru, D., Pal, U., …Blumenstein, M. (2021). DCT-phase statistics for forged IMEI numbers and air ticket detection. Expert systems with applications, 164, https://doi.org/10.1016/j.eswa.2020.114014

New tools have been developing with the intention of having more flexibility and greater user-friendliness for editing the images and documents in digital technologies, but, unfortunately, they are also being used for manipulating and tampering infor... Read More about DCT-phase statistics for forged IMEI numbers and air ticket detection.