School of Science, Engineering & Environment

TTS: Hilbert Transform-based Generative Adversarial Network for Tattoo and Scene Text Spotting (2024)
Journal Article
Banerjee, A., Palaiahnakote, S., Antonacopoulos, A., Pal, U., Lu, T., & Canet, J. L. (2024). TTS: Hilbert Transform-based Generative Adversarial Network for Tattoo and Scene Text Spotting. IEEE Transactions on Multimedia, 1-15. https://doi.org/10.1109/tmm.2024.3378458

Text spotting in natural scenes is of increasing interest and significance due to its critical role in several applications, such as visual question answering, named entity recognition and event rumor detection on social media. One of the newly emerg... Read More about TTS: Hilbert Transform-based Generative Adversarial Network for Tattoo and Scene Text Spotting.

A new deep CNN for 3D text localization in the wild through shadow removal (2023)
Journal Article
Shivakumara, P., Banerjee, A., Nandanwar, L., Pal, U., Antonacopoulos, A., Lu, T., & Blumenstein, M. (2024). A new deep CNN for 3D text localization in the wild through shadow removal. Computer Vision and Image Understanding, 238, https://doi.org/10.1016/j.cviu.2023.103863

Text localization in the wild is challenging due to the presence of 2D and 3D texts, the presence of shadows, arbitrary orientated text with non-linear arrangements, varying lighting conditions as well as complex background. This paper proposes the f... Read More about A new deep CNN for 3D text localization in the wild through shadow removal.

NAME – A Rich XML Format for Named Entity and Relation Tagging (2023)
Conference Proceeding
Clausner, C., Pletschacher, S., & Antonacopoulos, A. (2023). NAME – A Rich XML Format for Named Entity and Relation Tagging. In HIP '23: Proceedings of the 7th International Workshop on Historical Document Imaging and Processing (91-96). https://doi.org/10.1145/3604951.3605521

We present NAME XML, a schema for named entities and relations in documents. The standout features are: option to reference a variety of document formats (such as PAGE XML or plain text), support of entity hierarchies, custom entity types via ontolog... Read More about NAME – A Rich XML Format for Named Entity and Relation Tagging.

Text line segmentation from struck-out handwritten document images (2022)
Journal Article
Shivakumara, P., Jain, T., Pal, U., Surana, N., Antonacopoulos, A., & Lu, T. (2022). Text line segmentation from struck-out handwritten document images. Expert systems with applications, 210, 118266. https://doi.org/10.1016/j.eswa.2022.118266

In the case of freestyle everyday handwritten documents, writing, erasing, striking out, and overwriting are common behaviors of the writers. This not cleanly-written text poses significant challenges for text line segmentation. Accurate text line se... Read More about Text line segmentation from struck-out handwritten document images.

A new deep wavefront based model for text localization in 3D video (2021)
Journal Article
Nandanwar, L., Shivakumara, P., Ramachandra, R., Lu, T., Pal, U., Antonacopoulos, A., & Lu, Y. (2021). A new deep wavefront based model for text localization in 3D video. IEEE Transactions on Circuits and Systems for Video Technology, 32(6), 3375-3389. https://doi.org/10.1109/TCSVT.2021.3110990

With the evolution of electronic devices, such as 3D cameras, addressing the challenges of text localization in 3D video (e.g., for indexing) is increasingly drawing the attention of the multimedia and video processing community. Existing methods foc... Read More about A new deep wavefront based model for text localization in 3D video.

A survey of OCR evaluation tools and metrics (2021)
Conference Proceeding
Neudecker, C., Baierer, K., Gerber, M., Clausner, C., Antonacopoulos, A., & Pletschacher, S. (2021). A survey of OCR evaluation tools and metrics. In HIP '21: The 6th International Workshop on Historical Document Imaging and Processing. https://doi.org/10.1145/3476887.3476888

The millions of pages of historical documents that are digitized in libraries are increasingly used in contexts that have more specific requirements for OCR quality than keyword search. How to comprehensively, efficiently and reliably assess the qual... Read More about A survey of OCR evaluation tools and metrics.

Exploiting available domain knowledge to improve the retrieval and recommendation of Digital Cultural Heritage materials (2021)
Thesis
Usman, M. Exploiting available domain knowledge to improve the retrieval and recommendation of Digital Cultural Heritage materials. (Thesis). University of Salford

Cultural Heritage (CH) institutions, such as museums, have recently embraced computing techniques to digitise CH materials (artefacts, paintings, books etc) and to make accessible those digital representations through their online portals to millions... Read More about Exploiting available domain knowledge to improve the retrieval and recommendation of Digital Cultural Heritage materials.

Computer analysis for registration and change detection of retinal images (2021)
Thesis
Elmuntser, A. Computer analysis for registration and change detection of retinal images. (Thesis). University of Salford

The current system of retinal screening is manual; It requires repetitive examination of a large number of retinal images by professional optometrists who try to identify the presence of abnormalities. As a result of the manual and repetitive nature... Read More about Computer analysis for registration and change detection of retinal images.

Flexible character accuracy measure for reading-order-independent evaluation (2020)
Journal Article
Clausner, C., Pletschacher, S., & Antonacopoulos, A. (2020). Flexible character accuracy measure for reading-order-independent evaluation. Pattern Recognition Letters, 131, 390-397. https://doi.org/10.1016/j.patrec.2020.02.003

The extraction of textual information from scanned document pages is a fundamental stage in any digitisation effort and directly determines the success of the overall document analysis and understanding application scenarios. To evaluate and improve... Read More about Flexible character accuracy measure for reading-order-independent evaluation.

VISE : an interface for Visual Search and Exploration of museum collections (2019)
Journal Article
Usman, M., & Antonacopoulos, A. (2020). VISE : an interface for Visual Search and Exploration of museum collections. Journal on Computing and Cultural Heritage, 12(4), 1-9. https://doi.org/10.1145/3340936

This article presents VISE, an interface that enables VIsual Search and Exploration across collections of approximately 836,000 museum objects extracted from the websites of the National Museums Scotland and the Rijksmuseum in the Netherlands. VISE p... Read More about VISE : an interface for Visual Search and Exploration of museum collections.

Outputs (55)