Skip to main content

Research Repository

Advanced Search

All Outputs (48)

TTS: Hilbert Transform-based Generative Adversarial Network for Tattoo and Scene Text Spotting (2024)
Journal Article
Banerjee, A., Palaiahnakote, S., Antonacopoulos, A., Pal, U., Lu, T., & Canet, J. L. (2024). TTS: Hilbert Transform-based Generative Adversarial Network for Tattoo and Scene Text Spotting. IEEE Transactions on Multimedia, 1-15. https://doi.org/10.1109/tmm.2024.3378458

Text spotting in natural scenes is of increasing interest and significance due to its critical role in several applications, such as visual question answering, named entity recognition and event rumor detection on social media. One of the newly emerg... Read More about TTS: Hilbert Transform-based Generative Adversarial Network for Tattoo and Scene Text Spotting.

A new deep CNN for 3D text localization in the wild through shadow removal (2023)
Journal Article
Shivakumara, P., Banerjee, A., Nandanwar, L., Pal, U., Antonacopoulos, A., Lu, T., & Blumenstein, M. (2024). A new deep CNN for 3D text localization in the wild through shadow removal. Computer Vision and Image Understanding, 238, https://doi.org/10.1016/j.cviu.2023.103863

Text localization in the wild is challenging due to the presence of 2D and 3D texts, the presence of shadows, arbitrary orientated text with non-linear arrangements, varying lighting conditions as well as complex background. This paper proposes the f... Read More about A new deep CNN for 3D text localization in the wild through shadow removal.

NAME – A Rich XML Format for Named Entity and Relation Tagging (2023)
Conference Proceeding
Clausner, C., Pletschacher, S., & Antonacopoulos, A. (2023). NAME – A Rich XML Format for Named Entity and Relation Tagging. In HIP '23: Proceedings of the 7th International Workshop on Historical Document Imaging and Processing (91-96). https://doi.org/10.1145/3604951.3605521

We present NAME XML, a schema for named entities and relations in documents. The standout features are: option to reference a variety of document formats (such as PAGE XML or plain text), support of entity hierarchies, custom entity types via ontolog... Read More about NAME – A Rich XML Format for Named Entity and Relation Tagging.

Text line segmentation from struck-out handwritten document images (2022)
Journal Article
Shivakumara, P., Jain, T., Pal, U., Surana, N., Antonacopoulos, A., & Lu, T. (2022). Text line segmentation from struck-out handwritten document images. Expert systems with applications, 210, 118266. https://doi.org/10.1016/j.eswa.2022.118266

In the case of freestyle everyday handwritten documents, writing, erasing, striking out, and overwriting are common behaviors of the writers. This not cleanly-written text poses significant challenges for text line segmentation. Accurate text line se... Read More about Text line segmentation from struck-out handwritten document images.

A new deep wavefront based model for text localization in 3D video (2021)
Journal Article
Nandanwar, L., Shivakumara, P., Ramachandra, R., Lu, T., Pal, U., Antonacopoulos, A., & Lu, Y. (2021). A new deep wavefront based model for text localization in 3D video. IEEE Transactions on Circuits and Systems for Video Technology, 32(6), 3375-3389. https://doi.org/10.1109/TCSVT.2021.3110990

With the evolution of electronic devices, such as 3D cameras, addressing the challenges of text localization in 3D video (e.g., for indexing) is increasingly drawing the attention of the multimedia and video processing community. Existing methods foc... Read More about A new deep wavefront based model for text localization in 3D video.

A survey of OCR evaluation tools and metrics (2021)
Conference Proceeding
Neudecker, C., Baierer, K., Gerber, M., Clausner, C., Antonacopoulos, A., & Pletschacher, S. (2021). A survey of OCR evaluation tools and metrics. In HIP '21: The 6th International Workshop on Historical Document Imaging and Processing. https://doi.org/10.1145/3476887.3476888

The millions of pages of historical documents that are digitized in libraries are increasingly used in contexts that have more specific requirements for OCR quality than keyword search. How to comprehensively, efficiently and reliably assess the qual... Read More about A survey of OCR evaluation tools and metrics.

Exploiting available domain knowledge to improve the retrieval and recommendation of Digital Cultural Heritage materials (2021)
Thesis
Usman, M. Exploiting available domain knowledge to improve the retrieval and recommendation of Digital Cultural Heritage materials. (Thesis). University of Salford

Cultural Heritage (CH) institutions, such as museums, have recently embraced computing techniques to digitise CH materials (artefacts, paintings, books etc) and to make accessible those digital representations through their online portals to millions... Read More about Exploiting available domain knowledge to improve the retrieval and recommendation of Digital Cultural Heritage materials.

Computer analysis for registration and change detection of retinal images (2021)
Thesis
Elmuntser, A. Computer analysis for registration and change detection of retinal images. (Thesis). University of Salford

The current system of retinal screening is manual; It requires repetitive examination of a large number of retinal images by professional optometrists who try to identify the presence of abnormalities. As a result of the manual and repetitive nature... Read More about Computer analysis for registration and change detection of retinal images.

Flexible character accuracy measure for reading-order-independent evaluation (2020)
Journal Article
Clausner, C., Pletschacher, S., & Antonacopoulos, A. (2020). Flexible character accuracy measure for reading-order-independent evaluation. Pattern Recognition Letters, 131, 390-397. https://doi.org/10.1016/j.patrec.2020.02.003

The extraction of textual information from scanned document pages is a fundamental stage in any digitisation effort and directly determines the success of the overall document analysis and understanding application scenarios. To evaluate and improve... Read More about Flexible character accuracy measure for reading-order-independent evaluation.

VISE : an interface for Visual Search and Exploration of museum collections (2019)
Journal Article
Usman, M., & Antonacopoulos, A. (2020). VISE : an interface for Visual Search and Exploration of museum collections. Journal on Computing and Cultural Heritage, 12(4), 1-9. https://doi.org/10.1145/3340936

This article presents VISE, an interface that enables VIsual Search and Exploration across collections of approximately 836,000 museum objects extracted from the websites of the National Museums Scotland and the Rijksmuseum in the Netherlands. VISE p... Read More about VISE : an interface for Visual Search and Exploration of museum collections.

Efficient and effective OCR engine training (2019)
Journal Article
Clausner, C., Antonacopoulos, A., & Pletschacher, S. (2020). Efficient and effective OCR engine training. International Journal on Document Analysis and Recognition, 23(1), 73-78. https://doi.org/10.1007/s10032-019-00347-8

We present an efficient and effective approach to train OCR engines using the Aletheia document analysis system. All components required for training are seamlessly integrated into Aletheia: training data preparation, the OCR engine’s training proces... Read More about Efficient and effective OCR engine training.

Crowdsourcing historical tabular data : 1961 census of England and Wales (2019)
Conference Proceeding
Clausner, C., Hayes, J., & Antonacopoulos, A. (2019). Crowdsourcing historical tabular data : 1961 census of England and Wales. In Proceedings of the 5th International Workshop on Historical Document Imaging and Processing - HIP '19. https://doi.org/10.1145/3352631.3352643

This paper describes how crowdsourcing can be incorporated as an integral part of a comprehensive technical workflow to identify, extract and validate data from large volumes of printed tabular statistics, and transform them into operable digital dat... Read More about Crowdsourcing historical tabular data : 1961 census of England and Wales.

Towards the extraction of statistical information from digitised numerical tables - the Medical Officer of Health reports scoping study (2019)
Conference Proceeding
Clausner, C., Antonacopoulos, A., Henshaw, C., & Hayes, J. (2019). Towards the extraction of statistical information from digitised numerical tables - the Medical Officer of Health reports scoping study. In DATeCH2019 Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage. https://doi.org/10.1145/3322905.3322932

Numerical data of considerable significance is present in historical documents in tabular form. Due to the challenges involved in the extraction of this data from the scanned documents it is not available to researchers in a useful representation tha... Read More about Towards the extraction of statistical information from digitised numerical tables - the Medical Officer of Health reports scoping study.

Highlights of the novel dewaterability estimation test (DET) device (2019)
Journal Article
Scholz, M., Almuktar, S., Clausner, C., & Antonacopoulos, A. (2020). Highlights of the novel dewaterability estimation test (DET) device. Environmental Technology, 41(20), 2594-2602. https://doi.org/10.1080/09593330.2019.1575916

Many industries, which are producing sludge in large quantities, depend on sludge dewatering technology to reduce the corresponding water content. A key design parameter for dewatering equipment is the capillary suction time (CST) test, which has, ho... Read More about Highlights of the novel dewaterability estimation test (DET) device.

ICFHR 2018 Competition on recognition of historical Arabic scientific manuscripts - RASM2018 (2018)
Conference Proceeding
Clausner, C., Antonacopoulos, A., McGregor, N., & Wilson-Nunn, D. (2018). ICFHR 2018 Competition on recognition of historical Arabic scientific manuscripts - RASM2018. . https://doi.org/10.1109/ICFHR-2018.2018.00088

This paper presents an objective comparative evaluation of page analysis and recognition methods for historical scientific manuscripts with text in Arabic language and script. It describes the competition (modus operandi, dataset and evaluation metho... Read More about ICFHR 2018 Competition on recognition of historical Arabic scientific manuscripts - RASM2018.

Security and usability in a hybrid property based graphical authentication system (2018)
Thesis
Suru, H. (in press). Security and usability in a hybrid property based graphical authentication system. (Thesis). University of Salford

Alphanumeric text and PINs continue to be the dominant authentication methods in spite of the numerous concerns by security researchers of their inability to properly address usability and security flaws and to effectively combine usability and secur... Read More about Security and usability in a hybrid property based graphical authentication system.

Ontology and framework for semantic labelling of document data and software methods (2018)
Conference Proceeding
Clausner, C., & Antonacopoulos, A. (2018). Ontology and framework for semantic labelling of document data and software methods. . https://doi.org/10.1109/DAS.2018.46

We present a metadata labelling framework for datasets, software tools, and workflows. An ontology for document image analysis was developed with deep support for historical data. An accompanying open source software framework was implemented to enab... Read More about Ontology and framework for semantic labelling of document data and software methods.

Document analysis and text recognition (2018)
Book
(2018). V. Märgner, U. Pal, & A. Antonacopoulos (Eds.), Document analysis and text recognition. World Scientific. https://doi.org/10.1142/10689

The compendium presents the latest results of the most prominent competitions held in the field of Document Analysis and Text Recognition. It includes a description of the participating systems and the underlying methods on one hand and the datasets... Read More about Document analysis and text recognition.

Study protocol : responding to the needs of patients with IgA nephropathy, a social media approach (2017)
Journal Article
Graham-Brown, M., Vasilica, C., Oates, T., Light, B., Clausner, C., Antonacopoulos, A., …Barratt, J. (2017). Study protocol : responding to the needs of patients with IgA nephropathy, a social media approach. Clinical Kidney Journal, 11(4), 474-478. https://doi.org/10.1093/ckj/sfx131

Background IgA nephropathy is the most common cause of glomerulonephritis in the Western world and predominantly affects young adults. Demographically these patients are the biggest users of social media. With increasing numbers of patients turning... Read More about Study protocol : responding to the needs of patients with IgA nephropathy, a social media approach.

ICDAR2017 Competition on Recognition of Documents with Complex Layouts – RDCL2017 (2017)
Conference Proceeding
Clausner, C., Antonacopoulos, A., & Pletschacher, S. (2017). ICDAR2017 Competition on Recognition of Documents with Complex Layouts – RDCL2017. . https://doi.org/10.1109/ICDAR.2017.229

This paper presents an objective comparative evaluation of page segmentation and region classification methods for documents with complex layouts. It describes the competition (modus operandi, dataset and evaluation methodology) held in the context o... Read More about ICDAR2017 Competition on Recognition of Documents with Complex Layouts – RDCL2017.