Mr Christian Clausner C.Clausner@salford.ac.uk
Senior Research Fellow
Unearthing the recent past : digitising and understanding statistical information from census tables
Clausner, C; Hayes, J; Antonacopoulos, A; Pletschacher, S
Authors
J Hayes
Prof Apostolos Antonacopoulos A.Antonacopoulos@salford.ac.uk
Professor
Mr Stefan Pletschacher S.Pletschacher@salford.ac.uk
Lecturer
Abstract
Censuses comprise a wealth of information at a large (national) scale that allow governments (who commission them) and the public to have a detailed snapshot of how people live (geographical distribution and characteristics). In addition to underpinning socioeconomic research, the study of historical Census statistics provides a unique opportunity to understand several characteristics in a country and its heritage. This paper presents an overview of a complete account of the background, challenges, implemented preprocessing, recognition and post-processing pipeline, and the information-rich results obtained through a pilot digitisation project on the 1961 Census of England and Wales (the first time computers were used to process data and output very detailed information, a vital part of which is only available in the form of degraded historical computer printouts). The experience gained and the resulting methodology can also be used for digitising and understanding tabular information in a large variety of application scenarios.
Citation
Clausner, C., Hayes, J., Antonacopoulos, A., & Pletschacher, S. (2017). Unearthing the recent past : digitising and understanding statistical information from census tables. In Proceedings of the 2nd International Conference on Digital Access to Textual Cultural Heritage - DATeCH2017. https://doi.org/10.1145/3078081.3078106
Conference Name | Second International Conference on Digital Access to Textual Cultural Heritage (DATeCH 2017) |
---|---|
Conference Location | Goettingen, Germany |
Start Date | Jun 1, 2017 |
End Date | Jun 2, 2017 |
Publication Date | Jun 1, 2017 |
Deposit Date | Aug 7, 2017 |
Book Title | Proceedings of the 2nd International Conference on Digital Access to Textual Cultural Heritage - DATeCH2017 |
ISBN | 9781450352659 |
DOI | https://doi.org/10.1145/3078081.3078106 |
Related Public URLs | http://ddays.digitisation.eu/datech-2017/ |
You might also like
A survey of OCR evaluation tools and metrics
(2021)
Conference Proceeding
VISE : an interface for Visual Search and Exploration of museum collections
(2019)
Journal Article
Efficient and effective OCR engine training
(2019)
Journal Article
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search