University of Limerick
Browse
OShea_2023_Tabular.pdf (3.05 MB)

Tabular corner detection in historical Irish records

Download (3.05 MB)
journal contribution
posted on 2023-11-15, 09:37 authored by Edna OSheaEdna OShea

The process of extracting relevant data from historical handwritten documents can be time-consuming and challenging. In Ireland, from 1864 to 1922, government records regarding births, deaths, and marriages were documented by local registrars using printed tabular structures. Leveraging this systematic approach, we employ a neural network capable of segmenting scanned versions of these record documents. We sought to isolate the corner points with the goal of extracting the vital tabular elements and transforming them into consistently structured standalone images. By achieving uniformity in the segmented images, we enable more accurate row and column segmentation, enhancing our ability to isolate and classify individual cell contents effectively. This process must accommodate varying image qualities, different tabular orientations and sizes resulting from diverse scanning procedures, as well as faded and damaged ink lines that naturally occur over time.

Funding

SFI Centre for Research Training in Artificial Intelligence

Science Foundation Ireland

Find out more...

Automatic Design of Digital Circuits (ADDC)

Science Foundation Ireland

Find out more...

History

Publication

DocEng '23: Proceedings of the ACM Symposium on Document Engineering 2023

Publisher

Association for Computing Machinery

Sustainable development goals

  • (15) Life On Land

Department or School

  • Computer Science & Information Systems

Usage metrics

    University of Limerick

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC