University of Limerick
Browse

From Data science to modular workflows changing perspectives from data to platform: DBDIrl 1864-1922 Case study

Download (2.27 MB)
journal contribution
posted on 2024-11-22, 12:08 authored by Edna OSheaEdna OShea, Marco Krumrey, Daniel Sami Mitwalli, Sebastian Teumert, Tiziana MargariaTiziana Margaria

Many historical data collections foot on handwritten documents and registers, whose consultation is often very difficult due to the conservation state of the physical artefacts, and whose comprehension is also made difficult by the handwriting, difficult to interpret, and the language used, different from the modern terminology. Therefore significant research efforts by historians, demographers, population health scientists and others have been started in the past with the aim of making such data collections digitally available, first on the basis of images and then as readily available repositories of transcribed data in electronically queryable formats. For the purpose of extracting data from the Irish Civil registers of deaths in the DBDIrl 1864-1922 project (https://www. dbdirl.com), an AI-ML Data Analytics Pipeline was proposed as a working approach validated on a subset of the data. However, the pipeline requires manual steps and it is not applicable as is on similar datasets without significant modifications to its inner workings. We are currently transforming this prototyped, single purpose product to a modular, fully automated workflow, intended to be used and reconfigured for new data in a low-code/no-code fashion by domain experts like historians. We explain our adopted analysis and refactoring process, illustrate it on part of the pipeline, including how we faced obstacles and handled pitfalls. We also evaluate its potential to become a methodical approach to transforming an interactive program to a fully automated process, in a low-code/no-code workflow style, that can be easily reused, reconfigured and extended to be able to tailor it to other datasets as needed.

Funding

SFI Centre for Research Training in Artificial Intelligence

Science Foundation Ireland

Find out more...

History

Publication

Bridging the Gap Between AI and Reality (AISoLA 2023), pp. 84-103

Publisher

Springer Nature

Note

Conference Bridging the Gap Between AI and Reality (AISoLA 2023)

Other Funding information

Science Foundation Ireland (SFI) under grants number 18/CRT/6223 574 (SFI Centre of Research Training in AI) and 21/SPP/9979 (R@ISE).

Also affiliated with

  • Health Research Institute (HRI)
  • LERO - The Science Foundation Ireland Research Centre for Software

Department or School

  • Computer Science & Information Systems

Usage metrics

    University of Limerick

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC