From Data science to modular workflows changing perspectives from data to platform: DBDIrl 1864-1922 Case study
Many historical data collections foot on handwritten documents and registers, whose consultation is often very difficult due to the conservation state of the physical artefacts, and whose comprehension is also made difficult by the handwriting, difficult to interpret, and the language used, different from the modern terminology. Therefore significant research efforts by historians, demographers, population health scientists and others have been started in the past with the aim of making such data collections digitally available, first on the basis of images and then as readily available repositories of transcribed data in electronically queryable formats. For the purpose of extracting data from the Irish Civil registers of deaths in the DBDIrl 1864-1922 project (https://www. dbdirl.com), an AI-ML Data Analytics Pipeline was proposed as a working approach validated on a subset of the data. However, the pipeline requires manual steps and it is not applicable as is on similar datasets without significant modifications to its inner workings. We are currently transforming this prototyped, single purpose product to a modular, fully automated workflow, intended to be used and reconfigured for new data in a low-code/no-code fashion by domain experts like historians. We explain our adopted analysis and refactoring process, illustrate it on part of the pipeline, including how we faced obstacles and handled pitfalls. We also evaluate its potential to become a methodical approach to transforming an interactive program to a fully automated process, in a low-code/no-code workflow style, that can be easily reused, reconfigured and extended to be able to tailor it to other datasets as needed.
Funding
SFI Centre for Research Training in Artificial Intelligence
Science Foundation Ireland
Find out more...History
Publication
Bridging the Gap Between AI and Reality (AISoLA 2023), pp. 84-103Publisher
Springer NatureNote
Conference Bridging the Gap Between AI and Reality (AISoLA 2023)Other Funding information
Science Foundation Ireland (SFI) under grants number 18/CRT/6223 574 (SFI Centre of Research Training in AI) and 21/SPP/9979 (R@ISE).Also affiliated with
- Health Research Institute (HRI)
- LERO - The Science Foundation Ireland Research Centre for Software
External identifier
Department or School
- Computer Science & Information Systems