IDPP: Imbalanced datasets pipelines in Pyrus
We showcase and demonstrate IDPP, a Pyrus-based tool that offers a collection of pipelines for the analysis of imbalanced datasets. Like Pyrus, IDPP is a web-based, low-code/no-code graphical modelling environment for ML and data analytics applications. On a case study from the medical domain, we solve the challenge of re-using AI/ML models that do not address data with imbalanced class by implementing ML algorithms in Python that do the re-balancing. We then use these algorithms and the original ML models in the IDPP pipelines. With IDPP, our low-code development approach to balance datasets for AI/ML applications can be used by non-coders. It simplifies the data-preprocessing stage of any AI/ML project pipeline, which can potentially improve the performance of the models. The tool demo will showcase the low-code implementation and no-code reuse and repurposing of AI-based systems through end-to end Pyrus pipelines.
Funding
SFI Centre for Research Training in Artificial Intelligence
Science Foundation Ireland
Find out more...History
Publication
Engineering of Computer-Based Systems: 8th International Conference, ECBS 2023, Västerås, Sweden, October 16–18, 2023, ProceedingsOct 2023, pp. 60-69Publisher
Association for Computing MachineryRights
"© ACM, 2023 This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in Engineering of Computer-Based Systems: 8th International Conference, ECBS 2023, Västerås, Sweden, October 16–18, 2023, ProceedingsOct 2023, pp. 60-69, https://doi.org/10.1007/978-3-031-49252-5_6Sustainable development goals
- (4) Quality Education
External identifier
Department or School
- Mathematics & Statistics