University of Limerick
Browse

Multi-objective classification and feature selection of Covid-19 proteins sequences using NSGA-II and MAP-elites

Download (619.49 kB)
journal contribution
posted on 2022-11-29, 12:15 authored by Vijay Sambhe, Shanmukha Rajesh, ENRIQUE NAREDOENRIQUE NAREDO, Douglas Mota DiasDouglas Mota Dias, Meghana KshirsagarMeghana Kshirsagar, Conor RyanConor Ryan

The advent of the Covid-19 pandemic has resulted in a global crisis making the health systems vulnerable, challenging the research community to find novel approaches to facilitate early detection of infections. This open-up a window of opportunity to exploit machine learning and artificial intelligence techniques to address some of the issues related to this disease. In this work, we address the classification of ten SARS-CoV-2 protein sequences related to Covid-19 using k-mer frequency as features and considering two objectives; classification performance and feature selection. The first set of experiments considered the objectives one at the time, four techniques were used for the feature selection and twelve well known machine learning methods, where three are neural network based for the classification. The second set of experiments considered a multi-objective approach where we tested a well known multi-objective approach Non-dominated Sorting Genetic Algorithm II (NSGA-II), and the Multi-dimensional Archive of Phenotypic Elites (MAP-Elites), which considers quality+diversity containers to guide the search through elite solutions. The experimental results shows that ResNet and PCA is the best combination using single objectives. Whereas, for the mulit-classification, NSGA-II outperforms ME with two out of three classifiers, while ME gets competitive results bringing more diverse set of solutions.

Funding

SFI Centre for Research Training in Artificial Intelligence

Science Foundation Ireland

Find out more...

Automatic Design of Digital Circuits (ADDC)

Science Foundation Ireland

Find out more...

History

Publication

Proceedings of the 13th International Conference on Agents and Artificial Intelligence (ICAART 2021), 2, pp. 246-256

Publisher

SciTePress

Other Funding information

The fourth author is partially financed by the Coordenac¸ao˜ de Aperfeic¸oamento de Pessoal de N´ıvel Superior - Brasil (CAPES) - Finance Code 001.

Also affiliated with

  • LERO - The Irish Software Research Centre

Department or School

  • Computer Science & Information Systems

Usage metrics

    University of Limerick

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC