Skip to main navigation Skip to search Skip to main content

Ensemble of Artificial Intelligence Techniques for Bacterial Antimicrobial Resistance (AMR) Estimation Using Topic Modeling and Similarity Measure

  • Vellore Institute of Technology
  • Noroff University College

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

In recent times, bacterial Antimicrobial Resistance (AMR) analyses becomes a hot study topic. The AMR comprises information related to the antibiotic product name, class name, subclass name, type, subtype, gene type, etc., which can fight against the illness. However, the tagging language used to determine the data is of free context. These contexts often contain ambiguous data, which leads to a hugely challenging issue in retrieving, organizing, merging, and finding the relevant data. Manually reading this text and labelling is not time-consuming. Hence, topic modeling overcomes these challenges and provides efficient results in categorizing the topic and in determining the data. In this view, this research work designs an ensemble of artificial intelligence for categorizing the AMR gene data and determine the relationship between the antibiotics. The proposed model includes a weighted voting based ensemble model by the incorporation of Latent Dirichlet Allocation (LDA) and Hierarchical Recurrent Neural Networks (HRNN), shows the novelty of the work. It is used for determining the amount of "topics"that cluster utilizing a multidimensional scaling approach. In addition, the proposed model involves the data pre-processing stage to get rid of stop words, punctuations, lower casing, etc. Moreover, an explanatory data analysis uses word cloud which assures the proper functionality and to proceed with the model training process. Besides, three approaches namely perplexity, Harmonic mean, and Random initialization of K are employed to determine the number of topics. For experimental validation, an openly accessible Bacterial AMR reference gene database is employed. The experimental results reported that the perplexity provided the optimal number of topics from the AMR gene data of more than 6500 samples.

Original languageEnglish
Pages (from-to)541-565
Number of pages25
JournalInternational Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
Volume30
Issue number3
DOIs
StatePublished - 1 Jun 2022
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Antimicrobial Resistance
  • Latent Dirichlet Allocation
  • similarity measure
  • tagging
  • text categorization
  • topic modeling

Fingerprint

Dive into the research topics of 'Ensemble of Artificial Intelligence Techniques for Bacterial Antimicrobial Resistance (AMR) Estimation Using Topic Modeling and Similarity Measure'. Together they form a unique fingerprint.

Cite this