Skip to main navigation Skip to search Skip to main content

Overview of the Mowjaz Multi-Topic Labelling Task

  • Mahmoud Al-Ayyoub
  • , Haitham Seelawi
  • , Mohamed Zaghlol
  • , Hussein T. Al-Natsheh
  • , Samer Suileman
  • , Ali Fadel
  • , Riham Badawi
  • , Ahmed Morsy
  • , Ibraheem Tuffaha
  • , Mohannad Aljarrah
  • Jordan University of Science and Technology
  • Mawdoo3 Ltd

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

15 Scopus citations

Abstract

Multilabel text classification is an important task in Natural Language Processing (NLP). One use case of such a task is in categorizing news articles, where each article may belong to one or more classes. In this work, we present the ICICS2021 Mowjaz Multi-Topic Labelling Task. Given a piece of news, systems participating in this task are expected to select its topic(s). The systems are evaluated based on the F1 score measure. In total, 46 teams registered on the task's CodaLab page. Out of them, 28 teams submitted 309 runs. The results are surprisingly high. Moreover, they are very close to each other with all teams having systems achieving F1 scores ranging between 0.7965 and 0.8567. Most of these systems used deep learning models, such as Recurrent Neural Networks (RNN), coupled with pretrained word embeddings such as BERT-based models. Few of them experimented with traditional machine learning models such as Support Vector Machine (SVM) and Naive Bayes (NB).

Original languageEnglish
Title of host publication2021 12th International Conference on Information and Communication Systems, ICICS 2021
EditorsMohammad Alsmirat, Abdallah Almaaitah, Yaser Jararweh, Jaime Lloret Mauri
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages502-508
Number of pages7
ISBN (Electronic)9781665433518
DOIs
StatePublished - 24 May 2021
Externally publishedYes
Event12th International Conference on Information and Communication Systems, ICICS 2021 - Virtual, Valencia, Spain
Duration: 24 May 202126 May 2021

Publication series

Name2021 12th International Conference on Information and Communication Systems, ICICS 2021

Conference

Conference12th International Conference on Information and Communication Systems, ICICS 2021
Country/TerritorySpain
CityVirtual, Valencia
Period24/05/2126/05/21

Keywords

  • AraBERT
  • AraVec
  • Arabic BERT
  • GRU
  • GigaBERT
  • LSTM
  • Multi-label Text Classification
  • RNN
  • SVM

Fingerprint

Dive into the research topics of 'Overview of the Mowjaz Multi-Topic Labelling Task'. Together they form a unique fingerprint.

Cite this