Skip to main navigation Skip to search Skip to main content

Building a neural speech recognizer for quranic recitations

  • Jordan University of Science and Technology
  • Birmingham City University
  • Faculty of Computers and Artificial Intelligence

Research output: Contribution to journalArticlepeer-review

9 Scopus citations

Abstract

This work is an effort towards building Neural Speech Recognizers system for Quranic recitations that can be effectively used by anyone regardless of their gender and age. Despite having a lot of recitations available online, most of them are recorded by professional male adult reciters, which means that an ASR system trained on such datasets would not work for female/child reciters. We address this gap by adopting a benchmark dataset of audio records of Quranic recitations that consists of recitations by both genders from different ages. Using this dataset, we build several speaker-independent NSR systems based on the DeepSpeech model and use word error rate (WER) for evaluating them. The goal is to show how an NSR system trained and tuned on a dataset of a certain gender would perform on a test set from the other gender. Unfortunately, the number of female recitations in our dataset is rather small while the number of male recitations is much larger. In the first set of experiments, we avoid the imbalance issue between the two genders and down-sample the male part to match the female part. For this small subset of our dataset, the results are interesting with 0.968 WER when the system is trained on male recitations and tested on female recitations. The same system gives 0.406 WER when tested on male recitations. On the other hand, training the system on female recitations and testing it on male recitation gives 0.966 WER while testing it on female recitations gives 0.608 WER.

Original languageEnglish
Pages (from-to)1131-1151
Number of pages21
JournalInternational Journal of Speech Technology
Volume26
Issue number4
DOIs
StatePublished - Dec 2023
Externally publishedYes

Keywords

  • ASR
  • Dataset
  • DeepSpeech
  • Quran
  • Speech
  • WER

Fingerprint

Dive into the research topics of 'Building a neural speech recognizer for quranic recitations'. Together they form a unique fingerprint.

Cite this