Amazigh Spoken Digit Recognition using a Deep Learning Approach based on MFCC

Hossam boulal; Mohamed Hamidi; Mustapha Abarkan; Jamal barkani

doi:10.32985/ijeces.14.7.6

Authors

Hossam boulal Sidi Mohamed Ben Abdellah University of Fez Multidisciplinary faculty of Taza, LSI Laboratory Taza, Morocco. https://orcid.org/0009-0005-0749-508X
Mohamed Hamidi Mohamed First University of Oujda, Multidisciplinary faculty of Nador, Team of modeling and scientifc computing Nador, Morocco. https://orcid.org/0000-0003-2487-5517
Mustapha Abarkan Sidi Mohamed Ben Abdellah University of Fez Multidisciplinary faculty of Taza, LSI Laboratory Taza, Morocco. https://orcid.org/0009-0006-4639-6292
Jamal barkani Sidi Mohamed Ben Abdellah University of Fez Multidisciplinary faculty of Taza, LSI Laboratory Taza, Morocco. https://orcid.org/0000-0002-5670-5080

DOI:

https://doi.org/10.32985/ijeces.14.7.6

Keywords:

CNN, Deep learning, digits, Amazigh, Speech, ASR

Abstract

The field of speech recognition has made human-machine voice interaction more convenient. Recognizing spoken digits is particularly useful for communication that involves numbers, such as providing a registration code, cellphone number, score, or account number. This article discusses our experience with Amazigh's Automatic Speech Recognition (ASR) using a deep learning- based approach. Our method involves using a convolutional neural network (CNN) with Mel-Frequency Cepstral Coefficients (MFCC) to analyze audio samples and generate spectrograms. We gathered a database of numerals from zero to nine spoken by 42 native Amazigh speakers, consisting of men and women between the ages of 20 and 40, to recognize Amazigh numerals. Our experimental results demonstrate that spoken digits in Amazigh can be recognized with an accuracy of 91.75%, 93% precision, and 92% recall. The preliminary outcomes we have achieved show great satisfaction when compared to the size of the training database. This motivates us to further enhance the system's performance in order to attain a higher rate of recognition. Our findings align with those reported in the existing literature.

Amazigh Spoken Digit Recognition using a Deep Learning Approach based on MFCC

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Information

Make a Submission

JCR Impact factor for 2024

0.9