FEATURE SELECTION FOR ARABIC MISPRONUNCIATION DETECTION BASED ON SEQUENTIAL FLOATING FORWARD SELECTION AND DATA MINING CLASSIFIERS

Authors

  • M. Maqsood Software Engineering Department, University of Engineering and Technology Taxila, Pakistan

DOI:

https://doi.org/10.57041/pjs.v68i4.230

Keywords:

Sequential Floating Forward Selection and Mispronunciation detection, Acoustic-phonetic features, Feature selection

Abstract

Feature selection process is used to reduce the feature vector length and identify the
discriminative features. Many acoustic-phonetic features including Mel-Frequency Cepstral
Coefficient (MFCC), Energy, Pitch, Zero-crossing, spectrum were tested individually for Arabic
mispronunciation detection using three classifiers; Random Forest, Bayesian classifier, and Bagged
Support Vector Machine (SVM). The results for Bagged SVM were better than the other two
classifiers. Top three individual features with highest accuracies were identified for each isolated
Arabic consonant. To validate the results, a modified form of Sequential Floating Forward Selection
(SFFS) process was used. Results showed that MFCC along with its first and second derivatives,
energy, spectrum, and zero-crossing were the most suitable acoustic features for Arabic
mispronunciation detection system. The proposed approach provided an average accuracy of 94.9%
which was better than the previous best 92.95% for Arabic consonants.

Downloads

Published

2023-01-05

How to Cite

M. Maqsood. (2023). FEATURE SELECTION FOR ARABIC MISPRONUNCIATION DETECTION BASED ON SEQUENTIAL FLOATING FORWARD SELECTION AND DATA MINING CLASSIFIERS. Pakistan Journal of Science, 68(4). https://doi.org/10.57041/pjs.v68i4.230