An automated speech recognition and feature selection approach based on improved Northern Goshawk optimization

Automatic speech recognition (ASR) approach is dependent on optimal speech feature extraction, which attempts to get a parametric depiction of an input speech signal. Feature extraction (FE) strategy combined with a feature selection (FS) approach should capture the most important features of the si...

Full description

Bibliographic Details
Main Authors: Suryakumar, Santosh Kumar, Hiremath, Bharathi S., Mohankumar, Nageswara Guptha
Format: Article in Journal/Newspaper
Language:English
Published: Institute of Advanced Engineering and Science 2024
Subjects:
Online Access:https://ijai.iaescore.com/index.php/IJAI/article/view/22771
https://doi.org/10.11591/ijai.v13.i1.pp296-304
Description
Summary:Automatic speech recognition (ASR) approach is dependent on optimal speech feature extraction, which attempts to get a parametric depiction of an input speech signal. Feature extraction (FE) strategy combined with a feature selection (FS) approach should capture the most important features of the signal while discarding the rest. FS is a crucial process that can affect the pattern classification and recognition system's performance. In this research, we introduce a hybrid supervised learning using metaheuristic technique for optimum FE and FS termed Northern Goshawk optimization (NGO) and opposition-based learning (OBL). Pre-processing, feature extraction and selection, and recognition are the three steps of the proposed technique. The pre-processing is done first to lessen the amount of noise. In the FE stage, we extract features. The OBL-NGO method is used to pick the best collection of extracted characteristics. Finally, these optimised features are utilised to train the k-nearest neighbour (KNN) classifier, and the matching text is shown as the output based on these optimised characteristics of the provided input audio signal. The system's performance is outstanding, and the suggested OBL-NGO is best suited for ASR, according to the testing data.