A Robust Voice Pathology Detection System Based on the Combined BiLSTM–CNN Architecture

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Voice recognition systems have become increasingly important in recent years due to the growing need for more efficient and intuitive human-machine interfaces. The use of Hybrid LSTM networks and deep learning has been very successful in improving speech detection systems. The aim of this paper is to develop a novel approach for the detection of voice pathologies using a hybrid deep learning model that combines the Bidirectional Long Short-Term Memory (BiLSTM) and the Convolutional Neural Network (CNN) architectures. The proposed model uses a combination of temporal and spectral features extracted from speech signals to detect the different types of voice pathologies. The performance of the proposed detection model is evaluated on a publicly available dataset of speech signals from individuals with various voice pathologies(MEEI database). The experimental results showed that the hybrid BiLSTM-CNN model outperforms several classifiers by achieving an accuracy of 98.86%. The proposed model has the potential to assist health care professionals in the accurate diagnosis and treatment of voice pathologies, and improving the quality of life for affected individuals.

Original languageEnglish
Pages (from-to)202-210
Number of pages9
JournalMendel
Volume29
Issue number2
DOIs
StatePublished - 20 Dec 2023

Keywords

  • BiLSTM
  • Convolutional Neural Network
  • Hybrid Systems
  • MEEI Voice Disorders Database
  • Voice Pathology Detection

Fingerprint

Dive into the research topics of 'A Robust Voice Pathology Detection System Based on the Combined BiLSTM–CNN Architecture'. Together they form a unique fingerprint.

Cite this