A Robust Voice Pathology Detection System Based on the Combined BiLSTM–CNN Architecture

Rimah Amami; Rim Amami; Chiraz Trabelsi; Sherin Hassan Mabrouk; Hassan A. Khalil

doi:10.13164/mendel.2023.2.202

A Robust Voice Pathology Detection System Based on the Combined BiLSTM–CNN Architecture

Rimah Amami^*
, Rim Amami
, Chiraz Trabelsi
, Sherin Hassan Mabrouk
, Hassan A. Khalil

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

8 Scopus citations

Abstract

Voice recognition systems have become increasingly important in recent years due to the growing need for more efficient and intuitive human-machine interfaces. The use of Hybrid LSTM networks and deep learning has been very successful in improving speech detection systems. The aim of this paper is to develop a novel approach for the detection of voice pathologies using a hybrid deep learning model that combines the Bidirectional Long Short-Term Memory (BiLSTM) and the Convolutional Neural Network (CNN) architectures. The proposed model uses a combination of temporal and spectral features extracted from speech signals to detect the different types of voice pathologies. The performance of the proposed detection model is evaluated on a publicly available dataset of speech signals from individuals with various voice pathologies(MEEI database). The experimental results showed that the hybrid BiLSTM-CNN model outperforms several classifiers by achieving an accuracy of 98.86%. The proposed model has the potential to assist health care professionals in the accurate diagnosis and treatment of voice pathologies, and improving the quality of life for affected individuals.

Original language	English
Pages (from-to)	202-210
Number of pages	9
Journal	Mendel
Volume	29
Issue number	2
DOIs	https://doi.org/10.13164/mendel.2023.2.202
State	Published - 20 Dec 2023

Keywords

BiLSTM
Convolutional Neural Network
Hybrid Systems
MEEI Voice Disorders Database
Voice Pathology Detection

Access to Document

10.13164/mendel.2023.2.202

Cite this

@article{b4cde88271d84c0a977a483cdf0fb686,

title = "A Robust Voice Pathology Detection System Based on the Combined BiLSTM–CNN Architecture",

abstract = "Voice recognition systems have become increasingly important in recent years due to the growing need for more efficient and intuitive human-machine interfaces. The use of Hybrid LSTM networks and deep learning has been very successful in improving speech detection systems. The aim of this paper is to develop a novel approach for the detection of voice pathologies using a hybrid deep learning model that combines the Bidirectional Long Short-Term Memory (BiLSTM) and the Convolutional Neural Network (CNN) architectures. The proposed model uses a combination of temporal and spectral features extracted from speech signals to detect the different types of voice pathologies. The performance of the proposed detection model is evaluated on a publicly available dataset of speech signals from individuals with various voice pathologies(MEEI database). The experimental results showed that the hybrid BiLSTM-CNN model outperforms several classifiers by achieving an accuracy of 98.86\%. The proposed model has the potential to assist health care professionals in the accurate diagnosis and treatment of voice pathologies, and improving the quality of life for affected individuals.",

keywords = "BiLSTM, Convolutional Neural Network, Hybrid Systems, MEEI Voice Disorders Database, Voice Pathology Detection",

author = "Rimah Amami and Rim Amami and Chiraz Trabelsi and Mabrouk, \{Sherin Hassan\} and Khalil, \{Hassan A.\}",

year = "2023",

month = dec,

day = "20",

doi = "10.13164/mendel.2023.2.202",

language = "English",

volume = "29",

pages = "202--210",

journal = "Mendel",

issn = "1803-3814",

number = "2",

}

TY - JOUR

T1 - A Robust Voice Pathology Detection System Based on the Combined BiLSTM–CNN Architecture

AU - Amami, Rimah

AU - Amami, Rim

AU - Trabelsi, Chiraz

AU - Mabrouk, Sherin Hassan

AU - Khalil, Hassan A.

PY - 2023/12/20

Y1 - 2023/12/20

N2 - Voice recognition systems have become increasingly important in recent years due to the growing need for more efficient and intuitive human-machine interfaces. The use of Hybrid LSTM networks and deep learning has been very successful in improving speech detection systems. The aim of this paper is to develop a novel approach for the detection of voice pathologies using a hybrid deep learning model that combines the Bidirectional Long Short-Term Memory (BiLSTM) and the Convolutional Neural Network (CNN) architectures. The proposed model uses a combination of temporal and spectral features extracted from speech signals to detect the different types of voice pathologies. The performance of the proposed detection model is evaluated on a publicly available dataset of speech signals from individuals with various voice pathologies(MEEI database). The experimental results showed that the hybrid BiLSTM-CNN model outperforms several classifiers by achieving an accuracy of 98.86%. The proposed model has the potential to assist health care professionals in the accurate diagnosis and treatment of voice pathologies, and improving the quality of life for affected individuals.

AB - Voice recognition systems have become increasingly important in recent years due to the growing need for more efficient and intuitive human-machine interfaces. The use of Hybrid LSTM networks and deep learning has been very successful in improving speech detection systems. The aim of this paper is to develop a novel approach for the detection of voice pathologies using a hybrid deep learning model that combines the Bidirectional Long Short-Term Memory (BiLSTM) and the Convolutional Neural Network (CNN) architectures. The proposed model uses a combination of temporal and spectral features extracted from speech signals to detect the different types of voice pathologies. The performance of the proposed detection model is evaluated on a publicly available dataset of speech signals from individuals with various voice pathologies(MEEI database). The experimental results showed that the hybrid BiLSTM-CNN model outperforms several classifiers by achieving an accuracy of 98.86%. The proposed model has the potential to assist health care professionals in the accurate diagnosis and treatment of voice pathologies, and improving the quality of life for affected individuals.

KW - BiLSTM

KW - Convolutional Neural Network

KW - Hybrid Systems

KW - MEEI Voice Disorders Database

KW - Voice Pathology Detection

UR - https://www.scopus.com/pages/publications/85177026265

U2 - 10.13164/mendel.2023.2.202

DO - 10.13164/mendel.2023.2.202

M3 - Article

AN - SCOPUS:85177026265

SN - 1803-3814

VL - 29

SP - 202

EP - 210

JO - Mendel

JF - Mendel

IS - 2

ER -

A Robust Voice Pathology Detection System Based on the Combined BiLSTM–CNN Architecture

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this