Neural network architectures in data sequence analysis: diagnosis of Reinke's oedema and Laryngeal polyps

Jarosław Szkoła

University of Rzeszow


Abstrakt

In this review paper, we focus our attention on presenting selected neural network architectures dedicated to the analysis of sequential data, in particular to support the diagnosis of Reinke’s oedema and laryngeal polyps. The research discussed here is located in the area of clinical computer decision support systems (CDS) based on the use of artificial recurrent neural networks (RNNs) for speech signal analysis. RNNs are able to predict time series due to their memory and local recurrent connections. In the experimental part, Elman-Jordan artificial neural networks are used, whose characteristics are speed and accuracy in pattern learning allowing real-time decision-making. In the review presented here, one important theme is the use of Bezier curves for preprocessing the speech signal, leading to data reduction and noise elimination. Elman-Jordan networks significantly speed up the learning process and show high classification accuracy in laryngopathy diagnosis.


Słowa kluczowe:

clinical decision support, recurrent neural networks, Reinke’s oedema, laryngeal polyps, Bézier curves, speech signal analysis, temporal patterns, classification


BAKATOR M., RADOSAV D. 2018. Deep Learning and Medical Diagnosis: A Review of Literature. Multimodal Technologies and Interaction, 2(3):47. https://doi.org/10.3390/mti2030047   Google Scholar

ELMAN J. 1993. Distributed Representations, Simple Recurrent Networks, and Grammatical Structure. Machine Learning, 7: 195-225. https://doi.org/10.1007/BF00114844   Google Scholar

JORDAN M.I. 1986. Serial Order: A Parallel Distributed Processing Approach. ICS Report 8604. Institute for Cognitive Science University of California, San Diego.   Google Scholar

MEHRISH A., MAJUMDER N., BHARADWAJ R., MIHALCEA R., PORIA S. 2023. A Review of Deep Learning Techniques for Speech Processing. Information Fusion, 99: 101869. https://doi.org/10.1016/j.inffus.2023.101869   Google Scholar

SFAYYIH A.H., SABRY A.H., JAMEEL S.M., SULAIMAN N., RAAFAT S.M., HUMAIDI A.J., KUBAIAISI Y.M.A. 2023a. Acoustic-Based Deep Learning Architectures for Lung Disease Diagnosis: A Comprehensive Overview. Diagnostics, 13(10):1748. https://doi.org/10.3390/diagnostics13101748   Google Scholar

SFAYYIH A.H., SULAIMAN N., SABRY A.H. 2023b. A Review on Lung Disease Recognition by Acoustic Signal Analysis with Deep Learning Networks. Journal of Big Data, 10(1), Article number 101. https://doi.org/10.1186/s40537-023-00762-z   Google Scholar

SZKOLA J., PANCERZ K., WARCHOL J. 2010a. Computer Diagnosis of Laryngopathies Based on Temporal Pattern Recognition in Speech Signal. Bio-Algorithms and Med-Systems, 6(12): 75-80.   Google Scholar

SZKOŁA J., PANCERZ K., WARCHOŁ J. 2010b. Computer-Based Clinical Decision Support for Laryngopathies Using Recurrent Neural Networks. 10th International Conference on Intelligent Systems Design and Applications, Cairo, Egypt, p. 627-632. https://doi.org/10.1109/ISDA.2010.5687195   Google Scholar

SZKOŁA J., PANCERZ K., WARCHOŁ J. 2011a. Improving Learning Ability of Recurrent Neural Networks: Experiments on Speech Signals of Patients with Laryngopathies. In: F. Babiloni, A. Fred, J. Filipe, H. Gamboa (Eds.) Proceedings of the International Conference on Bio-Inspired Systems and Signal Processing (BIOSIGNALS ‘11). SciTePress – Science and Technology Publications, Setúbal, p. 360-364. https://doi.org/10.5220/0003292603600364   Google Scholar

SZKOŁA J., PANCERZ K., WARCHOŁ J. 2011b. A Bezier Curve Approximation of the Speech Signal in the Classification Process of Laryngopathies. Federated Conference on Computer Science and Information Systems (FedCSIS), Szczecin, p. 141-146. Retrieved from https://annals-csis.org/proceedings/2011/pliks/221.pdf   Google Scholar

TANVEER M., RASTOGI A., PALIWAL V., GANAIE M.A., MALIK A.K., DEL SER J., LIN C.-T. 2023. Ensemble Deep Learning in Speech Signal Tasks: A Review. Neurocomputing, 550: 126436. https://doi.org/10.1016/J.NEUCOM.2023.126436   Google Scholar

ZAIDI B., SELOUANI S., BOUDRAA M., SIDI YAKOUB M. 2021. Deep Neural Network Architectures for Dysarthric Speech Analysis and Recognition. Neural Computing and Applications, 33: 9089-9108.   Google Scholar


Opublikowane
10-06-2024

Cited By /
Share

Szkoła, J. (2024). Neural network architectures in data sequence analysis: diagnosis of Reinke’s oedema and Laryngeal polyps. Technical Sciences, 27(27), 113–125. https://doi.org/10.31648/ts.10228

Jarosław Szkoła 
University of Rzeszow



Licencja

Creative Commons License

Utwór dostępny jest na licencji Creative Commons Uznanie autorstwa 4.0 Międzynarodowe.





-->