A survey of automatic speech recognition deep models performance for Polish medical terms

Marta Zielonka; Wiktor Krasiński; Jakub Nowak; Przemysław Rośleń; Jan Stopiński; Mateusz Żak; Franciszek Górski; Andrzej Czyżewski

A survey of automatic speech recognition deep models performance for Polish medical terms

Abstract

Among the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language. For this purpose, we selected 100 words from the International Classification of Diseases dictionary, the Polish-language version of the International Statistical Classification of Diseases and Health Problems. The words were read into a microphone by five women and five men and also generated with a speech synthesizer using a male and a female voice. This resulted in 1,200 recordings tested with the following systems: Whisper, Google speech-to-text, and Microsoft Azure speech-to-text. The achieved word recognition performance is reflected by the calculated metrics: WER, WIL, Levenshtein distance, Jaccard distance, MER, and CER. Results show that the highest efficiency for most cases was obtained by Azure speech-to-text. However, none of the tested models is ready for voice-filling medical records, describing cases, or prescribing treatment, because the number of errors made when converting speech to text is too high.

Authors (8)

Cite as

Full text

full text is not available in portal

full content of the article see on external site open in new tab

Keywords

Details

Category:

Conference activity

Type:

publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)

Language:

English

Publication year:

2023

Bibliographic description:

Zielonka M., Krasiński W., Nowak J., Rośleń P., Stopiński J., Żak M., Górski F., Czyżewski A.: A survey of automatic speech recognition deep models performance for Polish medical terms// / : , 2023,

Sources of funding:

Project Adaptive intelligent speech processing system of medical personnel with the structuring of test results and support of therapeutic process

Verified by:

Gdańsk University of Technology

seen 244 times

Recommended for you

The Impact of Foreign Accents on the Performance of Whisper Family Models Using Medical Speech in Polish

S. Zaporowski

2024

Machine learning tools match physician accuracy in multilingual text annotation

M. Zielonka,
A. Czyżewski,
D. Szplit
+ 4 authors

2025

Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions

2016

Voice command recognition using hybrid genetic algorithm

2010

Meta Tags