The Impact of Foreign Accents on the Performance of Whisper Family Models Using Medical Speech in Polish
Abstract
The article presents preliminary experiments investigating the impact of accent on the performance of the Whisper automatic speech recognition (ASR) system, specifically for the Polish language and medical data. The literature review revealed a scarcity of studies on the influence of accents on speech recognition systems in Polish, especially concerning medical terminology. The experiments involved voice cloning of selected individuals and adding prosodic contours with Russian and German accents, followed by transcription of these samples using all available models from the Whisper family and comparison with the original transcription. The results of these initial experiments suggest that the Whisper model struggles with foreign accents in the context of Polish language and medical terminology. This highlights the need for further research aimed at improving ASR systems for foreign accents and medical terminology.
Citations
-
0
CrossRef
-
0
Web of Science
-
0
Scopus
Author (1)
Cite as
Full text
- Publication version
- Accepted or Published Version
- DOI:
- Digital Object Identifier (open in new tab) 10.62036/ISD.2024.110
- License
- open in new tab
Keywords
Details
- Category:
- Conference activity
- Type:
- publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
- Language:
- English
- Publication year:
- 2024
- Bibliographic description:
- Zaporowski S.: The Impact of Foreign Accents on the Performance of Whisper Family Models Using Medical Speech in Polish// / : , 2024,
- DOI:
- Digital Object Identifier (open in new tab) 10.62036/isd.2024.110
- Sources of funding:
- Verified by:
- Gdańsk University of Technology
seen 33 times
Recommended for you
Introduction to the special issue on machine learning in acoustics
- Z. Michalopoulou,
- P. Gerstoft,
- B. Kostek
- + 1 authors