Improvement of speech intelligibility in the presence of noise interference using the Lombard effect and an automatic noise interference profiling based on deep learning
Abstrakt
The Lombard effect is a phenomenon that results in speech intelligibility improvement when applied to noise. There are many distinctive features of Lombard speech that were recalled in this dissertation. This work proposes the creation of a system capable of improving speech quality and intelligibility in real-time measured by objective metrics and subjective tests. This system consists of three main components: speech type detection, noise profiling, and an adaptive strategy of selection the modification. The role of the first component is to detect the Lombard speech in the input signal to avoid unnecessary speech modifications when the speech is naturally Lombard in its character. The second module is noise profiling, as the type of noise strongly impacts the selection of the best modification. The last part of the system is the adaptive modification selection component. The selection is made based on the speech signal features, resulting in the most considerable speech quality improvement, measured with objective metrics. To solve the problem posed, machine learning was used in this dissertation – especially deep learning with convolutional neural networks and typical multilayer networks. It was proven that it is possible to create an adaptive system that would improve speech quality in the presence of noise in real-time or near real-time.
Autor (1)
Cytuj jako
Pełna treść
- Wersja publikacji
- Accepted albo Published Version
- Licencja
- Copyright (Author(s))
Słowa kluczowe
Informacje szczegółowe
- Kategoria:
- Doktoraty, rozprawy habilitacyjne, nostryfikacje
- Typ:
- praca doktorska pracowników zatrudnionych w PG oraz studentów studium doktoranckiego
- Język:
- angielski
- Rok wydania:
- 2023
- Weryfikacja:
- Politechnika Gdańska
wyświetlono 106 razy
Publikacje, które mogą cię zainteresować
Noise profiling for speech enhancement employing machine learning models
- K. Kąkol,
- G. Korvel,
- B. Kostek
Investigating Noise Interference on Speech Towards Applying the Lombard Effect Automatically
- G. Korvel,
- K. Kąkol,
- P. Treigys
- + 1 autorów
Detecting Lombard Speech Using Deep Learning Approach
- K. Kąkol,
- G. Korvel,
- G. Tamulevicius
- + 1 autorów