Szymon Zaporowski - Profil naukowy

mgr inż. Szymon Zaporowski

Zatrudnienie

Specjalista informatyk w Katedra Systemów Multimedialnych
Asystent w Katedra Systemów Multimedialnych

Słowa kluczowe Pomoc

Kontakt dla biznesu

Centrum Transferu Wiedzy i Technologii

Lokalizacja: Al. Zwycięstwa 27, 80-219 Gdańsk
Telefon: +48 58 348 62 62
E-mail: biznes@pg.edu.pl

Media społecznościowe

Wybrane publikacje

Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling
- D. Korzekwa
- J. Lorenzo-trueba
- S. Zaporowski
- S. Calamaro
- T. Drugman
- B. Kostek
- Rok 2021
A common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...

Pełny tekst do pobrania w serwisie zewnętrznym
Comparison of the Ability of Neural Network Model and Humans to Detect a Cloned Voice
- Electronics - Rok 2023
The vulnerability of the speaker identity verification system to attacks using voice cloning was examined. The research project assumed creating a model for verifying the speaker’s identity based on voice biometrics and then testing its resistance to potential attacks using voice cloning. The Deep Speaker Neural Speaker Embedding System was trained, and the Real-Time Voice Cloning system was employed based on the SV2TTS, Tacotron,...

Pełny tekst do pobrania w portalu
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
- D. Korzekwa
- R. Barra-Chicote
- S. Zaporowski
- G. Beringer
- J. Lorenzo-trueba
- A. Serafinowicz
- J. Droppo
- T. Drugman
- B. Kostek
- Rok 2021
This paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...

Pełny tekst do pobrania w portalu

wyświetlono 2491 razy

Wyszukiwarka

mgr inż. Szymon Zaporowski

Zatrudnienie

Słowa kluczowe Pomoc

Kontakt dla biznesu

Media społecznościowe

Kontakt

Specjalista informatyk

Asystent

Wybrane publikacje

Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling

Comparison of the Ability of Neural Network Model and Humans to Detect a Cloned Voice

Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention