mgr inż. Szymon Zaporowski
Zatrudnienie
- Specjalista informatyk w Katedra Systemów Multimedialnych
- Asystent w Katedra Systemów Multimedialnych
Kontakt dla biznesu
- Lokalizacja
- Al. Zwycięstwa 27, 80-219 Gdańsk
- Telefon
- +48 58 348 62 62
- biznes@pg.edu.pl
Media społecznościowe
Kontakt
- szyzapor@pg.edu.pl
Specjalista informatyk
- Miejsce pracy
-
Budynek A Elektroniki
pokój EA 726 otwiera się w nowej karcie - Telefon
- 58-348-63-32
- smck@sound.eti.pg.gda.pl
Asystent
- Miejsce pracy
-
Budynek A Elektroniki
pokój EA 726 otwiera się w nowej karcie - smck@sound.eti.pg.gda.pl
Wybrane publikacje
-
Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling
A common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...
-
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
This paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...
-
Comparison of the Ability of Neural Network Model and Humans to Detect a Cloned Voice
The vulnerability of the speaker identity verification system to attacks using voice cloning was examined. The research project assumed creating a model for verifying the speaker’s identity based on voice biometrics and then testing its resistance to potential attacks using voice cloning. The Deep Speaker Neural Speaker Embedding System was trained, and the Real-Time Voice Cloning system was employed based on the SV2TTS, Tacotron,...
wyświetlono 2383 razy