Szymon Zaporowski - Profil naukowy - MOST Wiedzy

Wyszukiwarka

Wybrane publikacje

  • Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling

    - Rok 2021

    A common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention

    - Rok 2021

    This paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...

    Pełny tekst do pobrania

  • Comparison of the Ability of Neural Network Model and Humans to Detect a Cloned Voice

    The vulnerability of the speaker identity verification system to attacks using voice cloning was examined. The research project assumed creating a model for verifying the speaker’s identity based on voice biometrics and then testing its resistance to potential attacks using voice cloning. The Deep Speaker Neural Speaker Embedding System was trained, and the Real-Time Voice Cloning system was employed based on the SV2TTS, Tacotron,...

    Pełny tekst do pobrania

wyświetlono 2330 razy