Szymon Zaporowski - Publications - Bridge of Knowledge

Search

Filters

total: 29

  • Category
  • Year
  • Options

clear Chosen catalog filters disabled

Catalog Publications

Year 2024
Year 2023
Year 2022
Year 2021
  • Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
    Publication

    - Year 2021

    This paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...

    Full text available to download

  • Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling
    Publication

    - Year 2021

    A common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...

    Full text to download in external service

Year 2020
  • 1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type
    Publication

    A network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence....

  • Audio Feature Analysis for Precise Vocalic Segments Classification in English
    Publication

    An approach to identifying the most meaningful Mel-Frequency Cepstral Coefficients representing selected allophones and vocalic segments for their classification is presented in the paper. For this purpose, experiments were carried out using algorithms such as Principal Component Analysis, Feature Importance, and Recursive Parameter Elimination. The data used were recordings made within the ALOFON corpus containing audio signal...

    Full text to download in external service

  • Constructing a Dataset of Speech Recordingswith Lombard Effect
    Publication

    - Year 2020

    Thepurpose of therecordings was to create a speech corpus based on the ISLEdataset, extended with video and Lombard speech. Selected from a set of 165sentences, 10, evaluatedas having thehighest possibility to occur in the context ofthe Lombard effect,were repeated in the presence of the so-called babble speech to obtain Lombard speech features. Altogether,15speakers were recorded, and speech parameterswere...

  • Ranking Speech Features for Their Usage in Singing Emotion Classification
    Publication

    - Year 2020

    This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...

    Full text available to download

Year 2019
Year 2018
Year 2016
  • Procesor efektów dźwiękowych do gitary na urządzenia mobilne
    Publication

    W rozdziale przedstawiono sposób działania procesora efektów dźwiękowych do gitary, składającego się z układu elektronicznego i aplikacji pracującej w czasie rzeczywistym na urządzeniach mobilnych z systemem Android. W pierwszej części zaprezentowano układ (przejściówkę) w postaci przedwzmacniacza zasilanego z baterii, do którego podłącza się gitarę oraz urządzenie mobilne. W drugiej części referatu przedstawiono zaś proces przetwarzania...

  • Procesor efektów dźwiękowych do gitary na urządzenia oparte na systemie Android
    Publication

    W artykule przedstawiono procesor efektów dźwiękowych do gitary, składający się z układu elektronicznego i aplikacji pracującej w czasie rzeczywistym na urządzeniach mobilnych z systemem Android. W pierwszej części referatu przedstawiono proces przetwarzania dźwięku w aplikacji oraz interfejs użytkownika. Interfejs użytkownika napisany został w języku Java, wspartym językiem znaczników XML, zaś przetwarzanie dźwięku, ze względu...

seen 2295 times