Audio Feature Analysis for Precise Vocalic Segments Classification in English

Szymon Zaporowski; Andrzej Czyżewski

doi:10.1007/978-3-030-59000-0

Audio Feature Analysis for Precise Vocalic Segments Classification in English

Abstrakt

An approach to identifying the most meaningful Mel-Frequency Cepstral Coefficients representing selected allophones and vocalic segments for their classification is presented in the paper. For this purpose, experiments were carried out using algorithms such as Principal Component Analysis, Feature Importance, and Recursive Parameter Elimination. The data used were recordings made within the ALOFON corpus containing audio signal recorded employing 7 speakers who spoke English at the native or near-native speaker level withing a Standard Southern British English variety accent. The recordings were analyzed by specialists from the field of phonology in order to extract vocalic segments and selected allophones. Then parameterization was made using Mel Frequency Cepstral Coefficients, Delta MFCC, and Delta Delta MFCC. In the next stage, feature vectors were passed to the input of individual algorithms utilized to reduce the size of the vector by previously mentioned algorithms. The vectors prepared in this way have been used for classifying allophones and vocalic segments employing simple Artificial Neural Network (ANN) and Support Vector Machine (SVM). The classification results using both classifiers and methods applied for reducing the number of parameters were presented. The results of the reduction are also shown explicitly, by indicating parameters proven to be significant and those rejected by particular algorithms. Factors influencing the obtained results were discussed. Difficulties associated with obtaining the data set, its labeling, and research on allophones were also analyzed.

Cytowania

3

CrossRef
0

Web of Science
0

Scopus

Autorzy (2)

Cytuj jako

Pełna treść

pełna treść publikacji nie jest dostępna w portalu

pełna treść artykułu zobacz w serwisie zewnętrznym otwiera się w nowej karcie

Słowa kluczowe

Informacje szczegółowe

Kategoria:

Publikacja monograficzna

Typ:

rozdział, artykuł w książce - dziele zbiorowym /podręczniku w języku o zasięgu międzynarodowym

Język:

angielski

Rok wydania:

2020

Opis bibliograficzny:

Zaporowski S., Czyżewski A.: Audio Feature Analysis for Precise Vocalic Segments Classification in English// Multimedia Communications, Services and Security/ : , 2020, s.265-277

DOI:

10.1007/978-3-030-59000-0

Źródła finansowania:

Projekt Metodyka i technologia polimodalnej alofonicznej transkrypcji mowy

Weryfikacja:

Politechnika Gdańska

wyświetlono 153 razy

Publikacje, które mogą cię zainteresować

Ranking Speech Features for Their Usage in Singing Emotion Classification

2020

Machine Learning Applied to Aspirated and Non-Aspirated Allophone Classification—An Approach Based on Audio "Fingerprinting"

2018

Evaluation of aspiration problems in L2 English pronunciation employing machine learning

M. Piotrowska,
A. Czyżewski,
T. Ciszewski
+ 3 autorów

2021

Comparative Study of Self-Organizing Maps vs. Subjective Evaluation of Quality of Allophone Pronunciation for Nonnative English Speakers

2017

Meta Tagi

Audio Feature Analysis for Precise Vocalic Segments Classification in English

Abstrakt

Cytowania

Autorzy (2)

Szymon Zaporowski mgr inż.

Andrzej Czyżewski prof. dr hab. inż.

Cytuj jako

Pełna treść

Słowa kluczowe

Informacje szczegółowe

Publikacje, które mogą cię zainteresować

Ranking Speech Features for Their Usage in Singing Emotion Classification

Machine Learning Applied to Aspirated and Non-Aspirated Allophone Classification—An Approach Based on Audio "Fingerprinting"

Evaluation of aspiration problems in L2 English pronunciation employing machine learning

Comparative Study of Self-Organizing Maps vs. Subjective Evaluation of Quality of Allophone Pronunciation for Nonnative English Speakers

Wyszukiwarka

Audio Feature Analysis for Precise Vocalic Segments Classification in English

Abstrakt

Cytowania

Autorzy (2)

Szymon Zaporowski mgr inż.

Andrzej Czyżewski prof. dr hab. inż.

Cytuj jako

Pełna treść

Słowa kluczowe

Informacje szczegółowe

Publikacje, które mogą cię zainteresować

Ranking Speech Features for Their Usage in Singing Emotion Classification

Machine Learning Applied to Aspirated and Non-Aspirated Allophone Classification—An Approach Based on Audio "Fingerprinting"

Evaluation of aspiration problems in L2 English pronunciation employing machine learning

Comparative Study of Self-Organizing Maps vs. Subjective Evaluation of Quality of Allophone Pronunciation for Nonnative English Speakers