A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

Gintautas Tamulevicius; Grazina Korvel; Anil Bora Yayak; Povilas Treigys; Jolita Bernataviciene; Bożena Kostek

doi:10.3390/electronics9101725

A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

Abstrakt

In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation only. The assumption is that the speech audio signal carries sufficient emotional information to detect and retrieve it. Several two-dimensional acoustic feature spaces, such as cochleagrams, spectrograms, mel-cepstrograms, and fractal dimension-based space, are employed as the representations of speech emotional features. A convolutional neural network (CNN) is used as a classifier. The results show the superiority of cochleagrams over other feature spaces utilized. In the CNN-based speaker-independent cross-linguistic speech emotion recognition (SER) experiment, the accuracy of over 90% is achieved, which is close to the monolingual case of SER.

Cytowania

2 7

CrossRef
0

Web of Science
3 2

Scopus

Autorzy (6)

Gintautas Tamulevicius dr
Grazina Korvel
Anil Bora Yayak
Povilas Treigys dr
Jolita Bernataviciene
Bożena Kostek prof. dr hab. inż.

Cytuj jako

Pełna treść

pobierz publikację

pobrano 69 razy

Wersja publikacji: Accepted albo Published Version
Licencja: otwiera się w nowej karcie

Słowa kluczowe

Informacje szczegółowe

Kategoria:

Publikacja w czasopiśmie

Typ:

artykuły w czasopismach

Opublikowano w:

Electronics nr 9,
ISSN: 2079-9292

Język:

angielski

Rok wydania:

2020

Opis bibliograficzny:

Tamulevicius G., Korvel G., Yayak A., Treigys P., Bernataviciene J., Kostek B.: A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces// Electronics -Vol. 9,iss. 10 (2020), s.1725-

DOI:

10.3390/electronics9101725

Źródła finansowania:

Działalność statutowa/subwencja

Weryfikacja:

Politechnika Gdańska

wyświetlono 184 razy

Publikacje, które mogą cię zainteresować

Introduction to the special issue on machine learning in acoustics

Z. Michalopoulou,
P. Gerstoft,
B. Kostek
+ 1 autorów

2021

Investigating Feature Spaces for Isolated Word Recognition

G. Korvel,
G. Tamulevicus,
P. Treigys
+ 2 autorów

2018

Analysis of 2D Feature Spaces for Deep Learning-based Speech Recognition

G. Korvel,
P. Treigys,
G. Tamulevicus
+ 2 autorów

2018

Investigating Feature Spaces for Isolated Word Recognition

P. Treigys,
G. Korvel,
G. Tamulevicius
+ 2 autorów

2020

Meta Tagi

Wyszukiwarka

A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

Abstrakt

Cytowania

Autorzy (6)

Gintautas Tamulevicius dr

Grazina Korvel

Anil Bora Yayak

Povilas Treigys dr

Jolita Bernataviciene

Bożena Kostek prof. dr hab. inż.

Cytuj jako

Pełna treść

Słowa kluczowe

Informacje szczegółowe

Publikacje, które mogą cię zainteresować

Introduction to the special issue on machine learning in acoustics

Investigating Feature Spaces for Isolated Word Recognition

Analysis of 2D Feature Spaces for Deep Learning-based Speech Recognition

Investigating Feature Spaces for Isolated Word Recognition