Abstract
The study addresses the issues related to the appropriateness of a two-dimensional representation of speech signal for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and time-frequency signal representation converted to the investigated feature spaces. In particular, waveforms and fractal dimension features of the signal were chosen for the time domain, and three feature spaces were investigated for the frequency domain, namely: Linear Prediction Coefficient (LPC) spectrum, Hartley spectrum, and cochleagram. Due to the fact that deep learning requires an adequate training set size of the corpus and its content may significantly influence the outcome, thus for the data augmentation purpose, the created dataset was extended with mixes of the speech signal with noise with various SNRs (Signal-to-Noise Ratio). In order to evaluate the applicability of the implemented feature spaces for isolated word recognition task, three experiments were conducted, i.e., 10-, 70-, and 111-word cases were analyzed.
Citations
-
5
CrossRef
-
0
Web of Science
-
0
Scopus
Authors (5)
Cite as
Full text
full text is not available in portal
Keywords
Details
- Category:
- Monographic publication
- Type:
- rozdział, artykuł w książce - dziele zbiorowym /podręczniku w języku o zasięgu międzynarodowym
- Language:
- English
- Publication year:
- 2020
- Bibliographic description:
- Treigys P., Korvel G., Tamulevicius G., Bernataviciene J., Kostek B.: Investigating Feature Spaces for Isolated Word Recognition// Data Science: New Issues, Challenges and Applications/ : , 2020, s.165-181
- DOI:
- Digital Object Identifier (open in new tab) 10.1007/978-3-030-39250-5
- Sources of funding:
-
- Statutory activity/subsidy
- Verified by:
- Gdańsk University of Technology
seen 139 times
Recommended for you
Investigating Feature Spaces for Isolated Word Recognition
- G. Korvel,
- G. Tamulevicus,
- P. Treigys
- + 2 authors
A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces
- G. Tamulevicius,
- G. Korvel,
- A. B. Yayak
- + 3 authors