Multimodal English corpus for automatic speech recognition - Publication - Bridge of Knowledge

Search

Multimodal English corpus for automatic speech recognition

Abstract

A multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech recognition system which analyzes many modalities at the same time. The paper describes the process of multimodal material collection and the post-processing procedure applied to this material. Parameterization methods of signals belonging to different modalities are also proposed.

Cite as

Full text

full text is not available in portal

Keywords

Details

Category:
Conference activity
Type:
materiały konferencyjne indeksowane w Web of Science
Title of issue:
Signal Processing Algorithms, Architectures, Arrangements and Applications strony 106 - 111
ISSN:
2326-0262
Language:
English
Publication year:
2013
Bibliographic description:
Kunka B., Kupryjanow A., Dalka P., Bratoszewski P., Szczodrak M., Spaleniak P., Szykulski M., Czyżewski A..: Multimodal English corpus for automatic speech recognition, W: Signal Processing Algorithms, Architectures, Arrangements and Applications, 2013, IEEE,.
Verified by:
Gdańsk University of Technology

seen 134 times

Recommended for you

Meta Tags