dr hab. Tomasz Ciszewski
Zatrudnienie
Słowa kluczowe Pomoc
- allophonic speech transcription
- audio-visual speech recognition
- avsr
- face motion capture
- speech recognition
- sttereovision
- thermovision
- time-of-flight
- transcription
- viseme · parameterization of mouth region · support vector machine · hidden markov model · pattern recognition · audiovisual speech recognition
Kontakt
- tomcisz1@pg.edu.pl
Wybrane publikacje
-
A comparative study of English viseme recognition methods and algorithms
An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...
-
Building Knowledge for the Purpose of Lip Speech Identification
Consecutive stages of building knowledge for automatic lip speech identification are shown in this study. The main objective is to prepare audio-visual material for phonetic analysis and transcription. First, approximately 260 sentences of natural English were prepared taking into account the frequencies of occurrence of all English phonemes. Five native speakers from different countries read the selected sentences in front of...
-
Visual perception of vowels from static and dynamic cues
The purpose of the study was to analyse human identification of Polish vowels from static and dynamic durationally slowed visual cues. A total of 152 participants identified 6 Polish vowels produced by 4 speakers from static (still images) and dynamic (videos) cues. The results show that 59% of static vowels and 63% of dynamic vowels were successfully identified. There was a strong confusion between vowels within front, central,...
wyświetlono 792 razy