Wyniki wyszukiwania dla: avsr

Wyniki wyszukiwania dla: avsr

Znaleźliśmy mało wyników, wypróbuj alternatywnej metody wyszukiwania.

Filtry

wszystkich: 2

wyczyść wszystkie filtry niedostępne

Najlepsze wyniki w katalogu: Potencjał Badawczy Pokaż wszystkie wyniki (2)

Zespół Systemów Multimedialnych
Potencjał Badawczy
- Katedra Systemów Multimedialnych
* technologie archiwizacji, rekonstrukcji i dostępu do nagrań archiwalnych * technologie inteligentnego monitoringu wizyjnego i akustycznego * multimedialne technologie telemedyczne * multimodalne interfejsy komputerowe
Zespół Systemów Multimedialnych
Potencjał Badawczy
- Katedra Systemów Multimedialnych
* technologie archiwizacji, rekonstrukcji i dostępu do nagrań archiwalnych * technologie inteligentnego monitoringu wizyjnego i akustycznego * multimedialne technologie telemedyczne * multimodalne interfejsy komputerowe

Pozostałe wyniki Pokaż wszystkie wyniki (163)

Building Knowledge for the Purpose of Lip Speech Identification
Publikacja
- Advances in Intelligent Systems and Computing - Rok 2017
Consecutive stages of building knowledge for automatic lip speech identification are shown in this study. The main objective is to prepare audio-visual material for phonetic analysis and transcription. First, approximately 260 sentences of natural English were prepared taking into account the frequencies of occurrence of all English phonemes. Five native speakers from different countries read the selected sentences in front of...

Pełny tekst do pobrania w serwisie zewnętrznym
An audio-visual corpus for multimodal automatic speech recognition
Publikacja
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2017
review of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...

Pełny tekst do pobrania w portalu
Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions
Publikacja
- Rok 2016
Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...

Pełny tekst do pobrania w serwisie zewnętrznym
KORPUS MOWY ANGIELSKIEJ DO CELÓW MULTIMODALNEGO AUTOMATYCZNEGO ROZPOZNAWANIA MOWY
Publikacja
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2016
W referacie zaprezentowano audiowizualny korpus mowy zawierający 31 godzin nagrań mowy w języku angielskim. Korpus dedykowany jest do celów automatycznego audiowizualnego rozpoznawania mowy. Korpus zawiera nagrania wideo pochodzące z szybkoklatkowej kamery stereowizyjnej oraz dźwięk zarejestrowany przez matrycę mikrofonową i mikrofon komputera przenośnego. Dzięki uwzględnieniu nagrań zarejestrowanych w warunkach szumowych korpus...
MODALITY corpus - SPEAKER 35 - COMMANDS C1
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...

Wyszukiwarka

Filtry

Katalog

Najlepsze wyniki w katalogu: Potencjał Badawczy Pokaż wszystkie wyniki (2)

Wyniki wyszukiwania dla: avsr

Zespół Systemów Multimedialnych

Zespół Systemów Multimedialnych

Pozostałe wyniki Pokaż wszystkie wyniki (163)

Wyniki wyszukiwania dla: avsr

Building Knowledge for the Purpose of Lip Speech Identification

An audio-visual corpus for multimodal automatic speech recognition

Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions

KORPUS MOWY ANGIELSKIEJ DO CELÓW MULTIMODALNEGO AUTOMATYCZNEGO ROZPOZNAWANIA MOWY

MODALITY corpus - SPEAKER 35 - COMMANDS C1