Search results for: speech-to-text technology

Search results for: speech-to-text technology

Didn't find any results in this catalog!

But we have some results in other catalogs.

Przykład wyników znalezionych w innych katalogach

zobacz wszystkie wyniki

Filters

total: 6623

clear all filters disabled

displaying 1000 best results Help

Time-domain prosodic modifications for text-to-speech synthesizer
Publication
- J. Łopatka
- P. Suchomski
- A. Czyżewski
- Year 2010
An application of prosodic speech processing algorithms to Text-To-Speech synthesis is presented. Prosodic modifications that improve the naturalness of the synthesized signal are discussed. The applied method is based on the TD-PSOLA algorithm. The developed Text-To-Speech Synthesizer is used in applications employing multimodal computer interfaces.
Evaluation and Irony in Text in the Light of Speech Act Theory
Publication
- K. Kukowicz-Zarska
- Forum Filologiczne Ateneum - Year 2020
Full text to download in external service
Automatic prosodic modification in a Text-To-Speech synthesizer of Polish language
Publication
- K. Łopatka
- P. Suchomski
- A. Czyżewski
- Elektronika : konstrukcje, technologie, zastosowania - Year 2011
Przedstawiono system syntezy mowy polskiej z funkcją automatycznej modyfikacji prozodii wypowiedzi. Opisane zostały metody automatycznego wyznaczania akcentu i intonacji wypowiedzi. Przedstawiono zastosowanie algorytmów przetwarzania sygnału mowy w procesie kształtowania prozodii. Omówiono wpływ zastosowanych modyfikacji na naturalność brzmienia syntezowanego sygnału. Zastosowana metoda oparta jest na algorytmie TD-PSOLA. Opracowany...
SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM
Publication
- B. Kostek
- B. Szyca
- Journal of the Acoustical Society of America - Year 2023
The main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...

Full text available to download
Methodology and technology for the polymodal allophonic speech transcription
Publication
- Journal of the Acoustical Society of America - Year 2016
A method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for...

Full text to download in external service
Methodology and technology for the polymodal allophonic speech transcription
Publication
- Journal of the Acoustical Society of America - Year 2016
A method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory...

Full text to download in external service
Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech
Publication
- D. Piotrowski
- R. Korzeniowski
- A. Falai
- S. Cygert
- K. Pokora
- G. Tinchev
- Z. Zhang
- K. Yanagisawa
- Year 2023
In this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream Text-To-Speech (TTS) model. The proposed framework consists of 4 stages. In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker. In the third stage, the converted data is combined with the linguistic features and durations...

Full text to download in external service
Speech and Language Technology

Journals

ISSN: 1895-0434
SpringerBriefs in Speech Technology

Journals

ISSN: 2191-737X , eISSN: 2191-7388
Text Technology: A Journal of computer Text Processing

Journals

ISSN: 1496-0958

Search

Didn't find any results in this catalog!

Filters

Catalog

Search results for: speech-to-text technology