Marking the Allophones Boundaries Based on the DTW Algorithm - Publikacja - MOST Wiedzy

Wyszukiwarka

Marking the Allophones Boundaries Based on the DTW Algorithm

Abstrakt

The paper presents an approach to marking the boundaries of allophones in the speech signal based on the Dynamic Time Warping (DTW) algorithm. Setting and marking of allophones boundaries in continuous speech is a difficult issue due to the mutual influence of adjacent phonemes on each other. It is this neighborhood on the one hand that creates variants of phonemes that is allophones, and on the other hand it affects that the border between allophones is in some cases very difficult to determine. Nowadays, this task is carried out manually in cooperation with specialists in the field of phonetics. The presented approach allows to build a system that is able to automate this process. The aim of the work currently carried out by the author is a method that facilitates the training material processing for the needs of the development of multimodal speech recognition systems. For this purpose, the difficult problem of marking boundaries of allophones is solved in this report based on the Polish dictionary in the context of the creation of allophone bases for speech synthesis. This is done in this way due to the simplified possibility of organizing critical listening and subjective evaluation of received allophones by a large group of Polish native speakers (75 people). Strengthening the method will allow it to be used for the extraction of allophones for the needs of developed system of automatic transcription of English speech and for its notation according to the IPA standard. The analyzed continuous speech is combined in the DTW algorithm with a synthesized speech signal. The comparison of both signals is perform not in the time domain as in the classical DTW, but in the frequency domain. This allows for a statement that the phonetic content of both signals is compared. The paper describes the process of marking the boundaries of allophones for the Polish language, however after appropriate modifications, this approach can be used to determine the allophones boundaries in other languages, especially for English.

Cytowania

  • 0

    CrossRef

  • 0

    Web of Science

  • 0

    Scopus

Pełna treść

Informacje szczegółowe

Kategoria:
Archiwalna
Typ:
materiały konferencyjne indeksowane w Web of Science
Tytuł wydania:
SPA 2018 Signal Processing Algorithms, Architectures, Arrangements and Applications, Conference Proceedings strony 245 - 249
Język:
angielski
Rok wydania:
2018
Opis bibliograficzny:
Rafałko J..: Marking the Allophones Boundaries Based on the DTW Algorithm, W: SPA 2018 Signal Processing Algorithms, Architectures, Arrangements and Applications, Conference Proceedings, 2018, Poznan University of Technology,.
DOI:
Cyfrowy identyfikator dokumentu elektronicznego (otwiera się w nowej karcie) 10.23919/spa.2018.8563359
Źródła finansowania:
Weryfikacja:
Politechnika Gdańska

wyświetlono 16 razy

Publikacje, które mogą cię zainteresować

Meta Tagi