Marking the Allophones Boundaries Based on the DTW Algorithm - Publication - Bridge of Knowledge

Search

Marking the Allophones Boundaries Based on the DTW Algorithm

Abstract

The paper presents an approach to marking the boundaries of allophones in the speech signal based on the Dynamic Time Warping (DTW) algorithm. Setting and marking of allophones boundaries in continuous speech is a difficult issue due to the mutual influence of adjacent phonemes on each other. It is this neighborhood on the one hand that creates variants of phonemes that is allophones, and on the other hand it affects that the border between allophones is in some cases very difficult to determine. Nowadays, this task is carried out manually in cooperation with specialists in the field of phonetics. The presented approach allows to build a system that is able to automate this process. The aim of the work currently carried out by the author is a method that facilitates the training material processing for the needs of the development of multimodal speech recognition systems. For this purpose, the difficult problem of marking boundaries of allophones is solved in this report based on the Polish dictionary in the context of the creation of allophone bases for speech synthesis. This is done in this way due to the simplified possibility of organizing critical listening and subjective evaluation of received allophones by a large group of Polish native speakers (75 people). Strengthening the method will allow it to be used for the extraction of allophones for the needs of developed system of automatic transcription of English speech and for its notation according to the IPA standard. The analyzed continuous speech is combined in the DTW algorithm with a synthesized speech signal. The comparison of both signals is perform not in the time domain as in the classical DTW, but in the frequency domain. This allows for a statement that the phonetic content of both signals is compared. The paper describes the process of marking the boundaries of allophones for the Polish language, however after appropriate modifications, this approach can be used to determine the allophones boundaries in other languages, especially for English.

Citations

  • 1

    CrossRef

  • 0

    Web of Science

  • 1

    Scopus

Cite as

Full text

full text is not available in portal

Keywords

Details

Category:
Conference activity
Type:
materiały konferencyjne indeksowane w Web of Science
Title of issue:
SPA 2018 Signal Processing Algorithms, Architectures, Arrangements and Applications, Conference Proceedings strony 245 - 249
Language:
English
Publication year:
2018
Bibliographic description:
Rafałko J..: Marking the Allophones Boundaries Based on the DTW Algorithm, W: SPA 2018 Signal Processing Algorithms, Architectures, Arrangements and Applications, Conference Proceedings, 2018, ,.
DOI:
Digital Object Identifier (open in new tab) 10.23919/spa.2018.8563359
Sources of funding:
Verified by:
Gdańsk University of Technology

seen 136 times

Recommended for you

Meta Tags