Laboratorium Akustyki Fonicznej

Publikacje

Rok 2019

An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
Publikacja
- G. Korvel
- O. Kurasova
- B. Kostek
- Rok 2019
The speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing...

Pełny tekst do pobrania w portalu
ANALIZA KOLORÓW SCEN FILMOWYCH W KONTEKŚCIE COLOR GRADINGU
Publikacja
- D. Weber
- B. Kostek
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Rok 2019
W artykule przedstawiono zagadnienia związane z kolorowaniem sceny filmowej. W pracy przedyskutowano główne aspekty obróbki koloru obrazu filmowego oraz omówiono definicje pojęć związanych z kolorowaniem sceny, tj.: color correction oraz color gradingu. Opisano teorie psychologii koloru oraz ich praktyczne wykorzystanie w filmie i odniesiono je do podstawowych gatunków filmowych i modeli emocji. Następnie przedyskutowano założenia...

Pełny tekst do pobrania w portalu
ANALIZA PARAMETRÓW SYGNAŁU MOWY W KONTEKŚCIE ICH PRZYDATNOŚCI W AUTOMATYCZNEJ OCENIE JAKOŚCI EKSPRESJI ŚPIEWU
Publikacja
- S. Zaporowski
- B. Kostek
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Rok 2019
Praca dotyczy podejścia do parametryzacji w przypadku klasyfikacji emocji w śpiewie oraz porównania z klasyfikacją emocji w mowie. Do tego celu wykorzystano bazę mowy i śpiewu nacechowanego emocjonalnie RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song), zawierającą nagrania profesjonalnych aktorów prezentujących sześć różnych emocji. Następnie obliczono współczynniki mel-cepstralne (MFCC) oraz wybrane deskryptory...

Pełny tekst do pobrania w portalu
Assessment of the Effectiveness of a Short-term Hearing Aid Use in Patients with Different Degrees of Hearing Loss
Publikacja
- T. Poremski
- P. Szymański
- B. Kostek
- Archives of Acoustics - Rok 2019
The study presents evaluating the effectiveness of the hearing aid fitting process in the short-term use (7 days). The evaluation method consists of a survey based on the APHAB (Abbreviated Profile of Hearing Aid Benefit) questionnaire. Additional criteria such as a degree of hearing loss, number of hours and days of hearing aid use as well as the user’s experience were also taken into consideration. The outcomes of the benefit...

Pełny tekst do pobrania w portalu
Comparison of Lithuanian and Polish Consonant Phonemes Based on Acoustic Analysis – Preliminary Results
Publikacja
- G. Korvel
- O. Kurasova
- B. Kostek
- Archives of Acoustics - Rok 2019
The goal of this research is to find a set of acoustic parameters that are related to differences between Polish and Lithuanian language consonants. In order to identify these differences, an acoustic analysis is performed, and the phoneme sounds are described as the vectors of acoustic parameters. Parameters known from the speech domain as well as those from the music information retrieval area are employed. These parameters are...

Pełny tekst do pobrania w portalu
Comparison of the effectiveness of automatic EEG signal class separation algorithms
Publikacja
- JOURNAL OF INTELLIGENT & FUZZY SYSTEMS - Rok 2019
In this paper, an algorithm for automatic brain activity class identification of EEG (electroencephalographic) signals is presented. EEG signals are gathered from seventeen subjects performing one of the three tasks: resting, watching a music video and playing a simple logic game. The methodology applied consists of several steps, namely: signal acquisition, signal processing utilizing z-score normalization, parametrization and...

Pełny tekst do pobrania w portalu
Discovering Rule-Based Learning Systems for the Purpose of Music Analysis
Publikacja
- G. Korvel
- B. Kostek
- Journal of the Acoustical Society of America - Rok 2019
Music analysis and processing aims at understanding information retrieved from music (Music Information Retrieval). For the purpose of music data mining, machine learning (ML) methods or statistical approach are employed. Their primary task is recognition of musical instrument sounds, music genre or emotion contained in music, identification of audio, assessment of audio content, etc. In terms of computational approach, music databases...

Pełny tekst do pobrania w portalu
Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech
Publikacja
- D. Korzekwa
- R. Barra-Chicote
- B. Kostek
- T. Drugman
- M. Łajszczak
- Rok 2019
We present a novel deep learning model for the detection and reconstruction of dysarthric speech. We train the model with a multi-task learning technique to jointly solve dysarthria detection and speech reconstruction tasks. The model key feature is a low-dimensional latent space that is meant to encode the properties of dysarthric speech. It is commonly believed that neural networks are black boxes that solve problems but do not...

Pełny tekst do pobrania w portalu
MACHINE LEARNING–BASED ANALYSIS OF ENGLISH LATERAL ALLOPHONES
Publikacja
- M. Piotrowska
- G. Korvel
- B. Kostek
- T. Ciszewski
- A. Czyżewski
- International Journal of Applied Mathematics and Computer Science - Rok 2019
Automatic classification methods, such as artificial neural networks (ANNs), the k-nearest neighbor (kNN) and selforganizing maps (SOMs), are applied to allophone analysis based on recorded speech. A list of 650 words was created for that purpose, containing positionally and/or contextually conditioned allophones. For each word, a group of 16 native and non-native speakers were audio-video recorded, from which seven native speakers’...

Pełny tekst do pobrania w portalu
Method for Clustering of Brain Activity Data Derived from EEG Signals
Publikacja
- FUNDAMENTA INFORMATICAE - Rok 2019
A method for assessing separability of EEG signals associated with three classes of brain activity is proposed. The EEG signals are acquired from 23 subjects, gathered from a headset consisting of 14 electrodes. Data are processed by applying Discrete Wavelet Transform (DWT) for the signal analysis and an autoencoder neural network for the brain activity separation. Processing involves 74 wavelets from 3 DWT families: Coiflets,...

Pełny tekst do pobrania w portalu
Music information retrieval—The impact of technology, crowdsourcing, big data, and the cloud in art.
Publikacja
- B. Kostek
- Journal of the Acoustical Society of America - Rok 2019
The exponential growth of computer processing power, cloud data storage, and crowdsourcing model of gathering data bring new possibilities to music information retrieval (mir) field. Mir is no longer music content retrieval only; the area also comprises the discovery of expressing feelings and emotions contained in music, incorporating other than hearing modalities for helping this issue, users’ profiling, merging music with social...

Pełny tekst do pobrania w portalu
Recovering Sound Produced by Wind Turbine Structures Employing Video Motion Magnification
Publikacja
- Rok 2019
The recordings were made with a fast video camera and with a microphone. Using fast cameras allowed for observation of the micro vibrations of the object structure. Motion-magnified video recordings of wind turbines on a wind farm were made for the purpose of building a damage prediction system. An idea was to use video to recover sound & vibrations in order to obtain a contactless diagnostic method for wind turbines. The recovered signals...

Pełny tekst do pobrania w serwisie zewnętrznym
Relationship between album cover design and music genres.
Publikacja
- A. Dorochowicz
- B. Kostek
- Rok 2019
The aim of the study is to find out whether there exists a relationship between typographic, compositional and coloristic elements of the music album cover design and music contained in the album. The research study involves basic statistical analysis of the manually extracted data coming from the worldwide album covers. The samples represent 34 different music genres, coming from nine countries from around the world. There are...
Sound engineering as our commitment to its creators in Poland
Publikacja
- B. Kostek
- A. Czyżewski
- Archives of Acoustics - Rok 2019
Sound engineering is an interdisciplinary and rapidly expanding domain. It covers many aspects, such as sound perception, studio and sound mastering technology, music information retrieval including content-based search systems and automatic music transcription frameworks, sound synthesis, sound restoration, electroacoustics, and other ones constituting multimedia technology. Moreover, machine learning methods applied to the topics...

Pełny tekst do pobrania w serwisie zewnętrznym
Speech Analytics Based on Machine Learning
Publikacja
- Rok 2019
In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...

Pełny tekst do pobrania w serwisie zewnętrznym
Subjective tests for gathering knowledge for applying color grading to video clips automatically
Publikacja
- D. Weber
- B. Kostek
- Rok 2019
The analysis of film music concerning caused emotions may allow for a more accurate adaptation of the color of the film in the context of color grading. Therefore, this paper aims to gather knowledge on the correlation between the applied color palette to a video clip, music associated with a particular shot, and emotions evoked. For that purpose, subjective tests are prepared in which several video clips are presented with or...

Pełny tekst do pobrania w portalu
Subjective tests for gathering konwledge for applaying color grading to video clips automatically
Publikacja
- D. Weber
- B. Kostek
- Rok 2019
The analysis of film music concerning caused emotions may allow for a more accurate adaptation of the color of the film in the context of color grading. Therefore, this paper aims to gather knowledge on the correlation between the applied color palette to a video clip, music associated with a particular shot,and emotions evoked. For that purpose, subjective tests are prepared in which several video clips are presented with...

Pełny tekst do pobrania w serwisie zewnętrznym

Rok 2018

A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems
Publikacja
- Rok 2018
This paper presents a comparison of STI (Speech Transmission Index) coefficient measurement results carried out by direct and indirect methods. First, acoustic parameters important in the context of public address and sound reinforcement systems are recalled. A measurement methodology is presented that employs various test signals to determine impulse responses. The process of evaluating sound system performance, signals enabling...

Pełny tekst do pobrania w serwisie zewnętrznym
A Device for Measuring Auditory Brainstem Responses to Audio
Publikacja
- Rok 2018
Standard ABR devices use clicks and tone bursts to assess subjects’ hearing in an objective way. A new device was developed that extends the functionality of a standard ABR audiometer by collecting and analyzing auditory brainstem responses (ABR). The developed accessory allows for the use of complex sounds (e.g., speech or music excerpts) as stimuli. Therefore, it is possible to find out how efficiently different types of sounds...

Pełny tekst do pobrania w portalu
A Stand for Measurement and Prediction of Scattering Properties of Diffusers
Publikacja
- Rok 2018
In this paper we present a set of solutions which may be used for prototyping and simulation of acoustic scattering devices. A system proposed is capable of measuring sound field. Also a way to use an open source solution for simulation of scattering phenomena occurring in proximity of acoustic diffusers is shown. The result of our work are measurement procedure and a prototype of the simulation script based on FEniCS - an open source...

Pełny tekst do pobrania w serwisie zewnętrznym

Laboratorium Akustyki Fonicznej

Publikacje

Filtry

Kategoria

Rok

Opcje

Rok 2019

An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics

ANALIZA KOLORÓW SCEN FILMOWYCH W KONTEKŚCIE COLOR GRADINGU

ANALIZA PARAMETRÓW SYGNAŁU MOWY W KONTEKŚCIE ICH PRZYDATNOŚCI W AUTOMATYCZNEJ OCENIE JAKOŚCI EKSPRESJI ŚPIEWU

Assessment of the Effectiveness of a Short-term Hearing Aid Use in Patients with Different Degrees of Hearing Loss

Comparison of Lithuanian and Polish Consonant Phonemes Based on Acoustic Analysis – Preliminary Results

Comparison of the effectiveness of automatic EEG signal class separation algorithms

Discovering Rule-Based Learning Systems for the Purpose of Music Analysis

Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

MACHINE LEARNING–BASED ANALYSIS OF ENGLISH LATERAL ALLOPHONES

Method for Clustering of Brain Activity Data Derived from EEG Signals

Music information retrieval—The impact of technology, crowdsourcing, big data, and the cloud in art.

Recovering Sound Produced by Wind Turbine Structures Employing Video Motion Magnification

Relationship between album cover design and music genres.

Sound engineering as our commitment to its creators in Poland

Speech Analytics Based on Machine Learning

Subjective tests for gathering knowledge for applying color grading to video clips automatically

Subjective tests for gathering konwledge for applaying color grading to video clips automatically

Rok 2018

A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems

A Device for Measuring Auditory Brainstem Responses to Audio

A Stand for Measurement and Prediction of Scattering Properties of Diffusers

Wyszukiwarka

Laboratorium Akustyki Fonicznej

Publikacje

Filtry

Kategoria

Rok

Opcje

Katalog Publikacji

Rok 2019

Rok 2018