Bożena Kostek - Publikacje

prof. dr hab. inż. Bożena Kostek

Zatrudnienie

Profesor w Laboratorium Akustyki Fonicznej

Słowa kluczowe Pomoc

Publikacje

wyników na stronę:
rok:
- zaznaczony Sortuj po rok od najnowszych
- Sortuj po rok od najstarszych
tytuł:
- zaznaczony Sortuj po tytuł A-Z
- Sortuj po tytuł Z-A
cytowania:
- Sortuj po cytowania malejąco
- Sortuj po cytowania rosnąco

Rok 2021

Skuteczność klasyfikacji gatunków muzycznych za pomocą sieci neuronowej w zależności od typu danych wejściowych
Publikacja
- Rok 2021
Rozpoznawanie gatunku muzycznego jest jednym z podstawowych elementów inteligentnych systemów tworzenia automatycznych list muzyki. Platformy strumieniowe oferujące taką usługę wymagają rozwiązań, które umożliwią jak najdokładniej określić przynależność utworu do gatunku muzycznego. Zgodnie z aktualnym stanem wiedzy – najskuteczniejszym klasyfikatorem są sztuczne sieci neuronowe (w tym w wersji uczenia głębokiego), dla których...

Pełny tekst do pobrania w serwisie zewnętrznym
Techniki wielokanałowe wykorzystywane w koncertach i nagraniach muzycznych na odległość
Publikacja
- Rok 2021
W czasie pandemii koronawirusa COVID-19 nowego znaczenia nabrały możliwości transmisji dźwięku z obrazem – zwłaszcza do pracy zdalnej, która w przypadku muzyków jest szczególnym wyzwaniem zarówno w kontekście wspólnych ćwiczeń i prób, jak i koncertów. Wynikła konieczność wieloźródłowego połączenia ujawniła potrzebę uprzestrzennienia dźwięku w celu łatwiejszej lokalizacji źródeł dźwięku. Tworzenie zdalnych nagrań muzycznych stało...

Pełny tekst do pobrania w serwisie zewnętrznym
Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech
Publikacja
- D. Korzekwa
- J. Lorenzo-trueba
- T. Drugman
- S. Calamaro
- B. Kostek
- Rok 2021
We propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...

Pełny tekst do pobrania w portalu

Rok 2020

A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces
Publikacja
- G. Tamulevicius
- G. Korvel
- A. B. Yayak
- P. Treigys
- J. Bernataviciene
- B. Kostek
- Electronics - Rok 2020
In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...

Pełny tekst do pobrania w portalu
Analiza ruchu drogowego z wykorzystaniem analizy akustycznej
Publikacja
- K. Marciniuk
- B. Kostek
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2020
Tematyka pracy porusza zagadnienia dotyczące pozyskiwania informacji o ruchu drogowym z wykorzystaniem monitoringu akustycznego. Przybliżono podstawowe techniki nadzoru nad ruchem drogowym. Przedstawiono założenia akustycznego detektora ruchu i zbadano jego skuteczność na trzech płaszczyznach działania – zliczania pojazdów, klasyfikacji rodzajowej i klasyfikacji warunków pogodowych panujących na nawierzchni

Pełny tekst do pobrania w serwisie zewnętrznym
Analyzing the Effectiveness of the Brain–Computer Interface for Task Discerning Based on Machine Learning
Publikacja
- SENSORS - Rok 2020
The aim of the study is to compare electroencephalographic (EEG) signal feature extraction methods in the context of the effectiveness of the classification of brain activities. For classification, electroencephalographic signals were obtained using an EEG device from 17 subjects in three mental states (relaxation, excitation, and solving logical task). Blind source separation employing independent component analysis (ICA) was...

Pełny tekst do pobrania w portalu
Employing Subjective Tests and Deep Learning for Discovering the Relationship between Personality Types and Preferred Music Genres
Publikacja
- Electronics - Rok 2020
The purpose of this research is two-fold: (a) to explore the relationship between the listeners’ personality trait, i.e., extraverts and introverts and their preferred music genres, and (b) to predict the personality trait of potential listeners on the basis of a musical excerpt by employing several classification algorithms. We assume that this may help match songs according to the listener’s personality in social music networks....

Pełny tekst do pobrania w portalu
Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement
Publikacja
- G. Korvel
- K. Kąkol
- O. Kurasova
- B. Kostek
- IEEE Access - Rok 2020
The Lombard effect is one of the most well-known effects of noise on speech production. Speech with the Lombard effect is more easily recognizable in noisy environments than normal natural speech. Our previous investigations showed that speech synthesis models might retain Lombard-effect characteristics. In this study, we investigate several speech models, such as harmonic, source-filter, and sinusoidal, applied to Lombard speech...

Pełny tekst do pobrania w portalu
Improving Objective Speech Quality Indicators in Noise Conditions
Publikacja
- K. Kąkol
- G. Korvel
- B. Kostek
- Rok 2020
This work aims at modifying speech signal samples and test them with objective speech quality indicators after mixing the original signals with noise or with an interfering signal. Modifications that are applied to the signal are related to the Lombard speech characteristics, i.e., pitch shifting, utterance duration changes, vocal tract scaling, manipulation of formants. A set of words and sentences in Polish, recorded in silence,...

Pełny tekst do pobrania w serwisie zewnętrznym
Investigating Feature Spaces for Isolated Word Recognition
Publikacja
- P. Treigys
- G. Korvel
- G. Tamulevicius
- J. Bernataviciene
- B. Kostek
- Rok 2020
The study addresses the issues related to the appropriateness of a two-dimensional representation of speech signal for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and time-frequency signal representation converted to the investigated feature spaces. In particular, waveforms and fractal dimension features of the signal were chosen for the time domain, and...

Pełny tekst do pobrania w serwisie zewnętrznym
Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing
Publikacja
- D. Koszewski
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2020
Developing signal processing methods to extract information automatically has potential in several applications, for example searching for multimedia based on its audio content, making context-aware mobile applications (e.g., tuning apps), or pre-processing for an automatic mixing system. However, the last-mentioned application needs a significant amount of research to reliably recognize real musical instruments in recordings....

Pełny tekst do pobrania w portalu
Ranking Speech Features for Their Usage in Singing Emotion Classification
Publikacja
- S. Zaporowski
- B. Kostek
- Rok 2020
This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...

Pełny tekst do pobrania w portalu

Rok 2019

A Concept of Automatic Film Color Grading Based on Music Recognition and Evoked Emotions
Publikacja
- D. Weber
- B. Kostek
- Rok 2019
The article presents the aspects of the final selection of the color of shots in film production based on the psychology of color. First of all, the elements of color processing, contrast, saturation or white balance in the film shots were presented and the definition of color grading was given. In the second part of the article the analysis of film music was conducted in the context of stimulating appropriate emotions while watching...
An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
Publikacja
- G. Korvel
- O. Kurasova
- B. Kostek
- Rok 2019
The speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing...

Pełny tekst do pobrania w portalu
ANALIZA KOLORÓW SCEN FILMOWYCH W KONTEKŚCIE COLOR GRADINGU
Publikacja
- D. Weber
- B. Kostek
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Rok 2019
W artykule przedstawiono zagadnienia związane z kolorowaniem sceny filmowej. W pracy przedyskutowano główne aspekty obróbki koloru obrazu filmowego oraz omówiono definicje pojęć związanych z kolorowaniem sceny, tj.: color correction oraz color gradingu. Opisano teorie psychologii koloru oraz ich praktyczne wykorzystanie w filmie i odniesiono je do podstawowych gatunków filmowych i modeli emocji. Następnie przedyskutowano założenia...

Pełny tekst do pobrania w portalu
ANALIZA PARAMETRÓW SYGNAŁU MOWY W KONTEKŚCIE ICH PRZYDATNOŚCI W AUTOMATYCZNEJ OCENIE JAKOŚCI EKSPRESJI ŚPIEWU
Publikacja
- S. Zaporowski
- B. Kostek
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Rok 2019
Praca dotyczy podejścia do parametryzacji w przypadku klasyfikacji emocji w śpiewie oraz porównania z klasyfikacją emocji w mowie. Do tego celu wykorzystano bazę mowy i śpiewu nacechowanego emocjonalnie RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song), zawierającą nagrania profesjonalnych aktorów prezentujących sześć różnych emocji. Następnie obliczono współczynniki mel-cepstralne (MFCC) oraz wybrane deskryptory...

Pełny tekst do pobrania w portalu
Assessment of the Effectiveness of a Short-term Hearing Aid Use in Patients with Different Degrees of Hearing Loss
Publikacja
- T. Poremski
- P. Szymański
- B. Kostek
- Archives of Acoustics - Rok 2019
The study presents evaluating the effectiveness of the hearing aid fitting process in the short-term use (7 days). The evaluation method consists of a survey based on the APHAB (Abbreviated Profile of Hearing Aid Benefit) questionnaire. Additional criteria such as a degree of hearing loss, number of hours and days of hearing aid use as well as the user’s experience were also taken into consideration. The outcomes of the benefit...

Pełny tekst do pobrania w portalu
Comparison of Lithuanian and Polish Consonant Phonemes Based on Acoustic Analysis – Preliminary Results
Publikacja
- G. Korvel
- O. Kurasova
- B. Kostek
- Archives of Acoustics - Rok 2019
The goal of this research is to find a set of acoustic parameters that are related to differences between Polish and Lithuanian language consonants. In order to identify these differences, an acoustic analysis is performed, and the phoneme sounds are described as the vectors of acoustic parameters. Parameters known from the speech domain as well as those from the music information retrieval area are employed. These parameters are...

Pełny tekst do pobrania w portalu
Comparison of the effectiveness of automatic EEG signal class separation algorithms
Publikacja
- JOURNAL OF INTELLIGENT & FUZZY SYSTEMS - Rok 2019
In this paper, an algorithm for automatic brain activity class identification of EEG (electroencephalographic) signals is presented. EEG signals are gathered from seventeen subjects performing one of the three tasks: resting, watching a music video and playing a simple logic game. The methodology applied consists of several steps, namely: signal acquisition, signal processing utilizing z-score normalization, parametrization and...

Pełny tekst do pobrania w portalu
Discovering Rule-Based Learning Systems for the Purpose of Music Analysis
Publikacja
- G. Korvel
- B. Kostek
- Journal of the Acoustical Society of America - Rok 2019
Music analysis and processing aims at understanding information retrieved from music (Music Information Retrieval). For the purpose of music data mining, machine learning (ML) methods or statistical approach are employed. Their primary task is recognition of musical instrument sounds, music genre or emotion contained in music, identification of audio, assessment of audio content, etc. In terms of computational approach, music databases...

Pełny tekst do pobrania w portalu
Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech
Publikacja
- D. Korzekwa
- R. Barra-Chicote
- B. Kostek
- T. Drugman
- M. Łajszczak
- Rok 2019
We present a novel deep learning model for the detection and reconstruction of dysarthric speech. We train the model with a multi-task learning technique to jointly solve dysarthria detection and speech reconstruction tasks. The model key feature is a low-dimensional latent space that is meant to encode the properties of dysarthric speech. It is commonly believed that neural networks are black boxes that solve problems but do not...

Pełny tekst do pobrania w portalu
MACHINE LEARNING–BASED ANALYSIS OF ENGLISH LATERAL ALLOPHONES
Publikacja
- M. Piotrowska
- G. Korvel
- B. Kostek
- T. Ciszewski
- A. Czyżewski
- International Journal of Applied Mathematics and Computer Science - Rok 2019
Automatic classification methods, such as artificial neural networks (ANNs), the k-nearest neighbor (kNN) and selforganizing maps (SOMs), are applied to allophone analysis based on recorded speech. A list of 650 words was created for that purpose, containing positionally and/or contextually conditioned allophones. For each word, a group of 16 native and non-native speakers were audio-video recorded, from which seven native speakers’...

Pełny tekst do pobrania w portalu
Method for Clustering of Brain Activity Data Derived from EEG Signals
Publikacja
- FUNDAMENTA INFORMATICAE - Rok 2019
A method for assessing separability of EEG signals associated with three classes of brain activity is proposed. The EEG signals are acquired from 23 subjects, gathered from a headset consisting of 14 electrodes. Data are processed by applying Discrete Wavelet Transform (DWT) for the signal analysis and an autoencoder neural network for the brain activity separation. Processing involves 74 wavelets from 3 DWT families: Coiflets,...

Pełny tekst do pobrania w portalu
Music information retrieval—The impact of technology, crowdsourcing, big data, and the cloud in art.
Publikacja
- B. Kostek
- Journal of the Acoustical Society of America - Rok 2019
The exponential growth of computer processing power, cloud data storage, and crowdsourcing model of gathering data bring new possibilities to music information retrieval (mir) field. Mir is no longer music content retrieval only; the area also comprises the discovery of expressing feelings and emotions contained in music, incorporating other than hearing modalities for helping this issue, users’ profiling, merging music with social...

Pełny tekst do pobrania w portalu
Recovering Sound Produced by Wind Turbine Structures Employing Video Motion Magnification
Publikacja
- Rok 2019
The recordings were made with a fast video camera and with a microphone. Using fast cameras allowed for observation of the micro vibrations of the object structure. Motion-magnified video recordings of wind turbines on a wind farm were made for the purpose of building a damage prediction system. An idea was to use video to recover sound & vibrations in order to obtain a contactless diagnostic method for wind turbines. The recovered signals...

Pełny tekst do pobrania w serwisie zewnętrznym
Relationship between album cover design and music genres.
Publikacja
- A. Dorochowicz
- B. Kostek
- Rok 2019
The aim of the study is to find out whether there exists a relationship between typographic, compositional and coloristic elements of the music album cover design and music contained in the album. The research study involves basic statistical analysis of the manually extracted data coming from the worldwide album covers. The samples represent 34 different music genres, coming from nine countries from around the world. There are...
Sound engineering as our commitment to its creators in Poland
Publikacja
- B. Kostek
- A. Czyżewski
- Archives of Acoustics - Rok 2019
Sound engineering is an interdisciplinary and rapidly expanding domain. It covers many aspects, such as sound perception, studio and sound mastering technology, music information retrieval including content-based search systems and automatic music transcription frameworks, sound synthesis, sound restoration, electroacoustics, and other ones constituting multimedia technology. Moreover, machine learning methods applied to the topics...

Pełny tekst do pobrania w serwisie zewnętrznym
Speech Analytics Based on Machine Learning
Publikacja
- Rok 2019
In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...

Pełny tekst do pobrania w serwisie zewnętrznym
Subjective tests for gathering knowledge for applying color grading to video clips automatically
Publikacja
- D. Weber
- B. Kostek
- Rok 2019
The analysis of film music concerning caused emotions may allow for a more accurate adaptation of the color of the film in the context of color grading. Therefore, this paper aims to gather knowledge on the correlation between the applied color palette to a video clip, music associated with a particular shot, and emotions evoked. For that purpose, subjective tests are prepared in which several video clips are presented with or...

Pełny tekst do pobrania w portalu
Subjective tests for gathering konwledge for applaying color grading to video clips automatically
Publikacja
- D. Weber
- B. Kostek
- Rok 2019
The analysis of film music concerning caused emotions may allow for a more accurate adaptation of the color of the film in the context of color grading. Therefore, this paper aims to gather knowledge on the correlation between the applied color palette to a video clip, music associated with a particular shot,and emotions evoked. For that purpose, subjective tests are prepared in which several video clips are presented with...

Pełny tekst do pobrania w serwisie zewnętrznym

Rok 2018

A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems
Publikacja
- Rok 2018
This paper presents a comparison of STI (Speech Transmission Index) coefficient measurement results carried out by direct and indirect methods. First, acoustic parameters important in the context of public address and sound reinforcement systems are recalled. A measurement methodology is presented that employs various test signals to determine impulse responses. The process of evaluating sound system performance, signals enabling...

Pełny tekst do pobrania w serwisie zewnętrznym
A Device for Measuring Auditory Brainstem Responses to Audio
Publikacja
- Rok 2018
Standard ABR devices use clicks and tone bursts to assess subjects’ hearing in an objective way. A new device was developed that extends the functionality of a standard ABR audiometer by collecting and analyzing auditory brainstem responses (ABR). The developed accessory allows for the use of complex sounds (e.g., speech or music excerpts) as stimuli. Therefore, it is possible to find out how efficiently different types of sounds...

Pełny tekst do pobrania w portalu
A Stand for Measurement and Prediction of Scattering Properties of Diffusers
Publikacja
- Rok 2018
In this paper we present a set of solutions which may be used for prototyping and simulation of acoustic scattering devices. A system proposed is capable of measuring sound field. Also a way to use an open source solution for simulation of scattering phenomena occurring in proximity of acoustic diffusers is shown. The result of our work are measurement procedure and a prototype of the simulation script based on FEniCS - an open source...

Pełny tekst do pobrania w serwisie zewnętrznym
A Study on Audio Signal Processed by "Instant Mastering"
Publikacja
- M. Piotrowska
- S. Piotrowski
- B. Kostek
- Rok 2018
An increasing amount of music produced in home- and project-studios results in development and growth of "automatic mastering services". The presented investigation explores changes introduced to audio signal by various online mastering platforms. A music set consisting of 10 songs produced in small facilities was processed by eight on-line automatic mastering services. Additionally, some laboratory-constructed signals were tested....
A study on of music features derived from audio recordings examples – a quantitative analysis
Publikacja
- A. Dorochowicz
- B. Kostek
- Archives of Acoustics - Rok 2018
The paper presents a comparative study of music features derived from audio recordings, i.e. the same music pieces but representing different music genres, excerpts performed by different musicians, and songs performed by a musician, whose style evolved over time. Firstly, the origin and the background of the division of music genres were shortly presented. Then, several objective parameters of an audio signal were recalled that...

Pełny tekst do pobrania w portalu
Analysis of 2D Feature Spaces for Deep Learning-based Speech Recognition
Publikacja
- G. Korvel
- P. Treigys
- G. Tamulevicus
- J. Bernataviciene
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2018
convolutional neural network (CNN) which is a class of deep, feed-forward artificial neural network. We decided to analyze audio signal feature maps, namely spectrograms, linear and Mel-scale cepstrograms, and chromagrams. The choice was made upon the fact that CNN performs well in 2D data-oriented processing contexts. Feature maps were employed in the Lithuanian word recognition task. The spectral analysis led to the highest word...
Analysis of Lombard speech using parameterization and the objective quality indicators in noise conditions
Publikacja
- K. Kąkol
- G. Korvel
- B. Kostek
- Rok 2018
The aim of the work is to analyze Lombard speech effect in recordings and then modify the speech signal in order to obtain an increase in the improvement of objective speech quality indicators after mixing the useful signal with noise or with an interfering signal. The modifications made to the signal are based on the characteristics of the Lombard speech, and in particular on the effect of increasing the fundamental frequency...
Aparat słuchowy a alternatywne urządzenia poprawiające słyszenie
Publikacja
- T. Poremski
- P. Szymański
- B. Kostek
- Otorynolaryngologia - Przegląd Kliniczny - Rok 2018
W opracowaniu dokonano przeglądu dostępnych prac dotyczących różnych rodzajów urządzeń poprawiających słyszenie, które w szczególnych przypadkach mogą być traktowane jako rozwiązania alternatywne w stosunku do klasycznych aparatów słuchowych. Praca zawiera dyskusję na temat nowego rodzaju aparatu słuchowego wstępnie zaprogramowanego, który może być dystrybuowany korespondencyjnie lub bezpośrednio potencjalnym użytkownikom. Ponadto...

Pełny tekst do pobrania w serwisie zewnętrznym
AUDIO SIGNAL EQUALIZATION BASED ON IMPULSE RESPONSE OF A LISTENING ROOM AND MUSIC CONTENT REPRODUCED
Publikacja
- P. Hoffmann
- B. Kostek
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Rok 2018
A research study presents investigations of the influence of the room acoustics on the frequency characteristic of the audio signal playback. First, a concept of a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the room spectral response, a system for room acoustics compensation based on an equalizer designed is proposed. The system settings depend on music genre recognized automatically....
Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.
Publikacja
- Rok 2018
In this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...

Pełny tekst do pobrania w serwisie zewnętrznym
Automatic Clustering of EEG-Based Data Associated with Brain Activity
Publikacja
- Rok 2018
The aim of this paper is to present a system for automatic assigning electroencephalographic (EEG) signals to appropriate classes associated with brain activity. The EEG signals are acquired from a headset consisting of 14 electrodes placed on skull. Data gathered are first processed by the Independent Component Analysis algorithm to obtain estimates of signals generated by primary sources reflecting the activity of the brain....

Pełny tekst do pobrania w serwisie zewnętrznym
Automatic music genre classification based on musical instrument track separation / Automatyczna klasyfikacja gatunku muzycznego wykorzystująca algorytm separacji dźwięku instrumentó muzycznych
Publikacja
- A. Rosner
- B. Kostek
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2018
The aim of this article is to investigate whether separating music tracks at the pre-processing phase and extending feature vector by parameters related to the specific musical instruments that are characteristic for the given musical genre allow for efficient automatic musical genre classification in case of database containing thousands of music excerpts and a dozen of genres. Results of extensive experiments show that the approach...

Pełny tekst do pobrania w portalu
Bimodal classification of English allophones employing acoustic speech signal and facial motion capture
Publikacja
- Journal of the Acoustical Society of America - Rok 2018
A method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...

Pełny tekst do pobrania w serwisie zewnętrznym
Classification of Music Genres by Means of Listening Tests and Decision Algorithms
Publikacja
- Rok 2018
The paper compares the results of audio excerpt assignment to a music genre obtained in listening tests and classification by means of decision algorithms. A short review on music description employing music styles and genres is given. Then, assumptions of listening tests to be carried out along with an online survey for assigning audio samples to selected music genres are presented. A framework for music parametrization is created...

Pełny tekst do pobrania w serwisie zewnętrznym
Comparative analysis of spectral and cepstral feature extraction techniques for phoneme modelling
Publikacja
- G. Korvel
- O. Kurasova
- B. Kostek
- Rok 2018
Phoneme parameter extraction framework based on spectral and cepstral parameters is proposed. Using this framework, the phoneme signal is divided into frames and Hamming window is used. The performances are evaluated for recognition of Lithuanian vowel and semivowel phonemes. Different feature sets without noise as well as at different level of noise are considered. Two classical machine learning methods (Naive Bayes and Support...

Pełny tekst do pobrania w serwisie zewnętrznym
Comparative analysis of various transformation techniques for voiceless consonants modeling
Publikacja
- G. Korvel
- B. Kostek
- O. Kurasova
- International Journal of Computers Communications & Control - Rok 2018
In this paper, a comparison of various transformation techniques, namely Discrete Fourier Transform (DFT), Discrete Cosine Transform (DCT) and Discrete Walsh Hadamard Transform (DWHT) are performed in the context of their application to voiceless consonant modeling. Speech features based on these transformation techniques are extracted. These features are mean and derivative values of cepstrum coefficients, derived from each transformation....

Pełny tekst do pobrania w portalu
Editor's note and 2018 reviewers
Publikacja
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2018
Przedmiotem pracy jest odniesienie do prac opublikowanych w 2018 roku, jak również do serii artykułów w ramach specjalnego wydania: Special Issue on Augmented and Participatory Sound and Music Interaction Using Semantic Audio.

Pełny tekst do pobrania w serwisie zewnętrznym
Eksternalizacja w binauralnej ambisonicznej auralizacji źródeł kierunkowych
Publikacja
- B. Mróz
- B. Kostek
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Rok 2018
W artykule przedstawiono najważniejsze składniki procesu skutecznego renderowania trójwymiarowego obrazu dźwiękowego za pomocą słuchawek. W tym celu badany jest stopień oddziaływania poszczególnych czynników wpływających na eksternalizację dźwięku: śledzenie położenia głowy (ang. head tracking), indywidualne funkcje przenoszenia głowy (HRTF – Head Related Transfer Function, odnoszące się do matematycznej funkcji propagacji dźwięku...

Pełny tekst do pobrania w portalu
EVALUATION OF SOUND QUALITY FEATURES ON ENVIRONMENTAL NOISE EFFECTS – A CASE STUDY APPLIED TO ROAD TRAFFIC NOISE
Publikacja
- W. Paszkowski
- J. Kotus
- T. Poremski
- B. Kostek
- Metrology and Measurement Systems - Rok 2018
The paper shows a study on the relationship between noise measures and sound quality (SQ) features that are related to annoyance caused by the traffic noise. First, a methodology to perform analyses related to the traffic noise annoyance is described including references to parameters of the assessment of road noise sources. Next, the measurement setup, location and results are presented along with the derived sound quality features....

Pełny tekst do pobrania w portalu
Examining Feature Vector for Phoneme Recognition
Publikacja
- G. Korvel
- B. Kostek
- Rok 2018
The aim of this paper is to analyze usability of descriptors coming from music information retrieval to the phoneme analysis. The case study presented consists in several steps. First, a short overview of parameters utilized in speech analysis is given. Then, a set of time and frequency domain-based parameters is selected and discussed in the context of stop consonant acoustical characteristics. A toolbox created for this purpose...

wyświetlono 10064 razy

Wyszukiwarka

prof. dr hab. inż. Bożena Kostek

Zatrudnienie

Słowa kluczowe Pomoc

Publikacje

Filtry

Kategoria

Rok

Opcje

Katalog Publikacji

Rok 2021

Rok 2020

Rok 2019

Rok 2018