Publikacje
Filtry
wszystkich: 348
Katalog Publikacji
Rok 2016
-
3D Acoustic Field Intensity Probe Design and Measurements
PublikacjaThe aim of this paper is two-fold. First, some basic notions on acoustic field intensity and its measurement are shortly recalled. Then, the equipment and the measurement procedure used in the sound intensity in the performed research study are described. The second goal is to present details of the design of the engineered 3D intensity probe, as well as the algorithms developed and applied for that purpose. Results of the intensity...
-
A concept of Signal Equalization Method Based on Music Genre and the Listener's Room Characteristics
PublikacjaA research study that investigates the influence of the room acoustics environment on the frequency characteristic of the audio signal playback is presented. First, a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the frequency response of the room, a system for room acoustics compensation based on eight-band equalizer is proposed. The system settings depend on music genre. In...
-
A Study in Experimental Methods of Human-Computer Communication for Patients After Severe Brain Injuries
PublikacjaExperimental research in the domain of multimedia technology applied to medical practice is discussed, employing a prototype of integrated multimodal system to assist diagnosis and polysensory stimulation of patients after severe brain injury. The system being developed includes among others: eye gaze tracker, and EEG monitoring of non-communicating patients after severe brain injuries. The proposed solutions are used for collecting...
-
A study on signal processing methods applied to hearing aids
PublikacjaThis paper presents a short survey on current technology available in hearing aids with a focus on digital signal processing techniques used. First, factors influencing the hearing aid effectiveness are introduced. Then, examples of the present DSP methods and strategies are provided. Also, a description of current limitations of hearing aids and future trends of development are shown. Finally, the notion of computational auditory...
-
A system for acoustic field measurement employing cartesian robot
PublikacjaA system setup for measurements of acoustic field, together with the results of 3D visualisations of acoustic energy flow are presented in the paper. Spatial sampling of the field is performed by a Cartesian robot. Automatization of the measurement process is achieved with the use of a specialized control system. The method is based on measuring the sound pressure (scalar) and particle velocity (vector) quantities. The aim of the...
-
Adaptive Personal Tuning of Sound in Mobile Computers
PublikacjaAn integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of their acoustic track to changing acoustic conditions of the environment and to users’ individual preferences. Signal processing algorithms are introduced that concern: linearization of frequency response, dialogue intelligibility enhancement, and dynamics processing tuned up to the users’...
-
Analiza stanu nawierzchni i klas pojazdów na podstawie parametrów ekstrahowanych z sygnału fonicznego
PublikacjaCelem badań jest poszukiwanie parametrów wektora cech ekstrahowanego z sygnału fonicznego w kontekście automatycznego rozpoznawania stanu nawierzchni jezdni oraz typu pojazdów. W pierwszej kolejności przedstawiono wpływ warunków pogodowych na charakterystykę widmową sygnału fonicznego rejestrowanego przy przejeżdżających pojazdach. Następnie, dokonano parametryzacji sygnału fonicznego oraz przeprowadzano analizę korelacyjną w celu...
-
Analiza sygnałów fonicznych w nagraniach pojazdów w zmiennych warunkach pogodowych
PublikacjaAkustyczna detekcja pojazdów jest najmniej inwazyjnym sposobem kontroli natężenia ruchu pojazdów w miastach. Charakteryzuje się ona również większą odpornością na warunki oświetleniowe i pogodowe. W niniejszym referacie przedstawiono wyniki parametryzacji sygnałów fonicznych dla sygnałów przejeżdżających pojazdów w kontekście zmian warunków atmosferycznych. W ramach badań przeprowadzono rejestrację wideofoniczną pojazdów w dwóch...
-
Analysis of soundscape recordings in close proximity to the road in changeable wather conditions
PublikacjaThe acoustic vehicle sensing is the least invasive type of traffic detection. Also, acoustic-based vehicle detection technology is insensitive to precipitation and can operate in low light level. Therefore, this kind of method may be used for automatic detection of the vehicle passage events. It can also be employed for measurements of a vehicle speed and the vehicle assignment to the particular category. In this paper the results...
Rok 2011
-
3D Hand Shape Modeling for Automatic Assessing Motor Performance in Parkinson's Disease
PublikacjaIn this paper a method for hand pattern processing to create a 3D hand model is presented. By applying a complete hand armature to the model obtained, an interpolation of three motor tests for an individual Parkinson's disease patient can be performed. To obtain the 3D hand model the top view of the hand from a web cam is analyzed. The hand contour is examined to find characteristic points that allows for dividing hand image into...
-
A new approach for an automatic assessment of a neurological condition employing hand gesture classification
Publikacja.
Rok 2015
-
3D Sound Intensity Measurement Around Organ Pipes Using Acoustic Vector Sensors
PublikacjaThe aim of the presented paper was to obtain and visualize sound intensity distribution of radiated acoustic energy around the organ pipes. The experimental setup consisted of the multichannel acoustic vector sensor and the specialized Cartesian robot. Measurements were performed in free field with spatial resolution of 0.1 [m]. Two organ pipes, i.e. wooden and metal were measured during the ex-periment. The organ pipes were activated...
-
Analiza drgań struny gitarowej z użyciem szybkich kamer
PublikacjaW referacie przedstawiono metodę analizy i wizualizacji ruchu struny gitarowej. Drgania struny zostały zarejestrowane za pomocą szybkich kamer. Układ optyczny zastosowany do rejestracji został dobrany w taki sposób, by móc obserwować drgania wzdłuż struny. Obrazy zarejestrowane za pomocą szybkich kamer zostały przeanalizowane za pomocą algorytmów cyfrowego przetwarzania sygnałów tak, aby z dużą dokładnością śledzić wychylenia i...
-
Analysis of impact of audio modifications on the robustness of watermark for non-blind architecture
PublikacjaThe aim of this paper is to assess the robustness of the non-blind audio content watermarking scheme proposed by the authors. The authors present the architecture of the designed system along with the employed workflows for embedding and extracting the watermark followed by the implementation phase description and the analysis of the experimental results. Some possible attack simulations on the embedded watermarks are reviewed,...
Rok 2018
-
A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems
PublikacjaThis paper presents a comparison of STI (Speech Transmission Index) coefficient measurement results carried out by direct and indirect methods. First, acoustic parameters important in the context of public address and sound reinforcement systems are recalled. A measurement methodology is presented that employs various test signals to determine impulse responses. The process of evaluating sound system performance, signals enabling...
-
A Device for Measuring Auditory Brainstem Responses to Audio
PublikacjaStandard ABR devices use clicks and tone bursts to assess subjects’ hearing in an objective way. A new device was developed that extends the functionality of a standard ABR audiometer by collecting and analyzing auditory brainstem responses (ABR). The developed accessory allows for the use of complex sounds (e.g., speech or music excerpts) as stimuli. Therefore, it is possible to find out how efficiently different types of sounds...
-
A Stand for Measurement and Prediction of Scattering Properties of Diffusers
PublikacjaIn this paper we present a set of solutions which may be used for prototyping and simulation of acoustic scattering devices. A system proposed is capable of measuring sound field. Also a way to use an open source solution for simulation of scattering phenomena occurring in proximity of acoustic diffusers is shown. The result of our work are measurement procedure and a prototype of the simulation script based on FEniCS - an open source...
-
A Study on Audio Signal Processed by "Instant Mastering"
PublikacjaAn increasing amount of music produced in home- and project-studios results in development and growth of "automatic mastering services". The presented investigation explores changes introduced to audio signal by various online mastering platforms. A music set consisting of 10 songs produced in small facilities was processed by eight on-line automatic mastering services. Additionally, some laboratory-constructed signals were tested....
-
A study on of music features derived from audio recordings examples – a quantitative analysis
PublikacjaThe paper presents a comparative study of music features derived from audio recordings, i.e. the same music pieces but representing different music genres, excerpts performed by different musicians, and songs performed by a musician, whose style evolved over time. Firstly, the origin and the background of the division of music genres were shortly presented. Then, several objective parameters of an audio signal were recalled that...
-
Analysis of 2D Feature Spaces for Deep Learning-based Speech Recognition
Publikacjaconvolutional neural network (CNN) which is a class of deep, feed-forward artificial neural network. We decided to analyze audio signal feature maps, namely spectrograms, linear and Mel-scale cepstrograms, and chromagrams. The choice was made upon the fact that CNN performs well in 2D data-oriented processing contexts. Feature maps were employed in the Lithuanian word recognition task. The spectral analysis led to the highest word...
-
Analysis of Lombard speech using parameterization and the objective quality indicators in noise conditions
PublikacjaThe aim of the work is to analyze Lombard speech effect in recordings and then modify the speech signal in order to obtain an increase in the improvement of objective speech quality indicators after mixing the useful signal with noise or with an interfering signal. The modifications made to the signal are based on the characteristics of the Lombard speech, and in particular on the effect of increasing the fundamental frequency...
-
Aparat słuchowy a alternatywne urządzenia poprawiające słyszenie
PublikacjaW opracowaniu dokonano przeglądu dostępnych prac dotyczących różnych rodzajów urządzeń poprawiających słyszenie, które w szczególnych przypadkach mogą być traktowane jako rozwiązania alternatywne w stosunku do klasycznych aparatów słuchowych. Praca zawiera dyskusję na temat nowego rodzaju aparatu słuchowego wstępnie zaprogramowanego, który może być dystrybuowany korespondencyjnie lub bezpośrednio potencjalnym użytkownikom. Ponadto...
Rok 2019
-
A Concept of Automatic Film Color Grading Based on Music Recognition and Evoked Emotions
PublikacjaThe article presents the aspects of the final selection of the color of shots in film production based on the psychology of color. First of all, the elements of color processing, contrast, saturation or white balance in the film shots were presented and the definition of color grading was given. In the second part of the article the analysis of film music was conducted in the context of stimulating appropriate emotions while watching...
-
An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
PublikacjaThe speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing...
-
ANALIZA KOLORÓW SCEN FILMOWYCH W KONTEKŚCIE COLOR GRADINGU
PublikacjaW artykule przedstawiono zagadnienia związane z kolorowaniem sceny filmowej. W pracy przedyskutowano główne aspekty obróbki koloru obrazu filmowego oraz omówiono definicje pojęć związanych z kolorowaniem sceny, tj.: color correction oraz color gradingu. Opisano teorie psychologii koloru oraz ich praktyczne wykorzystanie w filmie i odniesiono je do podstawowych gatunków filmowych i modeli emocji. Następnie przedyskutowano założenia...
-
ANALIZA PARAMETRÓW SYGNAŁU MOWY W KONTEKŚCIE ICH PRZYDATNOŚCI W AUTOMATYCZNEJ OCENIE JAKOŚCI EKSPRESJI ŚPIEWU
PublikacjaPraca dotyczy podejścia do parametryzacji w przypadku klasyfikacji emocji w śpiewie oraz porównania z klasyfikacją emocji w mowie. Do tego celu wykorzystano bazę mowy i śpiewu nacechowanego emocjonalnie RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song), zawierającą nagrania profesjonalnych aktorów prezentujących sześć różnych emocji. Następnie obliczono współczynniki mel-cepstralne (MFCC) oraz wybrane deskryptory...
Rok 2013
-
A new method for measuring the psychoacoustical properties of tinnitus
Publikacjainformation, select the tinnitus treatment and quantitatively substantiate its effects, the measurement of the Tinnitus psychoacoustic parameters should be made an inherent part of the Tinnitus therapy. Methods For this purpose the multimedia-based sound synthesizer has been proposed for testing tinnitus and the results obtained this way are compared with the outcome of the audiometer-based Wilcoxon test. The method has been verified...
-
A Study on Influence of Normalization Methods on Music Genre Classification Results Employing kNN Algorithms
PublikacjaThis paper presents a comparison of different normalization methods applied to the set of feature vectors of music pieces. Test results show the influence of min-nlax and Zero-Mean normalization methods, employing different distance functions (Euclidean, Manhattan, Chebyshev, Minkowski) as a pre-processing for genre classification, on k-Nearest Neighbor (kNN) algorithm classification results.
-
Acoustics - new services for urban planning, research and education
PublikacjaThe main purpose of the presented design is twofold, namely: providing detailed information about the noise threats that occur every day in city areas and preventing the noise induced hearing loss especially among young people. An experimental system designed for the continuous monitoring of the acoustic climate of urban areas was developed and implemented within the PLGrid Plus project. The assessment of environmental threats...
-
APPLICATION OF THE HIGH FREQUENCY LINEARIZATION OF THE EAR IN PATIENTS WITH TINNITUS . Metoda linearyzacji narządu słuchu u osób cierpiących z szumami usznymi
PublikacjaThis paper summarises the problem of tinnitus, hypotheses on its causes and the treatment methods. Moreover, a hypothesis on tinnitus origins is explained, based on the mechanisms of the analog-to-digital conversion and quantization. In addition, this paper describes methods of determining the acoustic intensity and spectra of low- level ultrasonic signals, as well as impedance characteristics of an ultrasound transducer. Furthermore,...
Rok 2009
-
A new methodological approach to the noise threat evaluation based on the selected physiological properties of the human hearing system
PublikacjaA new way of assessment of noise-induced harmful effects on human hearing system is presented in the paper. The method takes into consideration properties of the selected physiological human hearing system. On the basis of the hearing examinations and noise measurements results and psychoacoustical noise dosimeter performance the new indicators of the noise harmfulness were proposed. The evaluation of the proposed indicators were...
-
An new method of audio-visual correlation analysis
PublikacjaThis paper presents a new methodology of conducting the audio-visual correlation analysis employing the gaze tracking system. Interaction between two perceptual modalities, seeing and hearing, their interaction and mutual reinforcement in a complex relationship was a subject of many research studies. Earlier stage of the carried out experiments at the Multimedia Systems Department (MSD) showed that there exists a relationship between...
Rok 2022
-
A Novel Method for Intelligibility Assessment of Nonlinearly Processed Speech in Spaces Characterized by Long Reverberation Times
PublikacjaObjective assessment of speech intelligibility is a complex task that requires taking into account a number of factors such as different perception of each speech sub-bands by the human hearing sense or different physical properties of each frequency band of a speech signal. Currently, the state-of-the-art method used for assessing the quality of speech transmission is the speech transmission index (STI). It is a standardized way...
Rok 2020
-
A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces
PublikacjaIn this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...
-
Analiza ruchu drogowego z wykorzystaniem analizy akustycznej
PublikacjaTematyka pracy porusza zagadnienia dotyczące pozyskiwania informacji o ruchu drogowym z wykorzystaniem monitoringu akustycznego. Przybliżono podstawowe techniki nadzoru nad ruchem drogowym. Przedstawiono założenia akustycznego detektora ruchu i zbadano jego skuteczność na trzech płaszczyznach działania – zliczania pojazdów, klasyfikacji rodzajowej i klasyfikacji warunków pogodowych panujących na nawierzchni
-
Analyzing the Effectiveness of the Brain–Computer Interface for Task Discerning Based on Machine Learning
PublikacjaThe aim of the study is to compare electroencephalographic (EEG) signal feature extraction methods in the context of the effectiveness of the classification of brain activities. For classification, electroencephalographic signals were obtained using an EEG device from 17 subjects in three mental states (relaxation, excitation, and solving logical task). Blind source separation employing independent component analysis (ICA) was...
Rok 2007
-
A system for singing training
PublikacjaThe system proposed is aimed at the vocal students and persons who want to improve emission of their voices. The goal is not to substituite a singing teacher but to provide a tool for automatic teaching of voice emission basics. In this way singers can develop their vocal skills and improve them. By a visual feedback a student can control and modify vocal tract maximas (resonances) of a chosen vowel to match the resonances of the...
-
Applying computational intelligence to acoustics.
PublikacjaW artykule przedstawiono przegląd wybranych zagadnień związanych z zastosowaniem metod inteligencji obliczeniowej w akustyce. Zaprezentowane metody obejmują m.in. sztuczne sieci neuronowe, zbiory przyblizone, logikę rozmytą, algorytmy genetyczne. Problematyka akustyczna dotyczy z kolei klasyfikacji dźwięków muzycznych, inteligentnego przetwarzania muzyki, inteligentnego sterowania trakturą organową oraz obiektywizacji metody oceny...
Rok 2006
-
Accidental wow defect evaluation using sinusoidal analysis enhanced by artificial neural networks
PublikacjaArtykuł przedstawia metodę do wyznaczania charakterystyki pasożytniczych modulacji częstotliwości (kołysanie) obecnych w archiwalnych nagraniach dźwiękowych. Prezentowane podejście wykorzystuje śledzenie zmian sinusoidalnych komponentów dźwięku które odzwierciedlają przebieg kołysania. Analiza sinusoidalna wykorzystana jest do ekstrakcji składowych tonalnych ze zniekształconych nagrań dźwiękowych. Dodatkowo, w celu zwiększenia...
-
Applications of computational intelligence techqniues to acoustics
PublikacjaCelem artykułu jest przegląd wybranych zastosowań metod inteligentnych w akustyce, a w szczególności w szeroko rozumianej inżynierii dźwięku. Przedstawione badania i eksperymenty były prowadzone w oparciu o sztuczne sieci neuronowe, metodę zbiorów przybliżonych, logiką rozmytą, grafy przepływowe Pawlaka oraz algorytmy genetyczne. Rozwiązywane problemy dotyczyły klasyfikacji dźwięków muzycznych, rozpoznawania fraz muzycznych, przetwarzania...
Rok 2014
-
An Approach to Bass Enhancement in Portable Computers Employing Smart Virtual Bass Synthesis Algorithms
PublikacjaThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The developed algorithms are related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt and to the type of a portable device in use. To find optimum synthesis parameters of the VBS algorithms, subjective listening tests based on a parametric procedure...
-
Application of PL-Grid Platform for Modeling of the Selected Acoustic Phenomena
PublikacjaDomain grids are specific computational environments, developed within the PLGrid Plus project. For the Acoustic domain grid two supercomputer grid based services were prepared. Dedicated software consists of the outdoor sound propagation module and psychoacoustical noise dosimeter. The results are presented in a form of maps of sound level and Temporary Threshold Shift (TTS) values, therefore the services may play an informative...
Rok 2017
-
An audio-visual corpus for multimodal automatic speech recognition
Publikacjareview of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...
-
Analysis of allophones based on audio signal recordings and parameterization
PublikacjaThe aim of this study is to develop an allophonic description of English plosive consonants based on recordings of 600 specially selected words. Allophonic variations addressed in the study may have two sources: positional and contextual. The former one depends on the syllabic or prosodic position in which a particular phoneme occurs. Contextual allophony is conditioned by the local phonetic environment. Co-articulation overlapping...
-
Assessment of hearing in coma patients employing auditory brainstem response, electroencephalography, and eye-gaze-tracking
PublikacjaThe results of the study conducted by Tagliaferri et al. in 12 European countries indicate that the ratio of registered brain injury cases in Europe amounts to 150-300 per 100 000 people, with the European mean value of 235 cases per 100 000 people. The project presented in the paper assumes development of a combined metric of patients’ state remaining in coma by intelligent fusion of GCS (subjective Glasgow Coma Scale or its derivatives)...
Rok 2005
-
Analysis and generation of emotionally-charged animated gesticulation
PublikacjaPrzygotowano animacje komputerowe, przedstawiające gestykulację nacechowaną emocjonalnie. Wykorzystano metodę animacji z klatkami kluczowymi. Zaproponowano zestaw parametrów opisujących ruch, które sprawdzono pod kątem przydatności w klasyfikacji treści emocjonalnych w animacji. Wykorzystano metody analizy zbiorów przybliżonych. Przedstawiono możliwość wykorzystania wyników w generowaniu animacji o pożądanych cechach emocjonalnych....
Rok 2012
-
Analysis of impact of lossy audio compression on the robustness of watermark embedded in the DWT domain for non-blind copyright protection
PublikacjaA methodology of non-blind watermarking of the audio content is proposed. The outline of audio copyright problem and motivation for practical applications are discussed. The algorithmic theory pertaining watermarking techniques is briefly introduced. The system architecture together with employed workflows for embedding and extracting the watermarks are described. The implemented approach is described and obtained results are reported....
Rok 2010
-
Application of gaze tracking technology to quality of experience domain
PublikacjaA new methodological approach to study subjective assessment results employing gaze tracking technology is shown. Notions of Human-Computer Interaction (HCI) and Quality of Experience (QoE) are shortly introduced in the context of their common application. Then, the gaze tracking system developed at the Multimedia Systems Department (MSD) of Gdansk University of Technology (GUT) is presented. A series of audio-visual subjective...
Rok 2004
-
Application of Soft Computing to Automatic Music Information.
PublikacjaArtykuł przedstawia problemy związane z automatyczną klasyfikacją instrumentów muzycznych. Przedstawiono w nim przegląd metod, które moga służyć temu celowi oraz przykłady eksperymentów.
Rok 2023
-
Applying the Lombard Effect to Speech-in-Noise Communication
PublikacjaThis study explored how the Lombard effect, a natural or artificial increase in speech loudness in noisy environments, can improve speech-in-noise communication. This study consisted of several experiments that measured the impact of different types of noise on synthesizing the Lombard effect. The main steps were as follows: first, a dataset of speech samples with and without the Lombard effect was collected in a controlled setting;...