Publications
Filters
total: 348
Catalog Publications
Year 2011
-
Content-Based Approach to Automatic Recommendation of Music
PublicationThis paper presents a content-based approach to music recommendation. For this purpose, a database which contains more than 50000 music excerpts acquired from public repositories was built. Datasets contain tracks of distinct performers within several music genres. All music pieces were converted to mp3 format and then parameterized based on MPEG-7, mel-cepstral and time-related dedicated parameters. All feature vectors are stored...
Year 2015
-
"Creating a numerical model of noise conditions based on the analysis of traffic volume changes in cities with low and medium structure.
PublicationThe subject of this research study is to analyze noise conditions of the selected area in the city of Gdańsk using data related to traffic volume changes during a day. This is because daily distribution of noise levels is much more helpful for noise control and reduction than traditional maps with Lden levels indicated. Calculations are made with the use of a numerical model developed at the Gdansk Univ. of Technology and implemented...
-
"Creating a numerical model of noise conditions based on the analysis of traffic volume changes in cities with low and medium structure.
PublicationThe subject of this research study is to analyze noise conditions of the selected area in the city of Gdańsk using data related to traffic volume changes during a day. This is because daily distribution of noise levels is much more helpful for noise control and reduction than traditional maps with Lden levels indicated. Calculations are made with the use of a numerical model developed at the Gdansk Univ. of Technology and implemented...
-
Development of the sound field 3D intensity probe based on miniature microphones
PublicationThe engineered measuring probe uses three pairs of miniature microphones coupled. The signals from the microphones after an initial amplification are fed to differential circuits. Due to the required symmetry of the circuit it was necessary to select electronic components very carefully. Moreover, additional digital signal processing techniques were applied to avoid amplitude and phase mismatch. The view of the engineered probe...
-
Dopasowanie charakterystyki dynamiki dźwięku do preferencji słuchowych użytkownika urządzeń mobilnych
PublicationW celu określenia preferowanej charakterystyki dynamiki generowanych dźwięków należy uzyskać informację, w jaki sposób użytkownik postrzega głośność dźwięków o różnym poziomie dźwięku. Poruszany problem należy rozpatrywać oddzielnie dla dwóch grup użytkowników – osób słyszących prawidłowo oraz osób z ubytkiem słuchu. W pierwszym przypadku należy zadbać o to, aby wyznaczona charakterystyka dynamiki właściwie przetwarzała dźwięki...
-
Evaluation of a Novel Approach to Virtual Bass Synthesis Strategy
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) strategy applied to portable computers. The developed algorithms involve intelligent, rule-based settings of bass synthesis parameters with regard to music genre of an audio excerpt and the type of a portable device in use. The Smart VBS algorithm performs the synthesis based on a nonlinear device (NLD) with artificial controlling synthesis...
-
GRAPHICAL REPRESENTATION OF MUSIC SET BASED ON MOOD OF MUSIC. GRAFICZNA PREZENTACJA ZBIORU MUZYCZNEGO OPARTA NA ANOTACJI NASTROJU MUZYKI
PublicationOne of the features for music recommendation, which is useful and intuitive for music listen-ers, is “mood”. The paper presents an approach to graphical representation of mood of music pieces. Subjective evaluation based on listening tests is performed for assigning mood labels of 150 pieces of music and placing them on the 2D mood plane. As a result, a map of songs is created, where music excerpts with similar mood are organized...
Year 2014
-
Creating a Realible Music Discovery and Recomendation System
PublicationThe aim of this paper is to show problems related to creating a reliable music dis-covery system. The SYNAT database that contains audio files is used for the purpose of experiments. The files are divided into 22 classes corresponding to music genres with different cardinality. Of utmost importance for a reliable music recommendation system are the assignment of audio files to their appropriate gen-res and optimum parameterization...
-
Eye-Gaze Tracking-Based Telepresence System for Videoconferencing
PublicationAn approach to the teleimmersive videoconferencing system enhanced by the pan-tilt-zoom (PTZ) camera, controlled by the eye-gaze tracking system, is presented in this paper. An overview of the existing telepresence systems, especially dedicated to videoconferencing is included. The presented approach is based on the CyberEye eye-gaze tracking system engineered at the Multimedia Systems Department (MSD) of Gdańsk University of Technology...
-
Frequently updated noise threat maps created with use of supercomputing grid
PublicationAn innovative supercomputing grid services devoted to noise threat evaluation were presented. The services described in this paper concern two issues, first is related to the noise mapping, while the second one focuses on assessment of the noise dose and its influence on the human hearing system. The discussed services were developed within the PL-Grid Plus Infrastructure which accumulates Polish academic supercomputer centers....
Year 2022
-
Creating a Remote Choir Performance Recording Based on an Ambisonic Approach
PublicationThe aim of this paper is three-fold. First, the basics of binaural and ambisonic techniques are briefly presented. Then, details related to audio-visual recordings of a remote performance of the Academic Choir of the Gdańsk University of Technology are shown. Due to the COVID-19 pandemic, artists had a choice, namely, to stay at home and not perform or stay at home and perform. In fact, staying at home brought in the possibility...
Year 2013
-
Creating Dynamic Maps of Noise Threat Using PL-Grid Infrastructure
PublicationThe paper presents functionality and operation results of a system for creating dynamic maps of acoustic noise employing the PL-Grid infrastructure extended with a distributed sensor network. The work presented provides a demonstration of the services being prepared within the PLGrid Plus project for measuring, modeling and rendering data related to noise level distribution in city agglomerations. Specific computational environments,...
-
Creating dynamic maps of noise threat using pl-grid infrastructure; materiały konferencyjne
PublicationThis paper presents functionality and operation results of the system for creating dynamic maps of noise thread with the use of the PL-Grid infrastructure integrated with distributed sensors network for measuring, modeling and rendering noise level distribution. The work presented provides a demonstration of the services being prepared within the PLGrid Plus project. Specific computational environments, so called domain grids,...
-
Examining Classifiers Applied to Static Hand Gesture Recognition in Novel Sound Mixing System
PublicationThe main objective of the chapter is to present the methodology and results of examining various classifiers (Nearest Neighbor-like algorithm with non-nested generalization (NNge), Naive Bayes, C4.5 (J48), Random Tree, Random Forests, Artificial Neural Networks (Multilayer Perceptron), Support Vector Machine (SVM) used for static gesture recognition. A problem of effective gesture recognition is outlined in the context of the system...
-
Gesture-controlled Sound Mixing System With a Sonified Interface
PublicationIn this paper the Authors present a novel approach to sound mixing. It is materialized in a system that enables to mix sound with hand gestures recognized in a video stream. The system has been developed in such a way that mixing operations can be performed both with or without visual support. To check the hypothesis that the mixing process needs only an auditory display, the influence of audio information visualization on sound...
Year 2002
-
Decomposition of duet instrument sounds. W: [CD-ROM] International Sympo-sium of Musical Acoustics. ISMA MEXICO CITY. Mexico City, 9-13 December 2002. Mexico City: Escuela Nacional de Musica UNAM**2002, 10 s. 4 rys. 2 tab. bibliogr. 15 poz. Dekompozycja duetów muzycznych.
PublicationW referacie zaprezentowany został algorytm separacji nagrań duetów muzycz-nych. Metoda separacji oparta została na algorytmie FED, przy pomocy któregomożliwa jest ekstrakcja części harmonicznych sygnałów. Ponadto wykorzystanyzostał algorytm estymacji częstotliwości podstawowej oparty na korelacjiskrośnej, w celu estymacji częstotliwości dekomponowanych harmonicznych.
-
Digital waveguide models of the panpipes
PublicationW artykule przedstawiono główne cechy syntezy falowodowej. Omówiono cechy instrumentu fletni Pana. Przedyskutowano cechy zaproponowanych dwóch modeli fletni Pana różniących się złożonością obliczeniową. Pokazano szczegóły implementacyjne tych modeli, a także uzyskane wyniki symulacji dźwięków w modelach. Dokonano porównania dźwięków rzeczywistych i uzyskanych w wyniku syntezy falowodowej.
-
Expert media approach to hearing aids fitting
PublicationW artykule zaprezentowano problematykę dopasowania protez słuchu. Przedstawiono system ekspercki, który pozwala na znalezienie charakterystyk aparatu słuchowego adekwatnego do uszkodzenia słuchu. System został oparty o metodę zbiorów przybliżonych i logikę rozmytą.
Year 2017
-
Determining Pronunciation Differences in English Allophones Utilizing Audio Signal Parameterization
PublicationAn allophonic description of English plosive consonants, based on audio-visual recordings of 600 specially selected words, was developed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted...
-
Examining Feature Vector for Phoneme Recognition / Analiza parametrów w kontekście automatycznej klasyfikacji fonemów
PublicationThe aim of this paper is to analyze usability of descriptors coming from music information retrieval to the phoneme analysis. The case study presented consists in several steps. First, a short overview of parameters utilized in speech analysis is given. Then, a set of time and frequency domain-based parameters is selected and discussed in the context of stop consonant acoustical characteristics. A toolbox created for this purpose...
Year 2007
-
Determining the noise impact on hearing using psychoacoustical noise dosimeter
PublicationThis research study presents the designed noise dosimeter based on psychoacoustical properties of the human hearing system and, at the same time. evaluation of time and frequency characteristics of noise. The designed noise dosimeter enables assessing temporary threshold shift (TTS) in critical hands in real time. In this way it is possible monitoring the hearing threshold shift continuously for people who stay in the harmful noise...
Year 2006
-
Digital hearing aid with time and spectral transposition.
PublicationNastępstwem uruchomienia w Polsce, prowadzonych na szeroką skalę, badań przesiewowych słuchu jest konieczność zaoferowania pomocy osobom cierpiącym na niedosłuch poprzez leczenie i protetykę słuchu. Tymczasem, aktualnie oferowane rozwiązania aparatów słuchowych nie są w stanie sprostać niektórym specjalistycznym potrzebom aparatowania, m. in.: najmłodszych dzieci, osób pracujących w hałasie, pilotów wojskowych oraz osób korzystających...
-
Dithering strategy applied to tinnitus masking.
PublicationW referacie przedstawiono teorię wyjaśniającą zjawisko szumów usznych na gruncie akustyki, elektroniki i telekomunikacji. Spostrzeżenie, że słuch jest w istocie akustycznym układem transmisyjnym, skłania do poszukiwania interpretacji powstawania szumów usznych w ogólnej teorii spontanicznego generowania szumu w układach transmisyjnych. Sformułowana hipoteza wskazuje na istnienie pasożytniczej kwantyzacji, która pojawia się w sytuacji...
Year 2019
-
Discovering Rule-Based Learning Systems for the Purpose of Music Analysis
PublicationMusic analysis and processing aims at understanding information retrieved from music (Music Information Retrieval). For the purpose of music data mining, machine learning (ML) methods or statistical approach are employed. Their primary task is recognition of musical instrument sounds, music genre or emotion contained in music, identification of audio, assessment of audio content, etc. In terms of computational approach, music databases...
Year 2012
-
Distributed System For Noise Threat Evaluation Based On Psychoacoustic Measurements
PublicationAn innovative system designed for the continuous monitoring of acoustic climate of urban areas was presentedin the paper. The assessment of environmental threats is performed using online data, acquired through a grid ofengineered monitoring stations collecting comprehensive information about the acoustic climate of urban areas.The grid of proposed devices provides valuable data for the purpose of long and short time acoustic climateanalysis....
-
Editor's Farewell
PublicationBy this occasion, I would like to mention the major milestones Archives of Acoustics experienced during the last years. For some years, we concentrated our efforts on introducing Archives of Acoustics to the ISI Web of Knowledge and the Journal Citation Report databases.We achieved this aim, and since 2007 Archive of Acoustics has been referenced in the Journal Citation Report. Accordingly, our next object was to obtain the Impact...
-
Employing a biofeedback method based on hemispheric synchronization in effective learning
PublicationIn this paper an approach to build a brain computer-based hemispheric synchronization system is presented. The concept utilizes the wireless EEG signal registration and acquisition as well as advanced pre-processing methods. The influence of various filtration techniques of EOG artifacts on brain state recognition is examined. The emphasis is put on brain state recognition using band pass filtration for separation of individual...
-
Hand gesture recognition supported by fuzzy rules and Kalman filters
PublicationThe paper presents a system based on camera and multimediaprojector enabling a user to control computer applications by dynamic hand gestures. Gesture recognition methodology based on representing hand movement trajectory by motion vectors analysed using fuzzy rule-based inference is first given. For effective hand position tracking Kalman filters are employed. The system engineered is developed using J2SE and C++/OpenCV technology....
Year 2018
-
Editor's note and 2018 reviewers
PublicationPrzedmiotem pracy jest odniesienie do prac opublikowanych w 2018 roku, jak również do serii artykułów w ramach specjalnego wydania: Special Issue on Augmented and Participatory Sound and Music Interaction Using Semantic Audio.
-
Eksternalizacja w binauralnej ambisonicznej auralizacji źródeł kierunkowych
PublicationW artykule przedstawiono najważniejsze składniki procesu skutecznego renderowania trójwymiarowego obrazu dźwiękowego za pomocą słuchawek. W tym celu badany jest stopień oddziaływania poszczególnych czynników wpływających na eksternalizację dźwięku: śledzenie położenia głowy (ang. head tracking), indywidualne funkcje przenoszenia głowy (HRTF – Head Related Transfer Function, odnoszące się do matematycznej funkcji propagacji dźwięku...
-
EVALUATION OF SOUND QUALITY FEATURES ON ENVIRONMENTAL NOISE EFFECTS – A CASE STUDY APPLIED TO ROAD TRAFFIC NOISE
PublicationThe paper shows a study on the relationship between noise measures and sound quality (SQ) features that are related to annoyance caused by the traffic noise. First, a methodology to perform analyses related to the traffic noise annoyance is described including references to parameters of the assessment of road noise sources. Next, the measurement setup, location and results are presented along with the derived sound quality features....
-
Examining Feature Vector for Phoneme Recognition
PublicationThe aim of this paper is to analyze usability of descriptors coming from music information retrieval to the phoneme analysis. The case study presented consists in several steps. First, a short overview of parameters utilized in speech analysis is given. Then, a set of time and frequency domain-based parameters is selected and discussed in the context of stop consonant acoustical characteristics. A toolbox created for this purpose...
Year 2020
-
Employing Subjective Tests and Deep Learning for Discovering the Relationship between Personality Types and Preferred Music Genres
PublicationThe purpose of this research is two-fold: (a) to explore the relationship between the listeners’ personality trait, i.e., extraverts and introverts and their preferred music genres, and (b) to predict the personality trait of potential listeners on the basis of a musical excerpt by employing several classification algorithms. We assume that this may help match songs according to the listener’s personality in social music networks....
-
Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement
PublicationThe Lombard effect is one of the most well-known effects of noise on speech production. Speech with the Lombard effect is more easily recognizable in noisy environments than normal natural speech. Our previous investigations showed that speech synthesis models might retain Lombard-effect characteristics. In this study, we investigate several speech models, such as harmonic, source-filter, and sinusoidal, applied to Lombard speech...
Year 2009
-
Enhancement of computer character animation utilizing fuzzy rules
PublicationRozdział przedstawia nową metodę przetwarzania komputerowych animacji postaci. Wykorzystuje ona wnioskowanie rozmyte, oparte na regułach i funkcjach przynależności uzyskanych w procesie analizy wyników testów subiektywnej oceny jakości animacji. W trakcie przetwarzania do animacji automatycznie dodawane są nowe fazy ruchu, co skutkuje poprawą jakości wizualnej oraz zmianą płynności i stylizacji ruchu w sposób zamierzony. W referacie...
-
Gesture recognition framework for multimedia content viewer controlling
PublicationIn the paper a system for controlling a multimedia content viewer by hand gestures is presented. First, selected methods used for gesture recognition are described. Two different application cases of the system, i.e. for multimedia presentation purposes and for multimedia content viewing are outlined. Moreover, a proposal of improvement of the system combining these approaches is also given. The system work cycle is reviewed. The...
Year 2005
-
Estimation of musical sound separation algorithm effectiveness employing neural networks.
PublicationŚlepa separacja dźwięków sygnałów muzycznych zawartych w zmiksowanym materiale jest trudnym zadaniem. Jest to spowodowane tym, że dźwięki znajdujące się w relacjach harmonicznych mogą zawierać kolidujące składowe sinusoidalne (składowe harmoniczne). Ewaluacja wyników separacji jest również problematyczna, gdyż analiza błędu energetycznego często nie odzwierciedla subiektywnej jakości odseparowanych sygnałów. W tej publikacji zostały...
-
Estimation the rhythmic salience of sound with association rules and neural networks
PublicationW referacie przedstawiono eksperymenty mające na celu automatyczne wyszukiwanie wartości rytmicznych we frazie muzycznej. W tym celu wykorzystano metody data mining i sztuczne sieci neuronowe.
Year 2008
-
Evaluation of excessive noise effects on hearing employing psychoacoustic dosimetry
PublicationResearch results regarding the noise impact on hearing applying the concept of the Psychoacoustic Noise Dosimetry (PND) are presented. The general characteristics of the PND algorithm are discussed. Additionally, the results of hearing examinations conducted in the laboratory conditions are shown. The main objective of the research was to determine the time needed for the Temporary Threshold Shift to reverse. The results were used...
-
Hearing aid fitting method based on fuzzy logic processing
PublicationWażnym etapem dopasowania współczesnych aparatów słuchowych jest wyznaczanie charakterystyki dynamiki słuchu. Charakterystyka ta wyznaczana jest na podstawie wyników testu skalowania głośności. Niestety wyniki te wyrażone są w skali kategorii głośności, natomiast aparaty słuchowe wymagają para-metrów numerycznych. Problem ten można rozwiązać za pomocą logiki rozmytej. W niniejszym referacie przedstawiono metodę przetwarzania rozmytego...
Year 2010
-
Evaluation of the separation algorithm performance employing ANNs
PublicationCelem niniejszego rozdziału jest przedstawienie metodyki separacji dźwięków muzycznych bez informacji a priori o dźwiękach zawartych w muzycznym miksie. W pracy pokazano, że prawidłowo wytrenowana sztuczna sieć neuronowa (SNN)jest w stanie w sposób automatyczny poprawnie sklasyfikować dźwięki zawarte w zmiksowanym sygnale. Skuteczność klasyfikacji SNN jest porównywalna z oceną subiektywną ekspertów.
-
Exploiting audio-visual correlation by means of gaze tracking
PublicationThis paper presents a novel means for increasing audio-visual correlation analysis reliability. This is done based on gaze tracking technology engineered at the Multimedia Systems Department of the Gdansk University of Technology, Poland. In the paper, the past history and current research in the area of audio-visual perception analysis are shortly reviewed. Then the methodology employing gaze tracking is presented along with the...
-
Fuzzy rule-based dynamic gesture recognition employing camera & multimedia projector
PublicationIn the paper the system based on camera and multimedia projector enabling a user to control computer applications by dynamic hand gestures is presented. The main objective is to present the gesture recognition methodology which bases on representing hand movement trajectory by motion vectors analyzed using fuzzy rule-based inference. The approach was engineered in the system developed with J2SE and C++ / OpenCV technology. OpenCV...
-
Gaze-tracking based audio-visual correlation analysis employing quality of experience methodology
PublicationThis paper investigates a new approach to audio-visual correlation assessment based on the gaze-tracking system developed at the Multimedia Systems Department (MSD) of Gdansk University of Technology (GUT). The gaze-tracking methodology, having roots in Human-Computer Interaction borrows the relevance feedback through gaze-tracking and applies it to the new area of interests, which is Quality of Experience. Results of subjective...
-
Gesture-based computer control system
PublicationIn the paper a system for controlling computer applications by hand gestures is presented. First, selected methods used for gesture recognition are described. The system hardware and a way of controlling a computer by gestures are described. The architecture of the software along with hand gesture recognition methods and algorithms used are presented. Examples of basic and complex gestures recognized by the system are given.
-
Gesture-based computer control system applied to the interactive whiteboard
PublicationIn the paper the gesture-based computer control system coupled with the dedicated touchless interactive whiteboard is presented. The system engineered enables a user to control any top-most computer application by using one or both hands gestures. First, a review of gesture recognition applications with a focus on methods and algorithms applied is given. Hardware and software solution of the system consisting of a PC, camera, multimedia...
-
Gesture-based computer control system applied to the interactive whiteboard
PublicationIn the paper the gesture-based computer control system coupled with the dedicated touchless interactive whiteboard is presented. The system engineered enables a user to control any top-most computer application by using one or both hands gestures. First, a review of gesture recognition applications with a focus on methods and algorithms applied is given. Hardware and software solution of the system consisting of a PC, camera, multimedia...
Year 2003
-
Extraction of music information based on artifical neutral networks
PublicationW artykule przedstawiono założenia systemu automatycznego rozpoznawania muzyki. Na podstawie przeprowadzonych eksperymentów w artykule przedstawiono efektywność zaimplementowanych algorytmów w zależności od sposobu opisu danych muzycznych. Zaimpementowany system jest oparty o sztuczne sieci neuronowe.
Year 2004
-
Forming and Ranking Musical Rhythm Hypotheses.
PublicationW pracy przedstawiono podstawowe pojęcia i definicje zwiazne z wyszukiwaniem informacji rytmicznej w utworach muzycznych. W muzykologii przyjmuje się, że atrybuty dźwięku, takie jak długość, częstotliwość oraz amplituda dźwięku determinują wagę rytmiczną dźwięku. W artykule przebadano te właściwości fizyczne dźwięku w kontekście okreslenia wagi rytmicznej, czyli miary określającej tendencję dźwięku do znalezienia się na początku...
Year 2016
-
Guitar String Sound Retrieved from Moving Pixels
PublicationThe aim of this study was to develop a method of visual recording and analyzing the vibrations of guitar strings using high-speed cameras and dedicated video processing algorithms. The recording of a plucked string reveals the way in which the deformations propagate, composing the standing and travelling wave. The paper compares the results for a few selected models of classical and acoustic guitars, and it involves processing...