Publications
Filters
total: 894
Catalog Publications
Year 2014
-
Detection of vehicles stopping in restricted zones in video from surveillance cameras
PublicationAn algorithm for detection of vehicles that stop in restricted areas, e.g. excluded by traffic rules, is proposed. Classic approaches based on object tracking are inefficient in high traffic scenes because of tracking errors caused by frequent object merging and splitting. The proposed algorithm uses the background subtraction results for detection of moving objects, then pixels belonging to moving objects are tested for stability....
-
Employing flowgraphs for forward route reconstruction in video surveillance system
PublicationPawlak’s flowgraphs were utilized as a base idea and knowledge container for prediction and decision making algorithms applied to experimental video surveillance system. The system is used for tracking people inside buildings in order to obtain information about their appearance and movement. The fields of view of the cameras did not overlap. Therefore, when an object was moving through unsupervised areas, prediction was needed...
-
Evaluation of sound event detection, classification and localization in the presence of background noise for acoustic surveillance of hazardous situations
PublicationAn evaluation of the sound event detection, classification and localization of hazardous acoustic events in the presence of background noise of different types and changing intensities is presented. The methods for separating foreground events from the acoustic background are introduced. The classifier, based on a Support Vector Machine algorithm, is described. The set of features and samples used for the training of the classifier...
-
Examining Acoustic Emission of Engineered Ultrasound Loudspeakers
PublicationMeasurement results of the sound emitted from an ultrasound custom-made system with high spatial directivity are presented. The proposed system is using modulated ultrasound waves which demodulate in nonlinear medium resulting in audible sound. The system is aimed at enhancing the users’ personal audio space, therefore the measurements are performed using the Head and Torso Simulator which provides the realistic reproduction of...
-
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
PublicationThe problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
-
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
PublicationThe problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
-
Examining Quality of Hand Segmentation Based on Gaussian Mixture Models
PublicationResults of examination of various implementations of Gaussian mix-ture models are presented in the paper. Two of the implementations belonged to the Intel’s OpenCV 2.4.3 library and utilized Background Subtractor MOG and Background Subtractor MOG2 classes. The third implementation presented in the paper was created by the authors and extended Background Subtractor MOG2 with the possibility of operating on the scaled version of...
-
Eye-Gaze Tracking-Based Telepresence System for Videoconferencing
PublicationAn approach to the teleimmersive videoconferencing system enhanced by the pan-tilt-zoom (PTZ) camera, controlled by the eye-gaze tracking system, is presented in this paper. An overview of the existing telepresence systems, especially dedicated to videoconferencing is included. The presented approach is based on the CyberEye eye-gaze tracking system engineered at the Multimedia Systems Department (MSD) of Gdańsk University of Technology...
-
Fitting the mobile device characteristics to the user's hearing preferences
PublicationA method for fitting the mobile computer audio characteristics to the user's hearing preferences is proposed. The process consists of two stages: calibration and dynamics processing. During the calibration phase the user performs a loudness scaling test giving their response regarding the perceived loudness. The dynamics processing made on above basis sets the loudness to the most comfortable level. The processing accounts both...
-
Frequently updated noise threat maps created with use of supercomputing grid
PublicationAn innovative supercomputing grid services devoted to noise threat evaluation were presented. The services described in this paper concern two issues, first is related to the noise mapping, while the second one focuses on assessment of the noise dose and its influence on the human hearing system. The discussed services were developed within the PL-Grid Plus Infrastructure which accumulates Polish academic supercomputer centers....
-
Further Developments of the Online Sound Restoration System for Digital Library Applications
PublicationNew signal processing algorithms were introduced to the online service for audio restoration available at the web address: www.youarchive.net. Missing or distorted audio samples are estimated using a specific implementation of the Jannsen interpolation method. The algorithm is based on the autoregressive model (AR) combined with the iterative complementation of signal samples. Since the interpolation algorithm is computationally...
-
Inteligentna Synteza Niskich Częstotliwości w urządzeniach mobilnych
PublicationW pracy przedstawiono algorytm inteligentnej adaptacji parametrów syntezy niskich częstotliwości w urządzeniach przenośnych w zależności od odtwarzanego gatunku muzycznego (Smart VBS). Proponowany algorytm wykorzystuje metody generacji harmonicznych oparte na generatorze funkcji nieliniowych (NLD) i wokoderze fazowym (PV). Dla znalezienia optymalnych parametrów syntezy przeprowadzono testy subiektywne sprawdzające powiązanie parametrów...
-
Modelling Object Behaviour in a Video Surveillnace System Using Pawlak's Flowgraph
PublicationIn this paper, methodology of acquisition and processing of video streams for the purpose of modelling object behaviour is presented. Multilevel contextual video processing was also mentioned. The Pawlak’s flowgraph is used as a container for the knowledge related to the behaviour of objects in the area supervised by a video surveillance system. Spatio-temporal dependencies in transitions between cameras can be easily changed in...
-
MODELOWANIE PROPAGACJI HAŁASU I JEGO WPŁYWU NA SŁUCH Z WYKORZYSTANIEM PLATFORMY OBLICZENIOWEJ PL GRID PLUS
PublicationW referacie przedstawiono usługi dostępne w gridzie dziedzinowym Akustyka, opracowane w ramach projektu PL Grid Plus. Przygotowane usługi umożliwiają modelowanie propagacji hałasu w środowisku aglomeracji miejskiej pochodzącego ze źródeł liniowych (drogi), punktowych lub powierzchniowych (hałas przemysłowy, imprezy plenerowe) z wykorzystaniem klastrów oblicze-niowych. Na podstawie uzyskanych wyników rozkładu poziomu hałasu możliwe...
-
Multimodal human-computer interfaces based on advanced video and audio analysis
PublicationMultimodal interfaces development history is reviewed briefly in the introduction. Some applications of multimodal interfaces to education software for disabled people are presented. One of them, the LipMouse is a novel, vision-based human-computer interface that tracks user’s lip movements and detect lips gestures. A new approach to diagnosing Parkinson’s disease is also shown. The progression of the disease can be measured employing...
-
Music Data Processing and Mining in Large Databases for Active Media
PublicationThe aim of this paper was to investigate the problem of music data processing and mining in large databases. Tests were performed on a large data-base that included approximately 30000 audio files divided into 11 classes cor-responding to music genres with different cardinalities. Every audio file was de-scribed by a 173-element feature vector. To reduce the dimensionality of data the Principal Component Analysis (PCA) with variable...
-
Music Recommendation System
PublicationThe paper focuses on optimization vector content feature for the music recommendation system. For the purpose of experiments a database is created consisting of excerpts of music les. They are assigned to 22 classes corresponding to dierent music genres. Various feature vectors based on low-level signal descriptors are tested and then optimized using correlation analysis and Principal Component Analysis (PCA). Results of the experiments...
-
New methods for assessment and stimulation of non-communicative patients employing advanced multimodal HCI . Nowe metody oceny i stymulacji pacjentów niekomunikatywnych z wykorzystaniem zaawansowanych interfejsów multimodalnych człowiek-komputer
PublicationIn most cases of patients with locomotor system damage it is possible to find a solution to the medical problems originating from the injury. However, it is much more difficult to prevent cognitive and emotional impairments. Therefore, we believe that the technological support of therapists working with such patients on an everyday basis may be essential. We have acquired experience in designing and providing diagnostic and therapeutic...
-
OBRAZOWANIE ROZKŁADU NATĘŻENIA DŹWIĘKU W OTOCZENIU URZĄDZENIA MOBILNEGO Z WYKORZYSTANIEM WEKTOROWYCH CZUJNIKÓW AKUSTYCZNYCH
PublicationW referacie przedstawiono wyniki pomiarów rozkładu natężenia dźwięku w otoczeniu urządzenia mobilnego. Rozkład energii akustycznej pozyskano w polu swobodnym z wykorzystaniem metody natężeniowej w dziedzinie widma. W tym celu zastosowano zintegrowaną sondę natężeniową składającą się z czujników pomiaru przepływu cząstek powietrza oraz ciśnienia akustycznego. Pojedynczy czujnik przepływu cząstek powietrza jest czuły w jednej płaszczyźnie....
-
Parallel Background Subtraction in Video Streams Using OpenCL on GPU Platforms
PublicationImplementation of the background subtraction algorithm using OpenCL platform is presented. The algorithm processes live stream of video frames from the surveillance camera in on-line mode. Processing is performed using a host machine and a parallel computing device. The work focuses on optimizing an OpenCL algorithm implementation for GPU devices by taking into account specific features of the GPU architecture, such as memory access,...
-
Pomiary i analiza dźwięku w filmie oraz w reklamach filmowych z wykorzystaniem modelu głośności LKFS .
PublicationCelem niniejszej pracy był pomiar dźwięku filmie, zapowiedziach filmów oraz reklamach poprzedzających projekcję filmu. W referacie w pierwszej kolejności przywołano problemy związane z pomiarem dźwięku, rekomendacje, które wskazują na dopuszczalne poziomy projekcji filmowej oraz przywołano jednostki, które są wykorzystywane w określaniu głośności projekcji. Następnie przedstawiono pomiary, kalibrację sprzętu pomiarowego oraz analizę...
-
Prace badawcze i wdrożeniowe Zespołu Katedry Systemów Multimedialnych oraz Laboratorium Akustyki Fonicznej, Wydział Elektroniki, Telekomunikacji i Informatyki, Politechniki Gdańskiej
PublicationW bogatym dorobku prac naukowych oraz badawczo-wdrożeniowym z zakresu akustyki - Katedry Systemów Multimedialnych kierowanej przez prof. dr hab. inż. Andrzeja Czyżewskiego oraz Laboratorium Akustyki Fonicznej związanej z osobą prof. dr hab. inż. Bożeny Kostek - obecny jest nurt związanych z pracami dedykowanymi monitoringowi akustycznemu środowiska. W latach 2009-2012 pracownicy tych Jednostek zrealizowali projekt badawczy (grant...
-
Psychoakustyka realizowana na Politechnice Gdańskiej
PublicationW pracach naukowo-badawczych, wdrożeniowych oraz dydaktyce zespołów pracujących w Systemów Multimedialnych i Laboratorium Akustyki Fonicznej (WETI, PG) można wyróżnić kilka nurtów tematycznie dotyczących zagadnień psychoakustyki i ich zastosowań w akustyce fonicznej i inżynierii dźwięku i obrazu (leżących na pograniczu akustyki, telekomunikacji, nauk kognitywnych i informatyki oraz inżynierii biomedycznej). Wynikiem tych prac były...
-
Selection of Visual Descriptors for the Purpose of Multi-camera Object Re-identification
PublicationA comparative analysis of various visual descriptors is presented in this chapter. The descriptors utilize many aspects of image data: colour, texture, gradient, and statistical moments. The descriptor list is supplemented with local features calculated in close vicinity of key points found automatically in the image. The goal of the analysis is to find descriptors that are best suited for particular task, i.e. re-identification...
-
Smart Virtual Bass Synthesis Algorithm Based on Music Genre Classification
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm employed automatic music genre recognition to determine the optimum parameters for the synthesis of additional frequencies. The synthesis was carried out using the non-linear device (NLD) and phase vocoder (PV) methods depending on the music excerpt genre. Classification of musical...
-
Sound Field Intensity Measurements and Visualization around the Human Head Model . Rozkłąd natężenia pola akustycznego w komorze bezechowej obecności sztucznej głowy i w przypadku braku jej obecności
PublicationThe main goal of this research study was to measure and visualize sound field intensity distribution in and without presence of the human head model. Measurements were performed in the anechoic chamber with the 5 cm grid. Experimental setup consisted of a multitone generator, two loudspeakers, human head model, intensimetric probe, the Cartesian robot applied for precise positioning of the acoustic sensor, and an analyzer. Based...
-
Square Root Raised Cosine Fractionally Delaying Nyquist Filter - Design and Performance Evaluation
PublicationIn this paper we propose a discrete-time FIR (Finite Impulse Response) filter which is applied as a square root Nyquist filter and fractional delay filter simultaneously. The filter enables to substitute for a cascade of square root Nyquist filter and fractional delay filter in one device/algorithm. The aim is to compensate for transmission delay in digital communication system. Performance of the filter as a matched filter is...
-
SUBJECTIVE PERCEPTION OF MUSIC GENRES IN THE FIELD OF MUSIC INFORMATION RETRIEVAL SYSTEMS
PublicationThe aim of this paper is to evaluate the relationship between perception of music genres and subjective features of music that can be assigned to them. For this purpose a group of subjective features such as loudness, melody, rhythm, volume, instrumentation was chosen to describe music genres. A group of 30 listeners with normal hearing, ranging from 20 to 40, was created. Each sub-ject participating in listening tests was asked...
-
SUBJECTIVE PERCEPTION OF MUSIC GENRES IN THE FIELD OF MUSIC INFORMATION RETRIEVAL SYSTEMS
PublicationThe aim of this paper is to evaluate the relationship between perception of music genres and subjective features of music that can be assigned to them. For this purpose a group of subjective features such as loudness, melody, rhythm, volume, instrumentation was chosen to describe music genres. A group of 30 listeners with normal hearing, ranging from 20 to 40, was created. Each sub-ject participating in listening tests was asked...
-
Supercomputing Grid-Based Services for Hearing Protection and Acoustical Urban Planning, Research and Education
PublicationSpecific computational environments, so-called domain grids, are developed within the PLGrid Plus project in order to prepare specialized IT solutions, i.e., dedicated software implementations and hardware (infrastructure adaptation), suited for particular research group demands. One of the PLGrid Plus domain grids, presented in this paper, is Acoustics. The article describes in detail two kinds of the acoustic domain services....
-
Technika cyfrowego przetwarzania sygnałów
PublicationPodręcznik jest przeznaczony dla studentów kierunków Elektronika i telekomunikacja, Inżynieria biomedyczna oraz Automatyka i robotyka. Obejmuje on zagadnienia z zakresu cyfrowego przetwarzania sygnałów, przerabiane na takich przedmiotach jak Przetwarzanie sygnałów, Filtry cyfrowe, Zastosowania procesorów sygnałowych. Ma stanowić pomoc przy prowadzeniu zajęć z ćwiczeń tablicowych, zajęć laboratoryjnych czy projektu z zastosowań...
-
Technika sygnałów analogowych. - Tom 1,2
Publicationom I składa się z sześciu rozdziałów. W rozdziale 1 scharakteryzowano sygnały, elementy, układy i systemy analogowe. Poznanie właściwości elementów ma kluczowe znaczenie przy przewidywaniu właściwości zbudowanych z nich układów elektronicznych. Podobnie znajomość podstawowych praw rządzących rozkładami prądów i napięć ma zasadnicze znaczenie dla zrozumienia metod analizy układów elektronicznych. Rozdział 2 jest poświęcony liniowym...
-
Towards Cognitive and Perceptive Video Systems
PublicationIn this chapter we cover research and development issues related to smart cameras. We discuss challenges, new technologies and algorithms, applications and the evaluation of today’s technologies. We will cover problems related to software, hardware, communication, embedded and distributed systems, multi-modal sensors, privacy and security. We also discuss future trends and market expectations from the customer’s point of view.
-
Visual Lip Contour Detection for the Purpose of Speech Recognition
PublicationA method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...
Year 2013
-
A new method for measuring the psychoacoustical properties of tinnitus
Publicationinformation, select the tinnitus treatment and quantitatively substantiate its effects, the measurement of the Tinnitus psychoacoustic parameters should be made an inherent part of the Tinnitus therapy. Methods For this purpose the multimedia-based sound synthesizer has been proposed for testing tinnitus and the results obtained this way are compared with the outcome of the audiometer-based Wilcoxon test. The method has been verified...
-
A Nyquist filter of fractional delay
PublicationIn the paper a novel discrete-time FIR fractonal delay specjal filter is investigated. This is a Nyquist filter which, besides the traditional its attribute (interymbol interference (ISI) free property), has the ability to compensate for subsample transmission delay involved, for example, in multipath propagation channel. The performance of the filter is analysed and illustrated.
-
A Study on Influence of Normalization Methods on Music Genre Classification Results Employing kNN Algorithms
PublicationThis paper presents a comparison of different normalization methods applied to the set of feature vectors of music pieces. Test results show the influence of min-nlax and Zero-Mean normalization methods, employing different distance functions (Euclidean, Manhattan, Chebyshev, Minkowski) as a pre-processing for genre classification, on k-Nearest Neighbor (kNN) algorithm classification results.
-
Acoustics - new services for urban planning, research and education
PublicationThe main purpose of the presented design is twofold, namely: providing detailed information about the noise threats that occur every day in city areas and preventing the noise induced hearing loss especially among young people. An experimental system designed for the continuous monitoring of the acoustic climate of urban areas was developed and implemented within the PLGrid Plus project. The assessment of environmental threats...
-
Adaptive Method of Adjusting Flowgraph for Route Reconstruction in Video Surveillance Systems
PublicationPawlak’s flowgraph has been applied as a suitable data structure for description and anal- ysis of human behaviour in the area supervised with multicamera video surveillance system. Infor- mation contained in the flowgraph can be easily used to predict consecutive movements of a partic- ular object. Moreover, utilization of the flowgraph can support reconstructing object route from the past video images. However, such a flowgraph with...
-
An Approach to the Detection of Bank Robbery Acts Employing Thermal Image Analysis
PublicationA novel approach to the detection of selected security-related events in bank monitoring systems is presented. Thermal camera images are used for the detection of people in difficult lighting conditions. Next, the algorithm analyses movement of objects detected in thermal or standard monitoring cameras using a method evolved from the motion history images algorithm. At the same time, thermal images are analyzed in order to detect...
-
APPLICATION OF THE HIGH FREQUENCY LINEARIZATION OF THE EAR IN PATIENTS WITH TINNITUS . Metoda linearyzacji narządu słuchu u osób cierpiących z szumami usznymi
PublicationThis paper summarises the problem of tinnitus, hypotheses on its causes and the treatment methods. Moreover, a hypothesis on tinnitus origins is explained, based on the mechanisms of the analog-to-digital conversion and quantization. In addition, this paper describes methods of determining the acoustic intensity and spectra of low- level ultrasonic signals, as well as impedance characteristics of an ultrasound transducer. Furthermore,...
-
Audio-visual surveillance system for application in bank operating room
PublicationAn audio-visual surveillance system able to detect, classify and to localize acoustic events in a bank operating room is presented. Algorithms for detection and classification of abnormal acoustic events, such as screams or gunshots are introduced. Two types of detectors are employed to detect impulsive sounds and vocal activity. A Support Vector Machine (SVM) classifier is used to discern between the different classes of acoustic...
-
AUDITORY DISPLAY FROM THE MUSIC TECHNOLOGY PERSPECTIVE . Obecność wirtualnego środowiska dźwiękowego w technologiach muzycznych
PublicationThis paper presents some applications of Auditory Displays (AD) in the domain of music technology. First, the scope of music technology and auditory display areas are shortly outlined. Then, the research trends and system solutions within the fields of music technology, music information retrieval and music recommendation are discussed. Finally, an example of an auditory display that facilities music annotation process based on...
-
Auditory-visual attention stimulator
PublicationNew approach to lateralization irregularities formation was proposed. The emphasis is put on the relationship between visual and auditory attention stimulation. In this approach hearing is stimulated using time scale modified speech and sight is stimulated by rendering the text of the currently heard speech. Moreover, displayed text is modified using several techniques i.e. zooming, highlighting etc. In the experimental part of...
-
Cartographic Representation of Route Reconstruction Results in Video Surveillance System
PublicationThe video streams available in a surveillance system distributed on the wide area may be accompanied by metadata are obtained as a result of video processing. Many algorithms applied to surveillance systems, e.g. event detection or object tracking, are strictly connected with localization of the object and reconstruction of its route. Drawing related information on a plan of a building or on a map of the city can facilitate the...
-
Creating Dynamic Maps of Noise Threat Using PL-Grid Infrastructure
PublicationThe paper presents functionality and operation results of a system for creating dynamic maps of acoustic noise employing the PL-Grid infrastructure extended with a distributed sensor network. The work presented provides a demonstration of the services being prepared within the PLGrid Plus project for measuring, modeling and rendering data related to noise level distribution in city agglomerations. Specific computational environments,...
-
Creating dynamic maps of noise threat using pl-grid infrastructure; materiały konferencyjne
PublicationThis paper presents functionality and operation results of the system for creating dynamic maps of noise thread with the use of the PL-Grid infrastructure integrated with distributed sensors network for measuring, modeling and rendering noise level distribution. The work presented provides a demonstration of the services being prepared within the PLGrid Plus project. Specific computational environments, so called domain grids,...
-
Detection of moving objects in images combined from video and thermal cameras
PublicationAn algorithm for detection of moving objects in video streams from the monitoring cameras is presented. A system composed of a standard video camera and a thermal camera, mounted in close proximity to each other, is used for object detection. First, a background subtraction is performed in both video streams separately, using the popular Gaussian Mixture Models method. For the next processing stage, the authors propose an algorithm...
-
Development of Domain-Specific Solutions within the Polish Infrastructure for Advanced Scientific Research
PublicationThe Polish Grid computing infrastructure was established during the PL-Grid project (2009-2012). The main purpose of this Project was to provide the Polish scientists with an IT basic platform, allowing them to conduct interdisciplinary research on a national scale, and giving them transparent access to international grid resources via international grid infrastructures. Currently, the infrastructure is maintained and extended...
-
Drum Replacement Using Wavelet Filtering Podmienianie próbek perkusyjnych przy zastosowaniu filtracji falkowej .
PublicationThe paper presents the solution that can be used to unify snare drum sound within a chosen fragment. The algorithm is based on the wavelet transformation and allows replacement of sub-bands of particular sounds, which are outside a certain range. Five experienced sound engineers put the algorithm under the test using samples of five different snare drums. Wavelet filtering seems to be useful in terms of drum replacement, while...