displaying 1000 best results Help
Search results for: SOUND EXTRACTED FROM VIDEO
-
Adaptive Personal Tuning of Sound in Mobile Computers
PublicationAn integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of their acoustic track to changing acoustic conditions of the environment and to users’ individual preferences. Signal processing algorithms are introduced that concern: linearization of frequency response, dialogue intelligibility enhancement, and dynamics processing tuned up to the users’...
-
Analysis of the parameters of respiration patterns extracted from thermal image sequences
PublicationRemote estimation of vital signs is an important and active area of research. The goal of this work was to analyze the feasibility of estimating respiration parameters from video sequences of faces recorded using a mobile thermal camera. Different estimators were analyzed and experimentally verified. It was demonstrated that the respiration rate, periodicity of respiration, and presence and length of apnea periods could be reliably...
-
Classifying type of vehicles on the basis of data extracted from audio signal characteristics
PublicationThe aim of this study is to find and optimize a feature vector for an automatic recognition of the type of vehicles, extracted form an audio signal. First, the influence of weather-based conditions of road surface on spectral characteristic of the audio signal recorded from a passing vehicle in close proximity to the road is discussed. Next, parameterization of the recorded audio signal is performed. For that purpose, the MIRtoolbox,...
-
Image Classification Based on Video Segments
PublicationIn the dissertation a new method for improving the quality of classifications of images in video streams has been proposed and analyzed. In multiple fields concerning such a classification, the proposed algorithms focus on the analysis of single frames. This class of algorithms has been named OFA (One Frame Analyzed).In the dissertation, small segments of the video are considered and each image is analyzed in the context of its...
-
Sound intensity distribution around organ pipe
PublicationThe aim of the paper was to compare acoustic field around the open and stopped organ pipes. The wooden organ pipe was located in the anechoic chamber and activated with a constant air flow, produced by an external air-compressor. Thus, long-term steady state response was possible to obtain. Multichannel acoustic vector sensor was used to measure the sound intensity distribution of radiated acoustic energy. Measurements have been...
-
System do prototypowania bezprzewodowych inteligentnych urządzeń monitoringu audio-video
PublicationW komunikacie przedstawiono system prototypowania bezprzewodowych urządzeń do monitoringu audio-video. System bazuje na układach FPGA Virtex6 i wielu dodatkowych wspierających urządzeniach jak: szybka pamięć DDR3, mała kamera HD, mikrofon z konwerterem A/C, moduł radiowy WiFi, itp. Funkcjonalność systemu została szczegółowo opisana w komunikacie. System został zoptymalizowany do pracy pod kontrolą systemu operacyjnego Linux, zostały...
-
Sound engineering as our commitment to its creators in Poland
PublicationSound engineering is an interdisciplinary and rapidly expanding domain. It covers many aspects, such as sound perception, studio and sound mastering technology, music information retrieval including content-based search systems and automatic music transcription frameworks, sound synthesis, sound restoration, electroacoustics, and other ones constituting multimedia technology. Moreover, machine learning methods applied to the topics...
-
Design Elements of Affect Aware Video Games
PublicationIn this paper issues of design and development process of affect-aware video games are presented. Several important design aspects of such games are pointed out. A concept of a middleware framework is proposed that separates the development of affect-aware video games from emotion recognition algorithms and support from input sensors. Finally, two prototype affect-aware video games are presented that conform to the presented architecture...
-
Emotion Recognition for Affect Aware Video Games
PublicationIn this paper the idea of affect aware video games is presented. A brief review of automatic multimodal affect recognition of facial expressions and emotions is given. The first result of emotions recognition using depth data as well as prototype affect aware video game are presented
-
Features of Nonlinear Sound Propagation in Vibrationally Excited Gases
PublicationWeakly nonlinear sound propagation in a gas where molecular vibrational relaxation takes place is studied. New equations which govern the sound in media where the irreversible relaxation may take place are derived and discussed. Their form depends on the regime of excitation of oscillatory degrees of freedom, equilibrium (reversible) or non-equilibrium (irreversible), and on the comparative frequency of the sound in relation to...
-
Comparison of sound of organ pipes in contemporary and historical instruments
PublicationThe aim of this research is to examine the differences in the timbre of organ pipes’ sound between a historical and a contemporary organ instrument. The historical instrument is the Oliwa organ from Gdansk, Poland, and the contemporary one is from Kartuzy, Poland. Recordings are made of single notes played by an open labial pipe that belongs to the Principal rank. The analyses and comparison of several sound features compatible...
-
Online Sound Restoration for Digital Library Applications
PublicationA system for sound restoration was conceived and engineered having the following features: no special sound restoration software is needed to perform audio restoration by the user, the process of restoration employs automatic reduction of noise, wow and impulse distortions performed in the online mode, no skills in digital signal processing from the user are needed. The principles of the created system and its features as well...
-
New applications of sound and vision engineering
PublicationMultimedia, Sound & Vision Engineering are relatively new fields within the area of science and technology, but teaching and research in this area has been carried out at Gdansk University of Technology (Gdansk, Poland) for nearly 5 decades. Current project carried-out in the Multimedia Systems Department are in the scope of the paper.
-
Virtual touchpad - video-based multimodal interface
PublicationA new computer interface named Virtual-Touchpad (VTP) is presented. The Virtual-Touchpad provides a multimodal interface which enables controlling computer applications by hand gestures captured with a typical webcam. The video stream is processed in the software layer of the interface. Hitherto existing video-based interfaces analyzing frames of hand gestures are presented. Then, the hardware configuration and software features...
-
Numerical modeling of sound intensity distributions around acoustic transducer
PublicationThe aim of this research study is to measure, simulate and compare sound intensity distribution generated by the acoustic transducers of the loudspeaker. The comparison of the gathered data allows for validating the numerical model of the acoustic radiation. An accurate model of a sound source is necessary in mathematical modeling of the sound field distribution near the scattering obstacles. An example of such obstacle is a human...
-
Sound quality metrics applied to road noise evaluation
PublicationRoad noise monitoring systems typically measure sound levels in specific time periods. The more insightful approach suggests to measure also the nature of noise. Sound quality of sounds such as car noise can be objectively evaluated by several parameters. One of them is psychoacoustic annoyance, described by loudness, tone color, and the temporal structure of sound. In this paper the assessment of several sound quality parameters, such...
-
Endoscopic Video Classification with the Consideration of Temporal Patterns
PublicationThe article describes a novel approach to automatic recognition and classification of diseases in endoscopic videos. Current directions of research in this field are discussed. Most presented methods focus on processing single frames and do not take into consideration the temporal relationship between continuous classifications. Existing approaches that consider the temporal structure of an incoming frame sequence are focused on...
-
Sound signals generated during lapping of technical ceramics using electroplated tools with diamond grains
Open Research DataData contains the recordings of sound generated during single-sided lapping with the use of electroplated diamond tools. This relationship was examined with the use of spectral analysis of the sound signal in the frequency domain with a focus on the Ra parameter of the surface roughness. The estimated sound coefficient increased as the surface roughness...
-
Video of LEGO Bricks on Conveyor Belt Dataset Series
PublicationThe dataset series titled Video of LEGO bricks on conveyor belt is composed of 14 datasets containing video recordings of a moving white conveyor belt. The recordings were created using a smartphone camera in Full HD resolution. The dataset allows for the preparation of data for neural network training, and building of a LEGO sorting machine that can help builders to organise their collections.
-
Video content analysis in the urban area telemonitoring system
PublicationThe task of constant monitoring of video streams from a large number of cameras and reviewing the recordings in order to find a specified event requires a considerable amount of time and effort from the system operators and it is prone to errors. A solution to this problem is an automatic system for constant analysis of camera images being able to raise an alarm if a predefined event is detected. The chapter presents various aspects...
-
Acoustic Detector of Road Vehicles Based on Sound Intensity
PublicationA method of detecting and counting road vehicles using an acoustic sensor placed by the road is presented. The sensor measures sound intensity in two directions: parallel and perpendicular to the road. The sound intensity analysis performs acoustic event detection. A normalized position of the sound source is tracked and used to determine if the detected event is related to a moving vehicle and to establish the direction of movement....
-
The vortex flow caused by sound in a bubbly liquid
PublicationGeneration of vorticity in the field of intense sound in a bubbly liquid in the free half-space is considered. The reasons for generation of vorticity are nonlinearity, diffraction, and dispersion. Acoustic streaming differs from that in a Newtonian fluid. Under some conditions, the vortex flow changes its direction. Conclusions concern streaming induced by a harmonic or an impulse Gaussian beam.
-
Localization of sound sources with dual acoustic vector sensor
PublicationThe aim of the work is to estimate the position of sound sources. The proposed method uses a setup of two acoustic vector sensors (AVS). The intersection of azimuth rays from each AVS should indicate the position of a source. In practice, the result of position estimation using this method is an area rather than a point. This is a result of inaccuracy of the individual sensors, but more importantly, of the influence of a source...
-
Automatic sound recognition for security purposes
PublicationIn the paper an automatic sound recognition system is presented. It forms a part of a bigger security system developed in order to monitor outdoor places for non-typical audio-visual events. The analyzed audio signal is being recorded from a microphone mounted in an outdoor place thus a non stationary noise of a significant energy is present in it. In the paper an especially designed algorithm for outdoor noise reduction is presented,...
-
Multi-Stage Video Analysis Framework
PublicationThe chapter is organized as follows. Section 2 presents the general structure of the proposed framework and a method of data exchange between system elements. Section 3 is describing the low-level analysis modules for detection and tracking of moving objects. In Section 4 we present the object classification module. Sections 5 and 6 describe specialized modules for detection and recognition of faces and license plates, respectively....
-
Measurement and visualization of sound intensity vector distribution in proximity of acoustic diffusers
PublicationIn this work, we would like to present analyses and visualizations of sound intensity distribution measured in proximity of an acoustic diffuser. Such distribution may be used for estimation of basic acoustic parameters of a diffuser. Measurement is performed with the use of a logarithmic sine sweep which allows for the analysis of waves scattered by the diffuser and rejecting the direct sound signal component. Pressure and sound...
-
Detection of the Incoming Sound Direction Employing MEMS Microphones and the DSP
PublicationA 3D acoustic vector sensor based on MEMS microphones and its application to road traffic monitoring is presented in the paper. The sensor is constructed from three pairs of digital MEMS microphones, mounted on the orthogonal axes. Signals obtained from the microphones are used to compute sound intensity vectors in each direction. With this data, it is possible to compute the horizontal and vertical angle of an incoming sound....
-
ISSUES OF CLASSIFICATION FUNCTION CONTINUITY IN ENDOSCOPIC VIDEO CLASSIFICATION
PublicationIn the article a new way of analyzing the properties of feature vector functions (FVF) and classiers of images in a video stream is proposed. The general idea is based on focusing of the perceived continuity of the FVF and classier functions. Issues related to creating an exact mathematical model are discussed and a simplied solution is proposed. An exemplary algorithm is evaluated on three exemplary video sequences. The acquired...
-
AffecTube — Chrome extension for YouTube video affective annotations
PublicationThe shortage of emotion-annotated video datasets suitable for training and validating machine learning models for facial expression-based emotion recognition stems primarily from the significant effort and cost required for manual annotation. In this paper, we present AffecTube as a comprehensive solution that leverages crowdsourcing to annotate videos directly on the YouTube platform, resulting in ready-to-use emotion-annotated...
-
Measurements and visualization of sound field distribution around organ pipe
PublicationMeasurements and visualization of acoustic field around an organ pipe are presented. Sound intensity technique was applied for this purpose. Measurements were performed in free field. The organ pipe was activated with a constant air flow, produced by an external compressor, aimed at obtaining long-term steady state responses of generated acoustic signal. Sound energy distribution was measured in a defined fixed grid of points...
-
A Method of MOS Evaluation for Video Based Services
PublicationThis paper deals with a method for QoE evaluation for the services transmitting large amount of data perceived by the end user in relatively short time periods, e.g. streaming video in mobile operator...
-
Organised Sound
Journals -
SIGHT AND SOUND
Journals -
Sound Studies
Journals -
SOUND AND VIBRATION
Journals -
Video analytics-based algorithm for monitoring egress from buildings
PublicationA concept and a practical implementation of the algorithm for detecting of potentially dangerous situations related to crowding in passages is presented. An example of such a situation is a crush which may be caused by an obstructed pedestrian pathway. The surveillance video camera signal analysis performed in the online mode is employed in order to detect hold-ups near bottlenecks like doorways or staircases. The details of the...
-
Intelligent video and audio applications for learning enhancement
PublicationThe role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....
-
Nonlinear increase in bubbles radii caused by sound in a bubbly liquid
PublicationThe nonlinear interaction of acoustic and entropy modes in a bubbly liquid is considered. The reasons for interaction are both nonlinearity and dispersion. In the field of intense sound, a decrease in the mixture density is predicted. That corresponds to the well-established growth of bubbles volumes due to rectified diffusion. The nonlinear interaction of modes as a reason for a bubble to grow due to sound, is discovered. The...
-
Video Analytics-Based Algorithm for Monitoring Egress from Buildings
PublicationA concept and practical implementation of the algorithm for detecting of potentially dangerous situations of crowding in passages is presented. An example of such situation is a crush which may be caused by obstructed pedestrian pathway. Surveillance video camera signal analysis performed on line is employed in order to detect hold-ups near bottlenecks like doorways or staircases. The details of implemented algorithm which uses...
-
Estimation of Average Speed of Road Vehicles by Sound Intensity Analysis
PublicationConstant monitoring of road traffic is important part of modern smart city systems. The proposed method estimates average speed of road vehicles in the observation period, using a passive acoustic vector sensor. Speed estimation based on sound intensity analysis is a novel approach to the described problem. Sound intensity in two orthogonal axes is measured with a sensor placed alongside the road. Position of the apparent sound...
-
Pawlak's flow graph extensions for video surveillance systems
PublicationThe idea of the Pawlak's flow graphs is applicable to many problems in various fields related to decision algorithms or data mining. The flow graphs can be used also in the video surveillance systems. Especially in distributed multi-camera systems which are problematic to be handled by human operators because of their limited perception. In such systems automated video analysis needs to be implemented. Important part of this analysis...
-
Art of Space – Art in Space / The Role of Sound Art in Public
PublicationThe article presents a discussion of the crossroad between art and architecture. It sketches out the theoretical and practical aspects of space revitalisation and improving its quality in the context of multisensory dimension of public spaces. The unique manifestation of art and creativity in public space could be detected and stimulated using participatory architecture, for instance interactive installations, projects related...
-
"Ash" [æ] sound then and now: an overview of the current state of knowledge
PublicationThe objective of this article is to review the existing studies on the British Received Pronunciation “ash” [æ] sound, as well as its variations outside the United Kingdom. It starts with a short analysis of sociolinguistic aspects of the Received Pronunciation accent, then it points out the most conspicuous differences between the Received Pronunciation and General American vowel systems. Then, it presents the early beginnings...
-
Non-Wave Variations in Temperature Caused by Sound in a Chemically Reacting Gas
PublicationA weakly nonlinear generation of non-acoustic modes in the field of sound in a gas is considered. An exotericchemical reaction of A->B type, which takes place in a gas, may be reversible or not. Two types of sound areconsidered, low-frequency and high-frequency as compared with the characteristic time of a chemical reaction.For both these cases, the governing equations of non-acoustic modes are derived and conclusions of the efficiencyof...
-
A video monitoring system using ontology-driven identification of threats
PublicationIn this paper, we present a video monitoring systemthat leverages image recognition and ontological reasoningabout threats. In the solution, an image processing subsystemuses video recording of a monitored area and recognizesknown concepts in scenes. Then, a reasoning subsystem uses anontological description of security conditions and informationfrom image recognition to check if a violation of a conditionhas occurred. If a threat...
-
METHOD OF TRAINING THE ENDOSCOPIC VIDEO ANALYSIS ALGORITHMS TO MAXIMIZE BOTH ACCURACY AND STABILITY
PublicationIn the article a new training and testing method of endoscopic video analysis algorithms is presented. Classical methods take into account only eciency of recognizing objects on single video frames. Proposed method additionally considers stability of classiers output for real video input. The method is simple and can be trained on data sets created for other solutions. Therefore, it is easily applicable to existing endoscopic video...
-
Vortex flow caused by periodic and aperiodic sound in a relaxing maxwell fluid
PublicationThis paper concerns the description of vortex flow generated by periodic and aperiodic sound in relaxing Maxwell fluid. The analysis is based on governing equation of vorticity mode, which is a result of decomposition of the hydrodynamic equations for fluid flow with relaxation and thermal conductivity into acoustical and non-acoustical parts. The equation governing vorticity mode uses only instantaneous, not averaged over sound...
-
Acoustic streaming caused by some types of aperiodic sound. Buildup of acoustic streaming
PublicationThe analysis of streaming caused by aperiodic sound of different types (switched on at transducer sound or sound determined by initial conditions) is undertaken. The analysis bases on analytical governing equation for streaming Eulerian velocity, which is a result of decomposition of the hydrodynamic equations into acoustic and non-acoustic parts. Its driving force (of acoustic nature) represents a sum of two terms; one is the...
-
Modelling Object Behaviour in a Video Surveillnace System Using Pawlak's Flowgraph
PublicationIn this paper, methodology of acquisition and processing of video streams for the purpose of modelling object behaviour is presented. Multilevel contextual video processing was also mentioned. The Pawlak’s flowgraph is used as a container for the knowledge related to the behaviour of objects in the area supervised by a video surveillance system. Spatio-temporal dependencies in transitions between cameras can be easily changed in...
-
Cartographic Representation of Route Reconstruction Results in Video Surveillance System
PublicationThe video streams available in a surveillance system distributed on the wide area may be accompanied by metadata are obtained as a result of video processing. Many algorithms applied to surveillance systems, e.g. event detection or object tracking, are strictly connected with localization of the object and reconstruction of its route. Drawing related information on a plan of a building or on a map of the city can facilitate the...