Search results for: sound extracted from video

Search results for: sound extracted from video

results on page:
embed this view on your website

Filters

total: 1385

clear all filters disabled

displaying 1000 best results Help

Comparison of two methods of sound extraction from guitar string video recordings
Publication
- M. Zaporowska
- A. Czyżewski
- Year 2020
A comparison of two sound extraction methods from guitar string video recordings is presented in the paper. A brief overview of highframe rate camera technology and possible applications are included. The method using the image analysis from two such cameras is presented. The cameras are placed at the angle of 90 degrees for recording the image in three planes. The results achieved...
Recovering Sound Produced by Wind Turbine Structures Employing Video Motion Magnification
Publication
- Year 2019
The recordings were made with a fast video camera and with a microphone. Using fast cameras allowed for observation of the micro vibrations of the object structure. Motion-magnified video recordings of wind turbines on a wind farm were made for the purpose of building a damage prediction system. An idea was to use video to recover sound & vibrations in order to obtain a contactless diagnostic method for wind turbines. The recovered signals...

Full text to download in external service
New Aspects of Virtual Sound Source Localization Research—Impact of Visual Angle and 3-D Video Content on Sound Perception
Publication
- B. Kunka
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2013
The influence of image on virtual sound source localization, called the “image proximity effect” or the “ventriloquism effect”, is a well known phenomenon. This paper focuses on other aspects related to this effect, namely the impact of the visual angle of the presented object and 3D video content on sound perception. The research conducted confirmed that the visual angle of the presented object determines the image proximity effect...

Full text available to download
Piotr Szczuko dr hab. inż.

People

Department of Multimedia Systems

Piotr Szczuko received his M.Sc. degree in 2002. His thesis was dedicated to examination of correlation phenomena between perception of sound and vision for surround sound and digital image. He finished Ph.D. studies in 2007 and one year later completed a dissertation "Application of Fuzzy Rules in Computer Character Animation" that received award of Prime Minister of Poland. His interests include: processing of audio and video, computer...
Piotr Odya dr inż.

People

Department of Multimedia Systems

Piotr Odya was born in Gdansk in 1974. He received his M.Sc. in 1999 from the Faculty of Electronics, Telecommunications and Informatics, Gdansk University of Technology, Poland. His thesis was related to the problem of sound quality improvement in the contemporary broadcasting studio. He is interested in video editing and multichannel sound systems. The goal of Mr. Odya Ph.D. thesis concerned methods and algorithms for correcting...
Grzegorz Szwoch dr hab. inż.

People

Department of Multimedia Systems

Grzegorz Szwoch was born in 1972 in Gdansk. In 1991-1996 he studied at the Technical University of Gdansk. In 1996 he graduated as a student from the Sound Engineering Department. His thesis was related to physical modeling of musical instruments. Since that time he has been a member of the research staff at the Multimedia Systems Department as a PhD student (1996-2001), Assistant (2001-2004), Assistant professor (2004-2020) and...
Automatic sound source localization in disturbing conditions using acoustic vector sensors
Publication
- A. Czyżewski
- J. Kotus
- Elektronika Ir Elektrotechnika - Year 2011
A concept, practical realization and applications of a passive acoustic radar to automatic localization and tracking of sound sources in disturbing conditions were presented in the paper. The device consists of the new kind of multichannel miniature sound intensity sensors and a group of digital signal processing algorithms. The sensitivity of the realized acoustic radar was examined in free sound field. Several kinds of sound...

Full text to download in external service
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
Publication
- Year 2014
The problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
Publication
- Year 2014
The problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
Detection of moving objects in images combined from video and thermal cameras
Publication
- G. Szwoch
- M. Szczodrak
- Year 2013
An algorithm for detection of moving objects in video streams from the monitoring cameras is presented. A system composed of a standard video camera and a thermal camera, mounted in close proximity to each other, is used for object detection. First, a background subtraction is performed in both video streams separately, using the popular Gaussian Mixture Models method. For the next processing stage, the authors propose an algorithm...

Full text to download in external service
Gesture-controlled Sound Mixing System With a Sonified Interface
Publication
- M. Lech
- B. Kostek
- Year 2013
In this paper the Authors present a novel approach to sound mixing. It is materialized in a system that enables to mix sound with hand gestures recognized in a video stream. The system has been developed in such a way that mixing operations can be performed both with or without visual support. To check the hypothesis that the mixing process needs only an auditory display, the influence of audio information visualization on sound...

Full text to download in external service
Guitar String Sound Retrieved from Moving Pixels
Publication
- Year 2016
The aim of this study was to develop a method of visual recording and analyzing the vibrations of guitar strings using high-speed cameras and dedicated video processing algorithms. The recording of a plucked string reveals the way in which the deformations propagate, composing the standing and travelling wave. The paper compares the results for a few selected models of classical and acoustic guitars, and it involves processing...

Full text to download in external service
Improving automatic surveillance by sound analysis
Publication
- Year 2010
An automatic surveillance system, based on event detection in the video image can be improved by implementing algorithms for audio analysis. Dangerous or illegal actions are often connected with distinctive sound events like screams or sudden bursts of energy. A method for detection and classification of alarming sound events is presented. Detection is based on the observation of sudden changes in sound level in distinctive sub-bands...
Subjective tests for gathering knowledge for applying color grading to video clips automatically
Publication
- D. Weber
- B. Kostek
- Year 2019
The analysis of film music concerning caused emotions may allow for a more accurate adaptation of the color of the film in the context of color grading. Therefore, this paper aims to gather knowledge on the correlation between the applied color palette to a video clip, music associated with a particular shot, and emotions evoked. For that purpose, subjective tests are prepared in which several video clips are presented with or...

Full text available to download
Subjective tests for gathering konwledge for applaying color grading to video clips automatically
Publication
- D. Weber
- B. Kostek
- Year 2019
The analysis of film music concerning caused emotions may allow for a more accurate adaptation of the color of the film in the context of color grading. Therefore, this paper aims to gather knowledge on the correlation between the applied color palette to a video clip, music associated with a particular shot,and emotions evoked. For that purpose, subjective tests are prepared in which several video clips are presented with...

Full text to download in external service
Objectivization of audio-video correlation assessment experiments
Publication
- B. Kunka
- B. Kostek
- Year 2010
The purpose of this paper is to present a new method of conducting an audio-visual correlation analysis employing a head-motion-free gaze tracking system. First, a review of related works in the domain of sound and vision correlation is presented. Then assumptions concerning audio-visual scene creation are shortly described. The objectivization process of carrying out correlation tests employing gaze-tracking system is outlined....

Full text to download in external service
Postprodukcja nagrania wideo z dzwiekiem dookolnym
Publication
- Year 2009
One of the aims of this paper is to present issues related to audio-video correlation. This is presented on the basis of a short film realization employing surround microphone techniques. First, some related works in the domain of sound and vision correlation are presented. Then assumptions concerning scene creation related to both audio and video are shortly described. Another objective is to discuss results of subjective tests...
Detection of vehicles stopping in restricted zones in video from surveillance cameras
Publication
- G. Szwoch
- P. Dalka
- Year 2014
An algorithm for detection of vehicles that stop in restricted areas, e.g. excluded by traffic rules, is proposed. Classic approaches based on object tracking are inefficient in high traffic scenes because of tracking errors caused by frequent object merging and splitting. The proposed algorithm uses the background subtraction results for detection of moving objects, then pixels belonging to moving objects are tested for stability....

Full text to download in external service
Acoustic radar employing particle velocity sensors
Publication
- J. Kotus
- A. Czyżewski
- Year 2010
A concept, practical realization and applications of a passive acoustic radar to automatic localization, tracking of sound sources were presented in the paper. The device consist of the new kind of multichannel miniature sound intensity sensors and a group of digital signal processing algorithms. Contrary to active radars, it does not emit the scanning beam but after receiving surroundings sounds it provide information about the...
Eulerian motion magnification applied to structural health monitoring of wind turbines
Publication
- S. Cygert
- A. Czyżewski
- Journal of the Acoustical Society of America - Year 2018
Several types of defects may occur in wind turbines, as physical damage of blades or gearbox malfunction. A wind farm monitoring and damage prediction system is built to observe abnormal vibrations of elements of wind turbine: blades, nacelle, and tower. Contactless methods are developed which do not require turbine stopping. In this work, structural health monitoring of a wind turbine is evaluated using a conversion from the captured...

Full text to download in external service
Standards on Cyber Security Assessment of Smart Grid
Publication
- R. Leszczyna
- International Journal of Critical Infrastructure Protection - Year 2018
Security evaluation of communication systems in smart grid poses a great challenge to the developers and operators. In recent years many new smart grid standards were proposed, which paradoxically results in the difficulty in finding a relevant publication in this plethora of literature. This paper presents the results of a systematic analysis which aimed at addressing this issue by identifying standards that present sound security...

Full text available to download
Visual Traffic Noise Monitoring in Urban Areas
Publication
- A. Czyżewski
- P. Dalka
- International Journal of Multimedia and Ubiquitous Engineering - Year 2007
The paper presents an advanced system for railway and road traffic noise monitoring in metropolitan areas. This system is a functional part of a more complex solution designed for environmental monitoring in cities utilizing analyses of sound, vision and air pollution, based on a ubiquitous computing approach. The system consists of many autonomous, universal measuring units and a multimedia server, which gathers, processes and...

Full text to download in external service
Methodology and technology for the polymodal allophonic speech transcription
Publication
- Journal of the Acoustical Society of America - Year 2016
A method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for...

Full text to download in external service
Methodology and technology for the polymodal allophonic speech transcription
Publication
- Journal of the Acoustical Society of America - Year 2016
A method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory...

Full text to download in external service
Visual Lip Contour Detection for the Purpose of Speech Recognition
Publication
- Year 2014
A method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...
Traffic Noise Analysis Applied to Automatic Vehicle Counting and Classification
Publication
- Communications in Computer and Information Science - Year 2017
Problems related to determining traffic noise characteristics are discussed in the context of automatic dynamic noise analysis based on noise level measurements and traffic prediction models. The obtained analytical results provide the second goal of the study, namely automatic vehicle counting and classification. Several traffic prediction models are presented and compared to the results of in-situ noise level measurements. Synchronized...
The Innovative Faculty for Innovative Technologies
Publication
- Year 2013
A leaflet describing Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology. Multimedia Systems Department described laboratories and prototypes of: Auditory-visual attention stimulator, Automatic video event detection, Object re-identification application for multi-camera surveillance systems, Object Tracking and Automatic Master-Slave PTZ Camera Positioning System, Passive Acoustic Radar,...

Full text to download in external service
Extraction of stable foreground image regions for unattended luggage detection
Publication
- G. Szwoch
- MULTIMEDIA TOOLS AND APPLICATIONS - Year 2016
A novel approach to detection of stationary objects in the video stream is presented. Stationary objects are these separated from the static background, but remaining motionless for a prolonged time. Extraction of stationary objects from images is useful in automatic detection of unattended luggage. The proposed algorithm is based on detection of image regions containing foreground image pixels having stable values in time and...

Full text available to download
Layered background modeling for automatic detection of unattended objects in camera images
Publication
- G. Szwoch
- P. Dalka
- Year 2011
An algorithm for automatic detection of unattended objects in video camera images is presented. First, background subtraction is performed, using an approach based on the codebook method. Results of the detection are then processed by assigning the background pixels to time slots, based on the codeword age. Using this data, moving objects detected during a chosen period may be extracted from the background model. The proposed approach...

Full text to download in external service
Counting and tracking vehicles using acoustic vector sensors
Publication
- J. Kotus
- A. Czyżewski
- Journal of the Acoustical Society of America - Year 2018
A method is presented for counting vehicles and for determining their movement direction by means of acoustic vector sensor application. The assumptions of the method employing spatial distribution of sound intensity determined with the help of an integrated 3D intensity probe are discussed. The intensity probe developed by the authors was used for the experiments. The mode of operation of the algorithm is presented in conjunction...

Full text to download in external service
Application of autoencoder to traffic noise analysis
Publication
- Journal of the Acoustical Society of America - Year 2019
The aim of an autoencoder neural network is to transform the input data into a lower-dimensional code and then to reconstruct the output from this code representation. Applications of autoencoders to classifying sound events in the road traffic have not been found in the literature. The presented research aims to determine whether such an unsupervised learning method may be used for deploying classification algorithms applied to...

Full text available to download
Multimodal English corpus for automatic speech recognition
Publication
- Year 2013
A multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech...
Broadening the scope of measurement and analysis of vibrations of an organ pipe employing intensity probe, simulations, and highspeed camera
Publication
- P. Bordoni
- J. Kotus
- P. Odya
- F. Antonacci
- B. Kostek
- Journal of the Acoustical Society of America - Year 2022
This paper shows an integrated approach to measure, analyze, and model phenomena occurring in an organ pipe driven by pressurized air. The aim of this paper is two-fold, i.e., to measure the pressure signal and the intensity field around the mouth by means of an intensity probe and to visualize and observe the motion of the air jet, which represents the excitation mechanism of the system. This is realized through two techniques,...

Full text to download in external service
On the Consumption of Multimedia Content Using Mobile Devices: a Year to Year User Case Study
Publication
- P. Falkowski-Gilski
- Archives of Acoustics - Year 2020
In the early days, consumption of multimedia content related with audio signals was only possible in a stationary manner. The music player was located at home, with a necessary physical drive. An alternative way for an individual was to attend a live performance at a concert hall or host a private concert at home. To sum up, audio-visual effects were only reserved for a narrow group of recipients. Today, thanks to portable players,...

Full text available to download
A comparative study of English viseme recognition methods and algorithm
Publication
- D. Jachimski
- A. Czyżewski
- MULTIMEDIA TOOLS AND APPLICATIONS - Year 2018
An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector...

Full text available to download
A comparative study of English viseme recognition methods and algorithms
Publication
- MULTIMEDIA TOOLS AND APPLICATIONS - Year 2018
An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...

Full text available to download
Monitoring of Caged Bluefin Tuna Reactions to Ship and Offshore Wind Farm Operational Noises
Publication
- V. Puig-Pons
- E. Soliveres
- I. Pérez-Arjona
- V. Espinosa
- P. Poveda-Martínez
- J. Ramis-Soriano
- P. Ordoñez-Cebrián
- M. Moszyński
- F. de la Gándara
- M. Bou-Cabo... and 2 others
- SENSORS - Year 2021
Underwater noise has been identified as a relevant pollution affecting marine ecosystems in different ways. Despite the numerous studies performed over the last few decades regarding the adverse effect of underwater noise on marine life, a lack of knowledge and methodological procedures still exists, and results are often tentative or qualitative. A monitoring methodology for the behavioral response of bluefin tuna (Thunnus thynnus)...

Full text available to download
Verification of the Parameterization Methods in the Context of Automatic Recognition of Sounds Related to Danger
Publication
- P. Żwan
- A. Czyżewski
- Journal of Digital Forensic Practice - Year 2010
W artykule opisano aplikację, która automatycznie wykrywa zdarzenia dźwiękowe takie jak: rozbita szyba, wystrzał, wybuch i krzyk. Opisany system składa się z bloku parametryzacji i klasyfikatora. W artykule dokonano porównania parametrów dedykowanych dla tego zastosowania oraz standardowych deskryptorów MPEG-7. Porównano też dwa klasyfikatory: Jeden oparty o Percetron (sieci neuronowe) i drugi oparty o Maszynę wektorów wspierających....

Full text to download in external service
An Empirical Study on the Impact of Gender on Mobile Applications Usability
Publication
- P. Weichbroth
- IEEE Access - Year 2022
In the area of broadband wireless Internet, mobile applications have already replaced their desktop equivalents and are recognized as valuable tools for any size of businesses and for private use. With the emergence of millions of apps, the quality of their interaction with the user remains an open question for software vendors. While female and male requirements and preferences are not always similar, to the best of our knowledge,...

Full text available to download
Audio content analysis in the urban area telemonitoring system
Publication
- Year 2010
Artykuł przedstawia możliwości rozwinięcie monitoringu miejskiego o automatyczną analizę dźwięku. Przedstawiono metody parametryzacji dźwięku, które możliwe są do zastosowania w takim systemie oraz omówiono aspekty techniczne implementacji. W kolejnej części przedstawiono system decyzyjny oparty na drzewach zastosowany w systemie. System ten rozpoznaje dźwięki niebezpieczne (strzał, rozbita szyba, krzyk) wśród dźwięków zarejestrowanych...

Full text to download in external service
JOURNAL OF SOUND AND VIBRATION

Journals

ISSN: 0022-460X , eISSN: 1095-8568
Michał Lech dr inż.

People

Michał Lech was born in Gdynia in 1983. In 2007 he graduated from the faculty of Electronics, Telecommunications and Informatics of Gdansk University of Technology. In June 2013, he received his Ph.D. degree. The subject of the dissertation was: “A Method and Algorithms for Controlling the Sound Mixing Processes by Hand Gestures Recognized Using Computer Vision”. The main focus of the thesis was the bias of audio perception caused...
Data recorded for the purpose of the 3D sound intensity visualization around the organ pipe (des sound)
Open Research Data
open access
- P. Odya
The set contains data recorded using the Cartesian robot and multichannel acoustic vector sensor (from Microflown) for the purpose of the 3D sound intensity visualization of radiated acoustic energy around the organ pipe.
Affect aware video games
Publication
- M. Szwoch
- Year 2022
In this chapter a problem of affect aware video games is described, including such issue as: emotional model of the player, design, development and UX testing of affect-aware video games, multimodal emotion recognition and a featured review of affect-aware video games.

Full text to download in external service
Reversible Video Stream Anonymization for Video Surveillance Systems Based on Pixels Relocation and Watermarking
Publication
- J. Cichowski
- A. Czyżewski
- Year 2013
A method of reversible video image regions of interest anonymization for applications in video surveillance systems is described. A short introduction to theanonymization procedures is presented together with the explanation of its relation to visual surveillance. A short review of state of the art of sensitive data protection in media is included. An approach to reversible Region of Interest (ROI) hiding in video is presented,...

Full text to download in external service
Superresolution algorithm to video surveillance system
Publication
- T. Merta
- A. Czyżewski
- Year 2010
An application of a multiframe SR (superresolution) algorithm applied to video monitoring is described. The video signal generated by various types of video cameras with different parameters and signal distortions which may be very problematic for superresolution algorithms. The paper focuses on disadvantages in video signal which occur in video surveillance systems. Especially motion estimation and its influence on superresolution...
The nonlinear effects of sound in a liquid with relaxation losses
Publication
- A. Perelomova
- CANADIAN JOURNAL OF PHYSICS - Year 2015
The nonlinear effects of sound in electrolyte with a chemical reaction are examined. The dynamic equations that govern non-wave modes in the field of intense sound are derived, and acoustic forces of vortex, entropy, and relaxation modes are determined in the cases of low-frequency sound and high-frequency sound. The difference in the nonlinear effects of sound in electrolyte and in a gas with excited vibrational degrees of molecules,...

Full text available to download
generation of the vorticity mode by sound in a bingham plastic
Publication
- A. Perelomova
- P. Wojda
- CENTRAL EUROPEAN JOURNAL OF PHYSICS - Year 2011
This study investigates interaction between acoustic and non-acoustic modes, such as vorticity mode,in some class of a non-newtonian fluid called Bingham plastic. The instantaneous equations describinginteraction between different modes are derived. The attention is paid to the nonlinear effects in the fieldof intense sound. The resulting equations which describe dynamics of both sound and the vorticity modeapply to both periodic...

Full text available to download
Multi-task Video Enhancement for Dental Interventions
Publication
- E. Katsaros
- P. Kopa Ostrowski
- K. P. Włódarczak
- E. Lewandowska
- J. Rumiński
- D. Siupka-Mróz
- Ł. Lassmann
- A. Jezierska
- D. Węsierski
- Year 2022
A microcamera firmly attached to a dental handpiece allows dentists to continuously monitor the progress of conservative dental procedures. Video enhancement in video-assisted dental interventions alleviates low-light, noise, blur, and camera handshakes that collectively degrade visual comfort. To this end, we introduce a novel deep network for multi-task video enhancement that enables macro-visualization of dental scenes. In particular,...

Full text to download in external service
Systematic approach to binary classification of images in video streams using shifting time windows
Publication
- A. Blokus
- H. Krawczyk
- Signal Image and Video Processing - Year 2019
in the paper, after pointing out of realistic recordings and classifications of their frames, we propose a new shifting time window approach for improving binary classifications. We consider image classification in tewo steps. in the first one the well known binary classification algorithms are used for each image separately. In the second step the results of the previous step mare analysed in relatively short sequences of consecutive...

Full text available to download

Search

Filters

Catalog

Search results for: sound extracted from video

Piotr Szczuko dr hab. inż.

Piotr Odya dr inż.

Grzegorz Szwoch dr hab. inż.

Michał Lech dr inż.