Wyniki wyszukiwania dla: audio processing

Verification of the Parameterization Methods in the Context of Automatic Recognition of Sounds Related to Danger

Publikacja

- Journal of Digital Forensic Practice - Rok 2010

W artykule opisano aplikację, która automatycznie wykrywa zdarzenia dźwiękowe takie jak: rozbita szyba, wystrzał, wybuch i krzyk. Opisany system składa się z bloku parametryzacji i klasyfikatora. W artykule dokonano porównania parametrów dedykowanych dla tego zastosowania oraz standardowych deskryptorów MPEG-7. Porównano też dwa klasyfikatory: Jeden oparty o Percetron (sieci neuronowe) i drugi oparty o Maszynę wektorów wspierających....

Pełny tekst do pobrania w serwisie zewnętrznym

Comparative study on the effectiveness of various types of road traffic intensity detectors

Publikacja

A. Czyżewski
A. Sroczynski
T. Smialkowski
P. Hoffmann
S. Cygert
G. Szwoch
J. Kotus
D. Weber
M. Szczodrak
D. Koszewski... i 2 innych

- Rok 2019

Vehicle detection and speed measurements are crucial tasks in traffic monitoring systems. In this work, we focus on several types of electronic sensors, operating on different physical principles in order to compare their effectiveness in real traffic conditions. Commercial solutions are based on road tubes, microwave sensors, LiDARs, and video cameras. Distributed traffic monitoring systems require a high number of monitoring...

Pełny tekst do pobrania w serwisie zewnętrznym

Rough Sets Applied to Mood of Music Recognition

Publikacja

- Rok 2016

With the growth of accessible digital music libraries over the past decade, there is a need for research into automated systems for searching, organizing and recommending music. Mood of music is considered as one of the most intuitive criteria for listeners, thus this work is focused on the emotional content of music and its automatic recognition. The research study presented in this work contains an attempt to music emotion recognition...

English Language Learning Employing Developments in Multimedia IS

Publikacja

- Rok 2024

In the realm of the development of information systems related to education, integrating multimedia technologies offers novel ways to enhance foreign language learning. This study investigates audio-video processing methods that leverage real-time speech rate adjustment and dynamic captioning to support English language acquisition. Through a mixed-methods analysis involving participants from a language school, we explore the impact...

Pełny tekst do pobrania w serwisie zewnętrznym

Study Analysis of Transmission Efficiency in DAB+ Broadcasting System

Publikacja

P. Falkowski-Gilski

- Rok 2018

DAB+ is a very innovative and universal multimedia broadcasting system. Thanks to its updated multimedia technologies and metadata options, digital radio keeps pace with changing consumer expectations and the impact of media convergence. Broadcasting analog and digital radio services does vary, concerning devices on both transmitting and receiving side, as well as content processing mechanisms. However, the biggest difference is...

Pełny tekst do pobrania w portalu

Classifying type of vehicles on the basis of data extracted from audio signal characteristics

Publikacja

- Journal of the Acoustical Society of America - Rok 2017

The aim of this study is to find and optimize a feature vector for an automatic recognition of the type of vehicles, extracted form an audio signal. First, the influence of weather-based conditions of road surface on spectral characteristic of the audio signal recorded from a passing vehicle in close proximity to the road is discussed. Next, parameterization of the recorded audio signal is performed. For that purpose, the MIRtoolbox,...

Pełny tekst do pobrania w serwisie zewnętrznym

Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.

Publikacja

- Rok 2018

In this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...

Pełny tekst do pobrania w serwisie zewnętrznym

Improving automatic surveillance by sound analysis

Publikacja

- Rok 2010

An automatic surveillance system, based on event detection in the video image can be improved by implementing algorithms for audio analysis. Dangerous or illegal actions are often connected with distinctive sound events like screams or sudden bursts of energy. A method for detection and classification of alarming sound events is presented. Detection is based on the observation of sudden changes in sound level in distinctive sub-bands...

Pursuing Listeners’ Perceptual Response in Audio-Visual Interactions - Headphones vs Loudspeakers: A Case Study

Publikacja

- Archives of Acoustics - Rok 2022

This study investigates listeners’ perceptual responses in audio-visual interactions concerning binaural spatial audio. Audio stimuli are coupled with or without visual cues to the listeners. The subjective test participants are tasked to indicate the direction of the incoming sound while listening to the audio stimulus via loudspeakers or headphones with the head-related transfer function (HRTF) plugin. First, the methodology...

Pełny tekst do pobrania w portalu

Exploiting audio-visual correlation by means of gaze tracking

Publikacja

- International Journal of Computer Science and Applications - Rok 2010

This paper presents a novel means for increasing audio-visual correlation analysis reliability. This is done based on gaze tracking technology engineered at the Multimedia Systems Department of the Gdansk University of Technology, Poland. In the paper, the past history and current research in the area of audio-visual perception analysis are shortly reviewed. Then the methodology employing gaze tracking is presented along with the...

Pełny tekst do pobrania w portalu

Automatic sound recognition for security purposes

Publikacja

P. Żwan

- Rok 2008

In the paper an automatic sound recognition system is presented. It forms a part of a bigger security system developed in order to monitor outdoor places for non-typical audio-visual events. The analyzed audio signal is being recorded from a microphone mounted in an outdoor place thus a non stationary noise of a significant energy is present in it. In the paper an especially designed algorithm for outdoor noise reduction is presented,...

Testing Watermark Robustness against Application of Audio Restoration Algorithms

Publikacja

- Rok 2013

The purpose of this study was to test to what extent watermarks embedded in distorted audio signals are immune to audio restoration algorithm performing. Several restoration routines such as noise reduction, spectrum expansion, clipping or clicks reduction were applied in the online website system. The online service was extended with some copyright protection mechanisms proposed by the authors. They contain low-level music features...

Pełny tekst do pobrania w serwisie zewnętrznym

QoS/QoE in the Heterogeneous Internet of Things (IoT)

Publikacja

K. Nowicki
T. Uhl

- Rok 2017

Applications provided in the Internet of Things can generally be divided into three categories: audio, video and data. This has given rise to the popular term Triple Play Services. The most important audio applications are VoIP and audio streaming. The most notable video applications are VToIP, IPTV, and video streaming, and the service WWW is the most prominent example of data-type services. This chapter elaborates on the most...

Parametric impulsive noise detector for corrupted audio signals based on hidden Markow model

Publikacja

K. Cisowski

- Rok 2008

The paper addresses the problem of impulsive noise detection for audio signals. A structure of threshold parameter detectors using modelingof signals was introduced. the algorithm of the noise detection, based on discrete-time hidden Markow model (HMM)of whitened audio signal is elaborated

Sparse vector autoregressive modeling of audio signals and its application to the elimination of impulsive disturbances

Publikacja

- Rok 2015

Archive audio files are often corrupted by impulsive disturbances, such as clicks, pops and record scratches. This paper presents a new method for elimination of impulsive disturbances from stereo audio signals. The proposed approach is based on a sparse vector autoregressive signal model, made up of two components: one taking care of short-term signal correlations, and the other one taking care of long-term correlations. The method...

Pełny tekst do pobrania w serwisie zewnętrznym

Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions

Publikacja

- Rok 2016

Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...

Pełny tekst do pobrania w serwisie zewnętrznym

EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY

Publikacja

- Rok 2014

The problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...

EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY

Publikacja

- Rok 2014

The problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...

Testing A Novel Gesture-Based Mixing Interface

Publikacja

- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2013

With a digital audio workstation, in contrast to the traditional mouse-keyboard computer interface, hand gestures can be used to mix audio with eyes closed. Mixing with a visual representation of audio parameters during experiments led to broadening the panorama and a more intensive use of shelving equalizers. Listening tests proved that the use of hand gestures produces mixes that are aesthetically as good as those obtained using...

Pełny tekst do pobrania w portalu

Elimination of impulsive disturbances from stereo audio recordings

Publikacja

- Rok 2014

This paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. On-line tracking of signal model parameters is performed using the stability-preserving Whittle-Wiggins-Robinson algorithm with exponential data weighting. Detection of noise pulses and model-based interpolation of the irrevocably distorted samples...

Pełny tekst do pobrania w serwisie zewnętrznym

An new method of audio-visual correlation analysis

Publikacja

- Rok 2009

This paper presents a new methodology of conducting the audio-visual correlation analysis employing the gaze tracking system. Interaction between two perceptual modalities, seeing and hearing, their interaction and mutual reinforcement in a complex relationship was a subject of many research studies. Earlier stage of the carried out experiments at the Multimedia Systems Department (MSD) showed that there exists a relationship between...

Pełny tekst do pobrania w serwisie zewnętrznym

Objectivization of audio-video correlation assessment experiments

Publikacja

- Rok 2010

The purpose of this paper is to present a new method of conducting an audio-visual correlation analysis employing a head-motion-free gaze tracking system. First, a review of related works in the domain of sound and vision correlation is presented. Then assumptions concerning audio-visual scene creation are shortly described. The objectivization process of carrying out correlation tests employing gaze-tracking system is outlined....

Pełny tekst do pobrania w serwisie zewnętrznym

Examining Acoustic Emission of Engineered Ultrasound Loudspeakers

Publikacja

- Rok 2014

Measurement results of the sound emitted from an ultrasound custom-made system with high spatial directivity are presented. The proposed system is using modulated ultrasound waves which demodulate in nonlinear medium resulting in audible sound. The system is aimed at enhancing the users’ personal audio space, therefore the measurements are performed using the Head and Torso Simulator which provides the realistic reproduction of...

Digital Audio Broadcasting or Webcasting: A Network Quality Perspective

Publikacja

- Journal of Telecommunications and Information Technology - Rok 2016

In recent years, many alternative technologies of delivering audio content have emerged, with different advantages and disadvantages. In this paper pros and cons of digital audio broadcasting and webcasting transmission techniques in a network quality perspective are described. A case study of user expectations with respect to currently available services is analyzed, and the perceived quality of real digital broadcasted and webcasted...

Pełny tekst do pobrania w portalu

A concept of Signal Equalization Method Based on Music Genre and the Listener's Room Characteristics

Publikacja

- Rok 2016

A research study that investigates the influence of the room acoustics environment on the frequency characteristic of the audio signal playback is presented. First, a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the frequency response of the room, a system for room acoustics compensation based on eight-band equalizer is proposed. The system settings depend on music genre. In...

Gaze-tracking based audio-visual correlation analysis employing quality of experience methodology

Publikacja

- Intelligent Decision Technologies-Netherlands - Rok 2010

This paper investigates a new approach to audio-visual correlation assessment based on the gaze-tracking system developed at the Multimedia Systems Department (MSD) of Gdansk University of Technology (GUT). The gaze-tracking methodology, having roots in Human-Computer Interaction borrows the relevance feedback through gaze-tracking and applies it to the new area of interests, which is Quality of Experience. Results of subjective...

Pełny tekst do pobrania w serwisie zewnętrznym

Measurements and Simulations of Engineered Ultrasound Loudspeakers

Publikacja

- Computational Methods in Science and Technology - Rok 2015

Simulation and measurement results of the sound emitted from an ultrasound custom-made system with high spatial directivity are presented. The proposed system is using modulated ultrasound waves which demodulate in nonlinear medium resulting in audible sound. The system is aimed at enhancing the users’ personal audio space, therefore the measurements are performed using the Head and Torso Simulator which provides realistic reproduction...

Pełny tekst do pobrania w serwisie zewnętrznym

Intelligent multimedia solutions supporting special education needs.

Publikacja

- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2011

The role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....

Intelligent video and audio applications for learning enhancement

Publikacja

- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2011

The role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....

Pełny tekst do pobrania w portalu

Bimodal deep learning model for subjectively enhanced emotion classification in films

Publikacja

D. Weber
B. Kostek

- INFORMATION SCIENCES - Rok 2024

This research delves into the concept of color grading in film, focusing on how color influences the emotional response of the audience. The study commenced by recalling state-of-the-art works that process audio-video signals and associated emotions by machine learning. Then, assumptions of subjective tests for refining and validating an emotion model for assigning specific emotional labels to selected film excerpts were presented....

Pełny tekst do pobrania w serwisie zewnętrznym

Wow defect reduction based on interpolation techniques

Publikacja

P. Maziewski

- Rok 2005

W referacie przedstawiono wyniki badania różnych technik interpolacji wykorzystanych w redukcji kołysania dźwięku. W badaniach użyto: interpolację liniową, dwie techniki interpolacji wielomianowej (Hermite i spline), i technikę sumowania okienkowanych funkcji sink. Jakość rekonstrukcji wykonano wykorzystując sztucznie spreparowany sygnał audio, rekonstruowany wymienionymi metodami interpolacji. Jakość rekonstrukcji oceniono wykorzystując...

Akustyczna analiza natężenia ruchu drogowego dla systemów zarządzania ruchem

Publikacja

K. Marciniuk

- Rok 2019

W pracy przybliżono wybrane zagadnienia z dziedziny zarządzania transportem drogowym w Polsce i na świecie. W tym kontekście pzredstawiono potrzeby rynkowe, wymagania jak i możliwości w zakresie pozyskiwania informacji o aktualnym stanie sieci drogowych. Zaproponowano akustyczną metodę nadzorowania ruchu drogowego i jej możliwości w kontekście systemów zarządzania ruchem. Przedstawiono schemat akwizycji sygnału wraz z danymi odniesienia....

Transmitting Alarm Information in DAB+ Broadcasting System

Publikacja

P. Falkowski-Gilski

- Rok 2018

The main goal of digital broadcasting is to deliver high-quality content with the lowest possible bitrate. This paper is focused on transmitting alarm information, such as emergency warning and alerting, in the DAB+ (Digital Audio Broadcasting plus) broadcasting system. These additional services should be available at the lowest possible bitrate, in order to provide a clear and understandable voice message to people. Furthermore, additional...

Detection of impulsive disturbances in archive audio signals

Publikacja

- Rok 2017

In this paper the problem of detection of impulsive disturbances in archive audio signals is considered. It is shown that semi-causal/noncausal solutions based on joint evaluation of signal prediction errors and leave-one-out signal interpolation errors, allow one to noticeably improve detection results compared to the prediction-only based solutions. The proposed approaches are evaluated on a set of clean audio signals contaminated...

Pełny tekst do pobrania w portalu

In uence of Low-Level Features Extracted from Rhythmic and Harmonic Sections on Music Genre Classi cation

Publikacja

A. Rosner
F. Weninger
B. Schuller
M. Michalak
B. Kostek

- Rok 2013

We present a comprehensive evaluation of the infuence of 'harmonic' and rhythmic sections contained in an audio file on automatic music genre classi cation. The study is performed using the ISMIS database composed of music files, which are represented by vectors of acoustic parameters describing low-level music features. Non-negative Matrix Factorization serves for blind separation of instrument components. Rhythmic components...

Automatic system for audio-video material reconstruction and archiving

Publikacja

- Rok 2008

Referat przedstawia propozycję modelu systemu automatycznej archiwizacji i rekonstrukcji nagrań audio-wideo. Założeniem tego rozwiązania jest uczynienie procesu rekonstrukcji nagrań bardziej niezależnym od człowieka. Ma to na celu redukcję kosztów rekonstrukcji przetwarzanych nagrań. Z powodu dużej liczby archiwalnych nagrań audio-wideo istnieje potrzeba stworzenia systemu który umożliwi automatyczną indeksację ich treści. Pomoże...

In Memoriam Professors Marianna Sankiewicz-Budzyński and Gustaw K.E. Budzyński - Founders of the Polish Audio Engineering

Publikacja

- Archives of Acoustics - Rok 2018

Biography and scientific achievements of Professors Marianna Sankiewicz-Budzyński and Gustaw K.E. Budzyński - Founders of the Polish Audio Engineering.

Pełny tekst do pobrania w portalu

Building Knowledge for the Purpose of Lip Speech Identification

Publikacja

- Advances in Intelligent Systems and Computing - Rok 2017

Consecutive stages of building knowledge for automatic lip speech identification are shown in this study. The main objective is to prepare audio-visual material for phonetic analysis and transcription. First, approximately 260 sentences of natural English were prepared taking into account the frequencies of occurrence of all English phonemes. Five native speakers from different countries read the selected sentences in front of...

Pełny tekst do pobrania w serwisie zewnętrznym

Postprodukcja nagrania wideo z dzwiekiem dookolnym

Publikacja

- Rok 2009

One of the aims of this paper is to present issues related to audio-video correlation. This is presented on the basis of a short film realization employing surround microphone techniques. First, some related works in the domain of sound and vision correlation are presented. Then assumptions concerning scene creation related to both audio and video are shortly described. Another objective is to discuss results of subjective tests...

1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type

Publikacja

- Rok 2020

A network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence....

Evaluation of a Novel Approach to Virtual Bass Synthesis Strategy

Publikacja

- Rok 2015

The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) strategy applied to portable computers. The developed algorithms involve intelligent, rule-based settings of bass synthesis parameters with regard to music genre of an audio excerpt and the type of a portable device in use. The Smart VBS algorithm performs the synthesis based on a nonlinear device (NLD) with artificial controlling synthesis...

Pełny tekst do pobrania w serwisie zewnętrznym

Production of six-degrees-of-freedom (6DoF) navigable audio using 30 Ambisonic microphones

Publikacja

B. Mróz
M. Kabaciński
T. Ciotucha
A. Rumiński
T. Żernicki

- Rok 2021

This paper describes a method for planning, recording, and post-production of six-degrees-of-freedom audio recorded with multiple 3rd order Ambisonic microphone arrays. The description is based on the example of recordings conducted in August 2020 with the Poznan Philharmonic Orchestra using 30 units of Zylia ZM-1S. A convenient way to prepare and organize such a big project is proposed – this involves details of stage planning,...

Pełny tekst do pobrania w serwisie zewnętrznym

Classification of Music Genres by Means of Listening Tests and Decision Algorithms

Publikacja

- Rok 2018

The paper compares the results of audio excerpt assignment to a music genre obtained in listening tests and classification by means of decision algorithms. A short review on music description employing music styles and genres is given. Then, assumptions of listening tests to be carried out along with an online survey for assigning audio samples to selected music genres are presented. A framework for music parametrization is created...

Pełny tekst do pobrania w serwisie zewnętrznym

A study on of music features derived from audio recordings examples – a quantitative analysis

Publikacja

- Archives of Acoustics - Rok 2018

The paper presents a comparative study of music features derived from audio recordings, i.e. the same music pieces but representing different music genres, excerpts performed by different musicians, and songs performed by a musician, whose style evolved over time. Firstly, the origin and the background of the division of music genres were shortly presented. Then, several objective parameters of an audio signal were recalled that...

Pełny tekst do pobrania w portalu

Analysis of impact of audio modifications on the robustness of watermark for non-blind architecture

Publikacja

- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2015

The aim of this paper is to assess the robustness of the non-blind audio content watermarking scheme proposed by the authors. The authors present the architecture of the designed system along with the employed workflows for embedding and extracting the watermark followed by the implementation phase description and the analysis of the experimental results. Some possible attack simulations on the embedded watermarks are reviewed,...

Pełny tekst do pobrania w portalu

Music genre classification applied to bass enhancement for mobile technology

Publikacja

- Elektronika : konstrukcje, technologie, zastosowania - Rok 2015

The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm is related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt. The classification of music genres is automatically executed employing MPEG 7 parameters and the Principal Component Analysis method applied to reduce information...

Pełny tekst do pobrania w serwisie zewnętrznym

Machine learning applied to acoustic-based road traffic monitoring

Publikacja

- Rok 2022

The motivation behind this study lies in adapting acoustic noise monitoring systems for road traffic monitoring for driver’s safety. Such a system should recognize a vehicle type and weather-related pavement conditions based on the audio level measurement. The study presents the effectiveness of the selected machine learning algorithms in acoustic-based road traffic monitoring. Bases of the operation of the acoustic road traffic...

Pełny tekst do pobrania w portalu

Machine learning applied to acoustic-based road traffic monitoring

Publikacja

- Procedia Computer Science - Rok 2022

The motivation behind this study lies in adapting acoustic noise monitoring systems for road traffic monitoring for driver’s safety. Such a system should recognize a vehicle type and weather-related pavement conditions based on the audio level measurement. The study presents the effectiveness of the selected machine learning algorithms in acoustic-based road traffic monitoring. Bases of the operation of the acoustic road traffic...

Pełny tekst do pobrania w portalu

Analysis of impact of lossy audio compression on the robustness of watermark embedded in the DWT domain for non-blind copyright protection

Publikacja

- Rok 2012

A methodology of non-blind watermarking of the audio content is proposed. The outline of audio copyright problem and motivation for practical applications are discussed. The algorithmic theory pertaining watermarking techniques is briefly introduced. The system architecture together with employed workflows for embedding and extracting the watermarks are described. The implemented approach is described and obtained results are reported....

Pełny tekst do pobrania w serwisie zewnętrznym

Localization of impulsive disturbances in archive audio signals using predictive matched filtering

Publikacja

- Rok 2014

The problem of elimination of impulsive disturbances from archive audio signals is considered and its new solution, called predictive matched filtering, is proposed. The new approach is based on the observation that a large percentage of noise pulses corrupting archive audio recordings have highly repetitive shapes that match several typical “patterns”, called click templates. To localize noise pulses, click templates can be correlated...

Pełny tekst do pobrania w serwisie zewnętrznym

Wyszukiwarka

Filtry

Katalog

Kategoria

Rok

Opcje

Wyniki wyszukiwania dla: audio processing