Search results for: audio

Search results for: audio

results on page:
embed this view on your website

Filters

total: 458

clear all filters disabled

Multimodal human-computer interfaces based on advanced video and audio analysis
Publication
- Year 2013
Multimodal interfaces development history is reviewed briefly in the introduction. Examples of applications of multimodal interfaces to education software and for the disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and the audio interface for speech stretching for hearing impaired and stuttering people. The Smart...

Full text to download in external service
Multimodal human-computer interfaces based on advanced video and audio analysis
Publication
- Advances in Intelligent Systems and Computing - Year 2014
Multimodal interfaces development history is reviewed briefly in the introduction. Some applications of multimodal interfaces to education software for disabled people are presented. One of them, the LipMouse is a novel, vision-based human-computer interface that tracks user’s lip movements and detect lips gestures. A new approach to diagnosing Parkinson’s disease is also shown. The progression of the disease can be measured employing...

Full text to download in external service
Noise reduction in audio employing spectral unpredictability measure and neural net.
Publication
- A. Czyżewski
- M. Dziubiński
- Year 2004
modelu psychoakustycznym zostały przedyskutowane. Uczący się algorytm decyzjny, działający w opraciu o sztuczną sieć neuronową wykorzystany został w klasyfikacji składowych na pasożytnicze i użyteczne. Przedstawiona została również nowa iteracyjna procedura obliczania progu maskowania. W pracy zawarte zostały wyniki eksperymentów, oraz konkluzje odnoszące się do przedstawionych algorytmów.
Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing
Publication
- M. Niedźwiecki
- M. Ciołek
- IEEE Transactions on Audio Speech and Language Processing - Year 2013
In this application-oriented paper we consider the problem of elimination of impulsive disturbances, such as clicks, pops and record scratches, from archive audio recordings. The proposed approach is based on bidirectional processing—noise pulses are localized by combining the results of forward-time and backward-time signal analysis. Based on the results of specially designed empirical tests (rather than on the results of theoretical analysis),...

Full text available to download
Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?
Publication
- B. Kostek
- Year 2022
In this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...

Full text available to download
Classifying type of vehicles on the basis of data extracted from audio signal characteristics
Publication
- Journal of the Acoustical Society of America - Year 2017
The aim of this study is to find and optimize a feature vector for an automatic recognition of the type of vehicles, extracted form an audio signal. First, the influence of weather-based conditions of road surface on spectral characteristic of the audio signal recorded from a passing vehicle in close proximity to the road is discussed. Next, parameterization of the recorded audio signal is performed. For that purpose, the MIRtoolbox,...

Full text to download in external service
RENOVATION OF ARCHIVE AUDIO RECORDINGS USING SPARSE AUTOREGRESSIVE MODELING AND BIDIRECTIONAL PROCESSING
Publication
- M. Niedźwiecki
- M. Ciołek
- Year 2013
The paper presents a new approach to elimination of broadband noise and impulsive disturbances from archive audio recordings. The proposed adaptive Kalman-like algorithm, based on a sparse autoregressive model of the audio signal, simultaneously detects noise pulses, interpolates the irrevocably distorted samples and performs signal smoothing. It is shown that bidirectional (forward-backward) processing of the archive signal improves...

Full text to download in external service
Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.
Publication
- Year 2018
In this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...

Full text to download in external service
A study on of music features derived from audio recordings examples – a quantitative analysis
Publication
- A. Dorochowicz
- B. Kostek
- Archives of Acoustics - Year 2018
The paper presents a comparative study of music features derived from audio recordings, i.e. the same music pieces but representing different music genres, excerpts performed by different musicians, and songs performed by a musician, whose style evolved over time. Firstly, the origin and the background of the division of music genres were shortly presented. Then, several objective parameters of an audio signal were recalled that...

Full text available to download
Localization of impulsive disturbances in archive audio signals using predictive matched filtering
Publication
- M. Niedźwiecki
- M. Ciołek
- Year 2014
The problem of elimination of impulsive disturbances from archive audio signals is considered and its new solution, called predictive matched filtering, is proposed. The new approach is based on the observation that a large percentage of noise pulses corrupting archive audio recordings have highly repetitive shapes that match several typical “patterns”, called click templates. To localize noise pulses, click templates can be correlated...

Full text to download in external service
Audio Content and Crowdsourcing: A Subjective Quality Evaluation of Radio Programs Streamed Online
Publication
- P. Falkowski-Gilski
- Year 2023
Radio broadcasting has been present in our lives for over 100 years. The transmission of speech and music signals accompanies us from an early age. Broadcasts provide the latest information from home and abroad. They also shape musical tastes and allow many artists to share their creativity. Modern distribution involves transmission over a number of terrestrial systems. The most popular are analog FM (Frequency Modulation) and...

Full text to download in external service
Intelligent acquisition of audio signals, employing neutral networks and rough set algorithms
Publication
- A. Czyżewski
- Year 2003
Algorytmy oparte na sztucznych sieciach neuronowych i metodzie zbiorówprzybliżonych zostały zastosowane do lokalizacji sygnałów fonicznych obar-czonych pasożytniczym szumem i rewerberacjami. Informacja o kierunku napły-wania dźwięku była uzyskiwana na wyjściach tych algorytmów na podstawie re-prezentacji parametrycznej. Przedstawiono wyniki eksperymentalne i przepro-wadzono ich dyskusję.
Pomiary wartości opóźnień w torze audio urządzeń z systemem Android
Publication
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Year 2018
Poniższy artykuł opisuje metody pomiarów wartości opóźnienia w torze fonicznym urządzeń pracujących na różnych wersjach systemu Android. W pierwszej części artykułu podano krótką charakterystykę środowiska Android w kontekście opóźnień w torze fonicznym. Następnie przedstawiono sposób pomiaru opóźnienia w torze fonicznym za pomocą aplikacji SuperPowered Latency oraz Dr. Rick O’Rang Loopback. W końcowej...

Full text available to download
Production of six-degrees-of-freedom (6DoF) navigable audio using 30 Ambisonic microphones
Publication
- B. Mróz
- M. Kabaciński
- T. Ciotucha
- A. Rumiński
- T. Żernicki
- Year 2021
This paper describes a method for planning, recording, and post-production of six-degrees-of-freedom audio recorded with multiple 3rd order Ambisonic microphone arrays. The description is based on the example of recordings conducted in August 2020 with the Poznan Philharmonic Orchestra using 30 units of Zylia ZM-1S. A convenient way to prepare and organize such a big project is proposed – this involves details of stage planning,...

Full text to download in external service
Parametric impulsive noise detector for corrupted audio signals based on hidden Markow model
Publication
- K. Cisowski
- Year 2008
The paper addresses the problem of impulsive noise detection for audio signals. A structure of threshold parameter detectors using modelingof signals was introduced. the algorithm of the noise detection, based on discrete-time hidden Markow model (HMM)of whitened audio signal is elaborated
Sparse vector autoregressive modeling of audio signals and its application to the elimination of impulsive disturbances
Publication
- M. Niedźwiecki
- M. Ciołek
- Year 2015
Archive audio files are often corrupted by impulsive disturbances, such as clicks, pops and record scratches. This paper presents a new method for elimination of impulsive disturbances from stereo audio signals. The proposed approach is based on a sparse vector autoregressive signal model, made up of two components: one taking care of short-term signal correlations, and the other one taking care of long-term correlations. The method...

Full text to download in external service
Gaze-tracking based audio-visual correlation analysis employing quality of experience methodology
Publication
- Intelligent Decision Technologies-Netherlands - Year 2010
This paper investigates a new approach to audio-visual correlation assessment based on the gaze-tracking system developed at the Multimedia Systems Department (MSD) of Gdansk University of Technology (GUT). The gaze-tracking methodology, having roots in Human-Computer Interaction borrows the relevance feedback through gaze-tracking and applies it to the new area of interests, which is Quality of Experience. Results of subjective...

Full text to download in external service
AUDIO SIGNAL EQUALIZATION BASED ON IMPULSE RESPONSE OF A LISTENING ROOM AND MUSIC CONTENT REPRODUCED
Publication
- P. Hoffmann
- B. Kostek
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Year 2018
A research study presents investigations of the influence of the room acoustics on the frequency characteristic of the audio signal playback. First, a concept of a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the room spectral response, a system for room acoustics compensation based on an equalizer designed is proposed. The system settings depend on music genre recognized automatically....
Energy Efficiency Study of Audio-video Content Consumption on Selected Android Mobile Terminals
Publication
- P. Falkowski-Gilski
- M. Pańkowski
- Year 2021
Mobile devices are widely used by billions of users worldwide. Thanks to their main advantage, which is portability, they should be fully operational as long as possible, without the need to recharge or connect them to external power sources. This paper describes a study, carried out on four different mobile devices, with different hardware and software parameters, running the Android operating system. The research campaign involved...

Full text to download in external service
Pursuing Listeners’ Perceptual Response in Audio-Visual Interactions - Headphones vs Loudspeakers: A Case Study
Publication
- B. Mróz
- B. Kostek
- Archives of Acoustics - Year 2022
This study investigates listeners’ perceptual responses in audio-visual interactions concerning binaural spatial audio. Audio stimuli are coupled with or without visual cues to the listeners. The subjective test participants are tasked to indicate the direction of the incoming sound while listening to the audio stimulus via loudspeakers or headphones with the head-related transfer function (HRTF) plugin. First, the methodology...

Full text available to download
Automatic audio signal mixing system based on one-dimensional Wave-U-Net autoencoders
Publication
- D. Koszewski
- Year 2023
The purpose of this dissertation is to develop an automatic song mixing system that is capable of automatically mixing a song with good quality in any music genre. This work recalls first the audio signal processing methods used in audio mixing, and it describes selected methods for automatic audio mixing. Then, a novel architecture built based on one-dimensional Wave-U-Net autoencoders is proposed for automatic music mixing. Models...

Full text available to download
In Memoriam Professors Marianna Sankiewicz-Budzyński and Gustaw K.E. Budzyński - Founders of the Polish Audio Engineering
Publication
- A. Czyżewski
- B. Kostek
- Archives of Acoustics - Year 2018
Biography and scientific achievements of Professors Marianna Sankiewicz-Budzyński and Gustaw K.E. Budzyński - Founders of the Polish Audio Engineering.

Full text available to download
Towards Audio Signal Equalization Based on Spectral Characteristics of a Listening Room and Music Content Reproduced
Publication
- P. Hoffmann
- B. Kostek
- Year 2018
This study presents investigations of the influence of the room acoustics on the frequency characteristic of the audio signal playback. First, the concept of a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the room spectral response, a system for room acoustics compensation based on an equalizer designed is proposed. The system settings depend on music genre recognized automatically....

Full text to download in external service
New semi-causal and noncausal techniques for detection of impulsive disturbances in multivariate signals with audio applications
Publication
- M. Niedźwiecki
- M. Ciołek
- IEEE TRANSACTIONS ON SIGNAL PROCESSING - Year 2017
This paper deals with the problem of localization of impulsive disturbances in nonstationary multivariate signals. Both unidirectional and bidirectional (noncausal) detection schemes are proposed. It is shown that the strengthened pulse detection rule, which combines analysis of one-step-ahead signal prediction errors with critical evaluation of leave-one-out signal interpolation errors, allows one to noticeably improve detection results...

Full text available to download
Machine Learning Applied to Aspirated and Non-Aspirated Allophone Classification—An Approach Based on Audio "Fingerprinting"
Publication
- Year 2018
The purpose of this study is to involve both Convolutional Neural Networks and a typical learning algorithm in the allophone classification process. A list of words including aspirated and non-aspirated allophones pronounced by native and non-native English speakers is recorded and then edited and analyzed. Allophones extracted from English speakers’ recordings are presented in the form of two-dimensional spectrogram images and...

Full text to download in external service
IEEE-ACM Transactions on Audio Speech and Language Processing

Journals

ISSN: 2329-9290
Data obtained via parametrization of differently mixed audio signals
Open Research Data
open access
- J. Stefański
- K. Marciniuk
Dataset consists of audio samples and the results of their parametrization. The extraction of music parameters was performed using MIRToolbox. Information extracted from the samples was used as a database for master's thesis titled 'The influence of audio signal processing chain in mixing on the emotional state of a music piece'.
Auditory Brainstem Responses recorded employing Audio ABR device
Open Research Data
open access
- P. Odya
- A. Czyżewski
The dataset consists of ABR measurements employing click, burst and speech stimuli. Parameters of the particular stimuli were as follows:
Elimination of impulsive disturbances from archive audio files – comparison of three noise pulse detection schemes
Publication
- M. Niedźwiecki
- M. Ciołek
- Year 2014
The problem of elimination of impulsive disturbances (such as clicks, pops, ticks, crackles, and record scratches) from archive audio recordings is considered and solved using autoregressive modeling. Three classical noise pulse detection schemes are examined and compared: the approach based on open-loop multi-step-ahead signal prediction, the approach based on decision-feedback signal prediction, and the double threshold approach,...

Full text to download in external service
Evaluation of Six Degrees of Freedom 3D Audio Orchestra Recording and Playback using multi-point Ambisonic interpolation
Publication
- T. Ciotucha
- A. Rumiński
- T. Żernicki
- B. Mróz
- Scopus - Year 2021
This paper describes a strategy for recording sound and enabling six-degrees-of-freedom playback, making use of multiple simultaneous and synchronized Higher Order Ambisonics (HOA) recordings. Such a strategy enables users to navigate in a simulated 3D space and listen to the six-degrees-of-freedom recordings from different perspectives. For the evaluation of the proposed approach, an Unreal Engine-based navigable 3D audiovisual...

Full text to download in external service
Analysis of impact of lossy audio compression on the robustness of watermark embedded in the DWT domain for non-blind copyright protection
Publication
- Year 2012
A methodology of non-blind watermarking of the audio content is proposed. The outline of audio copyright problem and motivation for practical applications are discussed. The algorithmic theory pertaining watermarking techniques is briefly introduced. The system architecture together with employed workflows for embedding and extracting the watermarks are described. The implemented approach is described and obtained results are reported....

Full text to download in external service
Analiza jakości transmisji treści audio-wideo w symulowanym łączu telekomunikacyjnym z wykorzystaniem techniki OFDM
Publication
- M. Zamłyńska
- P. Falkowski-Gilski
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Year 2022
Wdrożenie niezawodnego systemu komunikacji audio-wideo przynosi wiele korzyści. Z uwagi na fakt, że ilość dostępnego pasma stale się kurczy, badacze koncentrują się na nowatorskich metodach transmisji. Obecnie technika OFDM (Orthogonal Frequency Division Multiplexing) jest szeroko stosowana zarówno w mediach przewodowych, jak i bezprzewodowych. W pracy przedstawiono badania jakości QoS (Quality of Service) symulowanego łącza transmisji...

Full text to download in external service
A commonly-accessible toolchain for live streaming music events with higher-order ambisonic audio and 4k 360 vision
Publication
- B. Mróz
- P. Odya
- P. Danowski
- M. Kabaciński
- Year 2023
An immersive live stream is especially interesting in the ongoing development of telepresence tools, especially in the virtual reality (VR) or mixed reality (MR) domain. This paper explores the remote and immersive way of enabling telepresence for the audience to high-fidelity music performance using freely-available and easily-accessible tools. A functional VR live-streaming toolchain, comprising 360 vision and higher-order ambisonic...

Full text available to download
Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering
Publication
- IEEE Transactions on Audio Speech and Language Processing - Year 2015
This paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...

Full text available to download
Adaptive system for recognition of sounds indicating threats to security of people and property employing parallel processing of audio data streams
Publication
- K. Łopatka
- Year 2015
A system for recognition of threatening acoustic events employing parallel processing on a supercomputing cluster is featured. The methods for detection, parameterization and classication of acoustic events are introduced. The recognition engine is based onthreshold-based detection with adaptive threshold and Support Vector Machine classifcation. Spectral, temporal and mel-frequency descriptors are used as signal features. The...
EURASIP Journal on Audio Speech and Music Processing

Journals

ISSN: 1687-4714 , eISSN: 1687-4722
Digital Audio Effects Conference

Conferences
Measuring and Analyzing Audio Levels in Film, Commercials, and Movie Trailers Using Leq(A) Values and the LUFS Loudness Model . Analiza pomiarów dźwięku w filmie oraz w reklamach filmowych z wykorzystaniem modelu głośności
Publication
- Year 2015
The purpose of this paper is to describe the measurement of loudness levels in movies, movie trailers, and commercials displayed before feature films at movie theaters. In the initial section, the paper discusses the issues related to measurement of loudness levels, provides recommendations regarding permissible loudness levels during movie screenings, and mentions the applied units of measurement. The following section of the...
Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej
Publication
- A. Czyżewski
- B. Kostek
- T. Ciszewski
- D. Majewicz
- Year 2013
The bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...
International Symposium on Audio, Video, Image Processing and Intelligent Applications

Conferences
Nowa Audiofonologia

Journals

ISSN: 2084-946X
Network and Operating System Support for Digital Audio and Video (Network and OS Support for Digital A/V)

Conferences
Computer modeling of perceptual masking and its audiology applications
Publication
- A. Czyżewski
- Year 2005
W referacie zaprezentowano podstawy perceptualne słyszenia pozwalające na stworzenie nowych modeli kodowania dźwięku, w szczególności do zastosowania w protezach słuchu.
Technika komputerowa w audiologii, foniatrii i logopedii
Publication
- A. Czyżewski
- B. Kostek
- H. Skarżyński
- Year 2002
Książka prezentuje opracowania, które są wynikiem kilkuletniej współpracy naukowców z dziedziny informatyki, telekomunikacji, otolaryngologii, audiologii, psychologii, pedagogiki, logopedii i foniatrii. Książka prezentuje zastosowania techniki komputerowej w dziedzinach określonych w jej tytule.
Audiology

Journals

ISSN: 0020-6091
Koncepcja kształtowania audiosfery miejsca pracy. Między sztuką a zarządzaniem
Publication
- P. Mizera-Pęczek
- Edukacja Ekonomistów i Menedżerów - Year 2023
Full text to download in external service
Development of an AI-based audiogram classification method for patient referral
Publication
- M. Kassjański
- M. Kulawiak
- T. Przewoźny
- Year 2022
Hearing loss is one of the most significant sensory disabilities. It can have various negative effects on a person's quality of life, ranging from impeded school and academic performance to total social isolation in severe cases. It is therefore vital that early symptoms of hearing loss are diagnosed quickly and accurately. Audiology tests are commonly performed with the use of tonal audiometry, which measures a patient's hearing...

Full text to download in external service
Audiovisual speech recognition for training hearing impaired patients
Publication
- Year 2006
Praca przedstawia system rozpoznawania izolowanych głosek mowy wykorzystujący dane wizualne i akustyczne. Modele Active Shape Models zostały wykorzystane do wyznaczania parametrów wizualnych na podstawie analizy kształtu i ruchu ust w nagraniach wideo. Parametry akustyczne bazują na współczynnikach melcepstralnych. Sieć neuronowa została użyta do rozpoznawania wymawianych głosek na podstawie wektora cech zawierającego oba typy...
Automated hearing loss type classification based on pure tone audiometry data
Publication
- M. Kassjański
- M. Kulawiak
- T. Przewoźny
- D. Tretiakow
- J. Kuryłowicz
- A. Molisz
- K. Koźmiński
- A. Kwaśniewska
- P. Mierzwińska-Dolny
- M. Grono
- Scientific Reports - Year 2024
Hearing problems are commonly diagnosed with the use of tonal audiometry, which measures a patient’s hearing threshold in both air and bone conduction at various frequencies. Results of audiometry tests, usually represented graphically in the form of an audiogram, need to be interpreted by a professional audiologist in order to determine the exact type of hearing loss and administer proper treatment. However, the small number of...

Full text to download in external service
Comparing noise levels and audiometric testing results employing it based diagnostic systems.
Publication
- Year 2004
W referacie przedstawiono Internetowy system przeznaczony do przeprowadzania przesiewowych testów słuchu. Zaprezentowano również system informacyjny przeznaczony do monitorowania hałasu środowiskowego. Obie Internetowe aplikacje mogą być pomocne w zmniejszaniu częstości występowania chorób słuchu powodowanych przez hałas środowiskowy i przemysłowy. Porównano wyniki testów audiometrycznych z pomiarami hałasu na podstawie zawartości...

Search

Filters

Catalog