Wyniki wyszukiwania dla: AUDIO ENGINEERING, SEMANTIC AUDIO

Wyniki wyszukiwania dla: AUDIO ENGINEERING, SEMANTIC AUDIO

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 447

wyczyść wszystkie filtry niedostępne

JOURNAL OF THE AUDIO ENGINEERING SOCIETY

Czasopisma

ISSN: 1549-4950
Retrospecting Polish Audio Engineering Society Membership on 20th Anniversary of the Polish Section of the Audio Engineering Society
Publikacja
- B. Kostek
- M. Sankiewicz
- Archives of Acoustics - Rok 2011
In this article some key events concerning founding Polish Section of the Audio Engineering Society were presented. In addition, the history covering International Symposia on Sound Engineering and Mastering was outlined. Also, papers contained in this issue were shortly reviewed.

Pełny tekst do pobrania w portalu
Journal of the Audio Engineering Society

Czasopisma

ISSN: 0004-7554
In Memoriam Professors Marianna Sankiewicz-Budzyński and Gustaw K.E. Budzyński - Founders of the Polish Audio Engineering
Publikacja
- A. Czyżewski
- B. Kostek
- Archives of Acoustics - Rok 2018
Biography and scientific achievements of Professors Marianna Sankiewicz-Budzyński and Gustaw K.E. Budzyński - Founders of the Polish Audio Engineering.

Pełny tekst do pobrania w portalu
Editor's note and 2018 reviewers
Publikacja
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2018
Przedmiotem pracy jest odniesienie do prac opublikowanych w 2018 roku, jak również do serii artykułów w ramach specjalnego wydania: Special Issue on Augmented and Participatory Sound and Music Interaction Using Semantic Audio.

Pełny tekst do pobrania w serwisie zewnętrznym
A double-talk detector using audio watermarking
Publikacja
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2009
a novel approach to double-talk detection in the acoustic echo canceler is proposed. a hidden signature is embedded into the arriving signal, using the echo-hiding method. next detection of the presence of this signature in the microphone signal is performed. the results of the signature detection may be used by the acoustic echo canceler to stop or restart the adaptation process.

Pełny tekst do pobrania w serwisie zewnętrznym
Testing A Novel Gesture-Based Mixing Interface
Publikacja
- M. Lech
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2013
With a digital audio workstation, in contrast to the traditional mouse-keyboard computer interface, hand gestures can be used to mix audio with eyes closed. Mixing with a visual representation of audio parameters during experiments led to broadening the panorama and a more intensive use of shelving equalizers. Listening tests proved that the use of hand gestures produces mixes that are aesthetically as good as those obtained using...

Pełny tekst do pobrania w portalu
Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing
Publikacja
- D. Koszewski
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2020
Developing signal processing methods to extract information automatically has potential in several applications, for example searching for multimedia based on its audio content, making context-aware mobile applications (e.g., tuning apps), or pre-processing for an automatic mixing system. However, the last-mentioned application needs a significant amount of research to reliably recognize real musical instruments in recordings....

Pełny tekst do pobrania w portalu
Adaptive Personal Tuning of Sound in Mobile Computers
Publikacja
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2016
An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of their acoustic track to changing acoustic conditions of the environment and to users’ individual preferences. Signal processing algorithms are introduced that concern: linearization of frequency response, dialogue intelligibility enhancement, and dynamics processing tuned up to the users’...

Pełny tekst do pobrania w portalu
Analysis of 2D Feature Spaces for Deep Learning-based Speech Recognition
Publikacja
- G. Korvel
- P. Treigys
- G. Tamulevicus
- J. Bernataviciene
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2018
convolutional neural network (CNN) which is a class of deep, feed-forward artificial neural network. We decided to analyze audio signal feature maps, namely spectrograms, linear and Mel-scale cepstrograms, and chromagrams. The choice was made upon the fact that CNN performs well in 2D data-oriented processing contexts. Feature maps were employed in the Lithuanian word recognition task. The spectral analysis led to the highest word...
Bass Enhancement Settings in Portable Devices Based on Music Genre Recognition
Publikacja
- P. Hoffmann
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2015
The paper presents a novel approach to the Virtual Bass Synthesis (VBS) applied to mobile devices, called Smart VBS (SVBS). The proposed algorithm uses an intelligent, rule-based setting of bass synthesis parameters adjusted to the particular music genre. Harmonic generation is based on a nonlinear device (NLD) method with the intelligent controlling system adapting to the recognized music genre. To automatically classify music...

Pełny tekst do pobrania w portalu
DSP techniques for determining ''Wow'' distortions
Publikacja
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2007
Artykuł przedstawia opis algorytmów do wyznaczania charakterystyki zniekształceń kołysania dźwięku. Są to algorytmy: śledzenia przydźwięku sieciowego, śledzenia pozostałości magnetycznej prądu podkładu wielkich częstotliwości, adaptacyjnej analizy środka ciężkości widma dla wybranej części zniekształconego sygnału. Przedstawione algorytmy pozwalają na implementację programową i sprzętową.
Expert system for automatic classification and quality assessment of singing voices
Publikacja
- P. Żwan
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2006
.

Pełny tekst do pobrania w serwisie zewnętrznym
System for automatic singing voice recognition
Publikacja
- P. Żwan
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2008
W artykule przedstawiono system automatycznego rozpoznawania jakości i typu głosu śpiewaczego. Przedstawiono bazę danych oraz zaimplementowane parametry. Algorytmem decyzyjnym jest algorytm sztucznych sieci neuronowych. Wytrenowany system decyzyjny osiąga skuteczność ok. 90% w obydwu kategoriach rozpoznawania. Dodatkowo wykazano przy pomocy metod statystycznych, że wyniki działania systemu automatycznej oceny jakości technicznej...
Tonality Estimation and Frequency Tracking of Modulated Tonal Components
Publikacja
- M. Kulesza
- A. Czyżewski
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2009
A novel method for tonality estimation and frequency tracking of tonal components modulated in frequency and amplitude is presented. The algorithm detects the local maxima of magnitude spectra corresponding to three contiguous frames of a signal and matches them into the tonal track candidates. The magnitude-based and phase-based methods are used to estimate the frequency jumps between spectrum maxima belonging to the tonal track...

Pełny tekst do pobrania w serwisie zewnętrznym
Measurements and Visualization of Sound Intensity Around the Human Head in Free Field Using Acoustic Vector Sensor
Publikacja
- J. Kotus
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2015
This paper presents measurements and visualization of sound intensity around the human head simulator in a free field. A Cartesian robot, applied for precise positioning of the acoustic vector sensor, was used to measure sound intensity. Measurements were performed in a free field using a head and torso simulator and the setup consisting of four different loudspeaker configurations. The acoustic vector sensor was positioned around...

Pełny tekst do pobrania w portalu
New Aspects of Virtual Sound Source Localization Research—Impact of Visual Angle and 3-D Video Content on Sound Perception
Publikacja
- B. Kunka
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2013
The influence of image on virtual sound source localization, called the “image proximity effect” or the “ventriloquism effect”, is a well known phenomenon. This paper focuses on other aspects related to this effect, namely the impact of the visual angle of the presented object and 3D video content on sound perception. The research conducted confirmed that the visual angle of the presented object determines the image proximity effect...

Pełny tekst do pobrania w portalu
Bożena Kostek prof. dr hab. inż.

Osoby

Laboratorium Akustyki Fonicznej
Journal of Radio & Audio Media

Czasopisma

ISSN: 1937-6529 , eISSN: 1937-6537
Piotr Szczuko dr hab. inż.

Osoby

Katedra Systemów Multimedialnych

Dr hab. inż. Piotr Szczuko w 2002 roku ukończył studia na Wydziale Elektroniki, Telekomunikacji i Informatyki Politechniki Gdańskiej zdobywając tytuł magistra inżyniera. Tematem pracy dyplomowej było badanie zjawisk jednoczesnej percepcji obrazu cyfrowego i dźwięku dookólnego. W roku 2008 obronił rozprawę doktorską zatytułowaną "Zastosowanie reguł rozmytych w komputerowej animacji postaci", za którą otrzymał nagrodę Prezesa Rady...
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING

Czasopisma

ISSN: 1063-6676
IEEE Transactions on Audio Speech and Language Processing

Czasopisma

ISSN: 1558-7916
Józef Kotus dr hab. inż.

Osoby

Katedra Systemów Multimedialnych
IEEE-ACM Transactions on Audio Speech and Language Processing

Czasopisma

ISSN: 2329-9290
Automatic system for audio-video material reconstruction and archiving
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Rok 2008
Referat przedstawia propozycję modelu systemu automatycznej archiwizacji i rekonstrukcji nagrań audio-wideo. Założeniem tego rozwiązania jest uczynienie procesu rekonstrukcji nagrań bardziej niezależnym od człowieka. Ma to na celu redukcję kosztów rekonstrukcji przetwarzanych nagrań. Z powodu dużej liczby archiwalnych nagrań audio-wideo istnieje potrzeba stworzenia systemu który umożliwi automatyczną indeksację ich treści. Pomoże...
EURASIP Journal on Audio Speech and Music Processing

Czasopisma

ISSN: 1687-4714 , eISSN: 1687-4722
Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing
Publikacja
- M. Niedźwiecki
- M. Ciołek
- IEEE Transactions on Audio Speech and Language Processing - Rok 2013
In this application-oriented paper we consider the problem of elimination of impulsive disturbances, such as clicks, pops and record scratches, from archive audio recordings. The proposed approach is based on bidirectional processing—noise pulses are localized by combining the results of forward-time and backward-time signal analysis. Based on the results of specially designed empirical tests (rather than on the results of theoretical analysis),...

Pełny tekst do pobrania w portalu
Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering
Publikacja
- IEEE Transactions on Audio Speech and Language Processing - Rok 2015
This paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...

Pełny tekst do pobrania w portalu
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
Publikacja
- Rok 2014
The problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
Publikacja
- Rok 2014
The problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
Exploiting audio-visual correlation by means of gaze tracking
Publikacja
- B. Kunka
- B. Kostek
- International Journal of Computer Science and Applications - Rok 2010
This paper presents a novel means for increasing audio-visual correlation analysis reliability. This is done based on gaze tracking technology engineered at the Multimedia Systems Department of the Gdansk University of Technology, Poland. In the paper, the past history and current research in the area of audio-visual perception analysis are shortly reviewed. Then the methodology employing gaze tracking is presented along with the...

Pełny tekst do pobrania w portalu
Digital Audio Effects Conference

Konferencje
Testing Watermark Robustness against Application of Audio Restoration Algorithms
Publikacja
- Rok 2013
The purpose of this study was to test to what extent watermarks embedded in distorted audio signals are immune to audio restoration algorithm performing. Several restoration routines such as noise reduction, spectrum expansion, clipping or clicks reduction were applied in the online website system. The online service was extended with some copyright protection mechanisms proposed by the authors. They contain low-level music features...

Pełny tekst do pobrania w serwisie zewnętrznym
An new method of audio-visual correlation analysis
Publikacja
- B. Kunka
- B. Kostek
- Rok 2009
This paper presents a new methodology of conducting the audio-visual correlation analysis employing the gaze tracking system. Interaction between two perceptual modalities, seeing and hearing, their interaction and mutual reinforcement in a complex relationship was a subject of many research studies. Earlier stage of the carried out experiments at the Multimedia Systems Department (MSD) showed that there exists a relationship between...

Pełny tekst do pobrania w serwisie zewnętrznym
Elimination of impulsive disturbances from stereo audio recordings
Publikacja
- M. Niedźwiecki
- M. Ciołek
- Rok 2014
This paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. On-line tracking of signal model parameters is performed using the stability-preserving Whittle-Wiggins-Robinson algorithm with exponential data weighting. Detection of noise pulses and model-based interpolation of the irrevocably distorted samples...

Pełny tekst do pobrania w serwisie zewnętrznym
Objectivization of audio-video correlation assessment experiments
Publikacja
- B. Kunka
- B. Kostek
- Rok 2010
The purpose of this paper is to present a new method of conducting an audio-visual correlation analysis employing a head-motion-free gaze tracking system. First, a review of related works in the domain of sound and vision correlation is presented. Then assumptions concerning audio-visual scene creation are shortly described. The objectivization process of carrying out correlation tests employing gaze-tracking system is outlined....

Pełny tekst do pobrania w serwisie zewnętrznym
Michał Lech dr inż.

Osoby

Michał Lech was born in Gdynia in 1983. In 2007 he graduated from the faculty of Electronics, Telecommunications and Informatics of Gdansk University of Technology. In June 2013, he received his Ph.D. degree. The subject of the dissertation was: “A Method and Algorithms for Controlling the Sound Mixing Processes with Hand Gestures Recognized Using Computer Vision”. The main focus of the thesis was the bias of audio perception caused...
Measurement of Latency in the Android Audio Path
Publikacja
- Rok 2018
This paper provides a description of experimental investigations concerning comparison between the audio path characteristics of various Android versions. First, information about the changes in each system version in the context of latency caused by them is presented. Then, a measurement procedure employing available applications to measure latency is described comparing to results contained in the Internet. Finally, a comparison...

Pełny tekst do pobrania w serwisie zewnętrznym
Intelligent video and audio applications for learning enhancement
Publikacja
- A. Czyżewski
- B. Kostek
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2011
The role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....

Pełny tekst do pobrania w portalu
Pursuing Listeners’ Perceptual Response in Audio-Visual Interactions - Headphones vs Loudspeakers: A Case Study
Publikacja
- B. Mróz
- B. Kostek
- Archives of Acoustics - Rok 2022
This study investigates listeners’ perceptual responses in audio-visual interactions concerning binaural spatial audio. Audio stimuli are coupled with or without visual cues to the listeners. The subjective test participants are tasked to indicate the direction of the incoming sound while listening to the audio stimulus via loudspeakers or headphones with the head-related transfer function (HRTF) plugin. First, the methodology...

Pełny tekst do pobrania w portalu
Digital Audio Broadcasting or Webcasting: A Network Quality Perspective
Publikacja
- P. Falkowski-Gilski
- J. Stefański
- Journal of Telecommunications and Information Technology - Rok 2016
In recent years, many alternative technologies of delivering audio content have emerged, with different advantages and disadvantages. In this paper pros and cons of digital audio broadcasting and webcasting transmission techniques in a network quality perspective are described. A case study of user expectations with respect to currently available services is analyzed, and the perceived quality of real digital broadcasted and webcasted...

Pełny tekst do pobrania w portalu
Detection of impulsive disturbances in archive audio signals
Publikacja
- M. Ciołek
- M. Niedźwiecki
- Rok 2017
In this paper the problem of detection of impulsive disturbances in archive audio signals is considered. It is shown that semi-causal/noncausal solutions based on joint evaluation of signal prediction errors and leave-one-out signal interpolation errors, allow one to noticeably improve detection results compared to the prediction-only based solutions. The proposed approaches are evaluated on a set of clean audio signals contaminated...

Pełny tekst do pobrania w portalu
Parametric impulsive noise detector for corrupted audio signals based on hidden Markow model
Publikacja
- K. Cisowski
- Rok 2008
The paper addresses the problem of impulsive noise detection for audio signals. A structure of threshold parameter detectors using modelingof signals was introduced. the algorithm of the noise detection, based on discrete-time hidden Markow model (HMM)of whitened audio signal is elaborated
Sparse vector autoregressive modeling of audio signals and its application to the elimination of impulsive disturbances
Publikacja
- M. Niedźwiecki
- M. Ciołek
- Rok 2015
Archive audio files are often corrupted by impulsive disturbances, such as clicks, pops and record scratches. This paper presents a new method for elimination of impulsive disturbances from stereo audio signals. The proposed approach is based on a sparse vector autoregressive signal model, made up of two components: one taking care of short-term signal correlations, and the other one taking care of long-term correlations. The method...

Pełny tekst do pobrania w serwisie zewnętrznym
An audio-visual corpus for multimodal automatic speech recognition
Publikacja
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2017
review of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...

Pełny tekst do pobrania w portalu
Personal adaptive tuning of mobile computer audio
Publikacja
- Rok 2015
An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of the acoustic track to the changing conditions and to the user's individual preferences. Original signal processing algorithms are introduced, which concern: linearization of frequency response, dialogue intelligibility enhancement and dynamics processing tuned up to the user's preferences....
Localization of impulsive disturbances in audio signals using template matching
Publikacja
- M. Niedźwiecki
- M. Ciołek
- DIGITAL SIGNAL PROCESSING - Rok 2015
In this paper, a new solution to the problem of elimination of impulsive disturbances from audio signals, based on the matched filtering technique, is proposed. The new approach stems from the observation that a large proportion of noise pulses corrupting audio recordings have highly repetitive shapes that match several typical “patterns”. In many cases a representative set of exemplary pulse waveforms can be extracted from the...

Pełny tekst do pobrania w portalu
Gaze-tracking based audio-visual correlation analysis employing quality of experience methodology
Publikacja
- Intelligent Decision Technologies-Netherlands - Rok 2010
This paper investigates a new approach to audio-visual correlation assessment based on the gaze-tracking system developed at the Multimedia Systems Department (MSD) of Gdansk University of Technology (GUT). The gaze-tracking methodology, having roots in Human-Computer Interaction borrows the relevance feedback through gaze-tracking and applies it to the new area of interests, which is Quality of Experience. Results of subjective...

Pełny tekst do pobrania w serwisie zewnętrznym
Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?
Publikacja
- B. Kostek
- Rok 2022
In this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...

Pełny tekst do pobrania w portalu
Classifying type of vehicles on the basis of data extracted from audio signal characteristics
Publikacja
- Journal of the Acoustical Society of America - Rok 2017
The aim of this study is to find and optimize a feature vector for an automatic recognition of the type of vehicles, extracted form an audio signal. First, the influence of weather-based conditions of road surface on spectral characteristic of the audio signal recorded from a passing vehicle in close proximity to the road is discussed. Next, parameterization of the recorded audio signal is performed. For that purpose, the MIRtoolbox,...

Pełny tekst do pobrania w serwisie zewnętrznym

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: AUDIO ENGINEERING, SEMANTIC AUDIO

Bożena Kostek prof. dr hab. inż.

Piotr Szczuko dr hab. inż.

Józef Kotus dr hab. inż.

Michał Lech dr inż.