Wyniki wyszukiwania dla: audio processing

Wow detection and compensation employing spectral processing of audio.

Publikacja

- Rok 2004

Praca zawiera opis opracowanych algorytmów detekcji i kompensacji pasożytniczych modulacji częstotliwości wynikających z nierównomiernego przesuwu nośnika dźwięku. Proponowane metody opracowano ze szczególnym uwzględnieniem przypadkowych zniekształceń drżenia obecnych w archiwalnych filmowych ścieżkach dźwiękowych. Dodatkowo algorytmy badają wpływ zniekształceń na strukturę formantową sygnałów. Analiza zmian położenia formantów...

Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing

Publikacja

- IEEE Transactions on Audio Speech and Language Processing - Rok 2013

In this application-oriented paper we consider the problem of elimination of impulsive disturbances, such as clicks, pops and record scratches, from archive audio recordings. The proposed approach is based on bidirectional processing—noise pulses are localized by combining the results of forward-time and backward-time signal analysis. Based on the results of specially designed empirical tests (rather than on the results of theoretical analysis),...

Pełny tekst do pobrania w portalu

RENOVATION OF ARCHIVE AUDIO RECORDINGS USING SPARSE AUTOREGRESSIVE MODELING AND BIDIRECTIONAL PROCESSING

Publikacja

- Rok 2013

The paper presents a new approach to elimination of broadband noise and impulsive disturbances from archive audio recordings. The proposed adaptive Kalman-like algorithm, based on a sparse autoregressive model of the audio signal, simultaneously detects noise pulses, interpolates the irrevocably distorted samples and performs signal smoothing. It is shown that bidirectional (forward-backward) processing of the archive signal improves...

Pełny tekst do pobrania w serwisie zewnętrznym

Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?

Publikacja

B. Kostek

- Rok 2022

In this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...

Pełny tekst do pobrania w portalu

Adaptive system for recognition of sounds indicating threats to security of people and property employing parallel processing of audio data streams

Publikacja

K. Łopatka

- Rok 2015

A system for recognition of threatening acoustic events employing parallel processing on a supercomputing cluster is featured. The methods for detection, parameterization and classication of acoustic events are introduced. The recognition engine is based onthreshold-based detection with adaptive threshold and Support Vector Machine classifcation. Spectral, temporal and mel-frequency descriptors are used as signal features. The...

Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering

Publikacja

- IEEE Transactions on Audio Speech and Language Processing - Rok 2015

This paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...

Pełny tekst do pobrania w portalu

Automatic music signal mixing system based on one-dimensional Wave-U-Net autoencoders

Publikacja

D. Koszewski
T. Görne
G. Korvel
B. Kostek

- EURASIP Journal on Audio Speech and Music Processing - Rok 2023

The purpose of this paper is to show a music mixing system that is capable of automatically mixing separate raw recordings with good quality regardless of the music genre. This work recalls selected methods for automatic audio mixing first. Then, a novel deep model based on one-dimensional Wave-U-Net autoencoders is proposed for automatic music mixing. The model is trained on a custom-prepared database. Mixes created using the...

Pełny tekst do pobrania w portalu

Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling

Publikacja

S. Raczyński
E. Vincent
S. Sagayama

- IEEE Transactions on Audio Speech and Language Processing - Rok 2013

Symbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is...

Pełny tekst do pobrania w serwisie zewnętrznym

Estimation of the short-term predictor parameters of speech under noisy conditions

Publikacja

M. Kuropatwinski
W. Kleijn
M. Kuropatwiński

- IEEE Transactions on Audio Speech and Language Processing - Rok 2006

Pełny tekst do pobrania w serwisie zewnętrznym

Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation

Publikacja

S. Raczyński
E. Vincent

- IEEE Transactions on Audio Speech and Language Processing - Rok 2014

In this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor pr ocess priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bi- gram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of -grams with a topic model,...

Pełny tekst do pobrania w serwisie zewnętrznym

New approach for determining the QoS of MP3-coded voice signals in IP networks

Publikacja

T. Uhl
S. Paulsen
K. Nowicki

- EURASIP Journal on Audio Speech and Music Processing - Rok 2017

Present-day IP transport platforms being what they are, it will never be possible to rule out conflicts between the available services. The logical consequence of this assertion is the inevitable conclusion that the quality of service (QoS) must always be quantifiable no matter what. This paper focuses on one method to determine QoS. It defines an innovative, simple model that can evaluate the QoS of MP3-coded voice data transported...

Pełny tekst do pobrania w portalu

Personal adaptive tuning of mobile computer audio

Publikacja

- Rok 2015

An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of the acoustic track to the changing conditions and to the user's individual preferences. Original signal processing algorithms are introduced, which concern: linearization of frequency response, dialogue intelligibility enhancement and dynamics processing tuned up to the user's preferences....

Measurement of Latency in the Android Audio Path

Publikacja

- Rok 2018

This paper provides a description of experimental investigations concerning comparison between the audio path characteristics of various Android versions. First, information about the changes in each system version in the context of latency caused by them is presented. Then, a measurement procedure employing available applications to measure latency is described comparing to results contained in the Internet. Finally, a comparison...

Pełny tekst do pobrania w serwisie zewnętrznym

Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing

Publikacja

- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2020

Developing signal processing methods to extract information automatically has potential in several applications, for example searching for multimedia based on its audio content, making context-aware mobile applications (e.g., tuning apps), or pre-processing for an automatic mixing system. However, the last-mentioned application needs a significant amount of research to reliably recognize real musical instruments in recordings....

Pełny tekst do pobrania w portalu

Music Data Processing and Mining in Large Databases for Active Media

Publikacja

- Rok 2014

The aim of this paper was to investigate the problem of music data processing and mining in large databases. Tests were performed on a large data-base that included approximately 30000 audio files divided into 11 classes cor-responding to music genres with different cardinalities. Every audio file was de-scribed by a 173-element feature vector. To reduce the dimensionality of data the Principal Component Analysis (PCA) with variable...

Pełny tekst do pobrania w serwisie zewnętrznym

A Study on Audio Signal Processed by "Instant Mastering"

Publikacja

M. Piotrowska
S. Piotrowski
B. Kostek

- Rok 2018

An increasing amount of music produced in home- and project-studios results in development and growth of "automatic mastering services". The presented investigation explores changes introduced to audio signal by various online mastering platforms. A music set consisting of 10 songs produced in small facilities was processed by eight on-line automatic mastering services. Additionally, some laboratory-constructed signals were tested....

Fitting the mobile device characteristics to the user's hearing preferences

Publikacja

- Rok 2014

A method for fitting the mobile computer audio characteristics to the user's hearing preferences is proposed. The process consists of two stages: calibration and dynamics processing. During the calibration phase the user performs a loudness scaling test giving their response regarding the perceived loudness. The dynamics processing made on above basis sets the loudness to the most comfortable level. The processing accounts both...

Pełny tekst do pobrania w serwisie zewnętrznym

Adaptive Personal Tuning of Sound in Mobile Computers

Publikacja

- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2016

An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of their acoustic track to changing acoustic conditions of the environment and to users’ individual preferences. Signal processing algorithms are introduced that concern: linearization of frequency response, dialogue intelligibility enhancement, and dynamics processing tuned up to the users’...

Pełny tekst do pobrania w portalu

An audio-visual corpus for multimodal automatic speech recognition

Publikacja

- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2017

review of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...

Pełny tekst do pobrania w portalu

Data, Information, Knowledge, Wisdom Pyramid Concept Revisited in the Context of Deep Learning

Publikacja

B. Kostek

- Rok 2023

In this paper, the data, information, knowledge, and wisdom (DIKW) pyramid is revisited in the context of deep learning applied to machine learningbased audio signal processing. A discussion on the DIKW schema is carried out, resulting in a proposal that may supplement the original concept. Parallels between DIWK pertaining to audio processing are presented based on examples of the case studies performed by the author and her collaborators....

Pełny tekst do pobrania w serwisie zewnętrznym

Filtry

Katalog

Kategoria

Rok

Opcje

Wow detection and compensation employing spectral processing of audio.

Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing

RENOVATION OF ARCHIVE AUDIO RECORDINGS USING SPARSE AUTOREGRESSIVE MODELING AND BIDIRECTIONAL PROCESSING

Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?

Adaptive system for recognition of sounds indicating threats to security of people and property employing parallel processing of audio data streams

Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering

Automatic music signal mixing system based on one-dimensional Wave-U-Net autoencoders

Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling

Estimation of the short-term predictor parameters of speech under noisy conditions

Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation

New approach for determining the QoS of MP3-coded voice signals in IP networks

Personal adaptive tuning of mobile computer audio

Measurement of Latency in the Android Audio Path

Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing

Music Data Processing and Mining in Large Databases for Active Media

A Study on Audio Signal Processed by "Instant Mastering"

Fitting the mobile device characteristics to the user's hearing preferences

Adaptive Personal Tuning of Sound in Mobile Computers

An audio-visual corpus for multimodal automatic speech recognition

Data, Information, Knowledge, Wisdom Pyramid Concept Revisited in the Context of Deep Learning

Wyszukiwarka

Filtry

Katalog

Kategoria

Rok

Opcje

Wyniki wyszukiwania dla: audio processing