Wyniki wyszukiwania dla: AUDIO PROCESSING

Wyniki wyszukiwania dla: AUDIO PROCESSING

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 220

wyczyść wszystkie filtry niedostępne

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING

Czasopisma

ISSN: 1063-6676
Wow detection and compensation employing spectral processing of audio.
Publikacja
- Rok 2004
Praca zawiera opis opracowanych algorytmów detekcji i kompensacji pasożytniczych modulacji częstotliwości wynikających z nierównomiernego przesuwu nośnika dźwięku. Proponowane metody opracowano ze szczególnym uwzględnieniem przypadkowych zniekształceń drżenia obecnych w archiwalnych filmowych ścieżkach dźwiękowych. Dodatkowo algorytmy badają wpływ zniekształceń na strukturę formantową sygnałów. Analiza zmian położenia formantów...
IEEE Transactions on Audio Speech and Language Processing

Czasopisma

ISSN: 1558-7916
Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing
Publikacja
- M. Niedźwiecki
- M. Ciołek
- IEEE Transactions on Audio Speech and Language Processing - Rok 2013
In this application-oriented paper we consider the problem of elimination of impulsive disturbances, such as clicks, pops and record scratches, from archive audio recordings. The proposed approach is based on bidirectional processing—noise pulses are localized by combining the results of forward-time and backward-time signal analysis. Based on the results of specially designed empirical tests (rather than on the results of theoretical analysis),...

Pełny tekst do pobrania w serwisie zewnętrznym
RENOVATION OF ARCHIVE AUDIO RECORDINGS USING SPARSE AUTOREGRESSIVE MODELING AND BIDIRECTIONAL PROCESSING
Publikacja
- M. Niedźwiecki
- M. Ciołek
- Rok 2013
The paper presents a new approach to elimination of broadband noise and impulsive disturbances from archive audio recordings. The proposed adaptive Kalman-like algorithm, based on a sparse autoregressive model of the audio signal, simultaneously detects noise pulses, interpolates the irrevocably distorted samples and performs signal smoothing. It is shown that bidirectional (forward-backward) processing of the archive signal improves...

Pełny tekst do pobrania w serwisie zewnętrznym
Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?
Publikacja
- B. Kostek
- Rok 2022
In this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...

Pełny tekst do pobrania w portalu
IEEE-ACM Transactions on Audio Speech and Language Processing

Czasopisma

ISSN: 2329-9290
Adaptive system for recognition of sounds indicating threats to security of people and property employing parallel processing of audio data streams
Publikacja
- K. Łopatka
- Rok 2015
A system for recognition of threatening acoustic events employing parallel processing on a supercomputing cluster is featured. The methods for detection, parameterization and classication of acoustic events are introduced. The recognition engine is based onthreshold-based detection with adaptive threshold and Support Vector Machine classifcation. Spectral, temporal and mel-frequency descriptors are used as signal features. The...
EURASIP Journal on Audio Speech and Music Processing

Czasopisma

ISSN: 1687-4714 , eISSN: 1687-4722
International Symposium on Audio, Video, Image Processing and Intelligent Applications

Konferencje
Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering
Publikacja
- IEEE Transactions on Audio Speech and Language Processing - Rok 2015
This paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...

Pełny tekst do pobrania w portalu
Automatic music signal mixing system based on one-dimensional Wave-U-Net autoencoders
Publikacja
- D. Koszewski
- T. Görne
- G. Korvel
- B. Kostek
- EURASIP Journal on Audio Speech and Music Processing - Rok 2023
The purpose of this paper is to show a music mixing system that is capable of automatically mixing separate raw recordings with good quality regardless of the music genre. This work recalls selected methods for automatic audio mixing first. Then, a novel deep model based on one-dimensional Wave-U-Net autoencoders is proposed for automatic music mixing. The model is trained on a custom-prepared database. Mixes created using the...

Pełny tekst do pobrania w portalu
Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling
Publikacja
- S. Raczyński
- E. Vincent
- S. Sagayama
- IEEE Transactions on Audio Speech and Language Processing - Rok 2013
Symbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is...

Pełny tekst do pobrania w serwisie zewnętrznym
Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
Publikacja
- S. Raczyński
- E. Vincent
- IEEE Transactions on Audio Speech and Language Processing - Rok 2014
In this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor pr ocess priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bi- gram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of -grams with a topic model,...

Pełny tekst do pobrania w serwisie zewnętrznym
Estimation of the short-term predictor parameters of speech under noisy conditions
Publikacja
- M. Kuropatwinski
- W. Kleijn
- M. Kuropatwiński
- IEEE Transactions on Audio Speech and Language Processing - Rok 2006
Pełny tekst do pobrania w serwisie zewnętrznym
New approach for determining the QoS of MP3-coded voice signals in IP networks
Publikacja
- T. Uhl
- S. Paulsen
- K. Nowicki
- EURASIP Journal on Audio Speech and Music Processing - Rok 2017
Present-day IP transport platforms being what they are, it will never be possible to rule out conflicts between the available services. The logical consequence of this assertion is the inevitable conclusion that the quality of service (QoS) must always be quantifiable no matter what. This paper focuses on one method to determine QoS. It defines an innovative, simple model that can evaluate the QoS of MP3-coded voice data transported...

Pełny tekst do pobrania w portalu
Piotr Szczuko dr hab. inż.

Osoby

Katedra Systemów Multimedialnych

Dr hab. inż. Piotr Szczuko w 2002 roku ukończył studia na Wydziale Elektroniki, Telekomunikacji i Informatyki Politechniki Gdańskiej zdobywając tytuł magistra inżyniera. Tematem pracy dyplomowej było badanie zjawisk jednoczesnej percepcji obrazu cyfrowego i dźwięku dookólnego. W roku 2008 obronił rozprawę doktorską zatytułowaną "Zastosowanie reguł rozmytych w komputerowej animacji postaci", za którą otrzymał nagrodę Prezesa Rady...
Marek Blok dr hab. inż.

Osoby

Marek Blok w 1994 roku ukończył studia na kierunku Telekomunikacja wydziału Elektroniki Politechniki Gdańskiej i uzyskał tytuł mgra inżyniera. Doktorat w zakresie telekomunikacji uzyskał w 2003 roku na Wydziale Elektroniki, Telekomunikacji i Informatyki Politechniki Gdańskiej. W 2017 roku uzyskał stopień naukowy dra habilitowanego w dyscyplinie telekomunikacja. Jego zainteresowania badawcze ukierunkowane są na telekomunikacyjne...
Personal adaptive tuning of mobile computer audio
Publikacja
- Rok 2015
An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of the acoustic track to the changing conditions and to the user's individual preferences. Original signal processing algorithms are introduced, which concern: linearization of frequency response, dialogue intelligibility enhancement and dynamics processing tuned up to the user's preferences....
Michał Lech dr inż.

Osoby

Michał Lech was born in Gdynia in 1983. In 2007 he graduated from the faculty of Electronics, Telecommunications and Informatics of Gdansk University of Technology. In June 2013, he received his Ph.D. degree. The subject of the dissertation was: “A Method and Algorithms for Controlling the Sound Mixing Processes with Hand Gestures Recognized Using Computer Vision”. The main focus of the thesis was the bias of audio perception caused...

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: AUDIO PROCESSING

Piotr Szczuko dr hab. inż.

Marek Blok dr hab. inż.

Michał Lech dr inż.