Wyniki wyszukiwania dla: automatic audio mixing - MOST Wiedzy

Wyszukiwarka

Wyniki wyszukiwania dla: automatic audio mixing

Wyniki wyszukiwania dla: automatic audio mixing

  • 2022/2023_zima SCADA Systems in Automatic Control

    Kursy Online
    • P. A. Kaczmarek

    SCADA Systems in Automatic Control - project materials

  • 2021/2022_zima SCADA Systems in Automatic Control

    Kursy Online
    • P. A. Kaczmarek

    SCADA Systems in Automatic Control - project materials

  • Objectivization of audio-video correlation assessment experiments

    Publikacja

    - Rok 2010

    The purpose of this paper is to present a new method of conducting an audio-visual correlation analysis employing a head-motion-free gaze tracking system. First, a review of related works in the domain of sound and vision correlation is presented. Then assumptions concerning audio-visual scene creation are shortly described. The objectivization process of carrying out correlation tests employing gaze-tracking system is outlined....

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Intelligent video and audio applications for learning enhancement

    The role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....

    Pełny tekst do pobrania w portalu

  • System for automatic singing voice recognition

    W artykule przedstawiono system automatycznego rozpoznawania jakości i typu głosu śpiewaczego. Przedstawiono bazę danych oraz zaimplementowane parametry. Algorytmem decyzyjnym jest algorytm sztucznych sieci neuronowych. Wytrenowany system decyzyjny osiąga skuteczność ok. 90% w obydwu kategoriach rozpoznawania. Dodatkowo wykazano przy pomocy metod statystycznych, że wyniki działania systemu automatycznej oceny jakości technicznej...

  • Detection of impulsive disturbances in archive audio signals

    Publikacja

    In this paper the problem of detection of impulsive disturbances in archive audio signals is considered. It is shown that semi-causal/noncausal solutions based on joint evaluation of signal prediction errors and leave-one-out signal interpolation errors, allow one to noticeably improve detection results compared to the prediction-only based solutions. The proposed approaches are evaluated on a set of clean audio signals contaminated...

    Pełny tekst do pobrania w portalu

  • In situ soil mixing.

    Publikacja

    - Rok 2004

    Opracowanie stanowi 9-ty rozdział w książce ''Ground Improvement'' (Wzmacnianie gruntu).W podrozdziałach 9.1 i 9.2 przedstawiono rys historyczny oraz aktualny stan zaawansowania technologii mieszania gruntu na sucho i na mokro. Wprowadzono także klasyfikację metod wdrożonych na świecie przez firmy specjalistyczne. W podrozdziale 9.3 opisano sprzęt i technologię i przebieg mieszania, z podziałem na urządzenia do mieszania głębokiego...

  • Exploiting audio-visual correlation by means of gaze tracking

    This paper presents a novel means for increasing audio-visual correlation analysis reliability. This is done based on gaze tracking technology engineered at the Multimedia Systems Department of the Gdansk University of Technology, Poland. In the paper, the past history and current research in the area of audio-visual perception analysis are shortly reviewed. Then the methodology employing gaze tracking is presented along with the...

    Pełny tekst do pobrania w portalu

  • Personal adaptive tuning of mobile computer audio

    An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of the acoustic track to the changing conditions and to the user's individual preferences. Original signal processing algorithms are introduced, which concern: linearization of frequency response, dialogue intelligibility enhancement and dynamics processing tuned up to the user's preferences....

  • Elimination of impulsive disturbances from stereo audio recordings

    Publikacja

    This paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. On-line tracking of signal model parameters is performed using the stability-preserving Whittle-Wiggins-Robinson algorithm with exponential data weighting. Detection of noise pulses and model-based interpolation of the irrevocably distorted samples...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Digital Audio Broadcasting or Webcasting: A Network Quality Perspective

    In recent years, many alternative technologies of delivering audio content have emerged, with different advantages and disadvantages. In this paper pros and cons of digital audio broadcasting and webcasting transmission techniques in a network quality perspective are described. A case study of user expectations with respect to currently available services is analyzed, and the perceived quality of real digital broadcasted and webcasted...

    Pełny tekst do pobrania w portalu

  • System do prototypowania bezprzewodowych inteligentnych urządzeń monitoringu audio-video

    Publikacja

    - Rok 2013

    W komunikacie przedstawiono system prototypowania bezprzewodowych urządzeń do monitoringu audio-video. System bazuje na układach FPGA Virtex6 i wielu dodatkowych wspierających urządzeniach jak: szybka pamięć DDR3, mała kamera HD, mikrofon z konwerterem A/C, moduł radiowy WiFi, itp. Funkcjonalność systemu została szczegółowo opisana w komunikacie. System został zoptymalizowany do pracy pod kontrolą systemu operacyjnego Linux, zostały...

  • Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering

    This paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...

    Pełny tekst do pobrania w portalu

  • Testing Watermark Robustness against Application of Audio Restoration Algorithms

    Publikacja

    The purpose of this study was to test to what extent watermarks embedded in distorted audio signals are immune to audio restoration algorithm performing. Several restoration routines such as noise reduction, spectrum expansion, clipping or clicks reduction were applied in the online website system. The online service was extended with some copyright protection mechanisms proposed by the authors. They contain low-level music features...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • A double-talk detector using audio watermarking

    a novel approach to double-talk detection in the acoustic echo canceler is proposed. a hidden signature is embedded into the arriving signal, using the echo-hiding method. next detection of the presence of this signature in the microphone signal is performed. the results of the signature detection may be used by the acoustic echo canceler to stop or restart the adaptation process.

    Pełny tekst do pobrania w serwisie zewnętrznym

  • SYNAT Music Genre Parameters PCA 19

    Dane Badawcze

    The dataset contains feature vector after  Principal Component Analysis (PCA) performing, so there are 11 music genres and 19-element vector derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier research studies carried out by the team of authors [1-6]. A collection of 52532 music excerpts described...

  • SYNAT_PCA_48

    Dane Badawcze

    There is a series of datasets containing feature vectors derived from music tracks. The dataset contains 51582 music tracks (22 music genres) and feature vector after  Principal Component Analysis (PCA) performing, so there are 48-element vectors derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier...

  • SYNAT_PCA_11

    Dane Badawcze

    The dataset contains 51582 music tracks (22 music genres) and feature vector after  Principal Component Analysis (PCA) performing, so there are 11-element vectors derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier research studies carried out by the team of authors [1-6]. A collection of more than...

  • Bożena Kostek prof. dr hab. inż.

  • Localization of impulsive disturbances in audio signals using template matching

    In this paper, a new solution to the problem of elimination of impulsive disturbances from audio signals, based on the matched filtering technique, is proposed. The new approach stems from the observation that a large proportion of noise pulses corrupting audio recordings have highly repetitive shapes that match several typical “patterns”. In many cases a representative set of exemplary pulse waveforms can be extracted from the...

    Pełny tekst do pobrania w portalu

  • Objectivization of Audio-Visual Correlation analysis

    Publikacja

    Simultaneous perception of audio and visual stimuli often causes the concealment or misrepresentation of information actually contained in these stimuli. Such effects are called the ''image proximity effect'' or the ''ventriloquism effect'' in literature. Until recently, most research carried out to understand their nature was based on subjective assessments. The Authors of this paper propose a methodology based on both subjective...

    Pełny tekst do pobrania w portalu

  • On mixing in the class of quadratic stochastic operators

    We study different types of limit behavior of quadratic stochastic operators acting on ℓ^1 (or ℓ^1_d) spaces in both strong and uniform topologies. The main motif of the paper is to express the uniform and strong asymptotic stability of the quadratic stochastic operator in terms of convergence of the associated (linear) nonhomogeneous Markov chain. We also examine which type of uniform convergence of iterates of the quadratic...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Ground improvement with in situ Soil Mixing

    Publikacja

    - Rok 2005

    Omówiono metodę wzmacniania słabego gruntu za pomocą technologii wgłębnego mieszania na sucho i mokro. W metodzie mokrej stosuje się składniki wiążące (głownie cementy) wymieszane z wodą, podawane w formie zaczynu. W metodzie suchej materiały wiążące podawane są w postaci sproszkowanej z udziałem sprężonego powietrza. Przedstawiono główne obszary zastosowania w geotechnice oraz podano przykłady kilku realizacji.

  • Analysis of impact of audio modifications on the robustness of watermark for non-blind architecture

    The aim of this paper is to assess the robustness of the non-blind audio content watermarking scheme proposed by the authors. The authors present the architecture of the designed system along with the employed workflows for embedding and extracting the watermark followed by the implementation phase description and the analysis of the experimental results. Some possible attack simulations on the embedded watermarks are reviewed,...

    Pełny tekst do pobrania w portalu

  • Text classifiers for automatic articles categorization

    Publikacja

    The article concerns the problem of automatic classification of textual content. We present selected methods for generation of documents representation and we evaluate them in classification tasks. The experiments have been performed on Wikipedia articles classified automatically to their categories made by Wikipedia editors.

  • Determining Pronunciation Differences in English Allophones Utilizing Audio Signal Parameterization

    Publikacja

    - Rok 2017

    An allophonic description of English plosive consonants, based on audio-visual recordings of 600 specially selected words, was developed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted...

  • Parametric impulsive noise detector for corrupted audio signals based on hidden Markow model

    Publikacja

    - Rok 2008

    The paper addresses the problem of impulsive noise detection for audio signals. A structure of threshold parameter detectors using modelingof signals was introduced. the algorithm of the noise detection, based on discrete-time hidden Markow model (HMM)of whitened audio signal is elaborated

  • Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?

    Publikacja

    - Rok 2022

    In this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...

    Pełny tekst do pobrania w portalu

  • Using concentrated spectrogram for analysis of audio acoustic signals

    Publikacja

    The paper presents results of time-frequency analysis of audio acoustic signals using the method of Concentrated Spectrograph also known as ''Cross-spectral method'' or ''Reassignment method''. Presented algorithm involves signal's local group delay and channelized instantaneous frequency to relevantly redistribute all Short-time Fourier transform lines in time-frequency plain. The main intention of the paper is to compare various...

    Pełny tekst do pobrania w portalu

  • RENOVATION OF ARCHIVE AUDIO RECORDINGS USING SPARSE AUTOREGRESSIVE MODELING AND BIDIRECTIONAL PROCESSING

    Publikacja

    The paper presents a new approach to elimination of broadband noise and impulsive disturbances from archive audio recordings. The proposed adaptive Kalman-like algorithm, based on a sparse autoregressive model of the audio signal, simultaneously detects noise pulses, interpolates the irrevocably distorted samples and performs signal smoothing. It is shown that bidirectional (forward-backward) processing of the archive signal improves...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Sparse vector autoregressive modeling of audio signals and its application to the elimination of impulsive disturbances

    Publikacja

    Archive audio files are often corrupted by impulsive disturbances, such as clicks, pops and record scratches. This paper presents a new method for elimination of impulsive disturbances from stereo audio signals. The proposed approach is based on a sparse vector autoregressive signal model, made up of two components: one taking care of short-term signal correlations, and the other one taking care of long-term correlations. The method...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Objectivization of phonological evaluation of speech elements by means of audio parametrization

    This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...

  • Quality Analysis of Audio-Video Transmission in an OFDM-Based Communication System

    Publikacja

    - Rok 2022

    Application of a reliable audio-video communication system, brings many advantages. With the spoken word we can exchange ideas, provide descriptive information, as well as aid to another person. With the availability of visual information one can monitor the surrounding, working environment, etc. As the amount of available bandwidth continues to shrink, researchers focus on novel types of transmission. Currently, orthogonal frequency...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Pursuing Listeners’ Perceptual Response in Audio-Visual Interactions - Headphones vs Loudspeakers: A Case Study

    Publikacja

    This study investigates listeners’ perceptual responses in audio-visual interactions concerning binaural spatial audio. Audio stimuli are coupled with or without visual cues to the listeners. The subjective test participants are tasked to indicate the direction of the incoming sound while listening to the audio stimulus via loudspeakers or headphones with the head-related transfer function (HRTF) plugin. First, the methodology...

    Pełny tekst do pobrania w portalu

  • Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.

    In this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • A study on of music features derived from audio recordings examples – a quantitative analysis

    The paper presents a comparative study of music features derived from audio recordings, i.e. the same music pieces but representing different music genres, excerpts performed by different musicians, and songs performed by a musician, whose style evolved over time. Firstly, the origin and the background of the division of music genres were shortly presented. Then, several objective parameters of an audio signal were recalled that...

    Pełny tekst do pobrania w portalu

  • Gaze-tracking based audio-visual correlation analysis employing quality of experience methodology

    This paper investigates a new approach to audio-visual correlation assessment based on the gaze-tracking system developed at the Multimedia Systems Department (MSD) of Gdansk University of Technology (GUT). The gaze-tracking methodology, having roots in Human-Computer Interaction borrows the relevance feedback through gaze-tracking and applies it to the new area of interests, which is Quality of Experience. Results of subjective...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Localization of impulsive disturbances in archive audio signals using predictive matched filtering

    Publikacja

    The problem of elimination of impulsive disturbances from archive audio signals is considered and its new solution, called predictive matched filtering, is proposed. The new approach is based on the observation that a large percentage of noise pulses corrupting archive audio recordings have highly repetitive shapes that match several typical “patterns”, called click templates. To localize noise pulses, click templates can be correlated...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Ground improvement with in-situ wet soil mixing

    Publikacja

    - Rok 2003

    Opisano rozwój technologii mieszania wgłębnego gruntu na świecie i w Polsce. Przedstawiono bliżej aktualnie wykorzystywane metody mieszania, wprowadzane przez specjalistyczne firmy. W syntetyczny sposób podsumowano zasady projektowania oraz doświadczenia wykonawcze. Zamieszczono bliższe informacje odnośnie kilku przykładów zastosowania w Polsce.

  • Strong mixing Markov semigroups on C1 are meager

    Publikacja

    Dowodzi się, że zbiór tych półgrup operatorów Markowa na klasie Schattena C1, dla których w mocnej topologii operatorowej T(t) jest zbieżne do operatora Markowa Q, gdzie Q jest 1-wymiarową projekcją, jest zbiorem rzadkim w zbiorze wszystkich półgrup Markowa.

  • Music Mixing Process Controlled by Hand Gestures

    Publikacja

    W referacie przedstawiono system umożliwiający sterowanie procesami miksowania śladów nagrania muzycznego za pomocą gestów rąk. Przybliżono podstawy wielomodalnej percepcji argumentujące potrzebę powstania tego typu systemu oraz założenia przyjęte w trakcie jego tworzenia. Część sprzętowa systemu składa się z rzutnika multimedialnego, kamery internetowej, komputera klasy PC z zainstalowanym oprogramowaniem systemu oraz ekranu dla...

  • Automatic Rhythm Retrieval from Musical Files

    Publikacja

    - Rok 2008

    This paper presents a comparison of the effectiveness of two computational intelligence approaches applied to the task of retrieving rhythmic structure from musical files. The method proposed by the authors of this paper generates rhythmic levels first, and then uses these levels to compose rhythmic hypotheses. Three phases: creating periods, creating simplified hypotheses and creating full hypotheses are examined within this study....

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Quality Evaluation of Novel DTD Algorithm Based on Audio Watermarking

    Publikacja

    Echo cancellers typically employ a doubletalk detection (DTD) algorithm in order to keep the adaptive filter from diverging in the presence of near-end speech signal or other disruptive sounds in the microphone signal. A novel doubletalk detection algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Automatic Classification of Polish Sign Language Words

    In the article we present the approach to automatic recognition of hand gestures using eGlove device. We present the research results of the system for detection and classification of static and dynamic words of Polish language. The results indicate the usage of eGlove allows to gain good recognition quality that additionally can be improved using additional data sources such as RGB cameras.

    Pełny tekst do pobrania w portalu

  • Automatic Analysis of Trajectories of Moving Objects

    Publikacja

    Ongoing monitoring is essential to providing security and safety of maritime and air operations. This paper presents the research in the area of automatic analysis of movement of unrestricted vehicles like ships and air-planes. The analysis is aimed at extraction of trajectory information, and the results can be used to identify anomalous behaviour in archived and real-time data. In this paper we focus on data acquired using the...

    Pełny tekst do pobrania w portalu

  • Production of six-degrees-of-freedom (6DoF) navigable audio using 30 Ambisonic microphones

    Publikacja
    • B. Mróz
    • M. Kabaciński
    • T. Ciotucha
    • A. Rumiński
    • T. Żernicki

    - Rok 2021

    This paper describes a method for planning, recording, and post-production of six-degrees-of-freedom audio recorded with multiple 3rd order Ambisonic microphone arrays. The description is based on the example of recordings conducted in August 2020 with the Poznan Philharmonic Orchestra using 30 units of Zylia ZM-1S. A convenient way to prepare and organize such a big project is proposed – this involves details of stage planning,...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Multimodal English corpus for automatic speech recognition

    A multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech...

  • Application of smart glasses for fast and automatic color correction in health care

    Publikacja

    In recent years different applications of smart glasses in health care have been proposed. In this paper we present the experiments related to automatic color correction using smart glasses platform developed within the eGlasses project. The color pattern is proposed and tested enabling the automatic detection of the pattern and automatic correction of colors. Additionally, the method for encoding and decoding of patient ID in...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Impact of Climate Change on Water Sources and River‐Floodplain Mixing in the Natural Wetland Floodplain of Biebrza River

    Publikacja

    - WATER RESOURCES RESEARCH - Rok 2023

    The origins of river and floodplain waters (groundwater, rainfall, and snowmelt) and their extent during overbank flow events strongly impact ecological processes such as denitrification and vegetation development. However, the long-term sensitivity of floodplain water signatures to climate change remains elusive. We examined how the integrated hydrological model HydroGeoSphere and the Hydraulic Mixing-Cell method could help us...

    Pełny tekst do pobrania w portalu

  • Analysis of the Usefulness of Cheap Audio Recorders for Spectral Measurement of Environmental Noise

    Environmental noise pollution is nowadays one of the most serious health threats. The impact of noise on the human body depends not only on the sound level but also on its spectral distribution. Reliable measurements of the environmental noise spectrum are often hampered by the very high price of top quality measuring devices. This paper explores the possibility of using much cheaper audio recorders for the frequency analysis....

    Pełny tekst do pobrania w portalu