Wyniki wyszukiwania dla: AUDIO CODING

Wyniki wyszukiwania dla: AUDIO CODING

wyników na stronę:
osadź ten widok na swojej stronie

Wyświetlane wyniki pochodzą z wyszukiwania alternatywnego.

Filtry

wszystkich: 3200

wyczyść wszystkie filtry niedostępne

wyświetlamy 1000 najlepszych wyników Pomoc

Two-stage method of impulsive noise detection for audio signals
Publikacja
- K. Cisowski
- Poznan University of Technology Academic Journals. Electrical Engineering - Rok 2007
Przedstawiono nowa dwuetapową metodę detekcji zakłóceń impulsowych opartą na analizie funkcji gęstości rozkładu prawdopodobieństwa zakłóconego sygnału. Opisano algorytm określania poziomu wyzwalania detektora progowego.
Exploring Neural Networks for Musical Instrument Identification in Polyphonic Audio
Publikacja
- M. Blaszke
- G. Korvel
- B. Kostek
- IEEE INTELLIGENT SYSTEMS - Rok 2024
The purpose of this paper is to introduce neural network-based methods that surpass state-of-the-art (SOTA) models, either by training faster or having simpler architecture, while maintaining comparable effectiveness in musical instrument identification in polyphonic music. Several approaches are presented, including two authors’ proposals, i.e., spiking neural networks (SNN) and a modular deep learning model named FMCNN (Fully...

Pełny tekst do pobrania w serwisie zewnętrznym
Localization of impulsive disturbances in audio signals using template matching
Publikacja
- M. Niedźwiecki
- M. Ciołek
- DIGITAL SIGNAL PROCESSING - Rok 2015
In this paper, a new solution to the problem of elimination of impulsive disturbances from audio signals, based on the matched filtering technique, is proposed. The new approach stems from the observation that a large proportion of noise pulses corrupting audio recordings have highly repetitive shapes that match several typical “patterns”. In many cases a representative set of exemplary pulse waveforms can be extracted from the...

Pełny tekst do pobrania w portalu
Audio-visual surveillance system for application in bank operating room
Publikacja
- J. Kotus
- K. Łopatka
- A. Czyżewski
- G. Bogdanis
- Communications in Computer and Information Science - Rok 2013
An audio-visual surveillance system able to detect, classify and to localize acoustic events in a bank operating room is presented. Algorithms for detection and classification of abnormal acoustic events, such as screams or gunshots are introduced. Two types of detectors are employed to detect impulsive sounds and vocal activity. A Support Vector Machine (SVM) classifier is used to discern between the different classes of acoustic...
Testing Watermark Robustness against Application of Audio Restoration Algorithms
Publikacja
- Rok 2013
The purpose of this study was to test to what extent watermarks embedded in distorted audio signals are immune to audio restoration algorithm performing. Several restoration routines such as noise reduction, spectrum expansion, clipping or clicks reduction were applied in the online website system. The online service was extended with some copyright protection mechanisms proposed by the authors. They contain low-level music features...

Pełny tekst do pobrania w serwisie zewnętrznym
Quality Evaluation of Novel DTD Algorithm Based on Audio Watermarking
Publikacja
- A. Ciarkowski
- A. Czyżewski
- Rok 2011
Echo cancellers typically employ a doubletalk detection (DTD) algorithm in order to keep the adaptive filter from diverging in the presence of near-end speech signal or other disruptive sounds in the microphone signal. A novel doubletalk detection algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation...

Pełny tekst do pobrania w serwisie zewnętrznym
Objectivization of phonological evaluation of speech elements by means of audio parametrization
Publikacja
- Rok 2018
This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...
Audio Feature Analysis for Precise Vocalic Segments Classification in English
Publikacja
- S. Zaporowski
- A. Czyżewski
- Rok 2020
An approach to identifying the most meaningful Mel-Frequency Cepstral Coefficients representing selected allophones and vocalic segments for their classification is presented in the paper. For this purpose, experiments were carried out using algorithms such as Principal Component Analysis, Feature Importance, and Recursive Parameter Elimination. The data used were recordings made within the ALOFON corpus containing audio signal...

Pełny tekst do pobrania w serwisie zewnętrznym
IEEE Transactions on Audio Speech and Language Processing

Czasopisma

ISSN: 1558-7916
Moving Image

Czasopisma

ISSN: 1532-3978 , eISSN: 1542-4235
Structure and Bonding

Czasopisma

ISSN: 0081-5993
Adaptive resolution-constrained scalar multiple-description coding
Publikacja
- J. Klejsa
- M. Kuropatwinski
- W. Bastiaan
- M. Kuropatwiński
- Rok 2008
Pełny tekst do pobrania w serwisie zewnętrznym
Further developments of parameterization methods of audio stream analysis for secuirty purposes
Publikacja
- P. Żwan
- A. Czyżewski
- Rok 2009
The paper presents an automatic sound recognition algorithm intended for application in an audiovisual security monitoring system. A distributed character of security systems does not allow for simultaneous observation of multiple multimedia streams, thus an automatic recognition algorithm must be introduced. In the paper, a module for the parameterization and automatic detection of audio events is described. The spectral analyses...
Noise reduction in audio employing spectral unpredictability measure and neural net.
Publikacja
- A. Czyżewski
- M. Dziubiński
- Rok 2004
modelu psychoakustycznym zostały przedyskutowane. Uczący się algorytm decyzjny, działający w opraciu o sztuczną sieć neuronową wykorzystany został w klasyfikacji składowych na pasożytnicze i użyteczne. Przedstawiona została również nowa iteracyjna procedura obliczania progu maskowania. W pracy zawarte zostały wyniki eksperymentów, oraz konkluzje odnoszące się do przedstawionych algorytmów.
Analysis of the Usefulness of Cheap Audio Recorders for Spectral Measurement of Environmental Noise
Publikacja
- J. Skibicki
- R. Licow
- N. Karkosińska-Brzozowska
- K. Daliga
- P. Chrostowski
- A. Wilk
- K. Karwowski
- M. Szafrański
- T. Widerski
- L. Jarzębowicz... i 4 innych
- Metrology - Rok 2023
Environmental noise pollution is nowadays one of the most serious health threats. The impact of noise on the human body depends not only on the sound level but also on its spectral distribution. Reliable measurements of the environmental noise spectrum are often hampered by the very high price of top quality measuring devices. This paper explores the possibility of using much cheaper audio recorders for the frequency analysis....

Pełny tekst do pobrania w portalu
Multimodal human-computer interfaces based on advanced video and audio analysis
Publikacja
- Rok 2013
Multimodal interfaces development history is reviewed briefly in the introduction. Examples of applications of multimodal interfaces to education software and for the disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and the audio interface for speech stretching for hearing impaired and stuttering people. The Smart...

Pełny tekst do pobrania w serwisie zewnętrznym
System do prototypowania bezprzewodowych inteligentnych urządzeń monitoringu audio-video
Publikacja
- M. Kłosowski
- Rok 2013
W komunikacie przedstawiono system prototypowania bezprzewodowych urządzeń do monitoringu audio-video. System bazuje na układach FPGA Virtex6 i wielu dodatkowych wspierających urządzeniach jak: szybka pamięć DDR3, mała kamera HD, mikrofon z konwerterem A/C, moduł radiowy WiFi, itp. Funkcjonalność systemu została szczegółowo opisana w komunikacie. System został zoptymalizowany do pracy pod kontrolą systemu operacyjnego Linux, zostały...
Multimodal human-computer interfaces based on advanced video and audio analysis
Publikacja
- Advances in Intelligent Systems and Computing - Rok 2014
Multimodal interfaces development history is reviewed briefly in the introduction. Some applications of multimodal interfaces to education software for disabled people are presented. One of them, the LipMouse is a novel, vision-based human-computer interface that tracks user’s lip movements and detect lips gestures. A new approach to diagnosing Parkinson’s disease is also shown. The progression of the disease can be measured employing...

Pełny tekst do pobrania w serwisie zewnętrznym
Analysis of impact of audio modifications on the robustness of watermark for non-blind architecture
Publikacja
- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2015
The aim of this paper is to assess the robustness of the non-blind audio content watermarking scheme proposed by the authors. The authors present the architecture of the designed system along with the employed workflows for embedding and extracting the watermark followed by the implementation phase description and the analysis of the experimental results. Some possible attack simulations on the embedded watermarks are reviewed,...

Pełny tekst do pobrania w portalu
Determining Pronunciation Differences in English Allophones Utilizing Audio Signal Parameterization
Publikacja
- B. Kostek
- M. Piotrowska
- T. Ciszewski
- A. Czyżewski
- Rok 2017
An allophonic description of English plosive consonants, based on audio-visual recordings of 600 specially selected words, was developed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted...
Quality Analysis of Audio-Video Transmission in an OFDM-Based Communication System
Publikacja
- M. Zamłyńska
- G. Debita
- P. Falkowski-Gilski
- Rok 2022
Application of a reliable audio-video communication system, brings many advantages. With the spoken word we can exchange ideas, provide descriptive information, as well as aid to another person. With the availability of visual information one can monitor the surrounding, working environment, etc. As the amount of available bandwidth continues to shrink, researchers focus on novel types of transmission. Currently, orthogonal frequency...

Pełny tekst do pobrania w serwisie zewnętrznym
Speech codec enhancements utilizing time compression and perceptual coding
Publikacja
- M. Kulesza
- A. Czyżewski
- Rok 2007
A method for encoding wideband speech signal employing standardized narrowband speech codecs is presented as well as experimental results concerning detection of tonal spectral components. The speech signal sampled with a higher sampling rate than it is suitable for narrowband coding algorithm is compressed in order to decrease the amount of samples. Next, the time-compressed representation of a signal is encoded using a narrowband...
A hybrid speech codec employing parametric and perceptual coding techniques
Publikacja
- Rok 2006
W referacie przedstawiono hybrydowy kodek mowy dla zastosowan w komunikacji VoIP wykorzystujący kodowanie parametryczne i percetualne. Sygnał mowy jest dzielony na składowe dźwięczne, które podlegają kodowania perceptualnemu, składowe bezdźwięczne, które kodowane są metodą parametryczną oraz transjenty, które nie są kodowane żadną stratną metodą. Dodatkowo przedstawiono architekturę kodeka, w której perceptualnie kodowana i przesyłana...
Underwater acoustic communications system with error correction and synchronization coding
Publikacja
- Rok 2006
Niezawodna transmisja danych w wielodrogowym i niestacjonarnym płytkim kanale podwodnym wymaga zastosowania efektywnej techniki modulacji oraz equalizacji adaptacyjnej. W zaproponowanym artykule w systemie transmisji danych zastosowano modulację OFDM oraz equalizację adaptacyjną opartą o filtrację Kalmana. Ponadto zaimplementowano dwie techniki kodowania: FEC w celu eliminacji błędów transmisji oraz kodowanie pseudoszumowe, którego...
Intelligent acquisition of audio signals, employing neutral networks and rough set algorithms
Publikacja
- A. Czyżewski
- Rok 2003
Algorytmy oparte na sztucznych sieciach neuronowych i metodzie zbiorówprzybliżonych zostały zastosowane do lokalizacji sygnałów fonicznych obar-czonych pasożytniczym szumem i rewerberacjami. Informacja o kierunku napły-wania dźwięku była uzyskiwana na wyjściach tych algorytmów na podstawie re-prezentacji parametrycznej. Przedstawiono wyniki eksperymentalne i przepro-wadzono ich dyskusję.
Audio Content and Crowdsourcing: A Subjective Quality Evaluation of Radio Programs Streamed Online
Publikacja
- P. Falkowski-Gilski
- Rok 2023
Radio broadcasting has been present in our lives for over 100 years. The transmission of speech and music signals accompanies us from an early age. Broadcasts provide the latest information from home and abroad. They also shape musical tastes and allow many artists to share their creativity. Modern distribution involves transmission over a number of terrestrial systems. The most popular are analog FM (Frequency Modulation) and...

Pełny tekst do pobrania w serwisie zewnętrznym
Classifying type of vehicles on the basis of data extracted from audio signal characteristics
Publikacja
- Journal of the Acoustical Society of America - Rok 2017
The aim of this study is to find and optimize a feature vector for an automatic recognition of the type of vehicles, extracted form an audio signal. First, the influence of weather-based conditions of road surface on spectral characteristic of the audio signal recorded from a passing vehicle in close proximity to the road is discussed. Next, parameterization of the recorded audio signal is performed. For that purpose, the MIRtoolbox,...

Pełny tekst do pobrania w serwisie zewnętrznym
Localization of impulsive disturbances in archive audio signals using predictive matched filtering
Publikacja
- M. Niedźwiecki
- M. Ciołek
- Rok 2014
The problem of elimination of impulsive disturbances from archive audio signals is considered and its new solution, called predictive matched filtering, is proposed. The new approach is based on the observation that a large percentage of noise pulses corrupting archive audio recordings have highly repetitive shapes that match several typical “patterns”, called click templates. To localize noise pulses, click templates can be correlated...

Pełny tekst do pobrania w serwisie zewnętrznym
Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.
Publikacja
- Rok 2018
In this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...

Pełny tekst do pobrania w serwisie zewnętrznym
A study on of music features derived from audio recordings examples – a quantitative analysis
Publikacja
- A. Dorochowicz
- B. Kostek
- Archives of Acoustics - Rok 2018
The paper presents a comparative study of music features derived from audio recordings, i.e. the same music pieces but representing different music genres, excerpts performed by different musicians, and songs performed by a musician, whose style evolved over time. Firstly, the origin and the background of the division of music genres were shortly presented. Then, several objective parameters of an audio signal were recalled that...

Pełny tekst do pobrania w portalu
Pomiary wartości opóźnień w torze audio urządzeń z systemem Android
Publikacja
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Rok 2018
Poniższy artykuł opisuje metody pomiarów wartości opóźnienia w torze fonicznym urządzeń pracujących na różnych wersjach systemu Android. W pierwszej części artykułu podano krótką charakterystykę środowiska Android w kontekście opóźnień w torze fonicznym. Następnie przedstawiono sposób pomiaru opóźnienia w torze fonicznym za pomocą aplikacji SuperPowered Latency oraz Dr. Rick O’Rang Loopback. W końcowej...

Pełny tekst do pobrania w portalu
Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?
Publikacja
- B. Kostek
- Rok 2022
In this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...

Pełny tekst do pobrania w portalu
Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing
Publikacja
- M. Niedźwiecki
- M. Ciołek
- IEEE Transactions on Audio Speech and Language Processing - Rok 2013
In this application-oriented paper we consider the problem of elimination of impulsive disturbances, such as clicks, pops and record scratches, from archive audio recordings. The proposed approach is based on bidirectional processing—noise pulses are localized by combining the results of forward-time and backward-time signal analysis. Based on the results of specially designed empirical tests (rather than on the results of theoretical analysis),...

Pełny tekst do pobrania w portalu
RENOVATION OF ARCHIVE AUDIO RECORDINGS USING SPARSE AUTOREGRESSIVE MODELING AND BIDIRECTIONAL PROCESSING
Publikacja
- M. Niedźwiecki
- M. Ciołek
- Rok 2013
The paper presents a new approach to elimination of broadband noise and impulsive disturbances from archive audio recordings. The proposed adaptive Kalman-like algorithm, based on a sparse autoregressive model of the audio signal, simultaneously detects noise pulses, interpolates the irrevocably distorted samples and performs signal smoothing. It is shown that bidirectional (forward-backward) processing of the archive signal improves...

Pełny tekst do pobrania w serwisie zewnętrznym
Parametric impulsive noise detector for corrupted audio signals based on hidden Markow model
Publikacja
- K. Cisowski
- Rok 2008
The paper addresses the problem of impulsive noise detection for audio signals. A structure of threshold parameter detectors using modelingof signals was introduced. the algorithm of the noise detection, based on discrete-time hidden Markow model (HMM)of whitened audio signal is elaborated
Gaze-tracking based audio-visual correlation analysis employing quality of experience methodology
Publikacja
- Intelligent Decision Technologies-Netherlands - Rok 2010
This paper investigates a new approach to audio-visual correlation assessment based on the gaze-tracking system developed at the Multimedia Systems Department (MSD) of Gdansk University of Technology (GUT). The gaze-tracking methodology, having roots in Human-Computer Interaction borrows the relevance feedback through gaze-tracking and applies it to the new area of interests, which is Quality of Experience. Results of subjective...

Pełny tekst do pobrania w serwisie zewnętrznym
Sparse vector autoregressive modeling of audio signals and its application to the elimination of impulsive disturbances
Publikacja
- M. Niedźwiecki
- M. Ciołek
- Rok 2015
Archive audio files are often corrupted by impulsive disturbances, such as clicks, pops and record scratches. This paper presents a new method for elimination of impulsive disturbances from stereo audio signals. The proposed approach is based on a sparse vector autoregressive signal model, made up of two components: one taking care of short-term signal correlations, and the other one taking care of long-term correlations. The method...

Pełny tekst do pobrania w serwisie zewnętrznym
AUDIO SIGNAL EQUALIZATION BASED ON IMPULSE RESPONSE OF A LISTENING ROOM AND MUSIC CONTENT REPRODUCED
Publikacja
- P. Hoffmann
- B. Kostek
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Rok 2018
A research study presents investigations of the influence of the room acoustics on the frequency characteristic of the audio signal playback. First, a concept of a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the room spectral response, a system for room acoustics compensation based on an equalizer designed is proposed. The system settings depend on music genre recognized automatically....
Production of six-degrees-of-freedom (6DoF) navigable audio using 30 Ambisonic microphones
Publikacja
- B. Mróz
- M. Kabaciński
- T. Ciotucha
- A. Rumiński
- T. Żernicki
- Rok 2021
This paper describes a method for planning, recording, and post-production of six-degrees-of-freedom audio recorded with multiple 3rd order Ambisonic microphone arrays. The description is based on the example of recordings conducted in August 2020 with the Poznan Philharmonic Orchestra using 30 units of Zylia ZM-1S. A convenient way to prepare and organize such a big project is proposed – this involves details of stage planning,...

Pełny tekst do pobrania w serwisie zewnętrznym
Energy Efficiency Study of Audio-video Content Consumption on Selected Android Mobile Terminals
Publikacja
- P. Falkowski-Gilski
- M. Pańkowski
- Rok 2021
Mobile devices are widely used by billions of users worldwide. Thanks to their main advantage, which is portability, they should be fully operational as long as possible, without the need to recharge or connect them to external power sources. This paper describes a study, carried out on four different mobile devices, with different hardware and software parameters, running the Android operating system. The research campaign involved...

Pełny tekst do pobrania w serwisie zewnętrznym
Technology audit tool for strategic innovation in SME
Publikacja
- J. Wojciechowski
- W. Przybylski
- Rok 2006
Wymagania stawiane przez współczesny rynek wraz z jego wyzwaniami i zagrożeniami potrzebują ciągłej aktywności małych i średnich przedsiębiorstw w obszarze doskonalenia ustawicznego oraz innowacji po to, by przetrwać we współczesnym konkurencyjnym świecie. Referat prezentuje praktyczne podejścia poprawiające równowagę głównych czynników rynkowych, tj. jakości, ceny i elestyczności jako głównych wskaźników konkurencyjności. Audyt...
Anxiety, Stress & Coping: An International Journal

Czasopisma

ISSN: 1061-5806 , eISSN: 1477-2205
Auditory Brainstem Responses recorded employing Audio ABR device
Dane Badawcze
open access
- P. Odya
- A. Czyżewski
The dataset consists of ABR measurements employing click, burst and speech stimuli. Parameters of the particular stimuli were as follows:
CLONING AND STEM CELLS

Czasopisma

ISSN: 1536-2302
Journal of Caring Sciences

Czasopisma

ISSN: 2251-9920
Music and the Moving Image

Czasopisma

ISSN: 2167-8464 , eISSN: 1940-7610
Bidirectional voting and continous voting concepts as possible use of Internet in democratic voting process
Publikacja
- J. Wachowicz
- Rok 2010
Democracies need elections for choosing their authorities and governments.This process has many factors that shape today's procedures. However, the Internet is a medium that may change possibilities and elections. The main issue is concern on how changes may influence the whole democratic process. this paper shows two possible ideas - that of bidirectional voting and continous voting., and considers possible reasons for introducing...
IEEE-ACM Transactions on Audio Speech and Language Processing

Czasopisma

ISSN: 2329-9290
Towards Audio Signal Equalization Based on Spectral Characteristics of a Listening Room and Music Content Reproduced
Publikacja
- P. Hoffmann
- B. Kostek
- Rok 2018
This study presents investigations of the influence of the room acoustics on the frequency characteristic of the audio signal playback. First, the concept of a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the room spectral response, a system for room acoustics compensation based on an equalizer designed is proposed. The system settings depend on music genre recognized automatically....

Pełny tekst do pobrania w serwisie zewnętrznym
Machine Learning Applied to Aspirated and Non-Aspirated Allophone Classification—An Approach Based on Audio "Fingerprinting"
Publikacja
- Rok 2018
The purpose of this study is to involve both Convolutional Neural Networks and a typical learning algorithm in the allophone classification process. A list of words including aspirated and non-aspirated allophones pronounced by native and non-native English speakers is recorded and then edited and analyzed. Allophones extracted from English speakers’ recordings are presented in the form of two-dimensional spectrogram images and...

Pełny tekst do pobrania w serwisie zewnętrznym

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: AUDIO CODING