Wyniki wyszukiwania dla: ARCHIWIZACJA AUDIO-WIDEO

Pursuing Listeners’ Perceptual Response in Audio-Visual Interactions - Headphones vs Loudspeakers: A Case Study

Publikacja

- Archives of Acoustics - Rok 2022

This study investigates listeners’ perceptual responses in audio-visual interactions concerning binaural spatial audio. Audio stimuli are coupled with or without visual cues to the listeners. The subjective test participants are tasked to indicate the direction of the incoming sound while listening to the audio stimulus via loudspeakers or headphones with the head-related transfer function (HRTF) plugin. First, the methodology...

Pełny tekst do pobrania w portalu

Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.

Publikacja

- Rok 2018

In this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...

Pełny tekst do pobrania w serwisie zewnętrznym

A study on of music features derived from audio recordings examples – a quantitative analysis

Publikacja

- Archives of Acoustics - Rok 2018

The paper presents a comparative study of music features derived from audio recordings, i.e. the same music pieces but representing different music genres, excerpts performed by different musicians, and songs performed by a musician, whose style evolved over time. Firstly, the origin and the background of the division of music genres were shortly presented. Then, several objective parameters of an audio signal were recalled that...

Pełny tekst do pobrania w portalu

Gaze-tracking based audio-visual correlation analysis employing quality of experience methodology

Publikacja

- Intelligent Decision Technologies-Netherlands - Rok 2010

This paper investigates a new approach to audio-visual correlation assessment based on the gaze-tracking system developed at the Multimedia Systems Department (MSD) of Gdansk University of Technology (GUT). The gaze-tracking methodology, having roots in Human-Computer Interaction borrows the relevance feedback through gaze-tracking and applies it to the new area of interests, which is Quality of Experience. Results of subjective...

Pełny tekst do pobrania w serwisie zewnętrznym

Localization of impulsive disturbances in archive audio signals using predictive matched filtering

Publikacja

- Rok 2014

The problem of elimination of impulsive disturbances from archive audio signals is considered and its new solution, called predictive matched filtering, is proposed. The new approach is based on the observation that a large percentage of noise pulses corrupting archive audio recordings have highly repetitive shapes that match several typical “patterns”, called click templates. To localize noise pulses, click templates can be correlated...

Pełny tekst do pobrania w serwisie zewnętrznym

Audio-visual surveillance system for application in bank operating room

Publikacja

- Communications in Computer and Information Science - Rok 2013

An audio-visual surveillance system able to detect, classify and to localize acoustic events in a bank operating room is presented. Algorithms for detection and classification of abnormal acoustic events, such as screams or gunshots are introduced. Two types of detectors are employed to detect impulsive sounds and vocal activity. A Support Vector Machine (SVM) classifier is used to discern between the different classes of acoustic...

Quality Evaluation of Novel DTD Algorithm Based on Audio Watermarking

Publikacja

- Rok 2011

Echo cancellers typically employ a doubletalk detection (DTD) algorithm in order to keep the adaptive filter from diverging in the presence of near-end speech signal or other disruptive sounds in the microphone signal. A novel doubletalk detection algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation...

Pełny tekst do pobrania w serwisie zewnętrznym

Automatic audio signal mixing system based on one-dimensional Wave-U-Net autoencoders

Publikacja

D. Koszewski

- Rok 2023

The purpose of this dissertation is to develop an automatic song mixing system that is capable of automatically mixing a song with good quality in any music genre. This work recalls first the audio signal processing methods used in audio mixing, and it describes selected methods for automatic audio mixing. Then, a novel architecture built based on one-dimensional Wave-U-Net autoencoders is proposed for automatic music mixing. Models...

Pełny tekst do pobrania w portalu

Production of six-degrees-of-freedom (6DoF) navigable audio using 30 Ambisonic microphones

Publikacja

B. Mróz
M. Kabaciński
T. Ciotucha
A. Rumiński
T. Żernicki

- Rok 2021

This paper describes a method for planning, recording, and post-production of six-degrees-of-freedom audio recorded with multiple 3rd order Ambisonic microphone arrays. The description is based on the example of recordings conducted in August 2020 with the Poznan Philharmonic Orchestra using 30 units of Zylia ZM-1S. A convenient way to prepare and organize such a big project is proposed – this involves details of stage planning,...

Pełny tekst do pobrania w serwisie zewnętrznym

Analysis of the Usefulness of Cheap Audio Recorders for Spectral Measurement of Environmental Noise

Publikacja

- Metrology - Rok 2023

Environmental noise pollution is nowadays one of the most serious health threats. The impact of noise on the human body depends not only on the sound level but also on its spectral distribution. Reliable measurements of the environmental noise spectrum are often hampered by the very high price of top quality measuring devices. This paper explores the possibility of using much cheaper audio recorders for the frequency analysis....

Pełny tekst do pobrania w portalu

Exploring Neural Networks for Musical Instrument Identification in Polyphonic Audio

Publikacja

M. Blaszke
G. Korvel
B. Kostek

- IEEE INTELLIGENT SYSTEMS - Rok 2024

The purpose of this paper is to introduce neural network-based methods that surpass state-of-the-art (SOTA) models, either by training faster or having simpler architecture, while maintaining comparable effectiveness in musical instrument identification in polyphonic music. Several approaches are presented, including two authors’ proposals, i.e., spiking neural networks (SNN) and a modular deep learning model named FMCNN (Fully...

Pełny tekst do pobrania w serwisie zewnętrznym

In Memoriam Professors Marianna Sankiewicz-Budzyński and Gustaw K.E. Budzyński - Founders of the Polish Audio Engineering

Publikacja

- Archives of Acoustics - Rok 2018

Biography and scientific achievements of Professors Marianna Sankiewicz-Budzyński and Gustaw K.E. Budzyński - Founders of the Polish Audio Engineering.

Pełny tekst do pobrania w portalu

Audio Content and Crowdsourcing: A Subjective Quality Evaluation of Radio Programs Streamed Online

Publikacja

P. Falkowski-Gilski

- Rok 2023

Radio broadcasting has been present in our lives for over 100 years. The transmission of speech and music signals accompanies us from an early age. Broadcasts provide the latest information from home and abroad. They also shape musical tastes and allow many artists to share their creativity. Modern distribution involves transmission over a number of terrestrial systems. The most popular are analog FM (Frequency Modulation) and...

Pełny tekst do pobrania w serwisie zewnętrznym

Audio Feature Analysis for Precise Vocalic Segments Classification in English

Publikacja

- Rok 2020

An approach to identifying the most meaningful Mel-Frequency Cepstral Coefficients representing selected allophones and vocalic segments for their classification is presented in the paper. For this purpose, experiments were carried out using algorithms such as Principal Component Analysis, Feature Importance, and Recursive Parameter Elimination. The data used were recordings made within the ALOFON corpus containing audio signal...

Pełny tekst do pobrania w serwisie zewnętrznym

Further developments of parameterization methods of audio stream analysis for secuirty purposes

Publikacja

- Rok 2009

The paper presents an automatic sound recognition algorithm intended for application in an audiovisual security monitoring system. A distributed character of security systems does not allow for simultaneous observation of multiple multimedia streams, thus an automatic recognition algorithm must be introduced. In the paper, a module for the parameterization and automatic detection of audio events is described. The spectral analyses...

Multimodal human-computer interfaces based on advanced video and audio analysis

Publikacja

- Rok 2013

Multimodal interfaces development history is reviewed briefly in the introduction. Examples of applications of multimodal interfaces to education software and for the disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and the audio interface for speech stretching for hearing impaired and stuttering people. The Smart...

Pełny tekst do pobrania w serwisie zewnętrznym

Automatic audio-visual threat detection

Publikacja

- Rok 2010

The concept, practical realization and application of a system for detection and classification of hazardous situations based on multimodal sound and vision analysis are presented. The device consists of new kind multichannel miniature sound intensity sensors, digital Pan Tilt Zoom and fixed cameras and a bundle of signal processing algorithms. The simultaneous analysis of multimodal signals can significantly improve the accuracy...

Analysis of impact of lossy audio compression on the robustness of watermark embedded in the DWT domain for non-blind copyright protection

Publikacja

- Rok 2012

A methodology of non-blind watermarking of the audio content is proposed. The outline of audio copyright problem and motivation for practical applications are discussed. The algorithmic theory pertaining watermarking techniques is briefly introduced. The system architecture together with employed workflows for embedding and extracting the watermarks are described. The implemented approach is described and obtained results are reported....

Pełny tekst do pobrania w serwisie zewnętrznym

AUDIO SIGNAL EQUALIZATION BASED ON IMPULSE RESPONSE OF A LISTENING ROOM AND MUSIC CONTENT REPRODUCED

Publikacja

- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Rok 2018

A research study presents investigations of the influence of the room acoustics on the frequency characteristic of the audio signal playback. First, a concept of a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the room spectral response, a system for room acoustics compensation based on an equalizer designed is proposed. The system settings depend on music genre recognized automatically....

Energy Efficiency Study of Audio-video Content Consumption on Selected Android Mobile Terminals

Publikacja

- Rok 2021

Mobile devices are widely used by billions of users worldwide. Thanks to their main advantage, which is portability, they should be fully operational as long as possible, without the need to recharge or connect them to external power sources. This paper describes a study, carried out on four different mobile devices, with different hardware and software parameters, running the Android operating system. The research campaign involved...

Pełny tekst do pobrania w serwisie zewnętrznym

Towards Audio Signal Equalization Based on Spectral Characteristics of a Listening Room and Music Content Reproduced

Publikacja

- Rok 2018

This study presents investigations of the influence of the room acoustics on the frequency characteristic of the audio signal playback. First, the concept of a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the room spectral response, a system for room acoustics compensation based on an equalizer designed is proposed. The system settings depend on music genre recognized automatically....

Pełny tekst do pobrania w serwisie zewnętrznym

New semi-causal and noncausal techniques for detection of impulsive disturbances in multivariate signals with audio applications

Publikacja

- IEEE TRANSACTIONS ON SIGNAL PROCESSING - Rok 2017

This paper deals with the problem of localization of impulsive disturbances in nonstationary multivariate signals. Both unidirectional and bidirectional (noncausal) detection schemes are proposed. It is shown that the strengthened pulse detection rule, which combines analysis of one-step-ahead signal prediction errors with critical evaluation of leave-one-out signal interpolation errors, allows one to noticeably improve detection results...

Pełny tekst do pobrania w portalu

Elimination of impulsive disturbances from archive audio files – comparison of three noise pulse detection schemes

Publikacja

- Rok 2014

The problem of elimination of impulsive disturbances (such as clicks, pops, ticks, crackles, and record scratches) from archive audio recordings is considered and solved using autoregressive modeling. Three classical noise pulse detection schemes are examined and compared: the approach based on open-loop multi-step-ahead signal prediction, the approach based on decision-feedback signal prediction, and the double threshold approach,...

Pełny tekst do pobrania w serwisie zewnętrznym

A Device for Measuring Auditory Brainstem Responses to Audio

Publikacja

- Rok 2018

Standard ABR devices use clicks and tone bursts to assess subjects’ hearing in an objective way. A new device was developed that extends the functionality of a standard ABR audiometer by collecting and analyzing auditory brainstem responses (ABR). The developed accessory allows for the use of complex sounds (e.g., speech or music excerpts) as stimuli. Therefore, it is possible to find out how efficiently different types of sounds...

Pełny tekst do pobrania w portalu

Multimodal Audio-Visual Recognition of Traffic Events

Publikacja

- Rok 2011

Przedstawiono demonstrator systemu wykrywania niebezpiecznych zdarzeń w ruchu drogowym oparty na jednoczesnej analizie danych wizyjnych i akustycznych. System jest częścią systemu automatycznego nadzoru bezpieczeństwa. Wykorzystuje on kamery i mikrofony jako źródła danych. Przedstawiono wykorzystane algorytmy - algorytmy rozpoznawania zdarzeń dźwiękowych oraz analizy obrazu. Zaprezentowano wyniki działania algorytmów na przykładzie...

Adaptive filter for reconstruction of stereo audio signals.

Publikacja

K. Cisowski

- Rok 2004

Artykuł poświęcony jest omówieniu metody rekonstrukcji zakłóconych impulsowo sygnałów stereofonicznych. W pracy zdefiniowano model sygnału stereofonicznego i przedstawiono zaprojektowany dla tego modelu filtr Kalmana. Przedstawiono modyfikacje filtru, w wyniku których algorytm dokonuje rekonstrukcji zakłóconego impulsowo sygnału w jednym kanale z wykorzystaniem dodatkowej informacji zawartej w niezakłóconych próbkach sygnału pochodzącego...

Intelligent algorithms for optical track audio restoration

Publikacja

- Rok 2005

W referacie przedstawiono dwa algorytmy dedykowane redukcji pasożytniczych zniekształceń dźwięku spotykanych w optycznych ścieżkach dźwiękowych. Pierwszy algorytm umożliwia redukcję szerokopasmowego szumu w nagraniach fonicznych. Wykorzystano w nim psycho-akustyczny model słuchu oparty o miarę nieprzewidywalność sygnału (ang. Unpredictability Measure). Ocena jakości redukcji szumu została wykonana z wykorzystaniem metod inteligentnych....

Manifest prysznicowy: jakiej chciałbym sztuki, wideo, wystawa Wolne pokoje: Fiks, Gdańsk 2019

Publikacja

P. Różycki

- Rok 2019

FIKS (wystawa zbiorowa) Przemiany współczesnych społeczeństw i zachwianie poczucia bezpieczeństwa mają swoje odbicie w pewnych zachowaniach jednostek. Wynika to nie rzadko z frustracji, poczucia winy lub lęku. Gdzieś tam na innym poziomie świadomości nawykowe mechanizmy obronne niekoniecznie muszą być patologiczne. Codzine rytualy, metody, różnorodność osobistych tarczy, systemów i amuletów, fazy, cykle, niekończący sie kołowrotek. Zwykły...

A commonly-accessible toolchain for live streaming music events with higher-order ambisonic audio and 4k 360 vision

Publikacja

B. Mróz
P. Odya
P. Danowski
M. Kabaciński

- Rok 2023

An immersive live stream is especially interesting in the ongoing development of telepresence tools, especially in the virtual reality (VR) or mixed reality (MR) domain. This paper explores the remote and immersive way of enabling telepresence for the audience to high-fidelity music performance using freely-available and easily-accessible tools. A functional VR live-streaming toolchain, comprising 360 vision and higher-order ambisonic...

Pełny tekst do pobrania w portalu

Audio codec employing frequency-derived tonality measure

Publikacja

- Rok 2009

A transform codec employing efficient algorithm for detection of spectral tonal components is presented. The tonality measure used in MPEG psychoacoustic model is replaced with the method providing adequate tonality estimates even if the tonal components are deeply frequency modulated. The reliability of hearing threshold estimated using psychoacoustic model with standardized tonality measure and the proposed one is investigated...

Wireless intelligent audio-video surveillance prototyping system

Publikacja

M. Kłosowski

- Przegląd Elektrotechniczny - Rok 2013

The presented system is based on the Virtex6 FPGA and several supporting devices like a fast DDR3 memory, small HD camera, microphone with A/D converter, WiFi radio communication module, etc. The system is controlled by the Linux operating system. The Linux drivers for devices implemented in the system have been prepared. The system has been successfully verified in a H.264 compression accelerator prototype in which the most demanding...

Pełny tekst do pobrania w portalu

Analysis of allophones based on audio signal recordings and parameterization

Publikacja

- Journal of the Acoustical Society of America - Rok 2017

The aim of this study is to develop an allophonic description of English plosive consonants based on recordings of 600 specially selected words. Allophonic variations addressed in the study may have two sources: positional and contextual. The former one depends on the syllabic or prosodic position in which a particular phoneme occurs. Contextual allophony is conditioned by the local phonetic environment. Co-articulation overlapping...

Pełny tekst do pobrania w serwisie zewnętrznym

New algorithms for wow and flutter detection and compensation in audio

Publikacja

- Rok 2005

W referacie przedstawiono nowe metody dyskryminacji naturalnych efektów muzycznych i pasożytniczych zniekształceń drżenia dźwięku. Dodatkowo, opisano w nim metody wyznaczania przebiegu zniekształceń drżenia. Wśród nich znajdują się: detekcja okresowości sygnału w poszczególnych ramkach czasowych, śledzenie zmian przydźwięku sieciowego wykorzystujące modelowane AR widma sygnału, śledzenie zmian wysokoczęstotliwościowego prądu podkładu....

Applications of neural networks and perceptual masking to audio restoration

Publikacja

A. Czyżewski

- Journal of New Music Research - Rok 2002

Omówiono zastosowania algorytmów uczących się w dziedzinie rekonstruowania nagrań fonicznych. Szczególną uwagę zwrócono na zastosowanie sztucznych sieci neuronowych do usuwania zakłócających impulsów. Ponadto opisano zastosowanie inteligentnego algorytmu decyzyjnego do sterowania maskowaniem perceptualnym w celu redukowania szumu.

Wow detection and compensation employing spectral processing of audio.

Publikacja

- Rok 2004

Praca zawiera opis opracowanych algorytmów detekcji i kompensacji pasożytniczych modulacji częstotliwości wynikających z nierównomiernego przesuwu nośnika dźwięku. Proponowane metody opracowano ze szczególnym uwzględnieniem przypadkowych zniekształceń drżenia obecnych w archiwalnych filmowych ścieżkach dźwiękowych. Dodatkowo algorytmy badają wpływ zniekształceń na strukturę formantową sygnałów. Analiza zmian położenia formantów...

New algorithms for wow and flutter detection and compensation in audio

Publikacja

- Rok 2005

W referacie przedstawiono nowe metody dyskryminacji naturalnych efektów muzycznych i pasożytniczych zniekształceń drżenia dźwięku. Dodatkowo, opisano w nim metody wyznaczania przebiegu zniekształceń drżenia. Wśród nich znajdują się: detekcja okresowości sygnału w poszczególnych ramkach czasowych, śledzenie zmian przydźwięku sieciowego wykorzystujące modelowane AR widma sygnału, śledzenie zmian wysokoczęstotliwościowego prądu podkładu....

Two-stage method of impulsive noise detection for audio signals

Publikacja

K. Cisowski

- Poznan University of Technology Academic Journals. Electrical Engineering - Rok 2007

Przedstawiono nowa dwuetapową metodę detekcji zakłóceń impulsowych opartą na analizie funkcji gęstości rozkładu prawdopodobieństwa zakłóconego sygnału. Opisano algorytm określania poziomu wyzwalania detektora progowego.

Multimodal human-computer interfaces based on advanced video and audio analysis

Publikacja

- Advances in Intelligent Systems and Computing - Rok 2014

Multimodal interfaces development history is reviewed briefly in the introduction. Some applications of multimodal interfaces to education software for disabled people are presented. One of them, the LipMouse is a novel, vision-based human-computer interface that tracks user’s lip movements and detect lips gestures. A new approach to diagnosing Parkinson’s disease is also shown. The progression of the disease can be measured employing...

Pełny tekst do pobrania w serwisie zewnętrznym

Noise reduction in audio employing spectral unpredictability measure and neural net.

Publikacja

- Rok 2004

modelu psychoakustycznym zostały przedyskutowane. Uczący się algorytm decyzjny, działający w opraciu o sztuczną sieć neuronową wykorzystany został w klasyfikacji składowych na pasożytnicze i użyteczne. Przedstawiona została również nowa iteracyjna procedura obliczania progu maskowania. W pracy zawarte zostały wyniki eksperymentów, oraz konkluzje odnoszące się do przedstawionych algorytmów.

Pomiary wartości opóźnień w torze audio urządzeń z systemem Android

Publikacja

- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Rok 2018

Poniższy artykuł opisuje metody pomiarów wartości opóźnienia w torze fonicznym urządzeń pracujących na różnych wersjach systemu Android. W pierwszej części artykułu podano krótką charakterystykę środowiska Android w kontekście opóźnień w torze fonicznym. Następnie przedstawiono sposób pomiaru opóźnienia w torze fonicznym za pomocą aplikacji SuperPowered Latency oraz Dr. Rick O’Rang Loopback. W końcowej...

Pełny tekst do pobrania w portalu

Intelligent acquisition of audio signals, employing neutral networks and rough set algorithms

Publikacja

A. Czyżewski

- Rok 2003

Algorytmy oparte na sztucznych sieciach neuronowych i metodzie zbiorówprzybliżonych zostały zastosowane do lokalizacji sygnałów fonicznych obar-czonych pasożytniczym szumem i rewerberacjami. Informacja o kierunku napły-wania dźwięku była uzyskiwana na wyjściach tych algorytmów na podstawie re-prezentacji parametrycznej. Przedstawiono wyniki eksperymentalne i przepro-wadzono ich dyskusję.

Machine Learning Applied to Aspirated and Non-Aspirated Allophone Classification—An Approach Based on Audio "Fingerprinting"

Publikacja

- Rok 2018

The purpose of this study is to involve both Convolutional Neural Networks and a typical learning algorithm in the allophone classification process. A list of words including aspirated and non-aspirated allophones pronounced by native and non-native English speakers is recorded and then edited and analyzed. Allophones extracted from English speakers’ recordings are presented in the form of two-dimensional spectrogram images and...

Pełny tekst do pobrania w serwisie zewnętrznym

Evaluation of Six Degrees of Freedom 3D Audio Orchestra Recording and Playback using multi-point Ambisonic interpolation

Publikacja

T. Ciotucha
A. Rumiński
T. Żernicki
B. Mróz

- Scopus - Rok 2021

This paper describes a strategy for recording sound and enabling six-degrees-of-freedom playback, making use of multiple simultaneous and synchronized Higher Order Ambisonics (HOA) recordings. Such a strategy enables users to navigate in a simulated 3D space and listen to the six-degrees-of-freedom recordings from different perspectives. For the evaluation of the proposed approach, an Unreal Engine-based navigable 3D audiovisual...

Pełny tekst do pobrania w serwisie zewnętrznym

Adaptive system for recognition of sounds indicating threats to security of people and property employing parallel processing of audio data streams

Publikacja

K. Łopatka

- Rok 2015

A system for recognition of threatening acoustic events employing parallel processing on a supercomputing cluster is featured. The methods for detection, parameterization and classication of acoustic events are introduced. The recognition engine is based onthreshold-based detection with adaptive threshold and Support Vector Machine classifcation. Spectral, temporal and mel-frequency descriptors are used as signal features. The...

Testing A Novel Gesture-Based Mixing Interface

Publikacja

- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2013

With a digital audio workstation, in contrast to the traditional mouse-keyboard computer interface, hand gestures can be used to mix audio with eyes closed. Mixing with a visual representation of audio parameters during experiments led to broadening the panorama and a more intensive use of shelving equalizers. Listening tests proved that the use of hand gestures produces mixes that are aesthetically as good as those obtained using...

Pełny tekst do pobrania w portalu

Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing

Publikacja

- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2020

Developing signal processing methods to extract information automatically has potential in several applications, for example searching for multimedia based on its audio content, making context-aware mobile applications (e.g., tuning apps), or pre-processing for an automatic mixing system. However, the last-mentioned application needs a significant amount of research to reliably recognize real musical instruments in recordings....

Pełny tekst do pobrania w portalu

Adaptive Personal Tuning of Sound in Mobile Computers

Publikacja

- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2016

An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of their acoustic track to changing acoustic conditions of the environment and to users’ individual preferences. Signal processing algorithms are introduced that concern: linearization of frequency response, dialogue intelligibility enhancement, and dynamics processing tuned up to the users’...

Pełny tekst do pobrania w portalu

Editor's note and 2018 reviewers

Publikacja

B. Kostek

- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2018

Przedmiotem pracy jest odniesienie do prac opublikowanych w 2018 roku, jak również do serii artykułów w ramach specjalnego wydania: Special Issue on Augmented and Participatory Sound and Music Interaction Using Semantic Audio.

Pełny tekst do pobrania w serwisie zewnętrznym

KORPUS MOWY ANGIELSKIEJ DO CELÓW MULTIMODALNEGO AUTOMATYCZNEGO ROZPOZNAWANIA MOWY

Publikacja

- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2016

W referacie zaprezentowano audiowizualny korpus mowy zawierający 31 godzin nagrań mowy w języku angielskim. Korpus dedykowany jest do celów automatycznego audiowizualnego rozpoznawania mowy. Korpus zawiera nagrania wideo pochodzące z szybkoklatkowej kamery stereowizyjnej oraz dźwięk zarejestrowany przez matrycę mikrofonową i mikrofon komputera przenośnego. Dzięki uwzględnieniu nagrań zarejestrowanych w warunkach szumowych korpus...

EVENTS VISUALIZATION POST IN A DISTRIBUTED TELEINFORMATION SYSTEM FOR THE BORDER GUARD

Publikacja

- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2017

Events Visualization Post is a part of the STRADAR project, which is dedicated to streaming real-time data in distributed dispatcher and teleinformation systems of the Border Guard. Events Visualization Post is a software designed for simultaneous visualization of data of different types. In the paper, the structure of the software is presented, the process of generation of tasks is described, and the visualization of audio, files,...

Wyszukiwarka

Filtry

Katalog

Kategoria

Rok

Opcje

Wyniki wyszukiwania dla: ARCHIWIZACJA AUDIO-WIDEO