Wyniki wyszukiwania dla: audio parametrization

Wyniki wyszukiwania dla: audio parametrization

wyników na stronę:
osadź ten widok na swojej stronie

Wyświetlane wyniki pochodzą z wyszukiwania alternatywnego.

Filtry

wszystkich: 576

wyczyść wszystkie filtry niedostępne

Analysis of allophones based on audio signal recordings and parameterization
Publikacja
- Journal of the Acoustical Society of America - Rok 2017
The aim of this study is to develop an allophonic description of English plosive consonants based on recordings of 600 specially selected words. Allophonic variations addressed in the study may have two sources: positional and contextual. The former one depends on the syllabic or prosodic position in which a particular phoneme occurs. Contextual allophony is conditioned by the local phonetic environment. Co-articulation overlapping...

Pełny tekst do pobrania w serwisie zewnętrznym
Objectivization of phonological evaluation of speech elements by means of audio parametrization
Publikacja
- Rok 2018
This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...
Further developments of parameterization methods of audio stream analysis for secuirty purposes
Publikacja
- P. Żwan
- A. Czyżewski
- Rok 2009
The paper presents an automatic sound recognition algorithm intended for application in an audiovisual security monitoring system. A distributed character of security systems does not allow for simultaneous observation of multiple multimedia streams, thus an automatic recognition algorithm must be introduced. In the paper, a module for the parameterization and automatic detection of audio events is described. The spectral analyses...
Determining Pronunciation Differences in English Allophones Utilizing Audio Signal Parameterization
Publikacja
- B. Kostek
- M. Piotrowska
- T. Ciszewski
- A. Czyżewski
- Rok 2017
An allophonic description of English plosive consonants, based on audio-visual recordings of 600 specially selected words, was developed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted...
Data obtained via parametrization of differently mixed audio signals
Dane Badawcze
open access
- J. Stefański
- K. Marciniuk
Dataset consists of audio samples and the results of their parametrization. The extraction of music parameters was performed using MIRToolbox. Information extracted from the samples was used as a database for master's thesis titled 'The influence of audio signal processing chain in mixing on the emotional state of a music piece'.
JOURNAL OF THE AUDIO ENGINEERING SOCIETY

Czasopisma

ISSN: 1549-4950
Paremetrization of sounds for recognizing hazarodus events
Publikacja
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Rok 2010
Nowoczesne systemy monitoringu działają na zasadzie automatycznego wykrywania niebezpiecznych zdarzeń na podstawie analizy obrazu z kamer i dźwięku z mikrofonów. W niniejszej publikacji skupiono się na pierwszym etapie rozpoznawania zdarzeń dźwiękowych, jakim jest parametryzacja dźwięku. Podstawą do skutecznego działania systemu jest znalezienie parametrów, których zmienność najlepiej odzwierciedla cechy charakterystyczne dźwięku...
Parametrization and Correlation Analysis Applied to Music Mood Classification .
Publikacja
- B. Kostek
- M. Piotrowska
- International Journal of Computational Intelligence Studies - Rok 2013
The paper presents a study on music mood categorization. First, a review of music mood models is presented. Then, the preparation of a set of music excerpts to be used in the experiments and music parametrization is described. Next, some listening tasks performed to obtain mood descriptors are introduced. Finally,the correlation between mood descriptors and features extracted from parameters is discussed. The paper concludes with...

Pełny tekst do pobrania w serwisie zewnętrznym
Application of the neural networks for developing new parametrization of the Tersoff potential for carbon
Publikacja
- A. C. Nwachukwu
- S. Winczewski
- TASK Quarterly - Rok 2020
Penta-graphene (PG) is a 2D carbon allotrope composed of a layer of pentagons having sp2- and sp3-bonded carbon atoms. A study carried out in 2018 has shown that the parameterization of the Tersoff potential proposed in 2005 by Ehrhart and Able (T05 potential) performs better than other potentials available for carbon, being able to reproduce structural and mechanical properties of the PG. In this work, we tried to improve the...

Pełny tekst do pobrania w portalu
On geometry parameterization for simulation-driven design closure of antenna structures
Publikacja
- S. Kozieł
- A. Pietrenko-Dąbrowska
- Scientific Reports - Rok 2021
Full-wave electromagnetic (EM) simulation tools have become ubiquitous in antenna design, especially final tuning of geometry parameters. From the reliability standpoint, the recommended realization of EM-driven design is through rigorous numerical optimization. It is a challenging endeavor with the major issues related to the high computational cost of the process, but also the necessity of handling several objectives and constraints...

Pełny tekst do pobrania w portalu
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
Publikacja
- Rok 2014
The problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
Publikacja
- Rok 2014
The problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
Bożena Kostek prof. dr hab. inż.

Osoby

Laboratorium Akustyki Fonicznej
Retrospecting Polish Audio Engineering Society Membership on 20th Anniversary of the Polish Section of the Audio Engineering Society
Publikacja
- B. Kostek
- M. Sankiewicz
- Archives of Acoustics - Rok 2011
In this article some key events concerning founding Polish Section of the Audio Engineering Society were presented. In addition, the history covering International Symposia on Sound Engineering and Mastering was outlined. Also, papers contained in this issue were shortly reviewed.

Pełny tekst do pobrania w portalu
Automatic audio-visual threat detection
Publikacja
- J. Kotus
- J. Łopatka
- K. Kopaczewski
- A. Czyżewski
- Rok 2010
The concept, practical realization and application of a system for detection and classification of hazardous situations based on multimodal sound and vision analysis are presented. The device consists of new kind multichannel miniature sound intensity sensors, digital Pan Tilt Zoom and fixed cameras and a bundle of signal processing algorithms. The simultaneous analysis of multimodal signals can significantly improve the accuracy...
Objectivization of Audio-Visual Correlation analysis
Publikacja
- B. Kunka
- B. Kostek
- Archives of Acoustics - Rok 2012
Simultaneous perception of audio and visual stimuli often causes the concealment or misrepresentation of information actually contained in these stimuli. Such effects are called the ''image proximity effect'' or the ''ventriloquism effect'' in literature. Until recently, most research carried out to understand their nature was based on subjective assessments. The Authors of this paper propose a methodology based on both subjective...

Pełny tekst do pobrania w portalu
Measurement of Latency in the Android Audio Path
Publikacja
- Rok 2018
This paper provides a description of experimental investigations concerning comparison between the audio path characteristics of various Android versions. First, information about the changes in each system version in the context of latency caused by them is presented. Then, a measurement procedure employing available applications to measure latency is described comparing to results contained in the Internet. Finally, a comparison...

Pełny tekst do pobrania w serwisie zewnętrznym
Verification of the Parameterization Methods in the Context of Automatic Recognition of Sounds Related to Danger
Publikacja
- P. Żwan
- A. Czyżewski
- Journal of Digital Forensic Practice - Rok 2010
W artykule opisano aplikację, która automatycznie wykrywa zdarzenia dźwiękowe takie jak: rozbita szyba, wystrzał, wybuch i krzyk. Opisany system składa się z bloku parametryzacji i klasyfikatora. W artykule dokonano porównania parametrów dedykowanych dla tego zastosowania oraz standardowych deskryptorów MPEG-7. Porównano też dwa klasyfikatory: Jeden oparty o Percetron (sieci neuronowe) i drugi oparty o Maszynę wektorów wspierających....

Pełny tekst do pobrania w serwisie zewnętrznym
Automatic system for audio-video material reconstruction and archiving
Publikacja
- A. Kupryjanow
- A. Czyżewski
- Rok 2008
Referat przedstawia propozycję modelu systemu automatycznej archiwizacji i rekonstrukcji nagrań audio-wideo. Założeniem tego rozwiązania jest uczynienie procesu rekonstrukcji nagrań bardziej niezależnym od człowieka. Ma to na celu redukcję kosztów rekonstrukcji przetwarzanych nagrań. Z powodu dużej liczby archiwalnych nagrań audio-wideo istnieje potrzeba stworzenia systemu który umożliwi automatyczną indeksację ich treści. Pomoże...
Journal of the Audio Engineering Society

Czasopisma

ISSN: 0004-7554
Journal of Radio & Audio Media

Czasopisma

ISSN: 1937-6529 , eISSN: 1937-6537
Structures for parameterization, meshing and data exchange of topologically related surfaces of a ship hull
Publikacja
- A. Kniat
- Rok 2010
This paper presents proposal of data structures for storage and processing of a parametric three-dimensional model of a midship hull sections. The model consists of coarse surfaces like: decks, frames, girders, stiffeners, brackets, partitions etc. bounded by topological relations. All workshop details are omitted as the model is intended for numeric calculations. Proposed data structures are prepared to facilitate changes in the...
Analysis of Lombard speech using parameterization and the objective quality indicators in noise conditions
Publikacja
- K. Kąkol
- G. Korvel
- B. Kostek
- Rok 2018
The aim of the work is to analyze Lombard speech effect in recordings and then modify the speech signal in order to obtain an increase in the improvement of objective speech quality indicators after mixing the useful signal with noise or with an interfering signal. The modifications made to the signal are based on the characteristics of the Lombard speech, and in particular on the effect of increasing the fundamental frequency...
Objectivization of audio-video correlation assessment experiments
Publikacja
- B. Kunka
- B. Kostek
- Rok 2010
The purpose of this paper is to present a new method of conducting an audio-visual correlation analysis employing a head-motion-free gaze tracking system. First, a review of related works in the domain of sound and vision correlation is presented. Then assumptions concerning audio-visual scene creation are shortly described. The objectivization process of carrying out correlation tests employing gaze-tracking system is outlined....

Pełny tekst do pobrania w serwisie zewnętrznym
An new method of audio-visual correlation analysis
Publikacja
- B. Kunka
- B. Kostek
- Rok 2009
This paper presents a new methodology of conducting the audio-visual correlation analysis employing the gaze tracking system. Interaction between two perceptual modalities, seeing and hearing, their interaction and mutual reinforcement in a complex relationship was a subject of many research studies. Earlier stage of the carried out experiments at the Multimedia Systems Department (MSD) showed that there exists a relationship between...

Pełny tekst do pobrania w serwisie zewnętrznym
A double-talk detector using audio watermarking
Publikacja
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2009
a novel approach to double-talk detection in the acoustic echo canceler is proposed. a hidden signature is embedded into the arriving signal, using the echo-hiding method. next detection of the presence of this signature in the microphone signal is performed. the results of the signature detection may be used by the acoustic echo canceler to stop or restart the adaptation process.

Pełny tekst do pobrania w serwisie zewnętrznym
Intelligent algorithms for optical track audio restoration
Publikacja
- Rok 2005
W referacie przedstawiono dwa algorytmy dedykowane redukcji pasożytniczych zniekształceń dźwięku spotykanych w optycznych ścieżkach dźwiękowych. Pierwszy algorytm umożliwia redukcję szerokopasmowego szumu w nagraniach fonicznych. Wykorzystano w nim psycho-akustyczny model słuchu oparty o miarę nieprzewidywalność sygnału (ang. Unpredictability Measure). Ocena jakości redukcji szumu została wykonana z wykorzystaniem metod inteligentnych....
Adaptive filter for reconstruction of stereo audio signals.
Publikacja
- K. Cisowski
- Rok 2004
Artykuł poświęcony jest omówieniu metody rekonstrukcji zakłóconych impulsowo sygnałów stereofonicznych. W pracy zdefiniowano model sygnału stereofonicznego i przedstawiono zaprojektowany dla tego modelu filtr Kalmana. Przedstawiono modyfikacje filtru, w wyniku których algorytm dokonuje rekonstrukcji zakłóconego impulsowo sygnału w jednym kanale z wykorzystaniem dodatkowej informacji zawartej w niezakłóconych próbkach sygnału pochodzącego...
Intelligent video and audio applications for learning enhancement
Publikacja
- A. Czyżewski
- B. Kostek
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2011
The role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....

Pełny tekst do pobrania w portalu
Multimodal Audio-Visual Recognition of Traffic Events
Publikacja
- Rok 2011
Przedstawiono demonstrator systemu wykrywania niebezpiecznych zdarzeń w ruchu drogowym oparty na jednoczesnej analizie danych wizyjnych i akustycznych. System jest częścią systemu automatycznego nadzoru bezpieczeństwa. Wykorzystuje on kamery i mikrofony jako źródła danych. Przedstawiono wykorzystane algorytmy - algorytmy rozpoznawania zdarzeń dźwiękowych oraz analizy obrazu. Zaprezentowano wyniki działania algorytmów na przykładzie...
Personal adaptive tuning of mobile computer audio
Publikacja
- Rok 2015
An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of the acoustic track to the changing conditions and to the user's individual preferences. Original signal processing algorithms are introduced, which concern: linearization of frequency response, dialogue intelligibility enhancement and dynamics processing tuned up to the user's preferences....
Detection of impulsive disturbances in archive audio signals
Publikacja
- M. Ciołek
- M. Niedźwiecki
- Rok 2017
In this paper the problem of detection of impulsive disturbances in archive audio signals is considered. It is shown that semi-causal/noncausal solutions based on joint evaluation of signal prediction errors and leave-one-out signal interpolation errors, allow one to noticeably improve detection results compared to the prediction-only based solutions. The proposed approaches are evaluated on a set of clean audio signals contaminated...

Pełny tekst do pobrania w portalu
A Device for Measuring Auditory Brainstem Responses to Audio
Publikacja
- Rok 2018
Standard ABR devices use clicks and tone bursts to assess subjects’ hearing in an objective way. A new device was developed that extends the functionality of a standard ABR audiometer by collecting and analyzing auditory brainstem responses (ABR). The developed accessory allows for the use of complex sounds (e.g., speech or music excerpts) as stimuli. Therefore, it is possible to find out how efficiently different types of sounds...

Pełny tekst do pobrania w portalu
A Study on Audio Signal Processed by "Instant Mastering"
Publikacja
- M. Piotrowska
- S. Piotrowski
- B. Kostek
- Rok 2018
An increasing amount of music produced in home- and project-studios results in development and growth of "automatic mastering services". The presented investigation explores changes introduced to audio signal by various online mastering platforms. A music set consisting of 10 songs produced in small facilities was processed by eight on-line automatic mastering services. Additionally, some laboratory-constructed signals were tested....
Simple gait parameterization and 3D animation for anonymous visual monitoring based on augmented reality
Publikacja
- P. Szczuko
- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2016
The article presents a method for video anonymization and replacing real human silhouettes with virtual 3D figures rendered on a screen. Video stream is processed to detect and to track objects, whereas anonymization stage employs animating avatars accordingly to behavior of detected persons. Location, movement speed, direction, and person height are taken into account during animation and rendering phases. This approach requires...

Pełny tekst do pobrania w portalu
Audio content analysis in the urban area telemonitoring system
Publikacja
- Rok 2010
Artykuł przedstawia możliwości rozwinięcie monitoringu miejskiego o automatyczną analizę dźwięku. Przedstawiono metody parametryzacji dźwięku, które możliwe są do zastosowania w takim systemie oraz omówiono aspekty techniczne implementacji. W kolejnej części przedstawiono system decyzyjny oparty na drzewach zastosowany w systemie. System ten rozpoznaje dźwięki niebezpieczne (strzał, rozbita szyba, krzyk) wśród dźwięków zarejestrowanych...

Pełny tekst do pobrania w serwisie zewnętrznym
Exploiting audio-visual correlation by means of gaze tracking
Publikacja
- B. Kunka
- B. Kostek
- International Journal of Computer Science and Applications - Rok 2010
This paper presents a novel means for increasing audio-visual correlation analysis reliability. This is done based on gaze tracking technology engineered at the Multimedia Systems Department of the Gdansk University of Technology, Poland. In the paper, the past history and current research in the area of audio-visual perception analysis are shortly reviewed. Then the methodology employing gaze tracking is presented along with the...

Pełny tekst do pobrania w portalu
Wireless intelligent audio-video surveillance prototyping system
Publikacja
- M. Kłosowski
- Przegląd Elektrotechniczny - Rok 2013
The presented system is based on the Virtex6 FPGA and several supporting devices like a fast DDR3 memory, small HD camera, microphone with A/D converter, WiFi radio communication module, etc. The system is controlled by the Linux operating system. The Linux drivers for devices implemented in the system have been prepared. The system has been successfully verified in a H.264 compression accelerator prototype in which the most demanding...

Pełny tekst do pobrania w portalu
Audio codec employing frequency-derived tonality measure
Publikacja
- M. Kulesza
- A. Czyżewski
- Rok 2009
A transform codec employing efficient algorithm for detection of spectral tonal components is presented. The tonality measure used in MPEG psychoacoustic model is replaced with the method providing adequate tonality estimates even if the tonal components are deeply frequency modulated. The reliability of hearing threshold estimated using psychoacoustic model with standardized tonality measure and the proposed one is investigated...
New algorithms for wow and flutter detection and compensation in audio
Publikacja
- Rok 2005
W referacie przedstawiono nowe metody dyskryminacji naturalnych efektów muzycznych i pasożytniczych zniekształceń drżenia dźwięku. Dodatkowo, opisano w nim metody wyznaczania przebiegu zniekształceń drżenia. Wśród nich znajdują się: detekcja okresowości sygnału w poszczególnych ramkach czasowych, śledzenie zmian przydźwięku sieciowego wykorzystujące modelowane AR widma sygnału, śledzenie zmian wysokoczęstotliwościowego prądu podkładu....
New algorithms for wow and flutter detection and compensation in audio
Publikacja
- Rok 2005
W referacie przedstawiono nowe metody dyskryminacji naturalnych efektów muzycznych i pasożytniczych zniekształceń drżenia dźwięku. Dodatkowo, opisano w nim metody wyznaczania przebiegu zniekształceń drżenia. Wśród nich znajdują się: detekcja okresowości sygnału w poszczególnych ramkach czasowych, śledzenie zmian przydźwięku sieciowego wykorzystujące modelowane AR widma sygnału, śledzenie zmian wysokoczęstotliwościowego prądu podkładu....
Wow detection and compensation employing spectral processing of audio.
Publikacja
- Rok 2004
Praca zawiera opis opracowanych algorytmów detekcji i kompensacji pasożytniczych modulacji częstotliwości wynikających z nierównomiernego przesuwu nośnika dźwięku. Proponowane metody opracowano ze szczególnym uwzględnieniem przypadkowych zniekształceń drżenia obecnych w archiwalnych filmowych ścieżkach dźwiękowych. Dodatkowo algorytmy badają wpływ zniekształceń na strukturę formantową sygnałów. Analiza zmian położenia formantów...
Applications of neural networks and perceptual masking to audio restoration
Publikacja
- A. Czyżewski
- Journal of New Music Research - Rok 2002
Omówiono zastosowania algorytmów uczących się w dziedzinie rekonstruowania nagrań fonicznych. Szczególną uwagę zwrócono na zastosowanie sztucznych sieci neuronowych do usuwania zakłócających impulsów. Ponadto opisano zastosowanie inteligentnego algorytmu decyzyjnego do sterowania maskowaniem perceptualnym w celu redukowania szumu.
Elimination of impulsive disturbances from stereo audio recordings
Publikacja
- M. Niedźwiecki
- M. Ciołek
- Rok 2014
This paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. On-line tracking of signal model parameters is performed using the stability-preserving Whittle-Wiggins-Robinson algorithm with exponential data weighting. Detection of noise pulses and model-based interpolation of the irrevocably distorted samples...

Pełny tekst do pobrania w serwisie zewnętrznym
Using concentrated spectrogram for analysis of audio acoustic signals
Publikacja
- K. Czarnecki
- M. Moszyński
- HYDROACOUSTICS - Rok 2012
The paper presents results of time-frequency analysis of audio acoustic signals using the method of Concentrated Spectrograph also known as ''Cross-spectral method'' or ''Reassignment method''. Presented algorithm involves signal's local group delay and channelized instantaneous frequency to relevantly redistribute all Short-time Fourier transform lines in time-frequency plain. The main intention of the paper is to compare various...

Pełny tekst do pobrania w portalu
Digital Audio Broadcasting or Webcasting: A Network Quality Perspective
Publikacja
- P. Falkowski-Gilski
- J. Stefański
- Journal of Telecommunications and Information Technology - Rok 2016
In recent years, many alternative technologies of delivering audio content have emerged, with different advantages and disadvantages. In this paper pros and cons of digital audio broadcasting and webcasting transmission techniques in a network quality perspective are described. A case study of user expectations with respect to currently available services is analyzed, and the perceived quality of real digital broadcasted and webcasted...

Pełny tekst do pobrania w portalu
An audio-visual corpus for multimodal automatic speech recognition
Publikacja
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2017
review of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...

Pełny tekst do pobrania w portalu
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING

Czasopisma

ISSN: 1063-6676
Comptabilite Controle Audit

Czasopisma

ISSN: 1262-2788
Audio-visual surveillance system for application in bank operating room
Publikacja
- J. Kotus
- K. Łopatka
- A. Czyżewski
- G. Bogdanis
- Communications in Computer and Information Science - Rok 2013
An audio-visual surveillance system able to detect, classify and to localize acoustic events in a bank operating room is presented. Algorithms for detection and classification of abnormal acoustic events, such as screams or gunshots are introduced. Two types of detectors are employed to detect impulsive sounds and vocal activity. A Support Vector Machine (SVM) classifier is used to discern between the different classes of acoustic...

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: audio parametrization

Bożena Kostek prof. dr hab. inż.