Wyniki wyszukiwania dla: audio parametrization

Wyniki wyszukiwania dla: audio parametrization

wyników na stronę:
osadź ten widok na swojej stronie

Wyświetlane wyniki pochodzą z wyszukiwania alternatywnego.

Filtry

wszystkich: 576

wyczyść wszystkie filtry niedostępne

Fitting the mobile device characteristics to the user's hearing preferences
Publikacja
- Rok 2014
A method for fitting the mobile computer audio characteristics to the user's hearing preferences is proposed. The process consists of two stages: calibration and dynamics processing. During the calibration phase the user performs a loudness scaling test giving their response regarding the perceived loudness. The dynamics processing made on above basis sets the loudness to the most comfortable level. The processing accounts both...

Pełny tekst do pobrania w serwisie zewnętrznym
Data, Information, Knowledge, Wisdom Pyramid Concept Revisited in the Context of Deep Learning
Publikacja
- B. Kostek
- Rok 2023
In this paper, the data, information, knowledge, and wisdom (DIKW) pyramid is revisited in the context of deep learning applied to machine learningbased audio signal processing. A discussion on the DIKW schema is carried out, resulting in a proposal that may supplement the original concept. Parallels between DIWK pertaining to audio processing are presented based on examples of the case studies performed by the author and her collaborators....

Pełny tekst do pobrania w serwisie zewnętrznym
Comparison of perforator location in dynamic and static thermographic imaging with Doppler ultrasound in breast reconstruction surgery
Publikacja
- S. Kołacz
- M. Moderhak
- J. Jankau
- Rok 2016
This paper co mpares the effectiveness of the dTnorm and t90_10 parametrizations in dynamic thermography for imaging location of perforators in TRAM flaps in the intraoperative period. The results were compared with the location detected in a Doppler ultrasound examination. Cold and heat stimulation was used in dynamic thermography. Additionally, these results were compared with static...

Pełny tekst do pobrania w serwisie zewnętrznym
Real and imaginary motion classification based on rough set analysis of EEG signals for multimedia applications
Publikacja
- P. Szczuko
- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2017
Rough set-based approach to the classification of EEG signals of real and imaginary motion is presented. The pre-processing and signal parametrization procedures are described, the rough set theory is briefly introduced, and several classification scenarios and parameters selection methods are proposed. Classification results are provided and discussed with their potential utilization for multimedia applications controlled by the...

Pełny tekst do pobrania w portalu
Report of the ISMIS 2011 Contest : Music Information Retrieval
Publikacja
- B. Kostek
- A. Kupryjanow
- P. Żwan
- W. Jiang
- Z. W. Raś
- M. Wojnarski
- J. Świetlicka
- Rok 2011
This report presents an overview of the data mining contestorganized in conjunction with the 19th International Symposiumon Methodologies for Intelligent Systems (ISMIS 2011), in days betweenJan 10 and Mar 21, 2011, on TunedIT competition platform. The contestconsisted of two independent tasks, both related to music information retrieval:recognition of music genres and recognition of instruments, for agiven music sample represented...
Postprodukcja nagrania wideo z dzwiekiem dookolnym
Publikacja
- Rok 2009
One of the aims of this paper is to present issues related to audio-video correlation. This is presented on the basis of a short film realization employing surround microphone techniques. First, some related works in the domain of sound and vision correlation are presented. Then assumptions concerning scene creation related to both audio and video are shortly described. Another objective is to discuss results of subjective tests...
1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type
Publikacja
- Rok 2020
A network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence....
Evaluation of a Novel Approach to Virtual Bass Synthesis Strategy
Publikacja
- P. Hoffmann
- B. Kostek
- Rok 2015
The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) strategy applied to portable computers. The developed algorithms involve intelligent, rule-based settings of bass synthesis parameters with regard to music genre of an audio excerpt and the type of a portable device in use. The Smart VBS algorithm performs the synthesis based on a nonlinear device (NLD) with artificial controlling synthesis...

Pełny tekst do pobrania w serwisie zewnętrznym
Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing
Publikacja
- D. Koszewski
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2020
Developing signal processing methods to extract information automatically has potential in several applications, for example searching for multimedia based on its audio content, making context-aware mobile applications (e.g., tuning apps), or pre-processing for an automatic mixing system. However, the last-mentioned application needs a significant amount of research to reliably recognize real musical instruments in recordings....

Pełny tekst do pobrania w portalu
Physics-Based Coarse-Grained Modeling in Bio- and Nanochemistry
Publikacja
- A. Liwo
- A. K. Sieradzan
- A. S. Karczyńska
- E. Lubecka
- S. A. Samsonov
- C. Czaplewski
- P. Krupa
- M. Mozolewska
- Rok 2021
Coarse-grained approaches, in which groups of atoms are represented by single interaction sites, are very important in biological and materials sciences because they enable us to cover the size- and time-scales by several orders of magnitude larger than those available all-atom simulations, while largely keeping the details of the systems studied. The coarse-grained approaches differ by the scheme of reduction and by the origin...

Pełny tekst do pobrania w serwisie zewnętrznym
Audit of the existing surfaces /pavements of sidewalks and roads in the Gdańsk-Oliwa district, with particular emphasis on the location of the "Polanki" market and its direct neighbourhood in the contexts of pavement design of the "Polanki"market; stage from 2019 year.
Dane Badawcze
open access
- J. Borucka
- W. Mazurkiewicz
The document presents a valorization of paved surfaces (sidewalks and roads) in the Gdańsk-Oliwa district, prapared on the basis of an preliminary inventory work – 21 tables (one table for each street ) with a description of the street and materials used, mostly supplemented with photographic material. The valorization, after the initial inventory,...
Krystian Zawadzki dr hab. inż.

Osoby

Katedra Finansów

Krystian Zawadzki jest pracownikiem Katedry Finansów na Wydziale Zarządzania i Ekonomii Politechniki Gdańskiej; członkiem Polskiej Akademii Olimpijskiej przy PKOl; koordynatorem Szkoły Giełdowej (GPW) w Gdańsku; kierownikiem studiów podyplomowych "Inwestycje kapitałowe i zarządzanie finansami osobistymi"; członkiem komisji rewizyjnej klubu sportowego "Sportowa Politechnika". Założyciel kanału o tematyce naukowo-finansowo-sportowej...
Characterizing the Performance of <span class="sc">xor</span> Games and the Shannon Capacity of Graphs
Publikacja
- R. Ramanathan
- A. Kay
- G. Murta
- P. Horodecki
- PHYSICAL REVIEW LETTERS - Rok 2014
In this Letter we give a set of necessary and sufficient conditions such that quantum players of a two-party xor game cannot perform any better than classical players. With any such game, we associate a graph and examine its zero-error communication capacity. This allows us to specify a broad new class of graphs for which the Shannon capacity can be calculated. The conditions also enable the parametrization of new families of games...

Pełny tekst do pobrania w serwisie zewnętrznym
Music genre classification applied to bass enhancement for mobile technology
Publikacja
- P. Hoffmann
- B. Kostek
- Elektronika : konstrukcje, technologie, zastosowania - Rok 2015
The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm is related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt. The classification of music genres is automatically executed employing MPEG 7 parameters and the Principal Component Analysis method applied to reduce information...

Pełny tekst do pobrania w serwisie zewnętrznym
Machine learning applied to acoustic-based road traffic monitoring
Publikacja
- K. Marciniuk
- B. Kostek
- Procedia Computer Science - Rok 2022
The motivation behind this study lies in adapting acoustic noise monitoring systems for road traffic monitoring for driver’s safety. Such a system should recognize a vehicle type and weather-related pavement conditions based on the audio level measurement. The study presents the effectiveness of the selected machine learning algorithms in acoustic-based road traffic monitoring. Bases of the operation of the acoustic road traffic...

Pełny tekst do pobrania w portalu
Machine learning applied to acoustic-based road traffic monitoring
Publikacja
- K. Marciniuk
- B. Kostek
- Rok 2022
The motivation behind this study lies in adapting acoustic noise monitoring systems for road traffic monitoring for driver’s safety. Such a system should recognize a vehicle type and weather-related pavement conditions based on the audio level measurement. The study presents the effectiveness of the selected machine learning algorithms in acoustic-based road traffic monitoring. Bases of the operation of the acoustic road traffic...

Pełny tekst do pobrania w portalu
Impact of maintenance of floodplains of the Vistula River on high water levels on the section from Włocławek to Toruń
Publikacja
- D. Gąsiorowski
- M. Szydłowski
- Acta Energetica - Rok 2013
This article describes the methodology of hydraulic calculations to estimate the water levels in open channels for steady gradually varied flow. The presented method has been used to analyse the water level on the Vistula River from Włocławek cross-section to Toruń cross-section. The HEC-RAS modelling system has been used for parameterization of the river channel and floodplains, as well as for flow simulation. The results obtained...

Pełny tekst do pobrania w portalu
Theoretical calculation of the physico-chemical properties of 1-butyl-4-methylpyridinium based ionic liquids
Publikacja
- A. Giełdoń
- M. Bobrowski
- A. Bielicka-giełdoń
- C. Czaplewski
- JOURNAL OF MOLECULAR LIQUIDS - Rok 2017
ACCEPTED MAIonic liquids (ILs) have attracted much attention for their unique physicochemical properties, which can be designed as needed by altering the ion combinations. Besides experimental work, numerous computational studies have been concerned with prediction of physical properties of ILs. The results of molecular dynamics simulations of ILs depend strongly on the proper force field parameterization. Classical force fields...

Pełny tekst do pobrania w portalu
Adaptive Personal Tuning of Sound in Mobile Computers
Publikacja
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2016
An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of their acoustic track to changing acoustic conditions of the environment and to users’ individual preferences. Signal processing algorithms are introduced that concern: linearization of frequency response, dialogue intelligibility enhancement, and dynamics processing tuned up to the users’...

Pełny tekst do pobrania w portalu
Music Data Processing and Mining in Large Databases for Active Media
Publikacja
- B. Kostek
- P. Hoffmann
- Rok 2014
The aim of this paper was to investigate the problem of music data processing and mining in large databases. Tests were performed on a large data-base that included approximately 30000 audio files divided into 11 classes cor-responding to music genres with different cardinalities. Every audio file was de-scribed by a 173-element feature vector. To reduce the dimensionality of data the Principal Component Analysis (PCA) with variable...

Pełny tekst do pobrania w serwisie zewnętrznym
Zaawansowane Przetwarzanie Sygnału
Kursy Online
- A. Szewczyk
- J. Smulko
Przedmiot prezentuje wybrane metody przetwarzania sygnałów w bardzo szerokim obszarze zastosowań. Ilustruje najnowsze osiągnięcia w tym zakresie, wsparte wybranymi publikacjami. Zajęcia są podzielone na wykład (15 h) i seminarium (15 h). Podstawowe pojęcia dotyczące cyfrowego przetwarzania sygnałów, zalecana literatura Analiza widmowa gęstość widmowa mocy, widmo falkowe, polispektra i gęstość widmowa mocy skrośnej Efekty...
Camera angle invariant shape recognition in surveillance systems
Publikacja
- D. Ellwart
- A. Czyżewski
- Rok 2010
A method for human action recognition in surveillance systems is described. Problems within this task are discussed and a solution based on 3D object models is proposed. The idea is shown and some of its limitations are talked over. Shape description methods are introduced along with their main features. Utilized parameterization algorithm is presented. Classification problem, restricted to bi-nary cases is discussed. Support vector...
Further Developments of the Online Sound Restoration System for Digital Library Applications
Publikacja
- Rok 2014
New signal processing algorithms were introduced to the online service for audio restoration available at the web address: www.youarchive.net. Missing or distorted audio samples are estimated using a specific implementation of the Jannsen interpolation method. The algorithm is based on the autoregressive model (AR) combined with the iterative complementation of signal samples. Since the interpolation algorithm is computationally...

Pełny tekst do pobrania w serwisie zewnętrznym
Editor's note and 2018 reviewers
Publikacja
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2018
Przedmiotem pracy jest odniesienie do prac opublikowanych w 2018 roku, jak również do serii artykułów w ramach specjalnego wydania: Special Issue on Augmented and Participatory Sound and Music Interaction Using Semantic Audio.

Pełny tekst do pobrania w serwisie zewnętrznym
Sparse autoregressive modeling
Publikacja
- M. Ciołek
- Rok 2012
In the paper the comparison of the popular pitch determination (PD) algorithms for thepurpose of elimination of clicks from archive audio signals using sparse autoregressive (SAR)modeling is presented. The SAR signal representation has been widely used in code-excitedlinear prediction (CELP) systems. The appropriate construction of the SAR model is requiredto guarantee model stability. For this reason the signal representation...
An Approach to Bass Enhancement in Portable Computers Employing Smart Virtual Bass Synthesis Algorithms
Publikacja
- Rok 2014
The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The developed algorithms are related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt and to the type of a portable device in use. To find optimum synthesis parameters of the VBS algorithms, subjective listening tests based on a parametric procedure...

Pełny tekst do pobrania w serwisie zewnętrznym
Innovative method of localization airplanes in VCS (VCS-MLAT) distributed system
Publikacja
- S. Wiszniewski
- Rok 2019
The article presents the concept and the structure of the localization module. The prototype module is the part of the VCS (VCS-MLAT) localization distributed system. The device receives the audio signal transmitted in airplanes band (118 MHz – 136 MHz). Received data with the timestamps are send to the main server. The data from multiple devices estimates the localization of the airplane. The main aim of the project is the analysis...
Cross-domain applications of multimodal human-computer interfaces
Publikacja
- A. Czyżewski
- Rok 2015
Developed multimodal interfaces for education applications and for disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and audio interface for speech stretching for hearing impaired and stuttering people and intelligent pen allowing for diagnosing and ameliorating developmental dyslexia. The eye-gaze tracking system named...
Subjective and Objective Comparative Study of DAB+ Broadcast System
Publikacja
- P. Falkowski-Gilski
- J. Stefański
- Archives of Acoustics - Rok 2017
Broadcasting services seek to optimize their use of bandwidth in order to maximize user’s quality of experience. They aim to transmit high-quality digital speech and music signals at the lowest bitrate. They intend to offer the best quality under available conditions. Due to bandwidth limitations, audio quality is in conflict with the number of transmitted radio programs. This paper analyzes whether the quality of real-time digital...

Pełny tekst do pobrania w portalu
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
Publikacja
- D. Korzekwa
- R. Barra-Chicote
- S. Zaporowski
- G. Beringer
- J. Lorenzo-trueba
- A. Serafinowicz
- J. Droppo
- T. Drugman
- B. Kostek
- Rok 2021
This paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...

Pełny tekst do pobrania w portalu
Examining Feature Vector for Phoneme Recognition
Publikacja
- G. Korvel
- B. Kostek
- Rok 2018
The aim of this paper is to analyze usability of descriptors coming from music information retrieval to the phoneme analysis. The case study presented consists in several steps. First, a short overview of parameters utilized in speech analysis is given. Then, a set of time and frequency domain-based parameters is selected and discussed in the context of stop consonant acoustical characteristics. A toolbox created for this purpose...
Methodology and technology for the polymodal allophonic speech transcription
Publikacja
- Journal of the Acoustical Society of America - Rok 2016
A method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for...

Pełny tekst do pobrania w serwisie zewnętrznym
Methodology and technology for the polymodal allophonic speech transcription
Publikacja
- Journal of the Acoustical Society of America - Rok 2016
A method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory...

Pełny tekst do pobrania w serwisie zewnętrznym
Sound engineering as our commitment to its creators in Poland
Publikacja
- B. Kostek
- A. Czyżewski
- Archives of Acoustics - Rok 2019
Sound engineering is an interdisciplinary and rapidly expanding domain. It covers many aspects, such as sound perception, studio and sound mastering technology, music information retrieval including content-based search systems and automatic music transcription frameworks, sound synthesis, sound restoration, electroacoustics, and other ones constituting multimedia technology. Moreover, machine learning methods applied to the topics...

Pełny tekst do pobrania w serwisie zewnętrznym
Health outcomes of road-traffic pollution among exposed roadside workers in Rawalpindi City, Pakistan
Publikacja
- M. Ali
- A. Rashid
- B. Yousaf
- A. Kamal
- HUMAN AND ECOLOGICAL RISK ASSESSMENT - Rok 2017
Pełny tekst do pobrania w serwisie zewnętrznym
MODALITY corpus - SPEAKER 17 - SEQUENCE S1
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S4
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S2
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S5
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S3
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S6
Dane Badawcze
- seria: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
EVENTS VISUALIZATION POST IN A DISTRIBUTED TELEINFORMATION SYSTEM FOR THE BORDER GUARD
Publikacja
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2017
Events Visualization Post is a part of the STRADAR project, which is dedicated to streaming real-time data in distributed dispatcher and teleinformation systems of the Border Guard. Events Visualization Post is a software designed for simultaneous visualization of data of different types. In the paper, the structure of the software is presented, the process of generation of tasks is described, and the visualization of audio, files,...
Measurements of OF QoS/QoE parameters for media streaming in a PMIPv6 TESTBED WITH 802.11 b/g/n WLANs
Publikacja
- Metrology and Measurement Systems - Rok 2012
A growing number of mobile devices and the increasing popularity of multimedia services result in a new challenge of providing mobility in access networks. The paper describes experimental research on media (audio and video) streaming in a mobile IEEE 802.11 b/g/n environment realizing network-based mobility. It is an approach to mobility that requires little or no modification of the mobile terminal. Assessment of relevant parameters...

Pełny tekst do pobrania w portalu
Automatic music signal mixing system based on one-dimensional Wave-U-Net autoencoders
Publikacja
- D. Koszewski
- T. Görne
- G. Korvel
- B. Kostek
- EURASIP Journal on Audio Speech and Music Processing - Rok 2023
The purpose of this paper is to show a music mixing system that is capable of automatically mixing separate raw recordings with good quality regardless of the music genre. This work recalls selected methods for automatic audio mixing first. Then, a novel deep model based on one-dimensional Wave-U-Net autoencoders is proposed for automatic music mixing. The model is trained on a custom-prepared database. Mixes created using the...

Pełny tekst do pobrania w portalu
Określenie parametrów modelowania geometrii krzyżownic rozjazdów zwyczajnych dla potrzeb budowy i utrzymania linii kolejowych
Publikacja
- P. Omieczyński
- Zeszyty Naukowo-Techniczne Stowarzyszenia Inżynierów i Techników Komunikacji w Krakowie. Seria: Materiały Konferencyjne - Rok 2019
Zdecydowana większość rozjazdów występująca na liniach kolejowych w Polsce to rozjazdy zwyczajne o typowym zestawie parametrów. Z tego powodu analiza przypadków nietypowych (takich jak rozjazdy o zmiennej krzywiźnie toru zwrotnego) może być utrudniona. Wirtualny model geometryczno-konstrukcyjny rozjazdu, generowany w oparciu o metody analityczne, stanowić może narzędzie użyteczne w sferze projektowania, konstrukcji oraz diagnostyki...

Pełny tekst do pobrania w serwisie zewnętrznym
Data Analysis in Bridge of Data
Publikacja
- Rok 2022
The chapter presents the data analysis aspects of the Bridge of Data project. The software framework used, Jupyter, and its configuration are presented. The solution’s architecture, including the TRYTON supercomputer as the underlying infrastructure, is described. The use case templates provided by the Stat-reducer application are presented, including data analysis related to spatial points’ cloud-, audio- and wind-related research.

Pełny tekst do pobrania w portalu
Tonality Estimation and Frequency Tracking of Modulated Tonal Components
Publikacja
- M. Kulesza
- A. Czyżewski
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2009
A novel method for tonality estimation and frequency tracking of tonal components modulated in frequency and amplitude is presented. The algorithm detects the local maxima of magnitude spectra corresponding to three contiguous frames of a signal and matches them into the tonal track candidates. The magnitude-based and phase-based methods are used to estimate the frequency jumps between spectrum maxima belonging to the tonal track...

Pełny tekst do pobrania w serwisie zewnętrznym
System for automatic singing voice recognition
Publikacja
- P. Żwan
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2008
W artykule przedstawiono system automatycznego rozpoznawania jakości i typu głosu śpiewaczego. Przedstawiono bazę danych oraz zaimplementowane parametry. Algorytmem decyzyjnym jest algorytm sztucznych sieci neuronowych. Wytrenowany system decyzyjny osiąga skuteczność ok. 90% w obydwu kategoriach rozpoznawania. Dodatkowo wykazano przy pomocy metod statystycznych, że wyniki działania systemu automatycznej oceny jakości technicznej...
Expert system for automatic classification and quality assessment of singing voices
Publikacja
- P. Żwan
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2006
.

Pełny tekst do pobrania w serwisie zewnętrznym
DSP techniques for determining ''Wow'' distortions
Publikacja
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2007
Artykuł przedstawia opis algorytmów do wyznaczania charakterystyki zniekształceń kołysania dźwięku. Są to algorytmy: śledzenia przydźwięku sieciowego, śledzenia pozostałości magnetycznej prądu podkładu wielkich częstotliwości, adaptacyjnej analizy środka ciężkości widma dla wybranej części zniekształconego sygnału. Przedstawione algorytmy pozwalają na implementację programową i sprzętową.

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: audio parametrization

Krystian Zawadzki dr hab. inż.