Wyniki wyszukiwania dla: audio

Wyniki wyszukiwania dla: audio

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 458

wyczyść wszystkie filtry niedostępne

Testing A Novel Gesture-Based Mixing Interface
Publikacja
- M. Lech
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2013
With a digital audio workstation, in contrast to the traditional mouse-keyboard computer interface, hand gestures can be used to mix audio with eyes closed. Mixing with a visual representation of audio parameters during experiments led to broadening the panorama and a more intensive use of shelving equalizers. Listening tests proved that the use of hand gestures produces mixes that are aesthetically as good as those obtained using...

Pełny tekst do pobrania w portalu
Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing
Publikacja
- D. Koszewski
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2020
Developing signal processing methods to extract information automatically has potential in several applications, for example searching for multimedia based on its audio content, making context-aware mobile applications (e.g., tuning apps), or pre-processing for an automatic mixing system. However, the last-mentioned application needs a significant amount of research to reliably recognize real musical instruments in recordings....

Pełny tekst do pobrania w portalu
Adaptive Personal Tuning of Sound in Mobile Computers
Publikacja
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2016
An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of their acoustic track to changing acoustic conditions of the environment and to users’ individual preferences. Signal processing algorithms are introduced that concern: linearization of frequency response, dialogue intelligibility enhancement, and dynamics processing tuned up to the users’...

Pełny tekst do pobrania w portalu
Editor's note and 2018 reviewers
Publikacja
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2018
Przedmiotem pracy jest odniesienie do prac opublikowanych w 2018 roku, jak również do serii artykułów w ramach specjalnego wydania: Special Issue on Augmented and Participatory Sound and Music Interaction Using Semantic Audio.

Pełny tekst do pobrania w serwisie zewnętrznym
Analysis of 2D Feature Spaces for Deep Learning-based Speech Recognition
Publikacja
- G. Korvel
- P. Treigys
- G. Tamulevicus
- J. Bernataviciene
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2018
convolutional neural network (CNN) which is a class of deep, feed-forward artificial neural network. We decided to analyze audio signal feature maps, namely spectrograms, linear and Mel-scale cepstrograms, and chromagrams. The choice was made upon the fact that CNN performs well in 2D data-oriented processing contexts. Feature maps were employed in the Lithuanian word recognition task. The spectral analysis led to the highest word...
Bass Enhancement Settings in Portable Devices Based on Music Genre Recognition
Publikacja
- P. Hoffmann
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2015
The paper presents a novel approach to the Virtual Bass Synthesis (VBS) applied to mobile devices, called Smart VBS (SVBS). The proposed algorithm uses an intelligent, rule-based setting of bass synthesis parameters adjusted to the particular music genre. Harmonic generation is based on a nonlinear device (NLD) method with the intelligent controlling system adapting to the recognized music genre. To automatically classify music...

Pełny tekst do pobrania w portalu
System for automatic singing voice recognition
Publikacja
- P. Żwan
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2008
W artykule przedstawiono system automatycznego rozpoznawania jakości i typu głosu śpiewaczego. Przedstawiono bazę danych oraz zaimplementowane parametry. Algorytmem decyzyjnym jest algorytm sztucznych sieci neuronowych. Wytrenowany system decyzyjny osiąga skuteczność ok. 90% w obydwu kategoriach rozpoznawania. Dodatkowo wykazano przy pomocy metod statystycznych, że wyniki działania systemu automatycznej oceny jakości technicznej...
New Aspects of Virtual Sound Source Localization Research—Impact of Visual Angle and 3-D Video Content on Sound Perception
Publikacja
- B. Kunka
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2013
The influence of image on virtual sound source localization, called the “image proximity effect” or the “ventriloquism effect”, is a well known phenomenon. This paper focuses on other aspects related to this effect, namely the impact of the visual angle of the presented object and 3D video content on sound perception. The research conducted confirmed that the visual angle of the presented object determines the image proximity effect...

Pełny tekst do pobrania w portalu
Tonality Estimation and Frequency Tracking of Modulated Tonal Components
Publikacja
- M. Kulesza
- A. Czyżewski
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2009
A novel method for tonality estimation and frequency tracking of tonal components modulated in frequency and amplitude is presented. The algorithm detects the local maxima of magnitude spectra corresponding to three contiguous frames of a signal and matches them into the tonal track candidates. The magnitude-based and phase-based methods are used to estimate the frequency jumps between spectrum maxima belonging to the tonal track...

Pełny tekst do pobrania w serwisie zewnętrznym
Expert system for automatic classification and quality assessment of singing voices
Publikacja
- P. Żwan
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2006
.

Pełny tekst do pobrania w serwisie zewnętrznym
DSP techniques for determining ''Wow'' distortions
Publikacja
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2007
Artykuł przedstawia opis algorytmów do wyznaczania charakterystyki zniekształceń kołysania dźwięku. Są to algorytmy: śledzenia przydźwięku sieciowego, śledzenia pozostałości magnetycznej prądu podkładu wielkich częstotliwości, adaptacyjnej analizy środka ciężkości widma dla wybranej części zniekształconego sygnału. Przedstawione algorytmy pozwalają na implementację programową i sprzętową.
Measurements and Visualization of Sound Intensity Around the Human Head in Free Field Using Acoustic Vector Sensor
Publikacja
- J. Kotus
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2015
This paper presents measurements and visualization of sound intensity around the human head simulator in a free field. A Cartesian robot, applied for precise positioning of the acoustic vector sensor, was used to measure sound intensity. Measurements were performed in a free field using a head and torso simulator and the setup consisting of four different loudspeaker configurations. The acoustic vector sensor was positioned around...

Pełny tekst do pobrania w portalu
Audiology Research

Czasopisma

ISSN: 2039-4330 , eISSN: 2039-4349
Phraseological Units in Audiovisual Translation. A Case Study of Polish Dubbing of Disney’s 'The Little Mermaid'
Publikacja
- P. Golda
- J. Mężyk
- Kwartalnik Neofilologiczny - Rok 2021
The paper aims to discuss phraseological units as the object of audiovisual translation in the Polish dubbing of Disney’s 'The Little Mermaid', to discuss the role of phraseological translation techniques, and to present possible translation inconsistencies. A theoretical introduction presents definitions for crucial terms. It is followed by the analysis of the corpus of phraseological units in Disney’s The Little Mermaid and...

Pełny tekst do pobrania w portalu
Audiosfera środowiska pracy w przestrzeni biurowej na planie otwartym. Wyniki zwiadu badawczego
Publikacja
- P. Mizera-Pęczek
- e-mentor - Rok 2021
Pełny tekst do pobrania w serwisie zewnętrznym
Audiosfera środowiska pracy w przestrzeni biurowej na planie otwartym. Wyniki zwiadu badawczego
Publikacja
- P. Mizera-Pęczek
- e-mentor - Rok 2021
Pełny tekst do pobrania w serwisie zewnętrznym
American Journal of Audiology

Czasopisma

ISSN: 1059-0889 , eISSN: 1558-9137
AUDIOLOGY AND NEURO-OTOLOGY

Czasopisma

ISSN: 1420-3030 , eISSN: 1421-9700
Audiology and Neurotology Extra

Czasopisma

ISSN: 1664-5537
Journal of Audiology and Otology

Czasopisma

ISSN: 2384-1621 , eISSN: 2384-1710
International Journal of Audiology

Czasopisma

ISSN: 1499-2027 , eISSN: 1708-8186
Audiology and Speech Research

Czasopisma

ISSN: 2635-5019 , eISSN: 2635-5027
Automatic music signal mixing system based on one-dimensional Wave-U-Net autoencoders
Publikacja
- D. Koszewski
- T. Görne
- G. Korvel
- B. Kostek
- EURASIP Journal on Audio Speech and Music Processing - Rok 2023
The purpose of this paper is to show a music mixing system that is capable of automatically mixing separate raw recordings with good quality regardless of the music genre. This work recalls selected methods for automatic audio mixing first. Then, a novel deep model based on one-dimensional Wave-U-Net autoencoders is proposed for automatic music mixing. The model is trained on a custom-prepared database. Mixes created using the...

Pełny tekst do pobrania w portalu
Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling
Publikacja
- S. Raczyński
- E. Vincent
- S. Sagayama
- IEEE Transactions on Audio Speech and Language Processing - Rok 2013
Symbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is...

Pełny tekst do pobrania w serwisie zewnętrznym
Estimation of the short-term predictor parameters of speech under noisy conditions
Publikacja
- M. Kuropatwinski
- W. Kleijn
- M. Kuropatwiński
- IEEE Transactions on Audio Speech and Language Processing - Rok 2006
Pełny tekst do pobrania w serwisie zewnętrznym
Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
Publikacja
- S. Raczyński
- E. Vincent
- IEEE Transactions on Audio Speech and Language Processing - Rok 2014
In this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor pr ocess priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bi- gram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of -grams with a topic model,...

Pełny tekst do pobrania w serwisie zewnętrznym
New approach for determining the QoS of MP3-coded voice signals in IP networks
Publikacja
- T. Uhl
- S. Paulsen
- K. Nowicki
- EURASIP Journal on Audio Speech and Music Processing - Rok 2017
Present-day IP transport platforms being what they are, it will never be possible to rule out conflicts between the available services. The logical consequence of this assertion is the inevitable conclusion that the quality of service (QoS) must always be quantifiable no matter what. This paper focuses on one method to determine QoS. It defines an innovative, simple model that can evaluate the QoS of MP3-coded voice data transported...

Pełny tekst do pobrania w portalu
Revista de Logopedia, Foniatria y Audiologia

Czasopisma

ISSN: 0214-4603
Journal of the American Academy of Audiology

Czasopisma

ISSN: 1050-0545 , eISSN: 2157-3107
Canadian Journal of Speech-Language Pathology and Audiology

Czasopisma

ISSN: 1913-2018
Images. The International Journal of European Film, Performing Arts and Audiovisual Communication

Czasopisma

ISSN: 1731-450X
Bożena Kostek prof. dr hab. inż.

Osoby

Laboratorium Akustyki Fonicznej
Piotr Szczuko dr hab. inż.

Osoby

Katedra Systemów Multimedialnych

Dr hab. inż. Piotr Szczuko w 2002 roku ukończył studia na Wydziale Elektroniki, Telekomunikacji i Informatyki Politechniki Gdańskiej zdobywając tytuł magistra inżyniera. Tematem pracy dyplomowej było badanie zjawisk jednoczesnej percepcji obrazu cyfrowego i dźwięku dookólnego. W roku 2008 obronił rozprawę doktorską zatytułowaną "Zastosowanie reguł rozmytych w komputerowej animacji postaci", za którą otrzymał nagrodę Prezesa Rady...
Józef Kotus dr hab. inż.

Osoby

Katedra Systemów Multimedialnych
Michał Lech dr inż.

Osoby

Michał Lech was born in Gdynia in 1983. In 2007 he graduated from the faculty of Electronics, Telecommunications and Informatics of Gdansk University of Technology. In June 2013, he received his Ph.D. degree. The subject of the dissertation was: “A Method and Algorithms for Controlling the Sound Mixing Processes with Hand Gestures Recognized Using Computer Vision”. The main focus of the thesis was the bias of audio perception caused...
Marek Blok dr hab. inż.

Osoby

Marek Blok w 1994 roku ukończył studia na kierunku Telekomunikacja wydziału Elektroniki Politechniki Gdańskiej i uzyskał tytuł mgra inżyniera. Doktorat w zakresie telekomunikacji uzyskał w 2003 roku na Wydziale Elektroniki, Telekomunikacji i Informatyki Politechniki Gdańskiej. W 2017 roku uzyskał stopień naukowy dra habilitowanego w dyscyplinie telekomunikacja. Jego zainteresowania badawcze ukierunkowane są na telekomunikacyjne...
Marcin Kulawiak dr hab. inż.

Osoby

Katedra Systemów Geoinformatycznych
Piotr Odya dr inż.

Osoby

Katedra Systemów Multimedialnych

Piotr Odya urodził się w Gdańsku w 1974. W 1999 roku ukończył z wyróżnieniem studia na Wydziale Elektroniki, Telekomunikacji i Informatyki Politechniki Gdańskiej zdobywając tytuł magistra inżyniera. Praca dyplomowa dotyczyła problemów poprawy jakości dźwięku w studiach emisyjnych współczesnych rozgłośni radiowych.Jego zainteresowania dotyczą montażu wideofonicznego, systemów dźwięku wielokanałowego. W ramach studiów doktoranckich...
Grzegorz Szwoch dr hab. inż.

Osoby

Katedra Systemów Multimedialnych

Grzegorz Szwoch urodził się w 1972 roku w Gdańsku. W latach 1991-1996 studiował na wydziale Elektroniki Politechniki Gdańskiej. W roku 1996 ukończył studia w Zakładzie Inżynierii Dźwięku (obecnie Katedra Systemów Multimedialnych), broniąc pracę dyplomową pt. Modelowanie fizyczne wybranych instrumentów muzycznych. W tym samym roku dołączył do zespołu badawczego Katedry jako uczestnik Studium Doktoranckiego. Od stycznia 2001 roku...
Automatic sound recognition for security purposes
Publikacja
- P. Żwan
- Rok 2008
In the paper an automatic sound recognition system is presented. It forms a part of a bigger security system developed in order to monitor outdoor places for non-typical audio-visual events. The analyzed audio signal is being recorded from a microphone mounted in an outdoor place thus a non stationary noise of a significant energy is present in it. In the paper an especially designed algorithm for outdoor noise reduction is presented,...
QoS/QoE in the Heterogeneous Internet of Things (IoT)
Publikacja
- K. Nowicki
- T. Uhl
- Rok 2017
Applications provided in the Internet of Things can generally be divided into three categories: audio, video and data. This has given rise to the popular term Triple Play Services. The most important audio applications are VoIP and audio streaming. The most notable video applications are VToIP, IPTV, and video streaming, and the service WWW is the most prominent example of data-type services. This chapter elaborates on the most...
Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions
Publikacja
- Rok 2016
Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...

Pełny tekst do pobrania w serwisie zewnętrznym
Analiza stanu nawierzchni i klas pojazdów na podstawie parametrów ekstrahowanych z sygnału fonicznego
Publikacja
- K. Marciniuk
- B. Kostek
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Rok 2016
Celem badań jest poszukiwanie parametrów wektora cech ekstrahowanego z sygnału fonicznego w kontekście automatycznego rozpoznawania stanu nawierzchni jezdni oraz typu pojazdów. W pierwszej kolejności przedstawiono wpływ warunków pogodowych na charakterystykę widmową sygnału fonicznego rejestrowanego przy przejeżdżających pojazdach. Następnie, dokonano parametryzacji sygnału fonicznego oraz przeprowadzano analizę korelacyjną w celu...

Pełny tekst do pobrania w portalu
Digital Transformation of Terrestrial Radio: An Analysis of Simulcasted Broadcasts in FM and DAB+ for a Smart and Successful Switchover
Publikacja
- P. Falkowski-Gilski
- Applied Sciences-Basel - Rok 2021
The process of digitizing radio is far from over. It is an important interdisciplinary aspect, involving Big Data and AI (Artificial Intelligence) when it comes to classifying and handling content, and an organizational challenge in the Industry 4.0 concept. There exist several methods for delivering audio signals, including terrestrial broadcasting and internet streaming. Among them, the DAB+ (Digital Audio Broadcasting plus)...

Pełny tekst do pobrania w portalu
Examining Acoustic Emission of Engineered Ultrasound Loudspeakers
Publikacja
- Rok 2014
Measurement results of the sound emitted from an ultrasound custom-made system with high spatial directivity are presented. The proposed system is using modulated ultrasound waves which demodulate in nonlinear medium resulting in audible sound. The system is aimed at enhancing the users’ personal audio space, therefore the measurements are performed using the Head and Torso Simulator which provides the realistic reproduction of...
A concept of Signal Equalization Method Based on Music Genre and the Listener's Room Characteristics
Publikacja
- B. Kostek
- P. Hoffmann
- Rok 2016
A research study that investigates the influence of the room acoustics environment on the frequency characteristic of the audio signal playback is presented. First, a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the frequency response of the room, a system for room acoustics compensation based on eight-band equalizer is proposed. The system settings depend on music genre. In...
Measurements and Simulations of Engineered Ultrasound Loudspeakers
Publikacja
- Computational Methods in Science and Technology - Rok 2015
Simulation and measurement results of the sound emitted from an ultrasound custom-made system with high spatial directivity are presented. The proposed system is using modulated ultrasound waves which demodulate in nonlinear medium resulting in audible sound. The system is aimed at enhancing the users’ personal audio space, therefore the measurements are performed using the Head and Torso Simulator which provides realistic reproduction...

Pełny tekst do pobrania w serwisie zewnętrznym
Intelligent multimedia solutions supporting special education needs.
Publikacja
- A. Czyżewski
- B. Kostek
- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2011
The role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....
Quality Aspects in Digital Broadcasting and Webcasting Systems: Bitrate versus Loudness
Publikacja
- Journal of Telecommunications and Information Technology - Rok 2017
In this paper the quality aspects of bitrate and loudness in digital broadcasting and webcasting systems are examined. The authors discuss a survey concerning user preferences related with processing and managing audio content. The coding efficiency of a popular audio format is analyzed in the context of storing media. An objective study on a representative group of signal samples, as well as a subjective study of the perceived...

Pełny tekst do pobrania w portalu
Online sound restoration system for digital library applications
Publikacja
- Rok 2013
Audio signal processing algorithms were introduced to the new online non-commercial service for audio restoration intended to enhance the content of digitized audio repositories. Missing or distorted audio samples are predicted using neural networks and a specific implementation of the Jannsen interpolation method based on the autoregressive model (AR) combined with the iterative restoring of missing signal samples. Since the distortion...

Pełny tekst do pobrania w serwisie zewnętrznym

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: audio

Bożena Kostek prof. dr hab. inż.

Piotr Szczuko dr hab. inż.

Józef Kotus dr hab. inż.

Michał Lech dr inż.

Marek Blok dr hab. inż.

Marcin Kulawiak dr hab. inż.

Piotr Odya dr inż.

Grzegorz Szwoch dr hab. inż.