Wyniki wyszukiwania dla: AUDIO PROCESSING

Metody udostępniania materiałów multimedialnych w sieciach LAN i WAN.

Publikacja

- Rok 2004

W referacie przedstawiono możliwości wzbogacania treści edukacyjnych dzięki wykorzystaniu technik multimedialnych. Uzupełnienie materiału edukacyjnego w postaci plików audio oraz wideo daje zupełnie nową jakość . Opisano jak stworzyć taki materiał, jaki jest potrzebny do realizacji oraz jak bardzo czasochłonny jest ten proces. Wnioski i spostrzeżenia zostały przedstawione na podstawie praktycznej realizacji wykładu dot. Systemu...

Using Physiological Signals for Emotion Recognition

Publikacja

W. Szwoch

- Rok 2013

Recognizing user’s emotions is the promising area of research in a field of human-computer interaction. It is possible to recognize emotions using facial expression, audio signals, body poses, gestures etc. but physiological signals are very useful in this field because they are spontaneous and not controllable. In this paper a problem of using physiological signals for emotion recognition is presented. The kinds of physiological...

Pełny tekst do pobrania w serwisie zewnętrznym

Koncepcja oraz budowa modułu lokalizacyjnego w projekcie „Innowacyjna metoda lokalizowania statków powietrznych w rozproszonym systemie VCS (VCS-MLAT)”

Publikacja

S. Wiszniewski

- Rok 2018

Artykuł zawiera koncepcję, schemat oraz opis modułu lokalizacyjnego demonstratora technologicznego systemu lokalizacyjnego statków powietrznych w rozproszonym systemie VCS (VCS-MLAT). Urządzenie ma za zadanie odebrać sygnał audio nadawany w paśmie lotniczym 118 MHz – 136 MHz i wraz ze znacznikami czasu oraz dodatkowymi parametrami przesyłane są do serwera systemu VCS. Dane odebrane z wielu modułów lokalizacyjnych pozwolą estymować...

Performance of Watermarking-based DTD Algorithm Under Time-varying Echo Path Conditions

Publikacja

- Rok 2010

A novel double-talk detection (DTD) algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation system is presented. The problem of DTD robustness to time-varying conditions of acoustic echo path is discussed and explanation as to why such conditions occur in practical situations is provided. The...

Robustness analysis of watermarking-based dtd algorithm under time-variable echo conditions

Publikacja

- Rok 2010

A novel double-talk detection (DTD) algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation system is presented. The problem of DTD robustness to time-varying conditions of acoustic echo path is discussed and explanation as to why such conditions occur in practical situations is provided. The...

Evaluation of Sound Enhancement in Mobile Device Using Virtual Bass Synthesiss Algorithm

Publikacja

- Rok 2013

An experiment conducted to validate possibility of use virtual bass synthesis (VBS) algorithm in a portable computer is presented. The subjective listening tests based on the procedure of pairwise comparison between VBS, based on the so-called missing fundamental phenomenon, and standard bass boost technique are employed. The evaluation was carried out in two types of conditions: in a professional listening room and employing an...

SUBIEKTYWNA OCENA MULTIPLEKSU RADIOFONII LOKALNEJ DAB+ DZIAŁAJĄCEJ W GDAŃSKU I WROCŁAWIU

Publikacja

P. Falkowski-Gilski
S. Brachmański
A. Dobrucki

- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2020

Standard DAB+ (Digital Audio Broadcasting plus) jest wiodącym systemem naziemnej radiofonii cyfrowej. W porównaniu do analogowej radiofonii FM wszystkie usługi, obejmujące tradycyjne programy radiowe oraz usługi transmisji danych, grupowane są w zbiór (ensemble). Praca ta przedstawia proces rekonfiguracji polskiego multipleksu na przykładzie lokalnej radiofonii DAB+ w Gdańsku i Wrocławiu. Opisuje wyniki badań subiektywnych dotyczących...

Pełny tekst do pobrania w serwisie zewnętrznym

Badanie efektywności kodeków źródłowych w radiofonii cyfrowej DAB+

Publikacja

- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2015

W Polsce radiofonia cyfrowa jest dostępna dla słuchaczy już od 2013 roku. Jednakże brakuje ogólnodostępnych publikacji naukowych lub też raportów badawczych uzasadniających przyjęte przepływności dla strumieni audio. W artykule przedstawiono badania sprawności kodowania oraz subiektywnej oceny jakości kodeka MPEG-4 HE-AAC v2, wykorzystywanego w standardzie DAB+. Testy prze-prowadzono wg. techniki porównawczej MUSHRA na dwóch grupach,...

Pełny tekst do pobrania w serwisie zewnętrznym

Influence of the Delay in Monitor System on the Motor Coordination of Musicians while Performing

Publikacja

- Rok 2019

This paper provides a description and results of measurements of the maximum acceptable value of delay tolerated by a musician, while playing an instrument, that does not cause de-synchronization and discomfort. First, methodology of measurements comprising audio recording and a fast camera is described. Then, themeasurement procedure for acquiring the maximum value of delay conditioning...

Pełny tekst do pobrania w serwisie zewnętrznym

TRANSMISJA GŁOSOWYCH KOMUNIKATÓW DROGOWYCH W RADIOFONII CYFROWEJ DAB+

Publikacja

P. Falkowski-Gilski
S. Brachmański
A. Dobrucki

- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2019

Proces cyfryzacji radia jest nowym rozdziałem w historii radiofonii. Wiele rekomendacji i badań naukowych wskazuje na standard DAB+ (Digital Audio Broadcasting plus), który w niedalekiej przyszłości ma zastąpić analogową radiofonię FM. Ten system cyfrowy wprowadza wiele zmian, oferując przy tym lepszą jakość dźwięku oraz szereg usług dodatkowych. W pracy postanowiono zbadać minimalną wymaganą przepływność bitową potrzebną do transmisji...

Pełny tekst do pobrania w serwisie zewnętrznym

Intelligent equalizer solution employing music genre and the room characteristics analysis

Publikacja

- Elektronika : konstrukcje, technologie, zastosowania - Rok 2017

The paper presents an intelligent equalizer solution based on room acoustic conditions and music genre analysis. A series of acoustic characteristic measurements are performed for checking the concept proposed. White noise (reference signal) and audio excerpts belonging to six music genres are utilized as excitation signals in measurements. This results in registration of frequency responses of rooms and reverberation times. Signals...

Pełny tekst do pobrania w serwisie zewnętrznym

Analysis of the Usefulness of Cheap Audio Recorders for Spectral Measurement of Environmental Noise

Publikacja

- Metrology - Rok 2023

Environmental noise pollution is nowadays one of the most serious health threats. The impact of noise on the human body depends not only on the sound level but also on its spectral distribution. Reliable measurements of the environmental noise spectrum are often hampered by the very high price of top quality measuring devices. This paper explores the possibility of using much cheaper audio recorders for the frequency analysis....

Pełny tekst do pobrania w portalu

Analiza jakości transmisji treści audio-wideo w symulowanym łączu telekomunikacyjnym z wykorzystaniem techniki OFDM

Publikacja

M. Zamłyńska
P. Falkowski-Gilski

- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2022

Wdrożenie niezawodnego systemu komunikacji audio-wideo przynosi wiele korzyści. Z uwagi na fakt, że ilość dostępnego pasma stale się kurczy, badacze koncentrują się na nowatorskich metodach transmisji. Obecnie technika OFDM (Orthogonal Frequency Division Multiplexing) jest szeroko stosowana zarówno w mediach przewodowych, jak i bezprzewodowych. W pracy przedstawiono badania jakości QoS (Quality of Service) symulowanego łącza transmisji...

Pełny tekst do pobrania w serwisie zewnętrznym

Subiektywny pomiar jakości programów radiowych strumieniowanych w sieci metodą crowdsourcingu

Publikacja

P. Falkowski-Gilski

- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2022

Obecnie słuchacze mają dostęp do swoich ulubionych programów i audycji radiowych za pośrednictwem naziemnego standardu analogowego FM (Frequency Modulation) oraz cyfrowego DAB+ (Digital Audio Broadcasting plus). Należy podkreślić, że ten sam materiał nadawany jest jednocześnie w kilku technikach (tzw. simulcast), a znaczna większość rozgłośni udostępnia swoje programy także online. Niniejsza praca przedstawia wyniki badań dotyczących...

Pełny tekst do pobrania w serwisie zewnętrznym

Towards Audio Signal Equalization Based on Spectral Characteristics of a Listening Room and Music Content Reproduced

Publikacja

- Rok 2018

This study presents investigations of the influence of the room acoustics on the frequency characteristic of the audio signal playback. First, the concept of a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the room spectral response, a system for room acoustics compensation based on an equalizer designed is proposed. The system settings depend on music genre recognized automatically....

Pełny tekst do pobrania w serwisie zewnętrznym

Porównanie detekcji obwiedni i detekcji synchronicznej w radioodbiornikach lotniczych VHF

Publikacja

S. Wiszniewski

- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2018

Artykuł przedstawia porównanie detekcji obwiedniowej oraz detekcji koherentnej dla sygnałów audio zmodulowa-nych amplitudowo (A3E) w paśmie lotniczym VHF [118 MHz - 136 MHz]. Wykonane badania miały na celu porównanie metod detekcji oraz wskazanie, która z nich charakteryzuje się wyższą jakością estymacji czasów nadejścia sygnałów. Dokonano pomiarów opóźnień sygnałów wyjściowych dla dwóch radiostacji lotniczych stosując korelację...

Application of gaze tracking technology to quality of experience domain

Publikacja

- Rok 2010

A new methodological approach to study subjective assessment results employing gaze tracking technology is shown. Notions of Human-Computer Interaction (HCI) and Quality of Experience (QoE) are shortly introduced in the context of their common application. Then, the gaze tracking system developed at the Multimedia Systems Department (MSD) of Gdansk University of Technology (GUT) is presented. A series of audio-visual subjective...

Auto adaptation of mobile device characteristics to various acoustic conditions

Publikacja

- Rok 2014

The proposed methodology of auto adaptation of the mobile device characteristics to various acoustic conditions is presented in the paper. The first goal of this study was to determine the parameters of the acoustic path of the mobile device, for both transmitting (speaker) and receiver (microphone). Results of the measurement of characteristics of mobile devices were presented. Information about characteristics of individual parts...

Pełny tekst do pobrania w serwisie zewnętrznym

Comparison of sound of organ pipes in contemporary and historical instruments

Publikacja

- Rok 2020

The aim of this research is to examine the differences in the timbre of organ pipes’ sound between a historical and a contemporary organ instrument. The historical instrument is the Oliwa organ from Gdansk, Poland, and the contemporary one is from Kartuzy, Poland. Recordings are made of single notes played by an open labial pipe that belongs to the Principal rank. The analyses and comparison of several sound features compatible...

Pełny tekst do pobrania w serwisie zewnętrznym

A commonly-accessible toolchain for live streaming music events with higher-order ambisonic audio and 4k 360 vision

Publikacja

B. Mróz
P. Odya
P. Danowski
M. Kabaciński

- Rok 2023

An immersive live stream is especially interesting in the ongoing development of telepresence tools, especially in the virtual reality (VR) or mixed reality (MR) domain. This paper explores the remote and immersive way of enabling telepresence for the audience to high-fidelity music performance using freely-available and easily-accessible tools. A functional VR live-streaming toolchain, comprising 360 vision and higher-order ambisonic...

Pełny tekst do pobrania w portalu

The central server of the Border Guard's distributed multimedia system for monitoring and visualisation of ongoing and archival events

Publikacja

- Journal of Marine Engineering and Technology - Rok 2017

The paper presents the architecture and functionalities of the central server (CENTER) of the distributed system for the Polish Border Guard (BG) for monitoring maritime areas. The overall system has been extended to incorporate, apart from map data, also different multimedia elements such as video from cameras or audio from telephone connections operated by BG units. This requires new system elements: Archive Servers for storing...

Pełny tekst do pobrania w serwisie zewnętrznym

Multimedialny system nadzoru dla straży granicznej – projekt STRADAR

Publikacja

- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Rok 2019

STRADAR jest systemem nadzoru przeznaczonym do wspierania działań operacyjnych morskiej straży granicznej, umożliwiającym zbieranie, przetwarzanie i udostępnianie informacji i danych pochodzących z takich sensorów, jak radary, kamery wideo, AIS, GPS, aparaty fotograficzne oraz z połączeń audio, wiadomości SMS, plików i notatek. Informacje te mogą być udostępniane na bieżąco oraz archiwalnie z synchronizacją zdarzeń lub bez synchronizacji....

Pełny tekst do pobrania w serwisie zewnętrznym

Traffic Noise Analysis Applied to Automatic Vehicle Counting and Classification

Publikacja

- Communications in Computer and Information Science - Rok 2017

Problems related to determining traffic noise characteristics are discussed in the context of automatic dynamic noise analysis based on noise level measurements and traffic prediction models. The obtained analytical results provide the second goal of the study, namely automatic vehicle counting and classification. Several traffic prediction models are presented and compared to the results of in-situ noise level measurements. Synchronized...

Metody udostępniania materiałów multimedialnych w sieciach LAN I WAN.

Publikacja

- Rok 2005

Wraz z rozpowszechnianiem usług szerokopasmowych zmniejsza się ograniczenie co do objętości oferowanych materiałów edukacyjnych udostępnianych w sieciach LAN i WAN. W referacie przedstawiono możliwości wzbogacenia treści edukacyjnych dzięki wykorzystaniu technik multimedialnych. Uzupełnienie materiału edukacyjnego w postaci plików audio i wideo daje zupełnie nową jakość. Opisano jak stworzyć taki materiał, jaki sprzęt jest potrzebny...

Classification of Music Genres Based on Music Separation into Harmonic and Drum Components . Klasyfikacja gatunków muzycznych wykorzystująca separację instrumentów muzycznych

Publikacja

A. Rosner
B. Schuller
B. Kostek

- Archives of Acoustics - Rok 2014

This article presents a study on music genre classification based on music separation into harmonic and drum components. For this purpose, audio signal separation is executed to extend the overall vector of parameters by new descriptors extracted from harmonic and/or drum music content. The study is performed using the ISMIS database of music files represented by vectors of parameters containing music features. The Support Vector...

Pełny tekst do pobrania w portalu

Evaluation of sound event detection, classification and localization in the presence of background noise for acoustic surveillance of hazardous situations

Publikacja

- Rok 2014

An evaluation of the sound event detection, classification and localization of hazardous acoustic events in the presence of background noise of different types and changing intensities is presented. The methods for separating foreground events from the acoustic background are introduced. The classifier, based on a Support Vector Machine algorithm, is described. The set of features and samples used for the training of the classifier...

Pełny tekst do pobrania w serwisie zewnętrznym

AUDIO SIGNAL EQUALIZATION BASED ON IMPULSE RESPONSE OF A LISTENING ROOM AND MUSIC CONTENT REPRODUCED

Publikacja

- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Rok 2018

A research study presents investigations of the influence of the room acoustics on the frequency characteristic of the audio signal playback. First, a concept of a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the room spectral response, a system for room acoustics compensation based on an equalizer designed is proposed. The system settings depend on music genre recognized automatically....

DAB vs DAB+ Radio Broadcasting: a Subjective Comparative Study

Publikacja

P. Falkowski-Gilski

- Archives of Acoustics - Rok 2017

In the age of digital media, delivering high quality content to consumers is one of the most demanding tasks. There exist numerous broadcasting standards, with different pros and cons, and the DAB/DAB (Digital Audio Broadcasting) system is one of the most popular among them. From an engineer’s perspective, efficient resource management under limited bandwidth conditions has always been a challenge. In this paper a subjective quality...

Pełny tekst do pobrania w portalu

Using concentrated spectrogram for analysis of audio acoustic signals

Publikacja

- HYDROACOUSTICS - Rok 2012

The paper presents results of time-frequency analysis of audio acoustic signals using the method of Concentrated Spectrograph also known as ''Cross-spectral method'' or ''Reassignment method''. Presented algorithm involves signal's local group delay and channelized instantaneous frequency to relevantly redistribute all Short-time Fourier transform lines in time-frequency plain. The main intention of the paper is to compare various...

Pełny tekst do pobrania w portalu

Geospatial Coverage and Signal Quality Measurements of Terrestrial DAB+ Network in Northern Poland

Publikacja

- Rok 2020

Modern signal coverage maps are prepared based on industry-standard radio propagation models, which take into account a number of parameters, including: type of antenna, distance from the transmitter, type of terrain, etc. However, such simulations are prone to location-specific inaccuracies, and should be verified with in-situ measurements. This paper presents results of a field test of a terrestrial DAB+ (Digital Audio Broadcasting...

Pełny tekst do pobrania w serwisie zewnętrznym

Multi-Aspect Quality Assessment Of Mobile Image Classifiers For Companion Applications In The Publishing Sector

Publikacja

K. Draszawka

- Rok 2021

The paper presents the problem of quality assessment of image classifiers used in mobile phones for complimentary companion applications. The advantages of using this kind of applications have been described and a Narrator on Demand (NoD) functionality has been described as one of the examples, where the application plays an audio file related to a book page that is physically in front of the phone's camera. For such a NoD application,...

Pełny tekst do pobrania w serwisie zewnętrznym

Energy Efficiency Study of Audio-video Content Consumption on Selected Android Mobile Terminals

Publikacja

- Rok 2021

Mobile devices are widely used by billions of users worldwide. Thanks to their main advantage, which is portability, they should be fully operational as long as possible, without the need to recharge or connect them to external power sources. This paper describes a study, carried out on four different mobile devices, with different hardware and software parameters, running the Android operating system. The research campaign involved...

Pełny tekst do pobrania w serwisie zewnętrznym

Audio-visual surveillance system for application in bank operating room

Publikacja

- Communications in Computer and Information Science - Rok 2013

An audio-visual surveillance system able to detect, classify and to localize acoustic events in a bank operating room is presented. Algorithms for detection and classification of abnormal acoustic events, such as screams or gunshots are introduced. Two types of detectors are employed to detect impulsive sounds and vocal activity. A Support Vector Machine (SVM) classifier is used to discern between the different classes of acoustic...

Quality Evaluation of Novel DTD Algorithm Based on Audio Watermarking

Publikacja

- Rok 2011

Echo cancellers typically employ a doubletalk detection (DTD) algorithm in order to keep the adaptive filter from diverging in the presence of near-end speech signal or other disruptive sounds in the microphone signal. A novel doubletalk detection algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation...

Pełny tekst do pobrania w serwisie zewnętrznym

Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets

Publikacja

- Electronics - Rok 2022

Artificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...

Pełny tekst do pobrania w portalu

Loudness Scaling Tests in Hearing Problems Detection

Publikacja

- Rok 2015

The number of people using portable audio players has increased significantly over the recent years. This implies the rise in the number of people having hearing loss problems. Therefore, there is a need to find appropriate procedures that simplify the process of the hearing problem detection. Investigations performed show that audiometric tests may not be sufficient to assess hearing in young people. Contrarily, the obtained results...

Detection, classification and localization of acoustic events in the presence of background noise for acoustic surveillance of hazardous situations

Publikacja

- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2016

Evaluation of sound event detection, classification and localization of hazardous acoustic events in the presence of background noise of different types and changing intensities is presented. The methods for discerning between the events being in focus and the acoustic background are introduced. The classifier, based on a Support Vector Machine algorithm, is described. The set of features and samples used for the training of the...

Pełny tekst do pobrania w portalu

A comparative study of English viseme recognition methods and algorithms

Publikacja

- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2018

An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...

Pełny tekst do pobrania w portalu

A comparative study of English viseme recognition methods and algorithm

Publikacja

- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2018

An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector...

Pełny tekst do pobrania w portalu

System do prototypowania bezprzewodowych inteligentnych urządzeń monitoringu audio-video

Publikacja

M. Kłosowski

- Rok 2013

W komunikacie przedstawiono system prototypowania bezprzewodowych urządzeń do monitoringu audio-video. System bazuje na układach FPGA Virtex6 i wielu dodatkowych wspierających urządzeniach jak: szybka pamięć DDR3, mała kamera HD, mikrofon z konwerterem A/C, moduł radiowy WiFi, itp. Funkcjonalność systemu została szczegółowo opisana w komunikacie. System został zoptymalizowany do pracy pod kontrolą systemu operacyjnego Linux, zostały...

Testbed analysis of video and VoIP transsmission performance in IEEE 802.11 b/g/n networks

Publikacja

- TELECOMMUNICATION SYSTEMS - Rok 2011

The aim of the work is to analyze capabilities and limitations of different implementations of IEEE 802.11 technologies (IEEE 802.11 b/g/n), utilized for both video streaming and VoIP calls directed to mobile devices. Our preliminary research showed that results obtained with currently popular simulation tools can be drastically different than these possible in real-world environment, so, in order to correctly evaluate performance...

Pełny tekst do pobrania w portalu

A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

Publikacja

G. Tamulevicius
G. Korvel
A. B. Yayak
P. Treigys
J. Bernataviciene
B. Kostek

- Electronics - Rok 2020

In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...

Pełny tekst do pobrania w portalu

Ranking Speech Features for Their Usage in Singing Emotion Classification

Publikacja

- Rok 2020

This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...

Pełny tekst do pobrania w portalu

ANALIZA PARAMETRÓW SYGNAŁU MOWY W KONTEKŚCIE ICH PRZYDATNOŚCI W AUTOMATYCZNEJ OCENIE JAKOŚCI EKSPRESJI ŚPIEWU

Publikacja

- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Rok 2019

Praca dotyczy podejścia do parametryzacji w przypadku klasyfikacji emocji w śpiewie oraz porównania z klasyfikacją emocji w mowie. Do tego celu wykorzystano bazę mowy i śpiewu nacechowanego emocjonalnie RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song), zawierającą nagrania profesjonalnych aktorów prezentujących sześć różnych emocji. Następnie obliczono współczynniki mel-cepstralne (MFCC) oraz wybrane deskryptory...

Pełny tekst do pobrania w portalu

Multimodal Surveillance Based Personal Protection System

Publikacja

- Rok 2013

A novel, multimodal approach for automatic detection of abduction of a protected individual, employing dedicated personal protection device and a city monitoring system is proposed and overviewed. The solution is based on combining four modalities (signals coming from: Bluetooth, fixed and PTZ cameras, thermal camera, acoustic sensors). The Bluetooth signal is used continuously to monitor the protected person presence, and in case...

Exploring Neural Networks for Musical Instrument Identification in Polyphonic Audio

Publikacja

M. Blaszke
G. Korvel
B. Kostek

- IEEE INTELLIGENT SYSTEMS - Rok 2024

The purpose of this paper is to introduce neural network-based methods that surpass state-of-the-art (SOTA) models, either by training faster or having simpler architecture, while maintaining comparable effectiveness in musical instrument identification in polyphonic music. Several approaches are presented, including two authors’ proposals, i.e., spiking neural networks (SNN) and a modular deep learning model named FMCNN (Fully...

Pełny tekst do pobrania w serwisie zewnętrznym

Multimodal system for diagnosis and polysensory stimulation of subjects with communication disorders

Publikacja

- Rok 2017

An experimental multimodal system, designed for polysensory diagnosis and stimulation of persons with impaired communication skills or even non-communicative subjects is presented. The user interface includes an eye tracking device and the EEG monitoring of the subject. Furthermore, the system consists of a device for objective hearing testing and an autostereoscopic projection system designed to stimulate subjects through their...

New Applications of Multimodal Human-Computer Interfaces

Publikacja

A. Czyżewski

- Rok 2012

Multimodal computer interfaces and examples of their applications to education software and for the disabled people are presented. The proposed interfaces include the interactive electronic whiteboard based on video image analysis, application for controlling computers with gestures and the audio interface for speech stretching for hearing impaired and stuttering people. Application of the eye-gaze tracking system to awareness...

Study on CPU and RAM Resource Consumption of Mobile Devices using Streaming Services

Publikacja

- Rok 2021

Streaming multimedia services have become very popular in recent years, due to the development of wireless networks. With the growing number of mobile devices worldwide, service providers offer dedicated applications that allow to deliver on-demand audio and video content anytime and everywhere. The aim of this study was to compare different streaming services and investigate their impact on the CPU and RAM resources, with respect...

Pełny tekst do pobrania w serwisie zewnętrznym

Musical Instrument Identification Using Deep Learning Approach

Publikacja

- SENSORS - Rok 2022

The work aims to propose a novel approach for automatically identifying all instruments present in an audio excerpt using sets of individual convolutional neural networks (CNNs) per tested instrument. The paper starts with a review of tasks related to musical instrument identification. It focuses on tasks performed, input type, algorithms employed, and metrics used. The paper starts with the background presentation, i.e., metadata...

Pełny tekst do pobrania w portalu

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: AUDIO PROCESSING