Wyniki wyszukiwania dla: AUDIO ANALYSIS

Bimodal classification of English allophones employing acoustic speech signal and facial motion capture

Publikacja

- Journal of the Acoustical Society of America - Rok 2018

A method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...

Pełny tekst do pobrania w serwisie zewnętrznym

Loudness Scaling Tests in Hearing Problems Detection

Publikacja

- Rok 2015

The number of people using portable audio players has increased significantly over the recent years. This implies the rise in the number of people having hearing loss problems. Therefore, there is a need to find appropriate procedures that simplify the process of the hearing problem detection. Investigations performed show that audiometric tests may not be sufficient to assess hearing in young people. Contrarily, the obtained results...

A comparative study of English viseme recognition methods and algorithms

Publikacja

- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2018

An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...

Pełny tekst do pobrania w portalu

INFLUENCE OF DATA NORMALIZATION ON THE EFFECTIVENESS OF NEURAL NETWORKS APPLIED TO CLASSIFICATION OF PAVEMENT CONDITIONS – CASE STUDY

Publikacja

- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Rok 2018

In recent years automatic classification employing machine learning seems to be in high demand for tele-informatic-based solutions. An example of such solutions are intelligent transportation systems (ITS), in which various factors are taken into account. The subject of the study presented is the impact of data pre-processing and normalization on the accuracy and training effectiveness of artificial neural networks in the case...

ANALIZA PARAMETRÓW SYGNAŁU MOWY W KONTEKŚCIE ICH PRZYDATNOŚCI W AUTOMATYCZNEJ OCENIE JAKOŚCI EKSPRESJI ŚPIEWU

Publikacja

- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Rok 2019

Praca dotyczy podejścia do parametryzacji w przypadku klasyfikacji emocji w śpiewie oraz porównania z klasyfikacją emocji w mowie. Do tego celu wykorzystano bazę mowy i śpiewu nacechowanego emocjonalnie RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song), zawierającą nagrania profesjonalnych aktorów prezentujących sześć różnych emocji. Następnie obliczono współczynniki mel-cepstralne (MFCC) oraz wybrane deskryptory...

Pełny tekst do pobrania w portalu

Ranking Speech Features for Their Usage in Singing Emotion Classification

Publikacja

- Rok 2020

This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...

Pełny tekst do pobrania w portalu

A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

Publikacja

G. Tamulevicius
G. Korvel
A. B. Yayak
P. Treigys
J. Bernataviciene
B. Kostek

- Electronics - Rok 2020

In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...

Pełny tekst do pobrania w portalu

Multimodal Surveillance Based Personal Protection System

Publikacja

- Rok 2013

A novel, multimodal approach for automatic detection of abduction of a protected individual, employing dedicated personal protection device and a city monitoring system is proposed and overviewed. The solution is based on combining four modalities (signals coming from: Bluetooth, fixed and PTZ cameras, thermal camera, acoustic sensors). The Bluetooth signal is used continuously to monitor the protected person presence, and in case...

Subjective and Objective Quality Evaluation Study of BPL -PLC Wired Medium

Publikacja

G. Debita
P. Falkowski-Gilski
M. Habrych
B. Miedziński
B. Polnik
J. Wandzio
P. Jedlikowski

- Elektronika Ir Elektrotechnika - Rok 2020

This paper presents results of research on the effectiveness of bi-directional voice transmission in a 6 kV mine cable network using BPL-PLC (Broadband over Power Line - Power Line Communication) technology. It concerns both emergency cable state (supply outage with cable shorted at both ends) and loaded with distorted current waveforms. The narrowband (0.5 MHz–15 MHz) and broadband (two different modes, frequency range of 3 MHz–7.5...

Pełny tekst do pobrania w portalu

Rough Sets Applied to Mood of Music Recognition

Publikacja

- Rok 2016

With the growth of accessible digital music libraries over the past decade, there is a need for research into automated systems for searching, organizing and recommending music. Mood of music is considered as one of the most intuitive criteria for listeners, thus this work is focused on the emotional content of music and its automatic recognition. The research study presented in this work contains an attempt to music emotion recognition...

Bimodal Emotion Recognition Based on Vocal and Facial Features

Publikacja

- Rok 2023

Emotion recognition is a crucial aspect of human communication, with applications in fields such as psychology, education, and healthcare. Identifying emotions accurately is challenging, as people use a variety of signals to express and perceive emotions. In this study, we address the problem of multimodal emotion recognition using both audio and video signals, to develop a robust and reliable system that can recognize emotions...

Pełny tekst do pobrania w portalu

Musical Instrument Identification Using Deep Learning Approach

Publikacja

- SENSORS - Rok 2022

The work aims to propose a novel approach for automatically identifying all instruments present in an audio excerpt using sets of individual convolutional neural networks (CNNs) per tested instrument. The paper starts with a review of tasks related to musical instrument identification. It focuses on tasks performed, input type, algorithms employed, and metrics used. The paper starts with the background presentation, i.e., metadata...

Pełny tekst do pobrania w portalu

Study on CPU and RAM Resource Consumption of Mobile Devices using Streaming Services

Publikacja

- Rok 2021

Streaming multimedia services have become very popular in recent years, due to the development of wireless networks. With the growing number of mobile devices worldwide, service providers offer dedicated applications that allow to deliver on-demand audio and video content anytime and everywhere. The aim of this study was to compare different streaming services and investigate their impact on the CPU and RAM resources, with respect...

Pełny tekst do pobrania w serwisie zewnętrznym

A Review of Emotion Recognition Methods Based on Data Acquired via Smartphone Sensors

Publikacja

- SENSORS - Rok 2020

In recent years, emotion recognition algorithms have achieved high efficiency, allowing the development of various affective and affect-aware applications. This advancement has taken place mainly in the environment of personal computers offering the appropriate hardware and sufficient power to process complex data from video, audio, and other channels. However, the increase in computing and communication capabilities of smartphones,...

Pełny tekst do pobrania w portalu

Speech Analytics Based on Machine Learning

Publikacja

- Rok 2019

In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...

Pełny tekst do pobrania w serwisie zewnętrznym

Smart Virtual Bass Synthesis Algorithm Based on Music Genre Classification

Publikacja

- Rok 2014

The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm employed automatic music genre recognition to determine the optimum parameters for the synthesis of additional frequencies. The synthesis was carried out using the non-linear device (NLD) and phase vocoder (PV) methods depending on the music excerpt genre. Classification of musical...

Creating a Remote Choir Performance Recording Based on an Ambisonic Approach

Publikacja

- Applied Sciences-Basel - Rok 2022

The aim of this paper is three-fold. First, the basics of binaural and ambisonic techniques are briefly presented. Then, details related to audio-visual recordings of a remote performance of the Academic Choir of the Gdańsk University of Technology are shown. Due to the COVID-19 pandemic, artists had a choice, namely, to stay at home and not perform or stay at home and perform. In fact, staying at home brought in the possibility...

Pełny tekst do pobrania w portalu

Evaluation of aspiration problems in L2 English pronunciation employing machine learning

Publikacja

M. Piotrowska
A. Czyżewski
T. Ciszewski
G. Korvel
A. Kurowski
B. Kostek

- Journal of the Acoustical Society of America - Rok 2021

The approach proposed in this study includes methods specifically dedicated to the detection of allophonic variation in English. This study aims to find an efficient method for automatic evaluation of aspiration in the case of Polish second-language (L2) English speakers’ pronunciation when whole words are analyzed instead of particular allophones extracted from words. Sample words including aspirated and unaspirated allophones...

Pełny tekst do pobrania w portalu

Akustyczna analiza natężenia ruchu drogowego dla systemów zarządzania ruchem

Publikacja

K. Marciniuk

- Rok 2019

W pracy przybliżono wybrane zagadnienia z dziedziny zarządzania transportem drogowym w Polsce i na świecie. W tym kontekście pzredstawiono potrzeby rynkowe, wymagania jak i możliwości w zakresie pozyskiwania informacji o aktualnym stanie sieci drogowych. Zaproponowano akustyczną metodę nadzorowania ruchu drogowego i jej możliwości w kontekście systemów zarządzania ruchem. Przedstawiono schemat akwizycji sygnału wraz z danymi odniesienia....

Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling

Publikacja

S. Raczyński
E. Vincent
S. Sagayama

- IEEE Transactions on Audio Speech and Language Processing - Rok 2013

Symbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is...

Pełny tekst do pobrania w serwisie zewnętrznym

Buzz-based honeybee colony fingerprint

Publikacja

- COMPUTERS AND ELECTRONICS IN AGRICULTURE - Rok 2021

Non-intrusive remote monitoring has its applications in a variety of areas. For industrial surveillance case, devices are capable of detecting anomalies that may threaten machine operation. Similarly, agricultural monitoring devices are used to supervise livestock or provide higher yields. Modern IoT devices are often coupled with Machine Learning models, which provide valuable insights into device operation. However, the data...

Pełny tekst do pobrania w portalu

Fully Automated AI-powered Contactless Cough Detection based on Pixel Value Dynamics Occurring within Facial Regions

Publikacja

M. Szankin
A. Kwaśniewska
N. Kowalczyk
J. Rumiński
R. Nicolas
D. Gamba

- Rok 2021

Increased interest in non-contact evaluation of the health state has led to higher expectations for delivering automated and reliable solutions that can be conveniently used during daily activities. Although some solutions for cough detection exist, they suffer from a series of limitations. Some of them rely on gesture or body pose recognition, which might not be possible in cases of occlusions, closer camera distances or impediments...

Pełny tekst do pobrania w serwisie zewnętrznym

Detection of the Incoming Sound Direction Employing MEMS Microphones and the DSP

Publikacja

- Rok 2017

A 3D acoustic vector sensor based on MEMS microphones and its application to road traffic monitoring is presented in the paper. The sensor is constructed from three pairs of digital MEMS microphones, mounted on the orthogonal axes. Signals obtained from the microphones are used to compute sound intensity vectors in each direction. With this data, it is possible to compute the horizontal and vertical angle of an incoming sound....

Pełny tekst do pobrania w serwisie zewnętrznym

ZINTEGROWANY SYSTEM DOMOWEGO MONITORINGU PARAMETRÓW MEDYCZNYCH OSÓB STARSZYCH I CHORYCH

Publikacja

- Rok 2019

Proponowane rozwiązania mają na celu wspomaganie osób starszych i chorych, tak by mogły jak najdłużej mieszkać i żyć samodzielnie ze zwiększonym poczuciem bezpieczeństwa, iż są nadzorowane i w razie nagłego zagrożenia życia nie pozostaną bez pomocy. System jednocześnie nie narusza poczucia zachowania prywatności i intymności, gdyż nie są używane do monitoringu kamery wizyjne czy też stały nasłuch audio. Dodatkowo gromadzone informacje...

Filtry

Katalog

Kategoria

Rok

Opcje

Bimodal classification of English allophones employing acoustic speech signal and facial motion capture

Loudness Scaling Tests in Hearing Problems Detection

A comparative study of English viseme recognition methods and algorithms

INFLUENCE OF DATA NORMALIZATION ON THE EFFECTIVENESS OF NEURAL NETWORKS APPLIED TO CLASSIFICATION OF PAVEMENT CONDITIONS – CASE STUDY

ANALIZA PARAMETRÓW SYGNAŁU MOWY W KONTEKŚCIE ICH PRZYDATNOŚCI W AUTOMATYCZNEJ OCENIE JAKOŚCI EKSPRESJI ŚPIEWU

Ranking Speech Features for Their Usage in Singing Emotion Classification

A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

Multimodal Surveillance Based Personal Protection System

Subjective and Objective Quality Evaluation Study of BPL -PLC Wired Medium

Rough Sets Applied to Mood of Music Recognition

Bimodal Emotion Recognition Based on Vocal and Facial Features

Musical Instrument Identification Using Deep Learning Approach

Study on CPU and RAM Resource Consumption of Mobile Devices using Streaming Services

A Review of Emotion Recognition Methods Based on Data Acquired via Smartphone Sensors

Speech Analytics Based on Machine Learning

Smart Virtual Bass Synthesis Algorithm Based on Music Genre Classification

Creating a Remote Choir Performance Recording Based on an Ambisonic Approach

Evaluation of aspiration problems in L2 English pronunciation employing machine learning

Akustyczna analiza natężenia ruchu drogowego dla systemów zarządzania ruchem

Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling

Buzz-based honeybee colony fingerprint

Fully Automated AI-powered Contactless Cough Detection based on Pixel Value Dynamics Occurring within Facial Regions

Detection of the Incoming Sound Direction Employing MEMS Microphones and the DSP

ZINTEGROWANY SYSTEM DOMOWEGO MONITORINGU PARAMETRÓW MEDYCZNYCH OSÓB STARSZYCH I CHORYCH

Wyszukiwarka

Filtry

Katalog

Kategoria

Rok

Opcje

Wyniki wyszukiwania dla: AUDIO ANALYSIS