Katedra Systemów Multimedialnych

A comparative study of English viseme recognition methods and algorithms

Publikacja

- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2018

An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...

Pełny tekst do pobrania w portalu

Application of Vector Sensors to Acoustic Surveillance of a Public Interior Space

Publikacja

- Archives of Acoustics - Rok 2011

Przedstawiono metodę precyzyjnej detekcji i lokalizacji źródeł dźwięku w pomieszczeniach. Wykorzystano wektorowe czujniki akustyczne, dostarczające sygnałów ciśnienia akustycznego i prędkości cząsteczek powietrza. Zaprezentowano metodę lokalizacji źródeł dźwięku na widowni wydarzenia publicznego. Przedstawiono demonstracyjny system zainstalowany w sali wykładowej. System poddano ocenie dokładności na podstawie przeprowadzonych...

Pełny tekst do pobrania w portalu

Automatic assessment of the motor state of the Parkinson's disease patient --a case study

Publikacja

B. Kostek
K. Kaszuba-Miotke
P. Żwan
P. Robowski
J. Sławek

- Diagnostic Pathology - Rok 2012

This paper presents a novel methodology in which the Unified Parkinson's Disease Rating Scale (UPDRS) data processed with a rule-based decision algorithm is used to predict the state of the Parkinson's Disease patients. The research was carried out to investigate whether the advancement of the Parkinson's Disease can be automatically assessed. For this purpose, past and current UPDRS data from 47 subjects were examined. The results...

Pełny tekst do pobrania w portalu

Evaluation of aspiration problems in L2 English pronunciation employing machine learning

Publikacja

M. Piotrowska
A. Czyżewski
T. Ciszewski
G. Korvel
A. Kurowski
B. Kostek

- Journal of the Acoustical Society of America - Rok 2021

The approach proposed in this study includes methods specifically dedicated to the detection of allophonic variation in English. This study aims to find an efficient method for automatic evaluation of aspiration in the case of Polish second-language (L2) English speakers’ pronunciation when whole words are analyzed instead of particular allophones extracted from words. Sample words including aspirated and unaspirated allophones...

Pełny tekst do pobrania w portalu

Analysis of results of large-scale multimodal biometric identity verification experiment

Publikacja

- IET Biometrics - Rok 2018

An analysis of a large set of biometric data obtained during the enrolment and the verification phase in an experimental biometric system installed in bank branches is presented. Subjective opinions of bank clients and of bank tellers were also surveyed concerning the studied biometric methods in order to discover and to explore relations emerging from the obtained multimodal dataset. First, data acquisition and identity verification...

Pełny tekst do pobrania w portalu

3D Acoustic Field Intensity Probe Design and Measurements

Publikacja

- Archives of Acoustics - Rok 2016

The aim of this paper is two-fold. First, some basic notions on acoustic field intensity and its measurement are shortly recalled. Then, the equipment and the measurement procedure used in the sound intensity in the performed research study are described. The second goal is to present details of the design of the engineered 3D intensity probe, as well as the algorithms developed and applied for that purpose. Results of the intensity...

Pełny tekst do pobrania w portalu

Creating Dynamic Maps of Noise Threat Using PL-Grid Infrastructure

Publikacja

- Archives of Acoustics - Rok 2013

The paper presents functionality and operation results of a system for creating dynamic maps of acoustic noise employing the PL-Grid infrastructure extended with a distributed sensor network. The work presented provides a demonstration of the services being prepared within the PLGrid Plus project for measuring, modeling and rendering data related to noise level distribution in city agglomerations. Specific computational environments,...

Pełny tekst do pobrania w portalu

Online urban acoustic noise monitoring system

Publikacja

- NOISE CONTROL ENGINEERING JOURNAL - Rok 2012

Concepts and implementation of the Online Urban Noise Monitoring System are presented. Principles of proposed solution used for dynamic acoustical maps creating are discussed. The architecture of the system and the data acquisition scheme are described. The concept of noise mapping, based on noise source model and propagation simulations, was developed and employed in the system. Dynamic estimation of noise source parameters utilized...

Pełny tekst do pobrania w serwisie zewnętrznym

Vehicle classification based on soft computing algorithms

Publikacja

- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2010

Experiments and results regarding vehicle type classification are presented. Three classes of vehicles are recognized: sedans, vans and trucks. The system uses a non-calibrated traffic camera, therefore no direct vehicle dimensions are used. Various vehicle descriptors are tested, including those based on vehicle mask only and those based on vehicle images. The latter ones employ Speeded Up Robust Features (SURF) and gradient images...

Pełny tekst do pobrania w serwisie zewnętrznym

System for monitoring road slippery based on CCTV cameras and convolutional neural networks

Publikacja

D. Grabowski
A. Czyżewski

- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2020

The slipperiness of the surface is essential for road safety. The growing number of CCTV cameras opens the possibility of using them to automatically detect the slippery surface and inform road users about it. This paper presents a system of developed intelligent road signs, including a detector based on convolutional neural networks (CNNs) and the transferlearning method employed to the processing of images acquired with video...

Pełny tekst do pobrania w portalu

Acceleration of decision making in sound event recognition employing supercomputing cluster

Publikacja

- INFORMATION SCIENCES - Rok 2014

Parallel processing of audio data streams is introduced to shorten the decision making time in hazardous sound event recognition. A supercomputing cluster environment with a framework dedicated to processing multimedia data streams in real time is used. The sound event recognition algorithms employed are based on detecting foreground events, calculating their features in short time frames, and classifying the events with Support...

Pełny tekst do pobrania w serwisie zewnętrznym

An Automated Method for Biometric Handwritten Signature Authentication Employing Neural Networks

Publikacja

- Electronics - Rok 2021

Handwriting biometrics applications in e-Security and e-Health are addressed in the course of the conducted research. An automated graphomotor analysis method for the dynamic electronic representation of the handwritten signature authentication was researched. The developed algorithms are based on dynamic analysis of electronically handwritten signatures employing neural networks. The signatures were acquired with the use of the...

Pełny tekst do pobrania w portalu

Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

Publikacja

D. Korzekwa
R. Barra-Chicote
B. Kostek
T. Drugman
M. Łajszczak

- Rok 2019

We present a novel deep learning model for the detection and reconstruction of dysarthric speech. We train the model with a multi-task learning technique to jointly solve dysarthria detection and speech reconstruction tasks. The model key feature is a low-dimensional latent space that is meant to encode the properties of dysarthric speech. It is commonly believed that neural networks are black boxes that solve problems but do not...

Pełny tekst do pobrania w portalu

Fluctuation-enhanced scent sensing using a single gas sensor

Publikacja

- SENSORS AND ACTUATORS B-CHEMICAL - Rok 2011

Scent or aroma sensing during aromatherapy can be carried out by applying only a single resistance gas sensor (TGS - Taguchi Gas Sensors). This paper considers the efficiency of detection of essential oils by DC resistance and its fluctuations observed in TGS sensors. A detailed study has been conducted for scents emitted by five popular essential oils using three sensor types (TGS 2600, TGS 2602, TGS 823). The research was focused...

Pełny tekst do pobrania w serwisie zewnętrznym

UPDRS tests for diagnosis of Parkinson's disease employing virtual-touchpad

Publikacja

- Rok 2010

This paper presents a new approach to diagnosing Parkinson's disease. The progression of the disease can be measured by the UPDRS (Unified Parkinson Disease Rating Scale) scale which is used to evaluate motor and behavioral symptoms of Parkinson's disease. Hitherto the evaluation of the advancement of the disease in the UPDRS scale was made by a specialist through medical observation. The authors suggest a partial automation of...

Fluctuation-enhanced scent sensing using a single gas sensor

Publikacja

- SENSORS AND ACTUATORS B-CHEMICAL - Rok 2011

Wykrywanie zapachów podczas aromaterapii może być przeprowadzone za pomocą pojedynczego sensora gazów. W pracy rozważono efektywność detekcji zapachów olejków eterycznych za pomocą rezystancji DC oraz zjawisk fluktuacyjnych w tych sensorach, typu TGS2600,TGS2602,TGS823. Badania koncentrowały się na praktycznym zastosowaniu w aromaterapii do określania intensywności emitowanego zapachu. Opisano szczegółowo system do emisji zapachów.

Pełny tekst do pobrania w portalu

MACHINE LEARNING–BASED ANALYSIS OF ENGLISH LATERAL ALLOPHONES

Publikacja

M. Piotrowska
G. Korvel
B. Kostek
T. Ciszewski
A. Czyżewski

- International Journal of Applied Mathematics and Computer Science - Rok 2019

Automatic classification methods, such as artificial neural networks (ANNs), the k-nearest neighbor (kNN) and selforganizing maps (SOMs), are applied to allophone analysis based on recorded speech. A list of 650 words was created for that purpose, containing positionally and/or contextually conditioned allophones. For each word, a group of 16 native and non-native speakers were audio-video recorded, from which seven native speakers’...

Pełny tekst do pobrania w portalu

Musical Instrument Identification Using Deep Learning Approach

Publikacja

- SENSORS - Rok 2022

The work aims to propose a novel approach for automatically identifying all instruments present in an audio excerpt using sets of individual convolutional neural networks (CNNs) per tested instrument. The paper starts with a review of tasks related to musical instrument identification. It focuses on tasks performed, input type, algorithms employed, and metrics used. The paper starts with the background presentation, i.e., metadata...

Pełny tekst do pobrania w portalu

Music Mood Visualization Using Self-Organizing Maps

Publikacja

- Archives of Acoustics - Rok 2015

Due to an increasing amount of music being made available in digital form in the Internet, an automatic organization of music is sought. The paper presents an approach to graphical representation of mood of songs based on Self-Organizing Maps. Parameters describing mood of music are proposed and calculated and then analyzed employing correlation with mood dimensions based on the Multidimensional Scaling. A map is created in which...

Pełny tekst do pobrania w portalu

Visual Lip Contour Detection for the Purpose of Speech Recognition

Publikacja

- Rok 2014

A method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...

Publikacje

Filtry

Kategoria

Rok

Opcje

A comparative study of English viseme recognition methods and algorithms

Application of Vector Sensors to Acoustic Surveillance of a Public Interior Space

Automatic assessment of the motor state of the Parkinson's disease patient --a case study

Evaluation of aspiration problems in L2 English pronunciation employing machine learning

Analysis of results of large-scale multimodal biometric identity verification experiment

3D Acoustic Field Intensity Probe Design and Measurements

Creating Dynamic Maps of Noise Threat Using PL-Grid Infrastructure

Online urban acoustic noise monitoring system

Vehicle classification based on soft computing algorithms

System for monitoring road slippery based on CCTV cameras and convolutional neural networks

Acceleration of decision making in sound event recognition employing supercomputing cluster

An Automated Method for Biometric Handwritten Signature Authentication Employing Neural Networks

Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

Fluctuation-enhanced scent sensing using a single gas sensor

UPDRS tests for diagnosis of Parkinson's disease employing virtual-touchpad

Fluctuation-enhanced scent sensing using a single gas sensor

MACHINE LEARNING–BASED ANALYSIS OF ENGLISH LATERAL ALLOPHONES

Musical Instrument Identification Using Deep Learning Approach

Music Mood Visualization Using Self-Organizing Maps

Visual Lip Contour Detection for the Purpose of Speech Recognition

Wyszukiwarka