Katedra Systemów Multimedialnych

Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

Publikacja

D. Korzekwa
R. Barra-Chicote
B. Kostek
T. Drugman
M. Łajszczak

- Rok 2019

We present a novel deep learning model for the detection and reconstruction of dysarthric speech. We train the model with a multi-task learning technique to jointly solve dysarthria detection and speech reconstruction tasks. The model key feature is a low-dimensional latent space that is meant to encode the properties of dysarthric speech. It is commonly believed that neural networks are black boxes that solve problems but do not...

Pełny tekst do pobrania w portalu

Acceleration of decision making in sound event recognition employing supercomputing cluster

Publikacja

- INFORMATION SCIENCES - Rok 2014

Parallel processing of audio data streams is introduced to shorten the decision making time in hazardous sound event recognition. A supercomputing cluster environment with a framework dedicated to processing multimedia data streams in real time is used. The sound event recognition algorithms employed are based on detecting foreground events, calculating their features in short time frames, and classifying the events with Support...

Pełny tekst do pobrania w serwisie zewnętrznym

Vehicle classification based on soft computing algorithms

Publikacja

- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2010

Experiments and results regarding vehicle type classification are presented. Three classes of vehicles are recognized: sedans, vans and trucks. The system uses a non-calibrated traffic camera, therefore no direct vehicle dimensions are used. Various vehicle descriptors are tested, including those based on vehicle mask only and those based on vehicle images. The latter ones employ Speeded Up Robust Features (SURF) and gradient images...

Pełny tekst do pobrania w serwisie zewnętrznym

An Automated Method for Biometric Handwritten Signature Authentication Employing Neural Networks

Publikacja

- Electronics - Rok 2021

Handwriting biometrics applications in e-Security and e-Health are addressed in the course of the conducted research. An automated graphomotor analysis method for the dynamic electronic representation of the handwritten signature authentication was researched. The developed algorithms are based on dynamic analysis of electronically handwritten signatures employing neural networks. The signatures were acquired with the use of the...

Pełny tekst do pobrania w portalu

Online urban acoustic noise monitoring system

Publikacja

- NOISE CONTROL ENGINEERING JOURNAL - Rok 2012

Concepts and implementation of the Online Urban Noise Monitoring System are presented. Principles of proposed solution used for dynamic acoustical maps creating are discussed. The architecture of the system and the data acquisition scheme are described. The concept of noise mapping, based on noise source model and propagation simulations, was developed and employed in the system. Dynamic estimation of noise source parameters utilized...

Pełny tekst do pobrania w serwisie zewnętrznym

Creating Dynamic Maps of Noise Threat Using PL-Grid Infrastructure

Publikacja

- Archives of Acoustics - Rok 2013

The paper presents functionality and operation results of a system for creating dynamic maps of acoustic noise employing the PL-Grid infrastructure extended with a distributed sensor network. The work presented provides a demonstration of the services being prepared within the PLGrid Plus project for measuring, modeling and rendering data related to noise level distribution in city agglomerations. Specific computational environments,...

Pełny tekst do pobrania w portalu

Application of Vector Sensors to Acoustic Surveillance of a Public Interior Space

Publikacja

- Archives of Acoustics - Rok 2011

Przedstawiono metodę precyzyjnej detekcji i lokalizacji źródeł dźwięku w pomieszczeniach. Wykorzystano wektorowe czujniki akustyczne, dostarczające sygnałów ciśnienia akustycznego i prędkości cząsteczek powietrza. Zaprezentowano metodę lokalizacji źródeł dźwięku na widowni wydarzenia publicznego. Przedstawiono demonstracyjny system zainstalowany w sali wykładowej. System poddano ocenie dokładności na podstawie przeprowadzonych...

Pełny tekst do pobrania w portalu

3D Acoustic Field Intensity Probe Design and Measurements

Publikacja

- Archives of Acoustics - Rok 2016

The aim of this paper is two-fold. First, some basic notions on acoustic field intensity and its measurement are shortly recalled. Then, the equipment and the measurement procedure used in the sound intensity in the performed research study are described. The second goal is to present details of the design of the engineered 3D intensity probe, as well as the algorithms developed and applied for that purpose. Results of the intensity...

Pełny tekst do pobrania w portalu

Ship Resistance Prediction with Artificial Neural Networks

Publikacja

K. Grabowska
P. Szczuko

- Rok 2015

The paper is dedicated to a new method of ship’s resistance prediction using Artificial Neural Network (ANN). In the initial stage selected ships parameters are prepared to be used as a training and validation sets. Next step is to verify several network structures and to determine parameters with the highest influence on the result resistance. Finally, other parameters expected to impact the resistance are proposed. The research utilizes...

Pełny tekst do pobrania w portalu

Automatic assessment of the motor state of the Parkinson's disease patient --a case study

Publikacja

B. Kostek
K. Kaszuba-Miotke
P. Żwan
P. Robowski
J. Sławek

- Diagnostic Pathology - Rok 2012

This paper presents a novel methodology in which the Unified Parkinson's Disease Rating Scale (UPDRS) data processed with a rule-based decision algorithm is used to predict the state of the Parkinson's Disease patients. The research was carried out to investigate whether the advancement of the Parkinson's Disease can be automatically assessed. For this purpose, past and current UPDRS data from 47 subjects were examined. The results...

Pełny tekst do pobrania w portalu

A comparative study of English viseme recognition methods and algorithms

Publikacja

- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2018

An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...

Pełny tekst do pobrania w portalu

Analysis of results of large-scale multimodal biometric identity verification experiment

Publikacja

- IET Biometrics - Rok 2018

An analysis of a large set of biometric data obtained during the enrolment and the verification phase in an experimental biometric system installed in bank branches is presented. Subjective opinions of bank clients and of bank tellers were also surveyed concerning the studied biometric methods in order to discover and to explore relations emerging from the obtained multimodal dataset. First, data acquisition and identity verification...

Pełny tekst do pobrania w portalu

Evaluation of aspiration problems in L2 English pronunciation employing machine learning

Publikacja

M. Piotrowska
A. Czyżewski
T. Ciszewski
G. Korvel
A. Kurowski
B. Kostek

- Journal of the Acoustical Society of America - Rok 2021

The approach proposed in this study includes methods specifically dedicated to the detection of allophonic variation in English. This study aims to find an efficient method for automatic evaluation of aspiration in the case of Polish second-language (L2) English speakers’ pronunciation when whole words are analyzed instead of particular allophones extracted from words. Sample words including aspirated and unaspirated allophones...

Pełny tekst do pobrania w portalu

System for monitoring road slippery based on CCTV cameras and convolutional neural networks

Publikacja

D. Grabowski
A. Czyżewski

- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2020

The slipperiness of the surface is essential for road safety. The growing number of CCTV cameras opens the possibility of using them to automatically detect the slippery surface and inform road users about it. This paper presents a system of developed intelligent road signs, including a detector based on convolutional neural networks (CNNs) and the transferlearning method employed to the processing of images acquired with video...

Pełny tekst do pobrania w portalu

Human Computer Interface for Tracking Eye Movements Improves Assessment and Diagnosis of Patients With Acquired Brain Injuries

Publikacja

- Frontiers in Neurology - Rok 2019

One of the first clinical signs differentiating the minimally conscious state from the vegetative state is the presence of smooth pursuit eye movements occurring in direct response to moving salient stimuli. Glasgow Coma Scale (GCS) is one of the most commonly used diagnostic tools for acute phase assessment of the level of consciousness, together with a neurological examination. These classic measures are limited to qualitative...

Pełny tekst do pobrania w portalu

A comparative study of English viseme recognition methods and algorithm

Publikacja

- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2018

An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector...

Pełny tekst do pobrania w portalu

Music Information Retrieval in Music Repositories

Publikacja

B. Kostek

- Rok 2013

This chapter reviews the key concepts associated with automated Music Information Retrieval (MIR). First, current research trends and system solutions in terms of music retrieval and music recommendation are discussed. Next, experiments performed on a constructed music database are presented. A proposal for music retrieval and annotation aided by gaze tracking is also discussed.

Pełny tekst do pobrania w serwisie zewnętrznym

Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling

Publikacja

D. Korzekwa
J. Lorenzo-trueba
S. Zaporowski
S. Calamaro
T. Drugman
B. Kostek

- Rok 2021

A common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...

Pełny tekst do pobrania w serwisie zewnętrznym

Employing Subjective Tests and Deep Learning for Discovering the Relationship between Personality Types and Preferred Music Genres

Publikacja

- Electronics - Rok 2020

The purpose of this research is two-fold: (a) to explore the relationship between the listeners’ personality trait, i.e., extraverts and introverts and their preferred music genres, and (b) to predict the personality trait of potential listeners on the basis of a musical excerpt by employing several classification algorithms. We assume that this may help match songs according to the listener’s personality in social music networks....

Pełny tekst do pobrania w portalu

Instrument detection and pose estimation with rigid part mixtures model in video-assisted surgeries

Publikacja

- MEDICAL IMAGE ANALYSIS - Rok 2018

Localizing instrument parts in video-assisted surgeries is an attractive and open computer vision problem. A working algorithm would immediately find applications in computer-aided interventions in the operating theater. Knowing the location of tool parts could help virtually augment visual faculty of surgeons, assess skills of novice surgeons, and increase autonomy of surgical robots. A surgical tool varies in appearance due to...

Pełny tekst do pobrania w serwisie zewnętrznym

Publikacje

Filtry

Kategoria

Rok

Opcje

Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

Acceleration of decision making in sound event recognition employing supercomputing cluster

Vehicle classification based on soft computing algorithms

An Automated Method for Biometric Handwritten Signature Authentication Employing Neural Networks

Online urban acoustic noise monitoring system

Creating Dynamic Maps of Noise Threat Using PL-Grid Infrastructure

Application of Vector Sensors to Acoustic Surveillance of a Public Interior Space

3D Acoustic Field Intensity Probe Design and Measurements

Ship Resistance Prediction with Artificial Neural Networks

Automatic assessment of the motor state of the Parkinson's disease patient --a case study

A comparative study of English viseme recognition methods and algorithms

Analysis of results of large-scale multimodal biometric identity verification experiment

Evaluation of aspiration problems in L2 English pronunciation employing machine learning

System for monitoring road slippery based on CCTV cameras and convolutional neural networks

Human Computer Interface for Tracking Eye Movements Improves Assessment and Diagnosis of Patients With Acquired Brain Injuries

A comparative study of English viseme recognition methods and algorithm

Music Information Retrieval in Music Repositories

Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling

Employing Subjective Tests and Deep Learning for Discovering the Relationship between Personality Types and Preferred Music Genres

Instrument detection and pose estimation with rigid part mixtures model in video-assisted surgeries

Wyszukiwarka