Wyniki wyszukiwania dla: VISUAL SPEECH RECOGNITION

Voice command recognition using hybrid genetic algorithm

Publikacja

- TASK Quarterly - Rok 2010

Abstract: Speech recognition is a process of converting the acoustic signal into a set of words, whereas voice command recognition consists in the correct identification of voice commands, usually single words. Voice command recognition systems are widely used in the military, control systems, electronic devices, such as cellular phones, or by people with disabilities (e.g., for controlling a wheelchair or operating a computer...

Pełny tekst do pobrania w portalu

Noise profiling for speech enhancement employing machine learning models

Publikacja

K. Kąkol
G. Korvel
B. Kostek

- Journal of the Acoustical Society of America - Rok 2022

This paper aims to propose a noise profiling method that can be performed in near real-time based on machine learning (ML). To address challenges related to noise profiling effectively, we start with a critical review of the literature background. Then, we outline the experiment performed consisting of two parts. The first part concerns the noise recognition model built upon several baseline classifiers and noise signal features...

Pełny tekst do pobrania w portalu

Automatic Emotion Recognition in Children with Autism: A Systematic Literature Review

Publikacja

A. Landowska
A. Karpus
T. Zawadzka
B. Robins
D. Erol Barkana
H. Kose
T. Zorcec
N. Cummins

- SENSORS - Rok 2022

The automatic emotion recognition domain brings new methods and technologies that might be used to enhance therapy of children with autism. The paper aims at the exploration of methods and tools used to recognize emotions in children. It presents a literature review study that was performed using a systematic approach and PRISMA methodology for reporting quantitative and qualitative results. Diverse observation channels and modalities...

Pełny tekst do pobrania w portalu

Examining Feature Vector for Phoneme Recognition / Analiza parametrów w kontekście automatycznej klasyfikacji fonemów

Publikacja

- Rok 2017

The aim of this paper is to analyze usability of descriptors coming from music information retrieval to the phoneme analysis. The case study presented consists in several steps. First, a short overview of parameters utilized in speech analysis is given. Then, a set of time and frequency domain-based parameters is selected and discussed in the context of stop consonant acoustical characteristics. A toolbox created for this purpose...

From Linear Classifier to Convolutional Neural Network for Hand Pose Recognition

Publikacja

P. Rościszewski

- Computer Science - Rok 2017

Recently gathered image datasets and the new capabilities of high-performance computing systems have allowed developing new artificial neural network models and training algorithms. Using the new machine learning models, computer vision tasks can be accomplished based on the raw values of image pixels instead of specific features. The principle of operation of deep neural networks resembles more and more what we believe to be happening...

Pełny tekst do pobrania w portalu

PHONEME DISTORTION IN PUBLIC ADDRESS SYSTEMS

Publikacja

- Rok 2015

The quality of voice messages in speech reinforcement and public address systems is often poor. The sound engineering projects of such systems take care of sound intensity and possible reverberation phenomena in public space without, however, considering the influence of acoustic interference related to the number and distribution of loudspeakers. This paper presents the results of measurements and numerical simulations of the...

Marking the Allophones Boundaries Based on the DTW Algorithm

Publikacja

J. Rafałko

- Rok 2018

The paper presents an approach to marking the boundaries of allophones in the speech signal based on the Dynamic Time Warping (DTW) algorithm. Setting and marking of allophones boundaries in continuous speech is a difficult issue due to the mutual influence of adjacent phonemes on each other. It is this neighborhood on the one hand that creates variants of phonemes that is allophones, and on the other hand it affects that the border...

Automatic Watercraft Recognition and Identification on Water Areas Covered by Video Monitoring as Extension for Sea and River Traffic Supervision Systems

Publikacja

N. Wawrzyniak
A. Stateczny

- Polish Maritime Research - Rok 2018

The article presents the watercraft recognition and identification system as an extension for the presently used visual water area monitoring systems, such as VTS (Vessel Traffic Service) or RIS (River Information Service). The watercraft identification systems (AIS - Automatic Identification Systems) which are presently used in both sea and inland navigation require purchase and installation of relatively expensive transceivers...

Pełny tekst do pobrania w serwisie zewnętrznym

Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling

Publikacja

D. Korzekwa
J. Lorenzo-trueba
S. Zaporowski
S. Calamaro
T. Drugman
B. Kostek

- Rok 2021

A common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...

Pełny tekst do pobrania w serwisie zewnętrznym

Selection of Features for Multimodal Vocalic Segments Classification

Publikacja

- Rok 2018

English speech recognition experiments are presented employing both: audio signal and Facial Motion Capture (FMC) recordings. The principal aim of the study was to evaluate the inﬂuence of feature vector dimension reduction for the accuracy of vocalic segments classiﬁcation employing neural networks. Several parameter reduction strategies were adopted, namely: Extremely Randomized Trees, Principal Component Analysis and Recursive...

Pełny tekst do pobrania w serwisie zewnętrznym

Remote Estimation of Video-Based Vital Signs in Emotion Invocation Studies

Publikacja

- Rok 2018

Abstract— The goal of this study is to examine the influence of various imitated and video invoked emotions on the vital signs (respiratory and pulse rates). We also perform an analysis of the possibility to extract signals from sequences acquired with cost-effective cameras. The preliminary results show that the respiratory rate allows for better separation of some emotions than the pulse rate, yet this relation highly depends...

Pełny tekst do pobrania w portalu

Robust Object Detection with Multi-input Multi-output Faster R-CNN

Publikacja

- Rok 2022

Recent years have seen impressive progress in visual recognition on many benchmarks, however, generalization to the out-of-distribution setting remains a significant challenge. A state-of-the-art method for robust visual recognition is model ensembling. However, recently it was shown that similarly competitive results could be achieved with a much smaller cost, by using multi-input multi-output architecture (MIMO). In this work,...

Pełny tekst do pobrania w portalu

Robust Object Detection with Multi-input Multi-output Faster R-CNN

Publikacja

- Rok 2022

Recent years have seen impressive progress in visual recognition on many benchmarks, however, generalization to the out-of-distribution setting remains a significant challenge. A state-of-the-art method for robust visual recognition is model ensembling. However, recently it was shown that similarly competitive results could be achieved with a much smaller cost, by using multi-input multi-output architecture (MIMO). In this work,...

Pełny tekst do pobrania w serwisie zewnętrznym

Using Eye-tracking to get information on the skills acquisition by the radiology residents

Publikacja

- Rok 2019

This paper describes the possibility of monitoring the progress of knowledge and skills acquisition by the students of radiology. It is achieved by an analysis of a visual attention distribution patterns during image-based tasks solving. The concept is to use the eye-tracking data to recognize the way how the radiographic images are read by recognized experts, radiography residents involved in the training program, and untrained...

Pełny tekst do pobrania w serwisie zewnętrznym

Video Semantic Analysis Framework based on Run-time Production Rules - Towards Cognitive Vision

Publikacja

E. Szczerbicki
C. Toro
C. Sanin

- JOURNAL OF UNIVERSAL COMPUTER SCIENCE - Rok 2015

This paper proposes a service-oriented architecture for video analysis which separates object detection from event recognition. Our aim is to introduce new tools to be considered in the pathway towards Cognitive Vision as a support for classical Computer Vision techniques that have been broadly used by the scientific community. In the article, we particularly focus in solving some of the reported scalability issues found in current...

Pełny tekst do pobrania w portalu

Modeling and Simulation for Exploring Power/Time Trade-off of Parallel Deep Neural Network Training

Publikacja

P. Rościszewski

- Procedia Computer Science - Rok 2017

In the paper we tackle bi-objective execution time and power consumption optimization problem concerning execution of parallel applications. We propose using a discrete-event simulation environment for exploring this power/time trade-off in the form of a Pareto front. The solution is verified by a case study based on a real deep neural network training application for automatic speech recognition. A simulation lasting over 2 hours...

Pełny tekst do pobrania w portalu

Determining Pronunciation Differences in English Allophones Utilizing Audio Signal Parameterization

Publikacja

- Rok 2017

An allophonic description of English plosive consonants, based on audio-visual recordings of 600 specially selected words, was developed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted...

Minimizing Distribution and Data Loading Overheads in Parallel Training of DNN Acoustic Models with Frequent Parameter Averaging

Publikacja

- Rok 2017

In the paper we investigate the performance of parallel deep neural network training with parameter averaging for acoustic modeling in Kaldi, a popular automatic speech recognition toolkit. We describe experiments based on training a recurrent neural network with 4 layers of 800 LSTM hidden states on a 100-hour corpora of annotated Polish speech data. We propose a MPI-based modiﬁcation of the training program which minimizes the...

Pełny tekst do pobrania w serwisie zewnętrznym

Cross-domain applications of multimodal human-computer interfaces

Publikacja

A. Czyżewski

- Rok 2015

Developed multimodal interfaces for education applications and for disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and audio interface for speech stretching for hearing impaired and stuttering people and intelligent pen allowing for diagnosing and ameliorating developmental dyslexia. The eye-gaze tracking system named...

ANALIZA PARAMETRÓW SYGNAŁU MOWY W KONTEKŚCIE ICH PRZYDATNOŚCI W AUTOMATYCZNEJ OCENIE JAKOŚCI EKSPRESJI ŚPIEWU

Publikacja

- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Rok 2019

Praca dotyczy podejścia do parametryzacji w przypadku klasyfikacji emocji w śpiewie oraz porównania z klasyfikacją emocji w mowie. Do tego celu wykorzystano bazę mowy i śpiewu nacechowanego emocjonalnie RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song), zawierającą nagrania profesjonalnych aktorów prezentujących sześć różnych emocji. Następnie obliczono współczynniki mel-cepstralne (MFCC) oraz wybrane deskryptory...

Pełny tekst do pobrania w portalu

Detection, classification and localization of acoustic events in the presence of background noise for acoustic surveillance of hazardous situations

Publikacja

- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2016

Evaluation of sound event detection, classification and localization of hazardous acoustic events in the presence of background noise of different types and changing intensities is presented. The methods for discerning between the events being in focus and the acoustic background are introduced. The classifier, based on a Support Vector Machine algorithm, is described. The set of features and samples used for the training of the...

Pełny tekst do pobrania w portalu

Performance Analysis of the OpenCL Environment on Mobile Platforms

Publikacja

- Rok 2022

Today’s smartphones have more and more features that so far were only assigned to personal computers. Every year these devices are composed of better and more efficient components. Everything indicates that modern smartphones are replacing ordinary computers in various activities. High computing power is required for tasks such as image processing, speech recognition and object detection. This paper analyses the performance of...

Pełny tekst do pobrania w serwisie zewnętrznym

Separability Assessment of Selected Types of Vehicle-Associated Noise

Publikacja

- Advances in Intelligent Systems and Computing - Rok 2016

Music Information Retrieval (MIR) area as well as development of speech and environmental information recognition techniques brought various tools in-tended for recognizing low-level features of acoustic signals based on a set of calculated parameters. In this study, the MIRtoolbox MATLAB tool, designed for music parameter extraction, is used to obtain a vector of parameters to check whether they are suitable for separation of...

Pełny tekst do pobrania w serwisie zewnętrznym

From Knowledge based Vision Systems to Cognitive Vision Systems: A Review

Publikacja

T. Souza
C. De
C. Sanin
E. Szczerbicki

- Rok 2018

Computer vision research and applications have their origins in 1960s. Limitations in computational resources inherent of that time, among other reasons, caused research to move away from artificial intelligence and generic recognition goals to accomplish simple tasks for constrained scenarios. In the past decades, the development in machine learning techniques has contributed to noteworthy progress in vision systems. However,...

Pełny tekst do pobrania w portalu

The data exchange between smart glasses and healthcare information systems using the HL7 FHIR standard

Publikacja

J. Rumiński
A. Bujnowski
T. Kocejko
A. Andrushevich
M. Biallas
R. Kistler

- Rok 2016

In this study we evaluated system architecture for the use of smart glasses as a viewer of information, as a source of medical data (vital sign measurements: temperature, pulse rate, and respiration rate), and as a filter of healthcare information. All activities were based on patient/device identification procedures using graphical markers or features based on visual appearance. The architecture and particular use cases were implemented...

Pełny tekst do pobrania w serwisie zewnętrznym

Towards More Realistic Probabilistic Models for Data Structures: The External Path Length in Tries under the Markov Model

Publikacja

K. Leckey
R. Neininger
W. Szpankowski

- Rok 2013

Tries are among the most versatile and widely used data structures on words. They are pertinent to the (internal) structure of (stored) words and several splitting procedures used in diverse contexts ranging from document taxonomy to IP addresses lookup, from data compression (i.e., Lempel- Ziv'77 scheme) to dynamic hashing, from partial-match queries to speech recognition, from leader election algorithms to distributed hashing...

Multimodal human-computer interfaces based on advanced video and audio analysis

Publikacja

- Rok 2013

Multimodal interfaces development history is reviewed briefly in the introduction. Examples of applications of multimodal interfaces to education software and for the disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and the audio interface for speech stretching for hearing impaired and stuttering people. The Smart...

Pełny tekst do pobrania w serwisie zewnętrznym

Video content analysis in the urban area telemonitoring system

Publikacja

- Rok 2010

The task of constant monitoring of video streams from a large number of cameras and reviewing the recordings in order to find a specified event requires a considerable amount of time and effort from the system operators and it is prone to errors. A solution to this problem is an automatic system for constant analysis of camera images being able to raise an alarm if a predefined event is detected. The chapter presents various aspects...

Pełny tekst do pobrania w serwisie zewnętrznym

Computer-assisted pronunciation training—Speech synthesis is almost all you need

Publikacja

D. Korzekwa
J. Lorenzo-trueba
T. Drugman
B. Kostek

- SPEECH COMMUNICATION - Rok 2022

The research community has long studied computer-assisted pronunciation training (CAPT) methods in non-native speech. Researchers focused on studying various model architectures, such as Bayesian networks and deep learning methods, as well as on the analysis of different representations of the speech signal. Despite significant progress in recent years, existing CAPT methods are not able to detect pronunciation errors with high...

Pełny tekst do pobrania w portalu

Network oscillations modulate interictal epileptiform spike rate during human memory

Publikacja

J. Matsumoto
M. Stead
M. T. Kucewicz
A. Matsumoto
P. Peters
B. Brinkmann
J. C. Danstrom
S. Goerss
W. Marsh
F. Meyer
G. Worrell

- Brain: A Journal of Neurology - Rok 2013

Eleven patients being evaluated with intracranial electroencephalography for medically resistant temporal lobe epilepsy participated in a visual recognition memory task. Interictal epileptiform spikes were manually marked and their rate of occurrence compared between baseline and three 2 s periods spanning a 6 s viewing period. During successful, but not unsuccessful, encoding of the images there was a significant reduction in...

Pełny tekst do pobrania w serwisie zewnętrznym

Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement

Publikacja

G. Korvel
K. Kąkol
O. Kurasova
B. Kostek

- IEEE Access - Rok 2020

The Lombard effect is one of the most well-known effects of noise on speech production. Speech with the Lombard effect is more easily recognizable in noisy environments than normal natural speech. Our previous investigations showed that speech synthesis models might retain Lombard-effect characteristics. In this study, we investigate several speech models, such as harmonic, source-filter, and sinusoidal, applied to Lombard speech...

Pełny tekst do pobrania w portalu

Convolutional Neural Networks for C. Elegans Muscle Age Classification Using Only Self-Learned Features

Publikacja

- Journal of Telecommunications and Information Technology - Rok 2022

Nematodes Caenorhabditis elegans (C. elegans) have been used as model organisms in a wide variety of biological studies, especially those intended to obtain a better understanding of aging and age-associated diseases. This paper focuses on automating the analysis of C. elegans imagery to classify the muscle age of nematodes based on the known and well established IICBU dataset. Unlike many modern classification methods, the proposed...

Pełny tekst do pobrania w portalu

“Shadow” vs. “Phase 3D” method within endoscopic examinations of marine engines

Publikacja

- Combustion Engines - Rok 2013

A visual investigation of surfaces creating internal, working spaces of marine combustion engines by means of specialized view-finders so called endoscopes is at present almost a basic method of technical diag-nostics. The surface structure of constructional material is visible during investigations like through the magnifying glass (usually with a precisely determined magnification), which makes possible a detection, recognition...

Pełny tekst do pobrania w portalu

Speech Intelligibility Measurements in Auditorium

Publikacja

K. Leo

- ACTA PHYSICA POLONICA A - Rok 2010

Speech intelligibility was measured in Auditorium Novum on Technical University of Gdansk (seating capacity 408, volume 3300 m3). Articulation tests were conducted; STI and Early Decay Time EDT coefficients were measured. Negative noise contribution to speech intelligibility was taken into account. Subjective measurements and objective tests reveal high speech intelligibility at most seats in auditorium. Correlation was found between...

Pełny tekst do pobrania w portalu

Transient detection for speech coding applications

Publikacja

- International Journal of Computer Science and Network Security - Rok 2006

Signal quality in speech codecs may be improved by selecting transients from speech signal and encoding them using a suitable method. This paper presents an algorithm for transient detection in speech signal. This algorithm operates in several frequency bands. Transient detection functions are calculated from energy measured in short frames of the signal. The final selection of transient frames is based on results of detection...

Pełny tekst do pobrania w serwisie zewnętrznym

Visual Dimensions of Modeling Languages in Interdisciplinary Perspective

Publikacja

A. Bobkowska

- Rok 2013

Użyteczność języków modelowania wizualnego zależy od notacji. Notacja może być postrzegana jako zestaw wizualnych komponentów, które w określony sposób oddziałują na ludzkie oko i ludzki mózg. Referat przedstawia analizę interdyscyplinarną wykonaną w celu lepszego zrozumienia wizualnych wymiarów języków modelowania. Wizualne wymiary pochodzą z teorii opisujących percepcję wzrokową, wizualizację danych oraz reprezentacje poznawcze....

Pełny tekst do pobrania w serwisie zewnętrznym

Improving the quality of speech in the conditions of noise and interference

Publikacja

B. Kostek
K. Kąkol

- Journal of the Acoustical Society of America - Rok 2018

The aim of the work is to present a method of intelligent modification of the speech signal with speech features expressed in noise, based on the Lombard effect. The recordings utilized sets of words and sentences as well as disturbing signals, i.e., pink noise and the so-called babble speech. Noise signal, calibrated to various levels at the speaker's ears, was played over two loudspeakers located 2 m away from the speaker. In...

Pełny tekst do pobrania w serwisie zewnętrznym

Applying the Lombard Effect to Speech-in-Noise Communication

Publikacja

G. Korvel
K. Kąkol
P. Treigys
B. Kostek

- Electronics - Rok 2023

This study explored how the Lombard effect, a natural or artificial increase in speech loudness in noisy environments, can improve speech-in-noise communication. This study consisted of several experiments that measured the impact of different types of noise on synthesizing the Lombard effect. The main steps were as follows: first, a dataset of speech samples with and without the Lombard effect was collected in a controlled setting;...

Pełny tekst do pobrania w portalu

Constructing a Dataset of Speech Recordingswith Lombard Effect

Publikacja

D. Weber
S. Zaporowski
D. Korzekwa

- Rok 2020

Thepurpose of therecordings was to create a speech corpus based on the ISLEdataset, extended with video and Lombard speech. Selected from a set of 165sentences, 10, evaluatedas having thehighest possibility to occur in the context ofthe Lombard effect,were repeated in the presence of the so-called babble speech to obtain Lombard speech features. Altogether,15speakers were recorded, and speech parameterswere...

Improved method for real-time speech stretching

Publikacja

- Rok 2012

n algorithm for real-time speech stretching is presented. It was designed to modify input signal dependently on its content and on its relation with the historical input data. The proposed algorithm is a combination of speech signal analysis algorithms, i.e. voice, vowels/consonants, stuttering detection and SOLA (Synchronous-Overlap-and-Add) based speech stretching algorithm. This approach enables stretching input speech signal...

Pełny tekst do pobrania w serwisie zewnętrznym

Objectivization of Audio-Visual Correlation analysis

Publikacja

- Archives of Acoustics - Rok 2012

Simultaneous perception of audio and visual stimuli often causes the concealment or misrepresentation of information actually contained in these stimuli. Such effects are called the ''image proximity effect'' or the ''ventriloquism effect'' in literature. Until recently, most research carried out to understand their nature was based on subjective assessments. The Authors of this paper propose a methodology based on both subjective...

Pełny tekst do pobrania w portalu

Real-time speech-rate modification experiments

Publikacja

- Rok 2010

An algorithm designed for real-time speech time scale modification (stretching) is proposed, providing a combination of typical synchronous overlap and add based time scale modification algorithm and signal redundancy detection algorithms that allow to remove parts of the speech signal and replace them with the stretched speech signal fragments. Effectiveness of signal processing algorithms are examined experimentally together...

Pełny tekst do pobrania w serwisie zewnętrznym

Improving Objective Speech Quality Indicators in Noise Conditions

Publikacja

K. Kąkol
G. Korvel
B. Kostek

- Rok 2020

This work aims at modifying speech signal samples and test them with objective speech quality indicators after mixing the original signals with noise or with an interfering signal. Modifications that are applied to the signal are related to the Lombard speech characteristics, i.e., pitch shifting, utterance duration changes, vocal tract scaling, manipulation of formants. A set of words and sentences in Polish, recorded in silence,...

Pełny tekst do pobrania w serwisie zewnętrznym

Detecting Lombard Speech Using Deep Learning Approach

Publikacja

K. Kąkol
G. Korvel
G. Tamulevicius
B. Kostek

- SENSORS - Rok 2023

Robust Lombard speech-in-noise detecting is challenging. This study proposes a strategy to detect Lombard speech using a machine learning approach for applications such as public address systems that work in near real time. The paper starts with the background concerning the Lombard effect. Then, assumptions of the work performed for Lombard speech detection are outlined. The framework proposed combines convolutional neural networks...

Pełny tekst do pobrania w portalu

Speech synthesis controlled by eye gazing

Publikacja

- Rok 2010

A method of communication based on eye gaze controlling is presented. Investigations of using gaze tracking have been carried out in various context applications. The solution proposed in the paper could be referred to as ''talking by eyes'' providing an innovative approach in the domain of speech synthesis. The application proposed is dedicated to disabled people, especially to persons in a so-called locked-in syndrome who cannot...

Acoustic Sensing Analytics Applied to Speech in Reverberation Conditions

Publikacja

- SENSORS - Rok 2021

The paper aims to discuss a case study of sensing analytics and technology in acoustics when applied to reverberation conditions. Reverberation is one of the issues that makes speech in indoor spaces challenging to understand. This problem is particularly critical in large spaces with few absorbing or diffusing surfaces. One of the natural remedies to improve speech intelligibility in such conditions may be achieved through speaking...

Pełny tekst do pobrania w portalu

Time-domain prosodic modifications for text-to-speech synthesizer

Publikacja

- Rok 2010

An application of prosodic speech processing algorithms to Text-To-Speech synthesis is presented. Prosodic modifications that improve the naturalness of the synthesized signal are discussed. The applied method is based on the TD-PSOLA algorithm. The developed Text-To-Speech Synthesizer is used in applications employing multimodal computer interfaces.

A Method of Real-Time Non-uniform Speech Stretching

Publikacja

- Rok 2012

Developed method of real-time non-uniform speech stretching is presented.The proposed solution is based on the well-known SOLA algorithm(Synchronous Overlap and Add). Non-uniform time-scale modification isachieved by the adjustment of time scaling factor values in accordance with thesignal content. Dependently on the speech unit (vowels/consonants), instantaneousrate of speech (ROS), and speech signal presence, values of the scalingfactor...

Pełny tekst do pobrania w serwisie zewnętrznym

Visual TreeCmp : Comprehensive Comparison of Phylogenetic Trees on the Web

Publikacja

T. Goluch
D. Bogdanowicz
K. Giaro

- Methods in Ecology and Evolution - Rok 2020

1. We present Visual TreeCmp—a package of applications for comparing phylogenetic tree sets. 2. Visual TreeCmp includes a graphical web interface allowing the visualization of compared trees and command line application extended by comparison methods recently proposed in the literature. 3. The phylogenetic tree similarity analysis in Visual TreeCmp can be performed using eighteen metrics, of which 11 are dedicated to rooted trees...

Pełny tekst do pobrania w portalu

An new method of audio-visual correlation analysis

Publikacja

- Rok 2009

This paper presents a new methodology of conducting the audio-visual correlation analysis employing the gaze tracking system. Interaction between two perceptual modalities, seeing and hearing, their interaction and mutual reinforcement in a complex relationship was a subject of many research studies. Earlier stage of the carried out experiments at the Multimedia Systems Department (MSD) showed that there exists a relationship between...

Pełny tekst do pobrania w serwisie zewnętrznym

Wyszukiwarka

Filtry

Katalog

Kategoria

Rok

Opcje

Wyniki wyszukiwania dla: VISUAL SPEECH RECOGNITION