Wyniki wyszukiwania dla: viseme · parameterization of mouth region · support vector machine · hidden markov model · pattern recognition · audiovisual speech recognition - MOST Wiedzy

Wyszukiwarka

Wyniki wyszukiwania dla: viseme · parameterization of mouth region · support vector machine · hidden markov model · pattern recognition · audiovisual speech recognition

Wyniki wyszukiwania dla: viseme · parameterization of mouth region · support vector machine · hidden markov model · pattern recognition · audiovisual speech recognition

  • Music Genre Recognition in the Rough Set-Based Environment

    Publikacja

    - Rok 2015

    The aim of this paper is to investigate music genre recognition in the rough set-based environment. Experiments involve a parameterized music data-base containing 1100 music excerpts. The database is divided into 11 classes cor-responding to music genres. Tests are conducted using the Rough Set Exploration System (RSES), a toolset for analyzing data with the use of methods based on the rough set theory. Classification effectiveness...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Bożena Kostek prof. dr hab. inż.

  • An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics

    Publikacja

    - Rok 2019

    The speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing...

    Pełny tekst do pobrania w portalu

  • Limitations of Emotion Recognition from Facial Expressions in e-Learning Context

    Publikacja

    The paper concerns technology of automatic emotion recognition applied in e-learning environment. During a study of e-learning process the authors applied facial expressions observation via multiple video cameras. Preliminary analysis of the facial expressions using automatic emotion recognition tools revealed several unexpected results, including unavailability of recognition due to face coverage and significant inconsistency...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Topology recognition and leader election in colored networks

    Publikacja

    Topology recognition and leader election are fundamental tasks in distributed computing in networks. The first of them requires each node to find a labeled isomorphic copy of the network, while the result of the second one consists in a single node adopting the label 1 (leader), with all other nodes adopting the label 0 and learning a path to the leader. We consider both these problems in networks whose nodes are equipped with...

    Pełny tekst do pobrania w portalu

  • Molecular Recognition in Complexes of TRF Proteins with Telomeric DNA

    Publikacja

    Telomeres are specialized nucleoprotein assemblies that protect the ends of linear chromosomes. In humans and many other species, telomeres consist of tandem TTAGGG repeats bound by a protein complex known as shelterin that remodels telomeric DNA into a protective loop structure and regulates telomere homeostasis. Shelterin recognizes telomeric repeats through its two major components known as Telomere Repeat-Binding Factors, TRF1...

    Pełny tekst do pobrania w portalu

  • Hidden Markov Models for Visual Processing of Marketing Leaflets

    Publikacja

    - Rok 2021

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Hand gesture recognition supported by fuzzy rules and Kalman filters

    The paper presents a system based on camera and multimediaprojector enabling a user to control computer applications by dynamic hand gestures. Gesture recognition methodology based on representing hand movement trajectory by motion vectors analysed using fuzzy rule-based inference is first given. For effective hand position tracking Kalman filters are employed. The system engineered is developed using J2SE and C++/OpenCV technology....

  • Automatic recognition of therapy progress among children with autism

    Publikacja

    - Scientific Reports - Rok 2017

    The article presents a research study on recognizing therapy progress among children with autism spectrum disorder. The progress is recognized on the basis of behavioural data gathered via five specially designed tablet games. Over 180 distinct parameters are calculated on the basis of raw data delivered via the game flow and tablet sensors - i.e. touch screen, accelerometer and gyroscope. The results obtained confirm the possibility...

    Pełny tekst do pobrania w portalu

  • Emotion Recognition

    Dane Badawcze
    open access - seria: Person A

    The films presented here were recorded using so-called high-speed camera Phantom Miro. To play the movie  You need the special software which can be downloaded from the web site https://www.phantomhighspeed.com/resourcesandsupport/phantomresources/pccsoftware the details of the movie are available after starting the movie in the viewer in the description...

  • Emotion Recognition

    Dane Badawcze
    open access - seria: Person A

    The films presented here were recorded using so-called high-speed camera Phantom Miro. To play the movie  You need the special software which can be downloaded from the web site https://www.phantomhighspeed.com/resourcesandsupport/phantomresources/pccsoftware the details of the movie are available after starting the movie in the viewer in the description...

  • Andrzej Czyżewski prof. dr hab. inż.

    Prof. zw. dr hab. inż. Andrzej Czyżewski jest absolwentem Wydziału Elektroniki PG (studia magisterskie ukończył w 1982 r.). Pracę doktorską na temat związany z dźwiękiem cyfrowym obronił z wyróżnieniem na Wydziale Elektroniki PG w roku 1987. W 1992 r. przedstawił rozprawę habilitacyjną pt.: „Cyfrowe operacje na sygnałach fonicznych”. Jego kolokwium habilitacyjne zostało przyjęte jednomyślnie w czerwcu 1992 r. w Akademii Górniczo-Hutniczej...

  • Contextual Knowledge to Enhance Workplace Hazard Recognition and Interpretation in a Cognitive Vision Platform

    Publikacja

    - Rok 2018

    The combination of vision and sensor data together with the resulting necessity for formal representations builds a central component of an autonomous Cyber Physical System for detection and tracking of laborers in workplaces environments. This system must be adaptable and perceive the environment as automatically as possible, performing in a variety of plants and scenes without the necessity of recoding the application for each...

    Pełny tekst do pobrania w portalu

  • Introduction to the special issue on machine learning in acoustics

    Publikacja
    • Z. Michalopoulou
    • P. Gerstoft
    • B. Kostek
    • M. A. Roch

    - Journal of the Acoustical Society of America - Rok 2021

    When we started our Call for Papers for a Special Issue on “Machine Learning in Acoustics” in the Journal of the Acoustical Society of America, our ambition was to invite papers in which machine learning was applied to all acoustics areas. They were listed, but not limited to, as follows: • Music and synthesis analysis • Music sentiment analysis • Music perception • Intelligent music recognition • Musical source separation • Singing...

    Pełny tekst do pobrania w portalu

  • Gesture recognition framework for multimedia content viewer controlling

    Publikacja

    In the paper a system for controlling a multimedia content viewer by hand gestures is presented. First, selected methods used for gesture recognition are described. Two different application cases of the system, i.e. for multimedia presentation purposes and for multimedia content viewing are outlined. Moreover, a proposal of improvement of the system combining these approaches is also given. The system work cycle is reviewed. The...

  • A semi-Markov model of fuel combustion process in a Diesel engine

    Publikacja

    W artykule przedstawiono czterostanowy model procesu spalania w przestrzeniach roboczych (cylindrach) silników o zapłonie samoczynnym w formie procesu semimarkowskiego, dyskretnego w stanach i ciągłego w czasie. Wartościami tego procesu są stany odpowiadające powszechnie akceptowanym rodzajom spalania w tego rodzaju silnikach a mianowicie takie stany procesu jak: spalanie pełne (całkowite i zupełne), spalanie niezupełne, spalanie...

    Pełny tekst do pobrania w portalu

  • Automatic singing quality recognition employing artificial neural networks

    Publikacja

    Celem artykułu jest udowodnienie możliwości automatycznej oceny jakości technicznej głosów śpiewaczych. Pokrótce zaprezentowano w nim stworzoną bazę danych głosów śpiewaczych oraz zaimplementowane parametry. Przy pomocy sztucznych sieci neuronowych zaprojektowano system decyzyjny, który oceniono w pięciostopniowej skali jakość techniczną głosu. Przy pomocy metod statystycznych udowodniono, że wyniki generowane przez ten system...

    Pełny tekst do pobrania w portalu

  • Vocalic Segments Classification Assisted by Mouth Motion Capture

    Visual features convey important information for automatic speech recognition (ASR), especially in noisy environment. The purpose of this study is to evaluate to what extent visual data (i.e. lip reading) can enhance recognition accuracy in the multi-modal approach. For that purpose motion capture markers were placed on speakers' faces to obtain lips tracking data during speaking. Different parameterizations strategies were tested...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Real-time working gas recognition system based on the array of semiconductor gas sensors and portable computer Raspberry PI

    The gas-analyzing systems based on the array of partially selective gas sensors and pattern-recognition techniques are potentially fast and low-cost alternative for other devices, like gas analysers. They give the possibility of recognition the type and the concentration of measured volatile compounds in their working environment. In this work we present the implementation of gas recognition system, in which the signals from an...

  • Comparison of edge detection algorithms for electric wire recognition

    Publikacja

    Edge detection is the preliminary step in image processing for object detection and recognition procedure. It allows to remove useless information and reduce amount of data before further analysis. The paper contains the comparison of edge detection algorithms optimized for detection of horizontal edges. For comparison purposes the algorithms were implemented in the developed application dedicated to detection of electric line...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions

    Publikacja

    Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Optical recognition elements: macrocyclic imidazole chromoionophores entrapped in silica xerogel

    Publikacja

    Materials containing new chromoionophores consisting of crown residue and azole moiety as partsof macrocycles were encapsulated by the sol-gel procedure in silica xerogel matrices and proposed aschemical recognition elements especially for such metal ions as Li+, Cs+ and Cu2+. Action of these recognition elements is in principle based on changes of reflectance. The recognition elements containing 21-membered chromogenic...

  • Methodology and technology for the polymodal allophonic speech transcription

    A method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Methodology and technology for the polymodal allophonic speech transcription

    A method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Digits Recognition with Quadrant Photodiode and Convolutional Neural Network

    Publikacja

    - Rok 2018

    In this paper we have investigated the capabilities of a quadrant photodiode based gesture sensor in the recognition of digits drawn in the air. The sensor consisting of 4 active elements, 4 LEDs and a pinhole was considered as input interface for both discrete and continuous gestures. Index finger and a round pointer were used as navigating mediums for the sensor. Experiments performed with 5 volunteers...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Bridging challenges of clinical decision support systems with a semantic approach. A case study on breast cancer

    Publikacja

    - PATTERN RECOGNITION LETTERS - Rok 2013

    The integration of Clinical Decision Support Systems (CDSS) in nowadays clinical environments has not been fully achieved yet. Although numerous approaches and technologies have been proposed since 1960, there are still open gaps that need to be bridged. In this work we present advances from the established state of the art, overcoming some of the most notorious reported difficulties in: (i) automating CDSS, (ii) clinical workflow...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • A Review of Emotion Recognition Methods Based on Data Acquired via Smartphone Sensors

    Publikacja

    In recent years, emotion recognition algorithms have achieved high efficiency, allowing the development of various affective and affect-aware applications. This advancement has taken place mainly in the environment of personal computers offering the appropriate hardware and sufficient power to process complex data from video, audio, and other channels. However, the increase in computing and communication capabilities of smartphones,...

    Pełny tekst do pobrania w portalu

  • System for automatic singing voice recognition

    W artykule przedstawiono system automatycznego rozpoznawania jakości i typu głosu śpiewaczego. Przedstawiono bazę danych oraz zaimplementowane parametry. Algorytmem decyzyjnym jest algorytm sztucznych sieci neuronowych. Wytrenowany system decyzyjny osiąga skuteczność ok. 90% w obydwu kategoriach rozpoznawania. Dodatkowo wykazano przy pomocy metod statystycznych, że wyniki działania systemu automatycznej oceny jakości technicznej...

  • Deep Learning: A Case Study for Image Recognition Using Transfer Learning

    Publikacja

    - Rok 2021

    Deep learning (DL) is a rising star of machine learning (ML) and artificial intelligence (AI) domains. Until 2006, many researchers had attempted to build deep neural networks (DNN), but most of them failed. In 2006, it was proven that deep neural networks are one of the most crucial inventions for the 21st century. Nowadays, DNN are being used as a key technology for many different domains: self-driven vehicles, smart cities,...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • The Influence of Selecting Regions from Endoscopic Video Frames on The Efficiency of Large Bowel Disease Recognition Algorithms

    The article presents our research in the field of the automatic diagnosis of large intestine diseases on endoscopic video. It focuses on the methods of selecting regions of interest from endoscopic video frames for further analysis by specialized disease recognition algorithms. Four methods of selecting regions of interest have been discussed: a. trivial, b. with the deletion of characteristic, endoscope specific additions to the...

  • Acquisition and indexing of RGB-D recordings for facial expressions and emotion recognition

    Publikacja

    In this paper KinectRecorder comprehensive tool is described which provides for convenient and fast acquisition, indexing and storing of RGB-D video streams from Microsoft Kinect sensor. The application is especially useful as a supporting tool for creation of fully indexed databases of facial expressions and emotions that can be further used for learning and testing of emotion recognition algorithms for affect-aware applications....

    Pełny tekst do pobrania w serwisie zewnętrznym

  • On practical application of Shannon theory to character recognition and more

    Publikacja

    - Rok 2014

    Let us consider an optical character recognition system, which in particular can be used for identifying objects that were assigned strings of some length. The system is not perfect, for example, it sometimes recognizes wrongly the characters "Y" and "V". What is the largest set of strings of given length for the system under consideration, which can be mutually correctly recognized, and the corresponding objects correctly identified?...

  • Rotor-Flux Vector based Observer of Interior Permanent Synchronous Machine

    The sensorless control system of the interior permanent magnet machine is considered in this paper. The control system is based on classical linear controllers. In the machine, there occurs non-sinusoidal distribution of rotor flux together with the slot harmonics, which are treated as the control system disturbances. In this case, the classical observer structure in the (d-q) is unstable for the low range of rotor speed resulting...

    Pełny tekst do pobrania w portalu

  • Parameters optimization in medicine supporting image recognition algorithms

    Publikacja

    - Rok 2011

    In this paper, a procedure of automatic set up of image recognition algorithms' parameters is proposed, for the purpose of reducing the time needed for algorithms' development. The procedure is presented on two medicine supporting algorithms, performing bleeding detection in endoscopic images. Since the algorithms contain multiple parameters which must be specified, empirical testing is usually required to optimise the algorithm's...

  • Towards More Realistic Probabilistic Models for Data Structures: The External Path Length in Tries under the Markov Model

    Publikacja

    - Rok 2013

    Tries are among the most versatile and widely used data structures on words. They are pertinent to the (internal) structure of (stored) words and several splitting procedures used in diverse contexts ranging from document taxonomy to IP addresses lookup, from data compression (i.e., Lempel- Ziv'77 scheme) to dynamic hashing, from partial-match queries to speech recognition, from leader election algorithms to distributed hashing...

  • Accelerometer-based Human Activity Recognition and the Impact of the Sample Size

    Publikacja

    The presented study focused on the recognition of eight user activities (e.g. walking, lying, climbing stairs) basing on the measurements from an accelerometer embedded in a mobile device. It is assumed that the device is carried in a specific location of the user’s clothing. Three types of classifiers were tested on different sizes of the samples. The influence of the time window (the duration of a single trial) on selected activities...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Graph Representation Integrating Signals for Emotion Recognition and Analysis

    Data reusability is an important feature of current research, just in every field of science. Modern research in Affective Computing, often rely on datasets containing experiments-originated data such as biosignals, video clips, or images. Moreover, conducting experiments with a vast number of participants to build datasets for Affective Computing research is time-consuming and expensive. Therefore, it is extremely important to...

    Pełny tekst do pobrania w portalu

  • Comparison of selected off-the-shelf solutions for emotion recognition based on facial expressions

    The paper concerns accuracy of emotion recognition from facial expressions. As there are a couple of ready off-the-shelf solutions available in the market today, this study aims at practical evaluation of selected solutions in order to provide some insight into what potential buyers might expect. Two solutions were compared: FaceReader by Noldus and Xpress Engine by QuantumLab. The performed evaluation revealed that the recognition...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Automatic Singing Voice Recognition EmployingNeural Networks and Rough Sets

    Publikacja

    Celem badań jest automatyczne rozpoznawanie głosów śpiewaczych w kategorii rodzaju i jakości technicznej śpiewu. W artykule opisano stworzoną bazę danych głosów, która zawiera próbki głosu śpiewaków profesjonalnych i amatorskich. W dalszej części opisano parametry zdefiniowane w oparciu o zjawiska biomechaniczne w narządzie głosu podczas śpiewania. W oparciu o stworzone macierze parametrów wytrenowano i porównano automatyczne klasyfikatory...

  • International Journal of Signal Processing, Image Processing and Pattern Recognition

    Czasopisma

    ISSN: 2005-4254

  • Speaker Recognition Using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions

    Publikacja

    - Rok 2018

    With the technology advancements in smart home sector, voice control and automation are key components that can make a real difference in people's lives. The voice recognition technology market continues to involve rapidly as almost all smart home devices are providing speaker recognition capability today. However, most of them provide cloud-based solutions or use very deep Neural Networks for speaker recognition task, which are...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Feasibility Study for Food Intake Tasks Recognition Based on Smart Glasses

    Publikacja

    - Journal of Medical Imaging and Health Informatics - Rok 2015

    In this exploratory study 13 adult test subjects have performed different food intake tasks while wearing a three axis accelerometer mounted at a temple of glasses. Two different algorithms for task recognition have been applied and compared. The retrospective data processing leads to better task recognition results when the frequency range of 50 Hz to 100 Hz is analysed within accelerometer signal recordings. A straightforward...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Influence of Thermal Imagery Resolution on Accuracy of Deep Learning based Face Recognition

    Publikacja

    Human-system interactions frequently require a retrieval of the key context information about the user and the environment. Image processing techniques have been widely applied in this area, providing details about recognized objects, people and actions. Considering remote diagnostics solutions, e.g. non-contact vital signs estimation and smart home monitoring systems that utilize person’s identity, security is a very important factor....

    Pełny tekst do pobrania w portalu

  • Fuzzy rule-based dynamic gesture recognition employing camera & multimedia projector

    Publikacja

    - Rok 2010

    In the paper the system based on camera and multimedia projector enabling a user to control computer applications by dynamic hand gestures is presented. The main objective is to present the gesture recognition methodology which bases on representing hand movement trajectory by motion vectors analyzed using fuzzy rule-based inference. The approach was engineered in the system developed with J2SE and C++ / OpenCV technology. OpenCV...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Enhanced voice user interface employing spatial filtration of signals from acoustic vector sensor

    Spatial filtration of sound is introduced to enhance speech recognition accuracy in noisy conditions. An acoustic vector sensor (AVS) is employed. The signals from the AVS probe are processed in order to attenuate the surrounding noise. As a result the signal to noise ratio is increased. An experiment is featured in which speech signals are disturbed by babble noise. The signals before and after spatial filtration are processed...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Micro-cracking pattern recognition of hybrid CNTs/GNPs cement pastes under three-point bending loading using acoustic emission technique

    Publikacja

    - Rok 2021

    The generation of microcracks has an important influence on the behaviour of concrete structures. In this study, the acoustic emission (AE) technique was used to investigate the fracture phenomena and micro-cracking behavior of hybrid carbon nanotubes (CNTs, the 1-D allotrope of carbon atoms) and graphene nanoplatelets (GNPs, 2D monolayer of sp2-hybridized carbon atoms), cement composites under three-point bending loading. In...

  • JOURNAL OF MOLECULAR RECOGNITION

    Czasopisma

    ISSN: 0952-3499 , eISSN: 1099-1352

  • Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

    Publikacja
    • D. Korzekwa
    • R. Barra-Chicote
    • B. Kostek
    • T. Drugman
    • M. Łajszczak

    - Rok 2019

    We present a novel deep learning model for the detection and reconstruction of dysarthric speech. We train the model with a multi-task learning technique to jointly solve dysarthria detection and speech reconstruction tasks. The model key feature is a low-dimensional latent space that is meant to encode the properties of dysarthric speech. It is commonly believed that neural networks are black boxes that solve problems but do not...

    Pełny tekst do pobrania w portalu

  • Hidden Champions of Poland

    Publikacja
    • G. Leśniak- Łebkowska
    • M. Popowska
    • M. Godlewska
    • M. Łukasiewicz

    - Rok 2021

    Poland is a country with a strong entrepreneurial spirit. The hidden champions’ performance has been closely related to political and economic changes: the fall of communism in 1989 and the transition from a centrally planned economy to a market economy. This led to the emergence of small private businesses focused on the domestic market. The accession to the European Union in 2004 created new perspectives for the hidden champions’...

    Pełny tekst do pobrania w portalu

  • Role of cholesterol in substrate recognition by -secretase

    -Secretase is an enzyme known to cleave multiple substrates within their transmembrane domains, with the amyloid precursor protein of Alzheimer’s Disease among the most prominent examples. The activity of -secretase strictly depends on the membrane cholesterol content, yet the mechanistic role of cholesterol in the substrate binding and cleavage remains unclear. In this work, we used all-atom molecular dynamics simulations to examine...

    Pełny tekst do pobrania w portalu