Wyniki wyszukiwania dla: SPEECH RECOGNITION SYSTEMS - MOST Wiedzy

Wyszukiwarka

Wyniki wyszukiwania dla: SPEECH RECOGNITION SYSTEMS

Wyniki wyszukiwania dla: SPEECH RECOGNITION SYSTEMS

  • Communication Platform for Evaluation of Transmitted Speech Quality

    A voice communication system designed and implemented is described. The purpose of the presented platform was to enable a series of experiments related to the quality assessment of algorithms used in the coding and transmitting of speech. The system is equipped with tools for recording signals at each stage of processing, making it possible to subject them to subjective assessments by listening tests or, objective evaluation employing...

    Pełny tekst do pobrania w portalu

  • Transfer learning in imagined speech EEG-based BCIs

    Publikacja

    - Biomedical Signal Processing and Control - Rok 2019

    The Brain–Computer Interfaces (BCI) based on electroencephalograms (EEG) are systems which aim is to provide a communication channel to any person with a computer, initially it was proposed to aid people with disabilities, but actually wider applications have been proposed. These devices allow to send messages or to control devices using the brain signals. There are different neuro-paradigms which evoke brain signals of interest...

    Pełny tekst do pobrania w portalu

  • Scoreboard Architectural Pattern and Integration of Emotion Recognition Results

    Publikacja

    This paper proposes a new design pattern, named Scoreboard , dedicated for applications solving complex, multi-stage, non-deterministic problems. The pattern provides a computational framework for the design and implementation of systems that integrate a large number of diverse specialized modules that may vary in accuracy, solution level, and modality. The Scoreboard is an extension of Blackboard design pattern and comes under...

    Pełny tekst do pobrania w portalu

  • Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency

    In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.

  • Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency

    Publikacja

    - Rok 2007

    In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.

  • Automated detection of pronunciation errors in non-native English speech employing deep learning

    Publikacja

    - Rok 2023

    Despite significant advances in recent years, the existing Computer-Assisted Pronunciation Training (CAPT) methods detect pronunciation errors with a relatively low accuracy (precision of 60% at 40%-80% recall). This Ph.D. work proposes novel deep learning methods for detecting pronunciation errors in non-native (L2) English speech, outperforming the state-of-the-art method in AUC metric (Area under the Curve) by 41%, i.e., from...

    Pełny tekst do pobrania w portalu

  • Emotion Recognition Using Physiological Signals

    Publikacja

    - Rok 2015

    In this paper the problem of emotion recognition using physiological signals is presented. Firstly the problems with acquisition of physiological signals related to specific human emotions are described. It is not a trivial problem to elicit real emotions and to choose stimuli that always, and for all people, elicit the same emotion. Also different kinds of physiological signals for emotion recognition are considered. A set of...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Facial emotion recognition using depth data

    Publikacja

    - Rok 2015

    In this paper an original approach is presented for facial expression and emotion recognition based only on depth channel from Microsoft Kinect sensor. The emotional user model contains nine emotions including the neutral one. The proposed recognition algorithm uses local movements detection within the face area in order to recognize actual facial expression. This approach has been validated on Facial Expressions and Emotions Database...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Emotion recognition and its application in software engineering

    In this paper a novel application of multimodal emotion recognition algorithms in software engineering is described. Several application scenarios are proposed concerning program usability testing and software process improvement. Also a set of emotional states relevant in that application area is identified. The multimodal emotion recognition method that integrates video and depth channels, physiological signals and input devices...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Dependable Integration of Medical Image Recognition Components

    Computer driven medical image recognition may support medical doctors in the diagnosis process, but requires high dependability considering potential consequences of incorrect results. The paper presentsa system that improves dependability of medical image recognition by integration of results from redundant components. The components implement alternative recognition algorithms of diseases in thefield of gastrointestinal endoscopy....

  • Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech

    Publikacja
    • D. Korzekwa
    • J. Lorenzo-trueba
    • T. Drugman
    • S. Calamaro
    • B. Kostek

    - Rok 2021

    We propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...

    Pełny tekst do pobrania w portalu

  • Local Texture Pattern Selection for Efficient Face Recognition and Tracking

    This paper describes the research aimed at finding the optimal configuration of the face recognition algorithm based on local texture descriptors (binary and ternary patterns). Since the identification module was supposed to be a part of the face tracking system developed for interactive wearable computer, proper feature selection, allowing for real-time operation, became particularly important. Our experiments showed that it is...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Automatic Singing Voice Recognition EmployingNeural Networks and Rough Sets

    Publikacja

    Celem badań jest automatyczne rozpoznawanie głosów śpiewaczych w kategorii rodzaju i jakości technicznej śpiewu. W artykule opisano stworzoną bazę danych głosów, która zawiera próbki głosu śpiewaków profesjonalnych i amatorskich. W dalszej części opisano parametry zdefiniowane w oparciu o zjawiska biomechaniczne w narządzie głosu podczas śpiewania. W oparciu o stworzone macierze parametrów wytrenowano i porównano automatyczne klasyfikatory...

  • Michał Tomasz Kucewicz dr

    Michal Kucewicz was born in 1986 in Gdansk. In 2005 he completed International Baccalaureate programme in Topolowka (III High School in Gdańsk). Thanks to the G. D. Fahrenheit scholarship, he moved to the United Kingdom to study neuroscience. He received his Bachelor’s and Master’s degree from the Cambridge University, and his doctoral degree from the University of Bristol specializing in electrophysiology of memory and cognitive...

  • Guido: a musical score recognition system

    Publikacja

    - Rok 2007

    This paper presents an optical music recognition system Guido that can automatically recognize the main musical symbols of music scores that were scanned or taken by a digital camera. The application is based on object model of musical notation and uses linguistic approach for symbol interpretation and error correction. The system offers musical editor with a partially automatic error correction.

  • Mowa nienawiści (hate speech) a odpowiedzialność dostawców usług internetowych w orzecznictwie sądów europejskich

    Publikacja

    - Rok 2015

    The article analyses the phenomenon of hate speech in the Internet contrasted with the problem of responsability of Internet Service Providers for cases of such abuses of freedom of expression. The text provides an analysis of jurisprudence of two European Courts. On the one hand it presents the position of the European Court of Human Rights on the problem of hate speech: its definition and the liability for it as an exception...

  • Objectivization of phonological evaluation of speech elements by means of audio parametrization

    This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...

  • Anion recognition by n,n'-diarylalkanediamides

    Publikacja

    The preparation of N,N'-diarylalkanediamides from respective aliphatic dicarboxylic acidesand 4-nitroaniline via microwave-promoted reactions is presented. The most positive effect of microwave irradiation was observed for N,N'-bis(4-nitrophenyl)butanediamide. Anion binding studies on the obtained diamides were carried out in DMSO and acetonitrile using UV-vis and 1H NMR spectroscopy. A mechanism for selective fluoride recognition...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Robust and Efficient Machine Learning Algorithms for Visual Recognition

    Publikacja

    - Rok 2022

    In visual recognition, the task is to identify and localize all objects of interest in the input image. With the ubiquitous presence of visual data in modern days, the role of object recognition algorithms is becoming more significant than ever and ranges from autonomous driving to computer-aided diagnosis in medicine. Current models for visual recognition are dominated by models based on Convolutional Neural Networks (CNNs), which...

    Pełny tekst do pobrania w portalu

  • Piotr Szczuko dr hab. inż.

    Dr hab. inż. Piotr Szczuko w 2002 roku ukończył studia na Wydziale Elektroniki, Telekomunikacji i Informatyki Politechniki Gdańskiej zdobywając tytuł magistra inżyniera. Tematem pracy dyplomowej było badanie zjawisk jednoczesnej percepcji obrazu cyfrowego i dźwięku dookólnego. W roku 2008 obronił rozprawę doktorską zatytułowaną "Zastosowanie reguł rozmytych w komputerowej animacji postaci", za którą otrzymał nagrodę Prezesa Rady...

  • TELECOMMUNICATION SYSTEMS

    Czasopisma

    ISSN: 1018-4864 , eISSN: 1572-9451

  • Examining Classifiers Applied to Static Hand Gesture Recognition in Novel Sound Mixing System

    The main objective of the chapter is to present the methodology and results of examining various classifiers (Nearest Neighbor-like algorithm with non-nested generalization (NNge), Naive Bayes, C4.5 (J48), Random Tree, Random Forests, Artificial Neural Networks (Multilayer Perceptron), Support Vector Machine (SVM) used for static gesture recognition. A problem of effective gesture recognition is outlined in the context of the system...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • AN ALGORITHM FOR PORTAL HYPERTENSIVE GASTROPATHY RECOGNITION ON THE ENDOSCOPIC RECORDINGS

    Publikacja

    Symptoms recognition of portal hypertensive gastropathy (PHG) can be done by analysing endoscopic recordings, but manual analysis done by physician may take a long time. This increases probability of missing some symptoms and automated methods may be applied to prevent that. In this paper a novel hybrid algorithm for recognition of early stage of portal hypertensive gastropathy is proposed. First image preprocessing is described....

  • Human-computer interactions in speech therapy using a blowing interface

    Publikacja

    In this paper we present a new human-computer interface for the quantitative measurement of blowing activities. The interface can measure the air flow and air pressure during the blowing activity. The measured values are stored and used to control the state of the graphical objects in the graphical user interface. In speech therapy children will find easier to play attractive therapeutic games than to perform repetitive and tedious,...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Accelerometer signal pre-processing influence on human activity recognition

    A study of data pre-processing influence on accelerometer-based human activity recognition algorithms is presented. The frequency band used to filter-out the accelerometer signals and the number of accelerometers involved were considered in terms of their influence on the recognition accuracy.

  • Speech and Drama

    Czasopisma

    ISSN: 0038-7142

  • LANGUAGE AND SPEECH

    Czasopisma

    ISSN: 0023-8309 , eISSN: 1756-6053

  • A review of emotion recognition methods based on keystroke dynamics and mouse movements

    Publikacja

    - Rok 2013

    The paper describes the approach based on using standard input devices, such as keyboard and mouse, as sources of data for the recognition of users’ emotional states. A number of systems applying this idea have been presented focusing on three categories of research problems, i.e. collecting and labeling training data, extracting features and training classifiers of emotions. Moreover the advantages and examples of combining standard...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Bimodal Emotion Recognition Based on Vocal and Facial Features

    Emotion recognition is a crucial aspect of human communication, with applications in fields such as psychology, education, and healthcare. Identifying emotions accurately is challenging, as people use a variety of signals to express and perceive emotions. In this study, we address the problem of multimodal emotion recognition using both audio and video signals, to develop a robust and reliable system that can recognize emotions...

    Pełny tekst do pobrania w portalu

  • ALOFON corpus

    The ALOFON corpus is one of the multimodal database of word recordings in English, available at http://www.modality-corpus.org/.  The ALOFON corpus is oriented towards the recording of the speech equivalence variants. For this purpose, a total of 7 people who are or speak English with native speaker fluency and a variety of Standard Southern British...

  • Music Genre Recognition in the Rough Set-Based Environment

    Publikacja

    - Rok 2015

    The aim of this paper is to investigate music genre recognition in the rough set-based environment. Experiments involve a parameterized music data-base containing 1100 music excerpts. The database is divided into 11 classes cor-responding to music genres. Tests are conducted using the Rough Set Exploration System (RSES), a toolset for analyzing data with the use of methods based on the rough set theory. Classification effectiveness...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Emotion Recognition from Physiological Channels Using Graph Neural Network

    In recent years, a number of new research papers have emerged on the application of neural networks in affective computing. One of the newest trends observed is the utilization of graph neural networks (GNNs) to recognize emotions. The study presented in the paper follows this trend. Within the work, GraphSleepNet (a GNN for classifying the stages of sleep) was adjusted for emotion recognition and validated for this purpose. The...

    Pełny tekst do pobrania w portalu

  • Limitations of Emotion Recognition from Facial Expressions in e-Learning Context

    Publikacja

    The paper concerns technology of automatic emotion recognition applied in e-learning environment. During a study of e-learning process the authors applied facial expressions observation via multiple video cameras. Preliminary analysis of the facial expressions using automatic emotion recognition tools revealed several unexpected results, including unavailability of recognition due to face coverage and significant inconsistency...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Estimation of the short-term predictor parameters of speech under noisy conditions

    Publikacja

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Emotion Recognition Based on Facial Expressions of Gamers

    Publikacja

    This article presents an approach to emotion recognition based on facial expressions of gamers. With application of certain methods crucial features of an analysed face like eyebrows' shape, eyes and mouth width, height were extracted. Afterwards a group of artificial intelligence methods was applied to classify a given feature set as one of the following emotions: happiness, sadness, anger and fear.The approach presented in this...

  • Emotion Recognition Based on Facial Expressions of Gamers

    This article presents an approach to emotion recognition based on facial expressions of gamers. With application of certain methods crucial features of an analyzed face like eyebrows' shape, eyes and mouth width, height were extracted. Afterwards a group of artificial intelligence methods was applied to classify a given feature set as one of the following emotions: happiness, sadness, anger and fear. The approach presented in this...

  • Improvement of speech intelligibility in the presence of noise interference using the Lombard effect and an automatic noise interference profiling based on deep learning

    Publikacja
    • K. Kąkol

    - Rok 2023

    The Lombard effect is a phenomenon that results in speech intelligibility improvement when applied to noise. There are many distinctive features of Lombard speech that were recalled in this dissertation. This work proposes the creation of a system capable of improving speech quality and intelligibility in real-time measured by objective metrics and subjective tests. This system consists of three main components: speech type detection,...

    Pełny tekst do pobrania w portalu

  • Adversarial attack algorithm for traffic sign recognition

    Publikacja

    - MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2022

    Deep learning suffers from the threat of adversarial attacks, and its defense methods have become a research hotspot. In all applications of deep learning, intelligent driving is an important and promising one, facing serious threat of adversarial attack in the meanwhile. To address the adversarial attack, this paper takes the traffic sign recognition as a typical object, for it is the core function of intelligent driving. Considering...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Estimation of the excitation variances of speech and noise AR-models for enhanced speech coding

    Publikacja

    - Rok 2001

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Emotion Recognition

    Dane Badawcze
    open access - seria: Person A

    The films presented here were recorded using so-called high-speed camera Phantom Miro. To play the movie  You need the special software which can be downloaded from the web site https://www.phantomhighspeed.com/resourcesandsupport/phantomresources/pccsoftware the details of the movie are available after starting the movie in the viewer in the description...

  • Emotion Recognition

    Dane Badawcze
    open access - seria: Person A

    The films presented here were recorded using so-called high-speed camera Phantom Miro. To play the movie  You need the special software which can be downloaded from the web site https://www.phantomhighspeed.com/resourcesandsupport/phantomresources/pccsoftware the details of the movie are available after starting the movie in the viewer in the description...

  • Contextual Knowledge to Enhance Workplace Hazard Recognition and Interpretation in a Cognitive Vision Platform

    Publikacja

    - Rok 2018

    The combination of vision and sensor data together with the resulting necessity for formal representations builds a central component of an autonomous Cyber Physical System for detection and tracking of laborers in workplaces environments. This system must be adaptable and perceive the environment as automatically as possible, performing in a variety of plants and scenes without the necessity of recoding the application for each...

    Pełny tekst do pobrania w portalu

  • Topology recognition and leader election in colored networks

    Publikacja

    Topology recognition and leader election are fundamental tasks in distributed computing in networks. The first of them requires each node to find a labeled isomorphic copy of the network, while the result of the second one consists in a single node adopting the label 1 (leader), with all other nodes adopting the label 0 and learning a path to the leader. We consider both these problems in networks whose nodes are equipped with...

    Pełny tekst do pobrania w portalu

  • Real-time working gas recognition system based on the array of semiconductor gas sensors and portable computer Raspberry PI

    The gas-analyzing systems based on the array of partially selective gas sensors and pattern-recognition techniques are potentially fast and low-cost alternative for other devices, like gas analysers. They give the possibility of recognition the type and the concentration of measured volatile compounds in their working environment. In this work we present the implementation of gas recognition system, in which the signals from an...

  • Gesture recognition framework for multimedia content viewer controlling

    Publikacja

    In the paper a system for controlling a multimedia content viewer by hand gestures is presented. First, selected methods used for gesture recognition are described. Two different application cases of the system, i.e. for multimedia presentation purposes and for multimedia content viewing are outlined. Moreover, a proposal of improvement of the system combining these approaches is also given. The system work cycle is reviewed. The...

  • From Linear Classifier to Convolutional Neural Network for Hand Pose Recognition

    Publikacja

    Recently gathered image datasets and the new capabilities of high-performance computing systems have allowed developing new artificial neural network models and training algorithms. Using the new machine learning models, computer vision tasks can be accomplished based on the raw values of image pixels instead of specific features. The principle of operation of deep neural networks resembles more and more what we believe to be happening...

    Pełny tekst do pobrania w portalu

  • Deep Learning: A Case Study for Image Recognition Using Transfer Learning

    Publikacja

    - Rok 2021

    Deep learning (DL) is a rising star of machine learning (ML) and artificial intelligence (AI) domains. Until 2006, many researchers had attempted to build deep neural networks (DNN), but most of them failed. In 2006, it was proven that deep neural networks are one of the most crucial inventions for the 21st century. Nowadays, DNN are being used as a key technology for many different domains: self-driven vehicles, smart cities,...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Subjective Quality Evaluation of Speech Signals Transmitted via BPL-PLC Wired System

    Publikacja
    • P. Falkowski-Gilski
    • G. Debita
    • M. Habrych
    • B. Miedziński
    • P. Jedlikowski
    • B. Polnik
    • J. Wandzio
    • X. Wang

    - Rok 2020

    The broadband over power line – power line communication (BPL-PLC) cable is resistant to electricity stoppage and partial damage of phase conductors. It maintains continuity of transmission in case of an emergency. These features make it an ideal solution for delivering data, e.g. in an underground mine environment, especially clear and easily understandable voice messages. This paper describes a subjective quality evaluation of...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Comparison of edge detection algorithms for electric wire recognition

    Publikacja

    Edge detection is the preliminary step in image processing for object detection and recognition procedure. It allows to remove useless information and reduce amount of data before further analysis. The paper contains the comparison of edge detection algorithms optimized for detection of horizontal edges. For comparison purposes the algorithms were implemented in the developed application dedicated to detection of electric line...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Optical recognition elements: macrocyclic imidazole chromoionophores entrapped in silica xerogel

    Publikacja

    Materials containing new chromoionophores consisting of crown residue and azole moiety as partsof macrocycles were encapsulated by the sol-gel procedure in silica xerogel matrices and proposed aschemical recognition elements especially for such metal ions as Li+, Cs+ and Cu2+. Action of these recognition elements is in principle based on changes of reflectance. The recognition elements containing 21-membered chromogenic...