Wyniki wyszukiwania dla: SPEECH RECOGNITION SYSTEMS

Wyniki wyszukiwania dla: SPEECH RECOGNITION SYSTEMS

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 7127

wyczyść wszystkie filtry niedostępne

wyświetlamy 1000 najlepszych wyników Pomoc

Communication Platform for Evaluation of Transmitted Speech Quality
Publikacja
- A. Ciarkowski
- A. Czyżewski
- Journal of Telecommunications and Information Technology - Rok 2011
A voice communication system designed and implemented is described. The purpose of the presented platform was to enable a series of experiments related to the quality assessment of algorithms used in the coding and transmitting of speech. The system is equipped with tools for recording signals at each stage of processing, making it possible to subject them to subjective assessments by listening tests or, objective evaluation employing...

Pełny tekst do pobrania w portalu
Transfer learning in imagined speech EEG-based BCIs
Publikacja
- J. S. Garcia Salinas
- L. Villaseñor-Pineda
- C. A. Reyes-Garćia
- A. A. Torres-García
- Biomedical Signal Processing and Control - Rok 2019
The Brain–Computer Interfaces (BCI) based on electroencephalograms (EEG) are systems which aim is to provide a communication channel to any person with a computer, initially it was proposed to aid people with disabilities, but actually wider applications have been proposed. These devices allow to send messages or to control devices using the brain signals. There are different neuro-paradigms which evoke brain signals of interest...

Pełny tekst do pobrania w portalu
Scoreboard Architectural Pattern and Integration of Emotion Recognition Results
Publikacja
- A. Landowska
- G. Brodny
- IEEE Access - Rok 2019
This paper proposes a new design pattern, named Scoreboard , dedicated for applications solving complex, multi-stage, non-deterministic problems. The pattern provides a computational framework for the design and implementation of systems that integrate a large number of diverse specialized modules that may vary in accuracy, solution level, and modality. The Scoreboard is an extension of Blackboard design pattern and comes under...

Pełny tekst do pobrania w portalu
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
Publikacja
- Elektronika : konstrukcje, technologie, zastosowania - Rok 2008
In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
Publikacja
- T. Bandurski
- Ł. Hamerski
- M. Papaj
- A. Paruzel
- K. Świder
- Rok 2007
In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
Automated detection of pronunciation errors in non-native English speech employing deep learning
Publikacja
- D. Korzekwa
- Rok 2023
Despite significant advances in recent years, the existing Computer-Assisted Pronunciation Training (CAPT) methods detect pronunciation errors with a relatively low accuracy (precision of 60% at 40%-80% recall). This Ph.D. work proposes novel deep learning methods for detecting pronunciation errors in non-native (L2) English speech, outperforming the state-of-the-art method in AUC metric (Area under the Curve) by 41%, i.e., from...

Pełny tekst do pobrania w portalu
Emotion Recognition Using Physiological Signals
Publikacja
- W. Szwoch
- Rok 2015
In this paper the problem of emotion recognition using physiological signals is presented. Firstly the problems with acquisition of physiological signals related to specific human emotions are described. It is not a trivial problem to elicit real emotions and to choose stimuli that always, and for all people, elicit the same emotion. Also different kinds of physiological signals for emotion recognition are considered. A set of...

Pełny tekst do pobrania w serwisie zewnętrznym
Facial emotion recognition using depth data
Publikacja
- M. Szwoch
- P. Pieniazek
- Rok 2015
In this paper an original approach is presented for facial expression and emotion recognition based only on depth channel from Microsoft Kinect sensor. The emotional user model contains nine emotions including the neutral one. The proposed recognition algorithm uses local movements detection within the face area in order to recognize actual facial expression. This approach has been validated on Facial Expressions and Emotions Database...

Pełny tekst do pobrania w serwisie zewnętrznym
Emotion recognition and its application in software engineering
Publikacja
- Rok 2013
In this paper a novel application of multimodal emotion recognition algorithms in software engineering is described. Several application scenarios are proposed concerning program usability testing and software process improvement. Also a set of emotional states relevant in that application area is identified. The multimodal emotion recognition method that integrates video and depth channels, physiological signals and input devices...

Pełny tekst do pobrania w serwisie zewnętrznym
Dependable Integration of Medical Image Recognition Components
Publikacja
- Rok 2012
Computer driven medical image recognition may support medical doctors in the diagnosis process, but requires high dependability considering potential consequences of incorrect results. The paper presentsa system that improves dependability of medical image recognition by integration of results from redundant components. The components implement alternative recognition algorithms of diseases in thefield of gastrointestinal endoscopy....
Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech
Publikacja
- D. Korzekwa
- J. Lorenzo-trueba
- T. Drugman
- S. Calamaro
- B. Kostek
- Rok 2021
We propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...

Pełny tekst do pobrania w portalu
Local Texture Pattern Selection for Efficient Face Recognition and Tracking
Publikacja
- M. Smiatacz
- J. Rumiński
- Advances in Intelligent Systems and Computing - Rok 2015
This paper describes the research aimed at finding the optimal configuration of the face recognition algorithm based on local texture descriptors (binary and ternary patterns). Since the identification module was supposed to be a part of the face tracking system developed for interactive wearable computer, proper feature selection, allowing for real-time operation, became particularly important. Our experiments showed that it is...

Pełny tekst do pobrania w serwisie zewnętrznym
Automatic Singing Voice Recognition EmployingNeural Networks and Rough Sets
Publikacja
- Rok 2008
Celem badań jest automatyczne rozpoznawanie głosów śpiewaczych w kategorii rodzaju i jakości technicznej śpiewu. W artykule opisano stworzoną bazę danych głosów, która zawiera próbki głosu śpiewaków profesjonalnych i amatorskich. W dalszej części opisano parametry zdefiniowane w oparciu o zjawiska biomechaniczne w narządzie głosu podczas śpiewania. W oparciu o stworzone macierze parametrów wytrenowano i porównano automatyczne klasyfikatory...
Michał Tomasz Kucewicz dr

Osoby

Katedra Systemów Multimedialnych

Michal Kucewicz was born in 1986 in Gdansk. In 2005 he completed International Baccalaureate programme in Topolowka (III High School in Gdańsk). Thanks to the G. D. Fahrenheit scholarship, he moved to the United Kingdom to study neuroscience. He received his Bachelor’s and Master’s degree from the Cambridge University, and his doctoral degree from the University of Bristol specializing in electrophysiology of memory and cognitive...
Guido: a musical score recognition system
Publikacja
- M. Szwoch
- Rok 2007
This paper presents an optical music recognition system Guido that can automatically recognize the main musical symbols of music scores that were scanned or taken by a digital camera. The application is based on object model of musical notation and uses linguistic approach for symbol interpretation and error correction. The system offers musical editor with a partially automatic error correction.
Mowa nienawiści (hate speech) a odpowiedzialność dostawców usług internetowych w orzecznictwie sądów europejskich
Publikacja
- K. Kowalik-Bańczyk
- Rok 2015
The article analyses the phenomenon of hate speech in the Internet contrasted with the problem of responsability of Internet Service Providers for cases of such abuses of freedom of expression. The text provides an analysis of jurisprudence of two European Courts. On the one hand it presents the position of the European Court of Human Rights on the problem of hate speech: its definition and the liability for it as an exception...
Objectivization of phonological evaluation of speech elements by means of audio parametrization
Publikacja
- Rok 2018
This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...
Anion recognition by n,n'-diarylalkanediamides
Publikacja
- E. Wagner-Wysiecka
- N. Łukasik
- Rok 2012
The preparation of N,N'-diarylalkanediamides from respective aliphatic dicarboxylic acidesand 4-nitroaniline via microwave-promoted reactions is presented. The most positive effect of microwave irradiation was observed for N,N'-bis(4-nitrophenyl)butanediamide. Anion binding studies on the obtained diamides were carried out in DMSO and acetonitrile using UV-vis and 1H NMR spectroscopy. A mechanism for selective fluoride recognition...

Pełny tekst do pobrania w serwisie zewnętrznym
Robust and Efficient Machine Learning Algorithms for Visual Recognition
Publikacja
- S. Cygert
- Rok 2022
In visual recognition, the task is to identify and localize all objects of interest in the input image. With the ubiquitous presence of visual data in modern days, the role of object recognition algorithms is becoming more significant than ever and ranges from autonomous driving to computer-aided diagnosis in medicine. Current models for visual recognition are dominated by models based on Convolutional Neural Networks (CNNs), which...

Pełny tekst do pobrania w portalu
Piotr Szczuko dr hab. inż.

Osoby

Katedra Systemów Multimedialnych

Dr hab. inż. Piotr Szczuko w 2002 roku ukończył studia na Wydziale Elektroniki, Telekomunikacji i Informatyki Politechniki Gdańskiej zdobywając tytuł magistra inżyniera. Tematem pracy dyplomowej było badanie zjawisk jednoczesnej percepcji obrazu cyfrowego i dźwięku dookólnego. W roku 2008 obronił rozprawę doktorską zatytułowaną "Zastosowanie reguł rozmytych w komputerowej animacji postaci", za którą otrzymał nagrodę Prezesa Rady...
TELECOMMUNICATION SYSTEMS

Czasopisma

ISSN: 1018-4864 , eISSN: 1572-9451
Examining Classifiers Applied to Static Hand Gesture Recognition in Novel Sound Mixing System
Publikacja
- Advances in Intelligent Systems and Computing - Rok 2013
The main objective of the chapter is to present the methodology and results of examining various classifiers (Nearest Neighbor-like algorithm with non-nested generalization (NNge), Naive Bayes, C4.5 (J48), Random Tree, Random Forests, Artificial Neural Networks (Multilayer Perceptron), Support Vector Machine (SVM) used for static gesture recognition. A problem of effective gesture recognition is outlined in the context of the system...

Pełny tekst do pobrania w serwisie zewnętrznym
AN ALGORITHM FOR PORTAL HYPERTENSIVE GASTROPATHY RECOGNITION ON THE ENDOSCOPIC RECORDINGS
Publikacja
- Rok 2014
Symptoms recognition of portal hypertensive gastropathy (PHG) can be done by analysing endoscopic recordings, but manual analysis done by physician may take a long time. This increases probability of missing some symptoms and automated methods may be applied to prevent that. In this paper a novel hybrid algorithm for recognition of early stage of portal hypertensive gastropathy is proposed. First image preprocessing is described....
Human-computer interactions in speech therapy using a blowing interface
Publikacja
- Rok 2014
In this paper we present a new human-computer interface for the quantitative measurement of blowing activities. The interface can measure the air flow and air pressure during the blowing activity. The measured values are stored and used to control the state of the graphical objects in the graphical user interface. In speech therapy children will find easier to play attractive therapeutic games than to perform repetitive and tedious,...

Pełny tekst do pobrania w serwisie zewnętrznym
Accelerometer signal pre-processing influence on human activity recognition
Publikacja
- Rok 2009
A study of data pre-processing influence on accelerometer-based human activity recognition algorithms is presented. The frequency band used to filter-out the accelerometer signals and the number of accelerometers involved were considered in terms of their influence on the recognition accuracy.
Speech and Drama

Czasopisma

ISSN: 0038-7142
LANGUAGE AND SPEECH

Czasopisma

ISSN: 0023-8309 , eISSN: 1756-6053
A review of emotion recognition methods based on keystroke dynamics and mouse movements
Publikacja
- A. Kołakowska
- Rok 2013
The paper describes the approach based on using standard input devices, such as keyboard and mouse, as sources of data for the recognition of users’ emotional states. A number of systems applying this idea have been presented focusing on three categories of research problems, i.e. collecting and labeling training data, extracting features and training classifiers of emotions. Moreover the advantages and examples of combining standard...

Pełny tekst do pobrania w serwisie zewnętrznym
Bimodal Emotion Recognition Based on Vocal and Facial Features
Publikacja
- Rok 2023
Emotion recognition is a crucial aspect of human communication, with applications in fields such as psychology, education, and healthcare. Identifying emotions accurately is challenging, as people use a variety of signals to express and perceive emotions. In this study, we address the problem of multimodal emotion recognition using both audio and video signals, to develop a robust and reliable system that can recognize emotions...

Pełny tekst do pobrania w portalu
ALOFON corpus
Dane Badawcze
The ALOFON corpus is one of the multimodal database of word recordings in English, available at http://www.modality-corpus.org/. The ALOFON corpus is oriented towards the recording of the speech equivalence variants. For this purpose, a total of 7 people who are or speak English with native speaker fluency and a variety of Standard Southern British...
Music Genre Recognition in the Rough Set-Based Environment
Publikacja
- P. Hoffmann
- B. Kostek
- Rok 2015
The aim of this paper is to investigate music genre recognition in the rough set-based environment. Experiments involve a parameterized music data-base containing 1100 music excerpts. The database is divided into 11 classes cor-responding to music genres. Tests are conducted using the Rough Set Exploration System (RSES), a toolset for analyzing data with the use of methods based on the rough set theory. Classification effectiveness...

Pełny tekst do pobrania w serwisie zewnętrznym
Emotion Recognition from Physiological Channels Using Graph Neural Network
Publikacja
- SENSORS - Rok 2022
In recent years, a number of new research papers have emerged on the application of neural networks in affective computing. One of the newest trends observed is the utilization of graph neural networks (GNNs) to recognize emotions. The study presented in the paper follows this trend. Within the work, GraphSleepNet (a GNN for classifying the stages of sleep) was adjusted for emotion recognition and validated for this purpose. The...

Pełny tekst do pobrania w portalu
Limitations of Emotion Recognition from Facial Expressions in e-Learning Context
Publikacja
- Rok 2017
The paper concerns technology of automatic emotion recognition applied in e-learning environment. During a study of e-learning process the authors applied facial expressions observation via multiple video cameras. Preliminary analysis of the facial expressions using automatic emotion recognition tools revealed several unexpected results, including unavailability of recognition due to face coverage and significant inconsistency...

Pełny tekst do pobrania w serwisie zewnętrznym
Estimation of the short-term predictor parameters of speech under noisy conditions
Publikacja
- M. Kuropatwinski
- W. Kleijn
- M. Kuropatwiński
- IEEE Transactions on Audio Speech and Language Processing - Rok 2006
Pełny tekst do pobrania w serwisie zewnętrznym
Emotion Recognition Based on Facial Expressions of Gamers
Publikacja
- Rok 2012
This article presents an approach to emotion recognition based on facial expressions of gamers. With application of certain methods crucial features of an analysed face like eyebrows' shape, eyes and mouth width, height were extracted. Afterwards a group of artificial intelligence methods was applied to classify a given feature set as one of the following emotions: happiness, sadness, anger and fear.The approach presented in this...
Emotion Recognition Based on Facial Expressions of Gamers
Publikacja
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Rok 2014
This article presents an approach to emotion recognition based on facial expressions of gamers. With application of certain methods crucial features of an analyzed face like eyebrows' shape, eyes and mouth width, height were extracted. Afterwards a group of artificial intelligence methods was applied to classify a given feature set as one of the following emotions: happiness, sadness, anger and fear. The approach presented in this...
Improvement of speech intelligibility in the presence of noise interference using the Lombard effect and an automatic noise interference profiling based on deep learning
Publikacja
- K. Kąkol
- Rok 2023
The Lombard effect is a phenomenon that results in speech intelligibility improvement when applied to noise. There are many distinctive features of Lombard speech that were recalled in this dissertation. This work proposes the creation of a system capable of improving speech quality and intelligibility in real-time measured by objective metrics and subjective tests. This system consists of three main components: speech type detection,...

Pełny tekst do pobrania w portalu
Adversarial attack algorithm for traffic sign recognition
Publikacja
- J. Wang
- L. Shi
- Y. Zhao
- H. Zhang
- E. Szczerbicki
- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2022
Deep learning suffers from the threat of adversarial attacks, and its defense methods have become a research hotspot. In all applications of deep learning, intelligent driving is an important and promising one, facing serious threat of adversarial attack in the meanwhile. To address the adversarial attack, this paper takes the traffic sign recognition as a typical object, for it is the core function of intelligent driving. Considering...

Pełny tekst do pobrania w serwisie zewnętrznym
Estimation of the excitation variances of speech and noise AR-models for enhanced speech coding
Publikacja
- M. Kuropatwinski
- W. Kleijn
- M. Kuropatwiński
- Rok 2001
Pełny tekst do pobrania w serwisie zewnętrznym
Emotion Recognition
Dane Badawcze
open access
- M. Przyborski
- K. Bobkowska
- seria: Person A
The films presented here were recorded using so-called high-speed camera Phantom Miro. To play the movie You need the special software which can be downloaded from the web site https://www.phantomhighspeed.com/resourcesandsupport/phantomresources/pccsoftware the details of the movie are available after starting the movie in the viewer in the description...
Emotion Recognition
Dane Badawcze
open access
- M. Przyborski
- K. Bobkowska
- seria: Person A
The films presented here were recorded using so-called high-speed camera Phantom Miro. To play the movie You need the special software which can be downloaded from the web site https://www.phantomhighspeed.com/resourcesandsupport/phantomresources/pccsoftware the details of the movie are available after starting the movie in the viewer in the description...
Contextual Knowledge to Enhance Workplace Hazard Recognition and Interpretation in a Cognitive Vision Platform
Publikacja
- C. De
- C. Sanin
- E. Szczerbicki
- Rok 2018
The combination of vision and sensor data together with the resulting necessity for formal representations builds a central component of an autonomous Cyber Physical System for detection and tracking of laborers in workplaces environments. This system must be adaptable and perceive the environment as automatically as possible, performing in a variety of plants and scenes without the necessity of recoding the application for each...

Pełny tekst do pobrania w portalu
Topology recognition and leader election in colored networks
Publikacja
- D. Dereniowski
- A. Pelc
- THEORETICAL COMPUTER SCIENCE - Rok 2016
Topology recognition and leader election are fundamental tasks in distributed computing in networks. The first of them requires each node to find a labeled isomorphic copy of the network, while the result of the second one consists in a single node adopting the label 1 (leader), with all other nodes adopting the label 0 and learning a path to the leader. We consider both these problems in networks whose nodes are equipped with...

Pełny tekst do pobrania w portalu
Real-time working gas recognition system based on the array of semiconductor gas sensors and portable computer Raspberry PI
Publikacja
- Rok 2013
The gas-analyzing systems based on the array of partially selective gas sensors and pattern-recognition techniques are potentially fast and low-cost alternative for other devices, like gas analysers. They give the possibility of recognition the type and the concentration of measured volatile compounds in their working environment. In this work we present the implementation of gas recognition system, in which the signals from an...
Gesture recognition framework for multimedia content viewer controlling
Publikacja
- Rok 2009
In the paper a system for controlling a multimedia content viewer by hand gestures is presented. First, selected methods used for gesture recognition are described. Two different application cases of the system, i.e. for multimedia presentation purposes and for multimedia content viewing are outlined. Moreover, a proposal of improvement of the system combining these approaches is also given. The system work cycle is reviewed. The...
From Linear Classifier to Convolutional Neural Network for Hand Pose Recognition
Publikacja
- P. Rościszewski
- Computer Science - Rok 2017
Recently gathered image datasets and the new capabilities of high-performance computing systems have allowed developing new artificial neural network models and training algorithms. Using the new machine learning models, computer vision tasks can be accomplished based on the raw values of image pixels instead of specific features. The principle of operation of deep neural networks resembles more and more what we believe to be happening...

Pełny tekst do pobrania w portalu
Deep Learning: A Case Study for Image Recognition Using Transfer Learning
Publikacja
- S. Erpolat Tasabat
- O. Aydin
- Rok 2021
Deep learning (DL) is a rising star of machine learning (ML) and artificial intelligence (AI) domains. Until 2006, many researchers had attempted to build deep neural networks (DNN), but most of them failed. In 2006, it was proven that deep neural networks are one of the most crucial inventions for the 21st century. Nowadays, DNN are being used as a key technology for many different domains: self-driven vehicles, smart cities,...

Pełny tekst do pobrania w serwisie zewnętrznym
Subjective Quality Evaluation of Speech Signals Transmitted via BPL-PLC Wired System
Publikacja
- P. Falkowski-Gilski
- G. Debita
- M. Habrych
- B. Miedziński
- P. Jedlikowski
- B. Polnik
- J. Wandzio
- X. Wang
- Rok 2020
The broadband over power line – power line communication (BPL-PLC) cable is resistant to electricity stoppage and partial damage of phase conductors. It maintains continuity of transmission in case of an emergency. These features make it an ideal solution for delivering data, e.g. in an underground mine environment, especially clear and easily understandable voice messages. This paper describes a subjective quality evaluation of...

Pełny tekst do pobrania w serwisie zewnętrznym
Comparison of edge detection algorithms for electric wire recognition
Publikacja
- Rok 2018
Edge detection is the preliminary step in image processing for object detection and recognition procedure. It allows to remove useless information and reduce amount of data before further analysis. The paper contains the comparison of edge detection algorithms optimized for detection of horizontal edges. For comparison purposes the algorithms were implemented in the developed application dedicated to detection of electric line...

Pełny tekst do pobrania w serwisie zewnętrznym
Optical recognition elements: macrocyclic imidazole chromoionophores entrapped in silica xerogel
Publikacja
- M. Jamrógiewicz
- K. Kledzik
- M. Gwiazda
- E. Wagner-Wysiecka
- J. Jezierska
- J. Biernat
- A. Kłonkowski
- MATERIALS SCIENCE-POLAND - Rok 2007
Materials containing new chromoionophores consisting of crown residue and azole moiety as partsof macrocycles were encapsulated by the sol-gel procedure in silica xerogel matrices and proposed aschemical recognition elements especially for such metal ions as Li+, Cs+ and Cu2+. Action of these recognition elements is in principle based on changes of reflectance. The recognition elements containing 21-membered chromogenic...

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: SPEECH RECOGNITION SYSTEMS

Michał Tomasz Kucewicz dr

Piotr Szczuko dr hab. inż.