Wyniki wyszukiwania dla: VISUAL SPEECH RECOGNITION

Wyniki wyszukiwania dla: VISUAL SPEECH RECOGNITION

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 1419

wyczyść wszystkie filtry niedostępne

wyświetlamy 1000 najlepszych wyników Pomoc

Semantic Integration of Heterogeneous Recognition Systems
Publikacja
- P. Kaczmarek
- P. Raszkowski
- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2011
Computer perception of real-life situations is performed using a variety of recognition techniques, including video-based computer vision, biometric systems, RFID devices and others. The proliferation of recognition modules enables development of complex systems by integration of existing components, analogously to the Service Oriented Architecture technology. In the paper, we propose a method that enables integration of information...
Using Physiological Signals for Emotion Recognition
Publikacja
- W. Szwoch
- Rok 2013
Recognizing user’s emotions is the promising area of research in a field of human-computer interaction. It is possible to recognize emotions using facial expression, audio signals, body poses, gestures etc. but physiological signals are very useful in this field because they are spontaneous and not controllable. In this paper a problem of using physiological signals for emotion recognition is presented. The kinds of physiological...

Pełny tekst do pobrania w serwisie zewnętrznym
Emotions in polish speech recordings
Dane Badawcze
open access
- M. Mięsikowska
- D. Świsulski
The data set presents emotions recorded in sound files that are expressions of Polish speech. Statements were made by people aged 21-23, young voices of 5 men. Each person said the following words / nie – no, oddaj - give back, podaj – pass, stop - stop, tak - yes, trzymaj -hold / five times representing a specific emotion - one of three - anger (a),...
Communication Platform for Evaluation of Transmitted Speech Quality
Publikacja
- A. Ciarkowski
- A. Czyżewski
- Journal of Telecommunications and Information Technology - Rok 2011
A voice communication system designed and implemented is described. The purpose of the presented platform was to enable a series of experiments related to the quality assessment of algorithms used in the coding and transmitting of speech. The system is equipped with tools for recording signals at each stage of processing, making it possible to subject them to subjective assessments by listening tests or, objective evaluation employing...

Pełny tekst do pobrania w portalu
Emotion Recognition for Affect Aware Video Games
Publikacja
- M. Szwoch
- W. Szwoch
- Advances in Intelligent Systems and Computing - Rok 2015
In this paper the idea of affect aware video games is presented. A brief review of automatic multimodal affect recognition of facial expressions and emotions is given. The first result of emotions recognition using depth data as well as prototype affect aware video game are presented

Pełny tekst do pobrania w serwisie zewnętrznym
Emotion Recognition and Its Applications
Publikacja
- Advances in Intelligent Systems and Computing - Rok 2014
The paper proposes a set of research scenarios to be applied in four domains: software engineering, website customization, education and gaming. The goal of applying the scenarios is to assess the possibility of using emotion recognition methods in these areas. It also points out the problems of defining sets of emotions to be recognized in different applications, representing the defined emotional states, gathering the data and...

Pełny tekst do pobrania w serwisie zewnętrznym
Visual perception of vowels from static and dynamic cues
Publikacja
- Journal of the Acoustical Society of America - Rok 2018
The purpose of the study was to analyse human identification of Polish vowels from static and dynamic durationally slowed visual cues. A total of 152 participants identified 6 Polish vowels produced by 4 speakers from static (still images) and dynamic (videos) cues. The results show that 59% of static vowels and 63% of dynamic vowels were successfully identified. There was a strong confusion between vowels within front, central,...

Pełny tekst do pobrania w serwisie zewnętrznym
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
Publikacja
- Elektronika : konstrukcje, technologie, zastosowania - Rok 2008
In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
Pitch estimation of narrowband-filtered speech signal using instantaneous complex frequency
Publikacja
- T. Bandurski
- Ł. Hamerski
- M. Papaj
- A. Paruzel
- K. Świder
- Rok 2007
In this paper we propose a novel method of pitch estimation, based on instantaneous complex frequency (ICF). New iterative algorithm for analysis of ICF of speech signal in presented. Obtained results are compared with commonly used methods to prove its accuracy and connection between ICF and pitch, particularly for narrowband-filtered speech signal.
Results of tests on speech intelligibility in reverberant conditions
Dane Badawcze
open access
The dataset contains the results of tests that aimed to provide a relationship between the rate of speech (RoS) and reverberation conditions characterized by the Speech Transmission Index (STI).
Rough Sets Applied to Mood of Music Recognition
Publikacja
- B. Kostek
- M. Piotrowska
- Rok 2016
With the growth of accessible digital music libraries over the past decade, there is a need for research into automated systems for searching, organizing and recommending music. Mood of music is considered as one of the most intuitive criteria for listeners, thus this work is focused on the emotional content of music and its automatic recognition. The research study presented in this work contains an attempt to music emotion recognition...
Automated detection of pronunciation errors in non-native English speech employing deep learning
Publikacja
- D. Korzekwa
- Rok 2023
Despite significant advances in recent years, the existing Computer-Assisted Pronunciation Training (CAPT) methods detect pronunciation errors with a relatively low accuracy (precision of 60% at 40%-80% recall). This Ph.D. work proposes novel deep learning methods for detecting pronunciation errors in non-native (L2) English speech, outperforming the state-of-the-art method in AUC metric (Area under the Curve) by 41%, i.e., from...

Pełny tekst do pobrania w portalu
Emotion Recognition Using Physiological Signals
Publikacja
- W. Szwoch
- Rok 2015
In this paper the problem of emotion recognition using physiological signals is presented. Firstly the problems with acquisition of physiological signals related to specific human emotions are described. It is not a trivial problem to elicit real emotions and to choose stimuli that always, and for all people, elicit the same emotion. Also different kinds of physiological signals for emotion recognition are considered. A set of...

Pełny tekst do pobrania w serwisie zewnętrznym
Visual Content Representation for Cognitive Systems: Towards Augmented Intelligence
Publikacja
- C. S. d. Oliveira
- C. Sanin
- E. Szczerbicki
- Rok 2020
Cognitive Vision Systems have gained significant attention from academia and industry during the past few decades. One of the main reasons behind this interest is the potential of such technologies to revolutionize human life since they intend to work robustly under complex visual scenes (which environmental conditions may vary), adapting to a comprehensive range of unforeseen changes, and exhibiting prospective behavior. The combination...

Pełny tekst do pobrania w serwisie zewnętrznym
Facial emotion recognition using depth data
Publikacja
- M. Szwoch
- P. Pieniazek
- Rok 2015
In this paper an original approach is presented for facial expression and emotion recognition based only on depth channel from Microsoft Kinect sensor. The emotional user model contains nine emotions including the neutral one. The proposed recognition algorithm uses local movements detection within the face area in order to recognize actual facial expression. This approach has been validated on Facial Expressions and Emotions Database...

Pełny tekst do pobrania w serwisie zewnętrznym
Emotion recognition and its application in software engineering
Publikacja
- Rok 2013
In this paper a novel application of multimodal emotion recognition algorithms in software engineering is described. Several application scenarios are proposed concerning program usability testing and software process improvement. Also a set of emotional states relevant in that application area is identified. The multimodal emotion recognition method that integrates video and depth channels, physiological signals and input devices...

Pełny tekst do pobrania w serwisie zewnętrznym
A Novel Method for Intelligibility Assessment of Nonlinearly Processed Speech in Spaces Characterized by Long Reverberation Times
Publikacja
- SENSORS - Rok 2022
Objective assessment of speech intelligibility is a complex task that requires taking into account a number of factors such as different perception of each speech sub-bands by the human hearing sense or different physical properties of each frequency band of a speech signal. Currently, the state-of-the-art method used for assessing the quality of speech transmission is the speech transmission index (STI). It is a standardized way...

Pełny tekst do pobrania w portalu
Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech
Publikacja
- D. Korzekwa
- J. Lorenzo-trueba
- T. Drugman
- S. Calamaro
- B. Kostek
- Rok 2021
We propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...

Pełny tekst do pobrania w portalu
Dependable Integration of Medical Image Recognition Components
Publikacja
- Rok 2012
Computer driven medical image recognition may support medical doctors in the diagnosis process, but requires high dependability considering potential consequences of incorrect results. The paper presentsa system that improves dependability of medical image recognition by integration of results from redundant components. The components implement alternative recognition algorithms of diseases in thefield of gastrointestinal endoscopy....
Feature extraction in detection and recognition of graphical objects
Publikacja
- J. Dembski
- Rok 2022
Detection and recognition of graphic objects in images are of great and growing importance in many areas, such as medical and industrial diagnostics, control systems in automation and robotics, or various types of security systems, including biometric security systems related to the recognition of the face or iris of the eye. In addition, there are all systems that facilitate the personal life of the blind people, visually impaired...
Mining inconsistent emotion recognition results with the multidimensional model
Publikacja
- A. Landowska
- T. Zawadzka
- M. Zawadzki
- IEEE Access - Rok 2021
The paper deals with the challenge of inconsistency in multichannel emotion recognition. The focus of the paper is to explore factors that might influence the inconsistency. The paper reports an experiment that used multi-camera facial expression analysis with multiple recognition systems. The data were analyzed using a multidimensional approach and data mining techniques. The study allowed us to explore camera location, occlusions...

Pełny tekst do pobrania w portalu
Guido: a musical score recognition system
Publikacja
- M. Szwoch
- Rok 2007
This paper presents an optical music recognition system Guido that can automatically recognize the main musical symbols of music scores that were scanned or taken by a digital camera. The application is based on object model of musical notation and uses linguistic approach for symbol interpretation and error correction. The system offers musical editor with a partially automatic error correction.
Mowa nienawiści (hate speech) a odpowiedzialność dostawców usług internetowych w orzecznictwie sądów europejskich
Publikacja
- K. Kowalik-Bańczyk
- Rok 2015
The article analyses the phenomenon of hate speech in the Internet contrasted with the problem of responsability of Internet Service Providers for cases of such abuses of freedom of expression. The text provides an analysis of jurisprudence of two European Courts. On the one hand it presents the position of the European Court of Human Rights on the problem of hate speech: its definition and the liability for it as an exception...
Pursuing Listeners’ Perceptual Response in Audio-Visual Interactions - Headphones vs Loudspeakers: A Case Study
Publikacja
- B. Mróz
- B. Kostek
- Archives of Acoustics - Rok 2022
This study investigates listeners’ perceptual responses in audio-visual interactions concerning binaural spatial audio. Audio stimuli are coupled with or without visual cues to the listeners. The subjective test participants are tasked to indicate the direction of the incoming sound while listening to the audio stimulus via loudspeakers or headphones with the head-related transfer function (HRTF) plugin. First, the methodology...

Pełny tekst do pobrania w portalu
Objectivization of phonological evaluation of speech elements by means of audio parametrization
Publikacja
- Rok 2018
This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...
Selected aspects of customization of cognitive dimensions for evaluation of visual modeling languages.
Publikacja
- A. Bobkowska
- Rok 2004
For the successful application of diagrams in software engineering, high quality visual modelling languages (VML) are required. There is a need for new effective methodologies of VML evaluation. This paper discusses selected aspects of applying cognitive dimensions as a basis of the evaluation. Then, it briefly presents CD-VML methodology which integrates the cognitive dimensions with a theory of visual modelling languages. Finally,...
Multiclass AdaBoost Classifier Parameter Adaptation for Pattern Recognition
Publikacja
- J. Dembski
- Advances in Intelligent Systems and Computing - Rok 2017
The article presents the problem of parameter value selection of the multiclass ``one against all'' approach of an AdaBoost algorithm in tasks of object recognition based on two-dimensional graphical images. AdaBoost classifier with Haar features is still used in mobile devices due to the processing speed in contrast to other methods like deep learning or SVM but its main drawback is the need to assembly the results of binary...

Pełny tekst do pobrania w serwisie zewnętrznym
A Visual Method of Measuring Railway-Track Weed Infestation Level
Publikacja
- J. Skibicki
- R. Licow
- Metrology - Rok 2022
This paper concerns the assessment of railway track surface conditions in relation to the degree of weed infestation. The paper conceptually describes the proposed method using a visual system to analyse weed infestation level. The use of image analysis software for weed detection is also proposed. This new measurement method allows for a mobile assessment of the track’s weed infestation status. Validation of the assessment method...

Pełny tekst do pobrania w portalu
Anion recognition by n,n'-diarylalkanediamides
Publikacja
- E. Wagner-Wysiecka
- N. Łukasik
- Rok 2012
The preparation of N,N'-diarylalkanediamides from respective aliphatic dicarboxylic acidesand 4-nitroaniline via microwave-promoted reactions is presented. The most positive effect of microwave irradiation was observed for N,N'-bis(4-nitrophenyl)butanediamide. Anion binding studies on the obtained diamides were carried out in DMSO and acetonitrile using UV-vis and 1H NMR spectroscopy. A mechanism for selective fluoride recognition...

Pełny tekst do pobrania w serwisie zewnętrznym
Elimination of clicks from archive speech signals using sparse autoregressive modeling
Publikacja
- M. Niedźwiecki
- M. Ciołek
- Rok 2012
This paper presents a new approach to elimination of impulsivedisturbances from archive speech signals. The proposedsparse autoregressive (SAR) signal representation is given ina factorized form - the model is a cascade of the so-called formantfilter and pitch filter. Such a technique has been widelyused in code-excited linear prediction (CELP) systems, as itguarantees model stability. After detection of noise pulses usinglinear...

Pełny tekst do pobrania w serwisie zewnętrznym
AN ALGORITHM FOR PORTAL HYPERTENSIVE GASTROPATHY RECOGNITION ON THE ENDOSCOPIC RECORDINGS
Publikacja
- Rok 2014
Symptoms recognition of portal hypertensive gastropathy (PHG) can be done by analysing endoscopic recordings, but manual analysis done by physician may take a long time. This increases probability of missing some symptoms and automated methods may be applied to prevent that. In this paper a novel hybrid algorithm for recognition of early stage of portal hypertensive gastropathy is proposed. First image preprocessing is described....
Human-computer interactions in speech therapy using a blowing interface
Publikacja
- Rok 2014
In this paper we present a new human-computer interface for the quantitative measurement of blowing activities. The interface can measure the air flow and air pressure during the blowing activity. The measured values are stored and used to control the state of the graphical objects in the graphical user interface. In speech therapy children will find easier to play attractive therapeutic games than to perform repetitive and tedious,...

Pełny tekst do pobrania w serwisie zewnętrznym
Limitations of Emotion Recognition in Software User Experience Evaluation Context
Publikacja
- A. Landowska
- J. Miler
- Annals of Computer Science and Information Systems - Rok 2016
This paper concerns how an affective-behavioural- cognitive approach applies to the evaluation of the software user experience. Although it may seem that affect recognition solutions are accurate in determining the user experience, there are several challenges in practice. This paper aims to explore the limitations of the automatic affect recognition applied in the usability context as well as...

Pełny tekst do pobrania w portalu
Accelerometer signal pre-processing influence on human activity recognition
Publikacja
- Rok 2009
A study of data pre-processing influence on accelerometer-based human activity recognition algorithms is presented. The frequency band used to filter-out the accelerometer signals and the number of accelerometers involved were considered in terms of their influence on the recognition accuracy.
Audio-visual surveillance system for application in bank operating room
Publikacja
- J. Kotus
- K. Łopatka
- A. Czyżewski
- G. Bogdanis
- Communications in Computer and Information Science - Rok 2013
An audio-visual surveillance system able to detect, classify and to localize acoustic events in a bank operating room is presented. Algorithms for detection and classification of abnormal acoustic events, such as screams or gunshots are introduced. Two types of detectors are employed to detect impulsive sounds and vocal activity. A Support Vector Machine (SVM) classifier is used to discern between the different classes of acoustic...
Speech and Drama

Czasopisma

ISSN: 0038-7142
LANGUAGE AND SPEECH

Czasopisma

ISSN: 0023-8309 , eISSN: 1756-6053
Impact of Visual Image Quality on Lymphocyte Detection Using YOLOv5 and RetinaNet Algorithms
Publikacja
- Rok 2024
Lymphocytes, a type of leukocytes, play a vital role in the immune system. The precise quantification, spatial arrangement and phenotypic characterization of lymphocytes within haematological or histopathological images can serve as a diagnostic indicator of a particular lesion. Artificial neural networks, employed for the detection of lymphocytes, not only can provide support to the work of histopathologists but also enable better...

Pełny tekst do pobrania w serwisie zewnętrznym
Bimodal Emotion Recognition Based on Vocal and Facial Features
Publikacja
- Rok 2023
Emotion recognition is a crucial aspect of human communication, with applications in fields such as psychology, education, and healthcare. Identifying emotions accurately is challenging, as people use a variety of signals to express and perceive emotions. In this study, we address the problem of multimodal emotion recognition using both audio and video signals, to develop a robust and reliable system that can recognize emotions...

Pełny tekst do pobrania w portalu
Visual Capacity Assessment of the Open Landscape in Terms of Protection and Shaping: Case Study of a Village in Poland
Publikacja
- A. Górka
- Sustainability - Rok 2020
This article describes the methodology and results of research on landscape visual capacity. The aim of the project was to develop a tool that would support planning and design decisions at the level of communal management in rural areas in Poland through systematic application of visual criteria. Their importance in the protection, management and shaping of space is underlined by the document produced at the European Landscape...

Pełny tekst do pobrania w portalu
Simple gait parameterization and 3D animation for anonymous visual monitoring based on augmented reality
Publikacja
- P. Szczuko
- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2016
The article presents a method for video anonymization and replacing real human silhouettes with virtual 3D figures rendered on a screen. Video stream is processed to detect and to track objects, whereas anonymization stage employs animating avatars accordingly to behavior of detected persons. Location, movement speed, direction, and person height are taken into account during animation and rendering phases. This approach requires...

Pełny tekst do pobrania w portalu
Towards Precise Visual Navigation and Direct Georeferencing for MAV Using ORB-SLAM2
Publikacja
- P. Burdziakowski
- Rok 2017
A low accuracy of positioning using Global Navigation Satellite System (GNSS) are not meet geodetic requirements for direct images georeferencing for Unmanned Aerial Vehicle (UAV) photogrammetry. A majority of UAVs are equipped with a monocular or stereo non-metric cameras for either visual data gathering or live video feed for operator. A cheap positioning techniques used on board commercial UAVs are not that precise as geodetic...

Pełny tekst do pobrania w portalu
Music Genre Recognition in the Rough Set-Based Environment
Publikacja
- P. Hoffmann
- B. Kostek
- Rok 2015
The aim of this paper is to investigate music genre recognition in the rough set-based environment. Experiments involve a parameterized music data-base containing 1100 music excerpts. The database is divided into 11 classes cor-responding to music genres. Tests are conducted using the Rough Set Exploration System (RSES), a toolset for analyzing data with the use of methods based on the rough set theory. Classification effectiveness...

Pełny tekst do pobrania w serwisie zewnętrznym
Emotion Recognition from Physiological Channels Using Graph Neural Network
Publikacja
- SENSORS - Rok 2022
In recent years, a number of new research papers have emerged on the application of neural networks in affective computing. One of the newest trends observed is the utilization of graph neural networks (GNNs) to recognize emotions. The study presented in the paper follows this trend. Within the work, GraphSleepNet (a GNN for classifying the stages of sleep) was adjusted for emotion recognition and validated for this purpose. The...

Pełny tekst do pobrania w portalu
Limitations of Emotion Recognition from Facial Expressions in e-Learning Context
Publikacja
- Rok 2017
The paper concerns technology of automatic emotion recognition applied in e-learning environment. During a study of e-learning process the authors applied facial expressions observation via multiple video cameras. Preliminary analysis of the facial expressions using automatic emotion recognition tools revealed several unexpected results, including unavailability of recognition due to face coverage and significant inconsistency...

Pełny tekst do pobrania w serwisie zewnętrznym
UAV Design and Construction for Real Time Photogrammetry and Visual Navigation
Publikacja
- P. Burdziakowski
- Rok 2018
A unmanned aerial vehicles applications in photogrammetry have increased rapidly last years. A fast data gathering and processing in real time in some cases become crucial and desired in some application. In the paper, a real time solution is proposed. A real time photogrammetry from UAV is proposed, where image data are gathered and processed on board UAV and finally reconstructed 3D model and measurements are delivered. The paper...

Pełny tekst do pobrania w portalu
Gaze-tracking based audio-visual correlation analysis employing quality of experience methodology
Publikacja
- Intelligent Decision Technologies-Netherlands - Rok 2010
This paper investigates a new approach to audio-visual correlation assessment based on the gaze-tracking system developed at the Multimedia Systems Department (MSD) of Gdansk University of Technology (GUT). The gaze-tracking methodology, having roots in Human-Computer Interaction borrows the relevance feedback through gaze-tracking and applies it to the new area of interests, which is Quality of Experience. Results of subjective...

Pełny tekst do pobrania w serwisie zewnętrznym
Database of speech and facial expressions recorded with optimized face motion capture settings
Publikacja
- A. Czyżewski
- M. Kawaler
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2019
The broad objective of the present research is the analysis of spoken English employing a multiplicity of modalities. An important stage of this process, discussed in the paper, is creating a database of speech accompanied with facial expressions. Recordings of speakers were made using an advanced system for capturing facial muscle motion. A brief historical outline, current applications, limitations and the ways of capturing face...

Pełny tekst do pobrania w portalu
Study on Speech Transmission under Varying QoS Parameters in a OFDM Communication System
Publikacja
- M. Zamłyńska
- P. Falkowski-Gilski
- G. Debita
- B. Miedziński
- Rok 2021
Although there has been an outbreak of multiple multimedia platforms worldwide, speech communication is still the most essential and important type of service. With the spoken word we can exchange ideas, provide descriptive information, as well as aid to another person. As the amount of available bandwidth continues to shrink, researchers focus on novel types of transmission, based most often on multi-valued modulations, multiple...

Pełny tekst do pobrania w serwisie zewnętrznym
Hand gesture recognition supported by fuzzy rules and Kalman filters
Publikacja
- M. Lech
- B. Kostek
- International Journal of Intelligent Information and Database Systems - Rok 2012
The paper presents a system based on camera and multimediaprojector enabling a user to control computer applications by dynamic hand gestures. Gesture recognition methodology based on representing hand movement trajectory by motion vectors analysed using fuzzy rule-based inference is first given. For effective hand position tracking Kalman filters are employed. The system engineered is developed using J2SE and C++/OpenCV technology....

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: VISUAL SPEECH RECOGNITION