Wyniki wyszukiwania dla: audiovisual speech recognition

Wyniki wyszukiwania dla: audiovisual speech recognition

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 1086

wyczyść wszystkie filtry niedostępne

wyświetlamy 1000 najlepszych wyników Pomoc

SpringerBriefs in Speech Technology

Czasopisma

ISSN: 2191-737X , eISSN: 2191-7388
Audiology and Speech Research

Czasopisma

ISSN: 2635-5019 , eISSN: 2635-5027
Voice and Speech Review

Czasopisma

ISSN: 2326-8263 , eISSN: 2326-8271
Application of Binary Image Quality Assessment Methods to Predict the Quality of Optical Character Recognition Results
Publikacja
- Applied Sciences-Basel - Rok 2024
One of the continuous challenges related to the growing popularity of mobile devices and embedded systems with limited memory and computational power is the development of relatively fast methods for real-time image and video analysis. One such example is Optical Character Recognition (OCR), which is usually too complex for such devices. Considering that images captured by cameras integrated into mobile devices may be acquired...

Pełny tekst do pobrania w serwisie zewnętrznym
Theory of recognition in a historical perspective. Axel Honneth's Anerkennung: Eine europäische Ideengeschichte
Publikacja
- A. Karalus
- Archiwum Historii Filozofii i Myśli Społecznej - Rok 2019
The article discusses Honneth excursion into the realm of the history of ideas. This time Honneth decides to laser it on the notion of "recognition" in three different cultural areas and three different traditions: French, English, and German. The article discusses Honneth's persepctive and attempts at finding the common thread that would link three aforementioned traditions.

Pełny tekst do pobrania w portalu
A review of emotion recognition methods based on keystroke dynamics and mouse movements
Publikacja
- A. Kołakowska
- Rok 2013
The paper describes the approach based on using standard input devices, such as keyboard and mouse, as sources of data for the recognition of users’ emotional states. A number of systems applying this idea have been presented focusing on three categories of research problems, i.e. collecting and labeling training data, extracting features and training classifiers of emotions. Moreover the advantages and examples of combining standard...

Pełny tekst do pobrania w serwisie zewnętrznym
Speaker Recognition Using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions
Publikacja
- M. Wang
- T. Sirlapu
- A. Kwaśniewska
- M. Szankin
- M. Bartscherer
- R. Nicolas
- Rok 2018
With the technology advancements in smart home sector, voice control and automation are key components that can make a real difference in people's lives. The voice recognition technology market continues to involve rapidly as almost all smart home devices are providing speaker recognition capability today. However, most of them provide cloud-based solutions or use very deep Neural Networks for speaker recognition task, which are...

Pełny tekst do pobrania w serwisie zewnętrznym
Journal of Pattern Recognition Research

Czasopisma

ISSN: 1558-884X
Pattern Recognition and Image Analysis

Czasopisma

ISSN: 1054-6618
Graph Representation Integrating Signals for Emotion Recognition and Analysis
Publikacja
- SENSORS - Rok 2021
Data reusability is an important feature of current research, just in every field of science. Modern research in Affective Computing, often rely on datasets containing experiments-originated data such as biosignals, video clips, or images. Moreover, conducting experiments with a vast number of participants to build datasets for Affective Computing research is time-consuming and expensive. Therefore, it is extremely important to...

Pełny tekst do pobrania w portalu
Examining Classifiers Applied to Static Hand Gesture Recognition in Novel Sound Mixing System
Publikacja
- Advances in Intelligent Systems and Computing - Rok 2013
The main objective of the chapter is to present the methodology and results of examining various classifiers (Nearest Neighbor-like algorithm with non-nested generalization (NNge), Naive Bayes, C4.5 (J48), Random Tree, Random Forests, Artificial Neural Networks (Multilayer Perceptron), Support Vector Machine (SVM) used for static gesture recognition. A problem of effective gesture recognition is outlined in the context of the system...

Pełny tekst do pobrania w serwisie zewnętrznym
A method supporting fault-tolerant optical text recognition from video sequences recorded with handheld cameras
Publikacja
- K. P. Okarma
- P. Lech
- Engineering Applications of Artificial Intelligence - Rok 2023
In the paper a method supporting the optical character recognition from video sequences recorded with cameras without good stabilization is proposed. Due to the presence of various distortions, such as motion blur, shadows, lossy compression artifacts, auto-focusing errors, etc., the quality of individual video frames, e.g., recorded by a smartphone camera, differs noticeably, influencing the results of text recognition, causing...

Pełny tekst do pobrania w serwisie zewnętrznym
A Review of Emotion Recognition Methods Based on Data Acquired via Smartphone Sensors
Publikacja
- SENSORS - Rok 2020
In recent years, emotion recognition algorithms have achieved high efficiency, allowing the development of various affective and affect-aware applications. This advancement has taken place mainly in the environment of personal computers offering the appropriate hardware and sufficient power to process complex data from video, audio, and other channels. However, the increase in computing and communication capabilities of smartphones,...

Pełny tekst do pobrania w portalu
Real-Time Sensor-Based Human Activity Recognition for eFitness and eHealth Platforms
Publikacja
- Ł. Czekaj
- M. Kowalewski
- J. Domaszewicz
- R. Kitłowski
- M. Szwoch
- W. Duch
- SENSORS - Rok 2024
Human Activity Recognition (HAR) plays an important role in the automation of various tasks related to activity tracking in such areas as healthcare and eldercare (telerehabilitation, telemonitoring), security, ergonomics, entertainment (fitness, sports promotion, human–computer interaction, video games), and intelligent environments. This paper tackles the problem of real-time recognition and repetition counting of 12 types of...

Pełny tekst do pobrania w serwisie zewnętrznym
Applicability of Emotion Recognition and Induction Methods to Study the Behavior of Programmers
Publikacja
- M. Wróbel
- Applied Sciences-Basel - Rok 2018
Recent studies in the field of software engineering have shown that positive emotions can increase and negative emotions decrease the productivity of programmers. In the field of affective computing, many methods and tools to recognize the emotions of computer users were proposed. However, it has not been verified yet which of them can be used to monitor the emotional states of software developers. The paper describes a study carried...

Pełny tekst do pobrania w portalu
Estimation of time-frequency complex phase-based speech attributes using narrow band filter banks
Publikacja
- K. Abratkiewicz
- K. Czarnecki
- D. Fourer
- F. Auger
- Rok 2017
In this paper, we present nonlinear estimators of nonstationary and multicomponent signal attributes (parameters, properties) which are instantaneous frequency, spectral (or group) delay, and chirp-rate (also known as instantaneous frequency slope). We estimate all of these distributions in the time-frequency domain using both finite and infinite impulse response (FIR and IIR) narrow band filers for speech analysis. Then, we present...

Pełny tekst do pobrania w portalu
Recognition of environmentally important ions
Publikacja
- N. Łukasik
- E. Wagner-Wysiecka
- V. Hubscher-Bruder
- M. Bocheńska
- S. Michel
- Logistyka - Rok 2013
..
Influence of Thermal Imagery Resolution on Accuracy of Deep Learning based Face Recognition
Publikacja
- Rok 2019
Human-system interactions frequently require a retrieval of the key context information about the user and the environment. Image processing techniques have been widely applied in this area, providing details about recognized objects, people and actions. Considering remote diagnostics solutions, e.g. non-contact vital signs estimation and smart home monitoring systems that utilize person’s identity, security is a very important factor....

Pełny tekst do pobrania w portalu
Deep Learning: A Case Study for Image Recognition Using Transfer Learning
Publikacja
- S. Erpolat Tasabat
- O. Aydin
- Rok 2021
Deep learning (DL) is a rising star of machine learning (ML) and artificial intelligence (AI) domains. Until 2006, many researchers had attempted to build deep neural networks (DNN), but most of them failed. In 2006, it was proven that deep neural networks are one of the most crucial inventions for the 21st century. Nowadays, DNN are being used as a key technology for many different domains: self-driven vehicles, smart cities,...

Pełny tekst do pobrania w serwisie zewnętrznym
From Linear Classifier to Convolutional Neural Network for Hand Pose Recognition
Publikacja
- P. Rościszewski
- Computer Science - Rok 2017
Recently gathered image datasets and the new capabilities of high-performance computing systems have allowed developing new artificial neural network models and training algorithms. Using the new machine learning models, computer vision tasks can be accomplished based on the raw values of image pixels instead of specific features. The principle of operation of deep neural networks resembles more and more what we believe to be happening...

Pełny tekst do pobrania w portalu
New generation speech aid for stuttering people
Publikacja
- P. Odya
- A. Czyżewski
- Rok 2008
Współczesne Cyfrowe Procesory Sygnałowe (ang. DSP) mają niewielkie wymiary, ale są w stanie re-alizować złożone algorytmy. Ich dodatkową zaletą jest łatwość wymiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. Wykorzystując możliwości procesów stało się możliwe budowanie miniaturowych protez słuchu i mowy. W referacie skupiono się na zagadnieniach związanych z projekto-wanie i implementacją algorytmów...

Pełny tekst do pobrania w portalu
New generation speech aid for stuttering people
Publikacja
- P. Odya
- A. Czyżewski
- Archives of Acoustics - Rok 2008
Współczesne Cyfrowe Procesory Sygnałowe (ang. DSP) mają niewielkie wymiary, ale są w stanie re-alizować złożone algorytmy. Ich dodatkową zaletą jest łatwość wymiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. Wykorzystując możliwości procesów stało się możliwe budowanie miniaturowych protez słuchu i mowy. W referacie skupiono się na zagadnieniach związanych z projekto-wanie i implementacją algorytmów...

Pełny tekst do pobrania w portalu
Influence of modulation detection threshold on speech intelligibility
Publikacja
- K. Leo
- ACTA PHYSICA POLONICA A - Rok 2011
Pełny tekst do pobrania w portalu
Transient detection algorithms for speech coding applications
Publikacja
- G. Szwoch
- M. Kulesza
- A. Czyzewski
- Journal of the Acoustical Society of America - Rok 2006
Pełny tekst do pobrania w serwisie zewnętrznym
Comprehensive Evaluation of Statistical Speech Waveform Synthesis
Publikacja
- T. Merritt
- B. Putrycz
- A. Nadolski
- T. Ye
- D. Korzekwa
- W. Dolecki
- T. Drugman
- V. Klimkov
- A. Moinet
- A. Breen... i 3 innych
- Rok 2018
Pełny tekst do pobrania w serwisie zewnętrznym
Automatic singing quality recognition employing artificial neural networks
Publikacja
- P. Żwan
- Archives of Acoustics - Rok 2008
Celem artykułu jest udowodnienie możliwości automatycznej oceny jakości technicznej głosów śpiewaczych. Pokrótce zaprezentowano w nim stworzoną bazę danych głosów śpiewaczych oraz zaimplementowane parametry. Przy pomocy sztucznych sieci neuronowych zaprojektowano system decyzyjny, który oceniono w pięciostopniowej skali jakość techniczną głosu. Przy pomocy metod statystycznych udowodniono, że wyniki generowane przez ten system...

Pełny tekst do pobrania w portalu
Improvement of Image Binarization Methods Using Image Preprocessing with Local Entropy Filtering for Alphanumerical Character Recognition Purposes
Publikacja
- H. Michalak
- K. P. Okarma
- ENTROPY - Rok 2019
Automatic text recognition from the natural images acquired in uncontrolled lighting conditions is a challenging task due to the presence of shadows hindering the shape analysis and classification of individual characters. Since the optical character recognition methods require prior image binarization, the application of classical global thresholding methods in such case makes it impossible to preserve the visibility of all...

Pełny tekst do pobrania w serwisie zewnętrznym
A Novel IoT-Perceptive Human Activity Recognition (HAR) Approach Using Multi-Head Convolutional Attention
Publikacja
- H. Zhang
- Z. Xiao
- J. Wang
- F. Li
- E. Szczerbicki
- IEEE Internet of Things Journal - Rok 2019
Together with fast advancement of the Internet of Things (IoT), smart healthcare applications and systems are equipped with increasingly more wearable sensors and mobile devices. These sensors are used not only to collect data, but also, and more importantly, to assist in daily activity tracking and analyzing of their users. Various human activity recognition (HAR) approaches are used to enhance such tracking. Most of the existing...

Pełny tekst do pobrania w portalu
Real-time working gas recognition system based on the array of semiconductor gas sensors and portable computer Raspberry PI
Publikacja
- Rok 2013
The gas-analyzing systems based on the array of partially selective gas sensors and pattern-recognition techniques are potentially fast and low-cost alternative for other devices, like gas analysers. They give the possibility of recognition the type and the concentration of measured volatile compounds in their working environment. In this work we present the implementation of gas recognition system, in which the signals from an...
Unraveling the Interplay between DNA and Proteins: A Computational Exploration of Sequence and Structure-Specific Recognition Mechanisms
Publikacja
- K. A. Hossain
- Rok 2023
My PhD dissertation focused on DNA-protein interactions and the recognition of specific DNA sequences and structures. I discovered that acidic amino acid residues (Asp/Glu) play a crucial role by exhibiting a preference for cytosine. Their contribution to binding affinity depends on nearby cytosines, balancing electrostatic repulsion with specific interactions. Acidic residues act as negative selectors, discouraging non-cytosine...

Pełny tekst do pobrania w portalu
Improving Traffic Light Recognition Methods using Shifting Time-Windows
Publikacja
- A. Blokus
- H. Krawczyk
- Rok 2018
We propose a novel method of improving algorithms recognizing traffic lights in video sequences. Our focus is on algorithms for applications which notify the driver of a light in sight. Many existing methods process images in the recording separately. Our method bases on the observation that real-life videos depict underlying continuous processes. We named our method FSA (Frame Sequence Analyzed). It is applicable for any underlying...

Pełny tekst do pobrania w serwisie zewnętrznym
Adaptive system for recognition of sounds indicating threats to security of people and property employing parallel processing of audio data streams
Publikacja
- K. Łopatka
- Rok 2015
A system for recognition of threatening acoustic events employing parallel processing on a supercomputing cluster is featured. The methods for detection, parameterization and classication of acoustic events are introduced. The recognition engine is based onthreshold-based detection with adaptive threshold and Support Vector Machine classifcation. Spectral, temporal and mel-frequency descriptors are used as signal features. The...
Lucyna Nyka prof. dr hab. inż. arch.

Osoby

Politechnika Gdańska, Katedra Architektury Miejskiej i Przestrzeni Nadwodnych

Lucyna Nyka (prof. dr hab. inż. arch.) jest architektem i profesorem na Wydziale Architektury Politechniki Gdańskiej. W latach 2008-2016 pełniła funkcję prodziekana ds. nauki, a od 2016 jest dziekanem Wydziału Architektury. Zainteresowania badawcze prof. Lucyny Nyki skoncentrowane są wokół kwestii powiązań architektury i wody, przekształceń terenów nadwodnych oraz urbanistycznych krajobrazów. Jest autorem i współautorem wielu...
Viruses, cancer and non-self recognition
Publikacja
- M. Padariya
- U. Kalathiya
- S. Mikac
- K. Dziubek
- M. Tovar
- E. Sroka
- R. Fahraeus
- A. Sznarkowska
- Open Biology - Rok 2021
Pełny tekst do pobrania w serwisie zewnętrznym
Role of cholesterol in substrate recognition by -secretase
Publikacja
- Scientific Reports - Rok 2021
-Secretase is an enzyme known to cleave multiple substrates within their transmembrane domains, with the amyloid precursor protein of Alzheimer’s Disease among the most prominent examples. The activity of -secretase strictly depends on the membrane cholesterol content, yet the mechanistic role of cholesterol in the substrate binding and cleavage remains unclear. In this work, we used all-atom molecular dynamics simulations to examine...

Pełny tekst do pobrania w portalu
Face Recognition: Shape versus Texture
Publikacja
- M. Smiatacz
- Rok 2015
This paper describes experiments related to the application of well-known techniques of the texture feature extraction (Local Binary Patterns and Gabor filtering) to the problem of automatic face verification. Results of the tests show that simple image normalization strategy based on the eye center detection and a regular grid of fiducial points outperforms the more complicated approach, employing active models that are able to...

Pełny tekst do pobrania w serwisie zewnętrznym
Balance recognition on the basis of EEG measurement.
Publikacja
- Annals of Computer Science and Information Systems - Rok 2016
Although electroencephalography (EEG) is not typically used for verifying the sense of balance, it can be used for analysing cortical signals responsible for this phenomenon. Simple balance tasks can be proposed as a good indicator of whether the sense of balance is acting more or less actively. This article presents preliminary results for the potential of using EEG to balance sensing....

Pełny tekst do pobrania w portalu
Computer-based detection of depression and dementia in spontaneous speech
Publikacja
- K. Chlasta
- P. Holas
- K. Wolk
- EUROPEAN PSYCHIATRY - Rok 2021
Pełny tekst do pobrania w serwisie zewnętrznym
Investigations of speech signal parameters with regard to articulation influences
Publikacja
- A. Kaczmarek
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Rok 2008
W pracy zostało podjęte zagadnienie parametryzacji sygnału mowy w kontekście ekstrakcji cech biometrycznych. Analizowane parametry to parametry cepstralne (cepstrum liniowe i mel-cepstrum, czyli MFCC), parametry liniowej predykcji (LPC) oraz momenty widmowe i parametr F0. Zastosowano analize w krótkich stałych segmentach sygnału z zastosowaniem dużego zakładkowania, tzw. ''implicite segmentation''. Umożliwiło to zaobserwowanie...
Evaluation and Irony in Text in the Light of Speech Act Theory
Publikacja
- K. Kukowicz-Zarska
- Forum Filologiczne Ateneum - Rok 2020
Pełny tekst do pobrania w serwisie zewnętrznym
System of speech signal processing and visualisation for linguistic purposes
Publikacja
- K. Wojan
- Archives of Acoustics - Rok 2005
Digital analysis of ethnic speech – extraction of information code
Publikacja
- K. Wojan
- Archives of Acoustics - Rok 2003
On the EM algorithm for the estimation of speech AR parameters in noise
Publikacja
- M. Kuropatwinski
- B. Kleijn
- M. Kuropatwiński
- Rok 2014
Pełny tekst do pobrania w serwisie zewnętrznym
Detection of dialogue in movie soundtrack for speech intelligibility enhancement
Publikacja
- K. Łopatka
- Rok 2014
A method for detecting dialogue in 5.1 movie soundtrack based on interchannel spectral disparity is presented. The front channel signals (left, right, center) are analyzed in the frequency domain. The selected partials in the center channel signal, which yield high disparity with left and right channels, are detected as dialogue. Subsequently, the dialogue frequency components are boosted to achieve increased dialogue intelligibility....

Pełny tekst do pobrania w serwisie zewnętrznym
New approach to localization of clicks in archive speech signals.
Publikacja
- M. Niedźwiecki
- A. Sobociński
- Rok 2004
Przedstawiono problem lokalizacji zniekształceń impulsowych w archiwalnych sygnałach mowy. Pokazano, że detekcja oparta na dwuzakresowym modelu autoregresyjnym i przetwarzanie dwukierunkowe pozwala uzyskać znaczącą poprawę działania w stosunku do istniejących metod lokalizacji zniekształceń.
Advanced speech archiving and restoration system for aviation applications
Publikacja
- A. Czyżewski
- J. Kotus
- A. Kaczmarek
- A. Rypulak
- A. Pawlik
- Rok 2005
W referacie przedstawiono opracowany System Rejestracji I Rekonstrukcji Mowy dla potrzeb lotnictwa. System ten umożliwia jednoczesny zapis, archiwizację i poprawę zrozumiałości sygnału mowy pochodzącego z wielu różnych kanałów komunikacji radiowej. Głównym celem systemu jest rejestracja i rekonstrukcja komunikatów słownych wymienianych drogą radiową pomiędzy pilotem samolotu a stacją kontroli lotów - jest to niezwykle istotne w...
Application of hybrid signals processors to speech and hearing aids
Publikacja
- P. Odya
- A. Czyżewski
- Rok 2005
Dzięki postępowi w technice Cyfrowych Procesorów Sygnałowych (ang. DSP) stało się możliwe budowanie miniaturowych protez słuchu i mowy. Mimo niewielkich wymiarów procesory te są w stanie wykonywać złożone algorytmy. Ich dodatkową zaletą jest łatwość zmiany oprogramowania, a co za tym idzie łatwość zmiany dziedziny zastosowań. W pracy skupiono się na zagadnieniach związanych z projektowanie i implementacją algorytmów mających zastosowanie...
Automatic recognition of males and females among web browser users based on behavioural patterns of peripherals usage
Publikacja
- A. Kołakowska
- A. Landowska
- P. Jarmolkowicz
- M. Jarmolkowicz
- K. Sobota
- Internet Research - Rok 2016
Purpose The purpose of this paper is to answer the question whether it is possible to recognise the gender of a web browser user on the basis of keystroke dynamics and mouse movements. Design/methodology/approach An experiment was organised in order to track mouse and keyboard usage using a special web browser plug-in. After collecting the data, a number of parameters describing the users’ keystrokes, mouse movements and clicks...

Pełny tekst do pobrania w serwisie zewnętrznym
The Influence of Selecting Regions from Endoscopic Video Frames on The Efficiency of Large Bowel Disease Recognition Algorithms
Publikacja
- Rok 2012
The article presents our research in the field of the automatic diagnosis of large intestine diseases on endoscopic video. It focuses on the methods of selecting regions of interest from endoscopic video frames for further analysis by specialized disease recognition algorithms. Four methods of selecting regions of interest have been discussed: a. trivial, b. with the deletion of characteristic, endoscope specific additions to the...
International Journal of Speech Technology

Czasopisma

ISSN: 1381-2416 , eISSN: 1572-8110

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: audiovisual speech recognition

Lucyna Nyka prof. dr hab. inż. arch.