Wyniki wyszukiwania dla: automatic speech recognition

Wyniki wyszukiwania dla: automatic speech recognition

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 1540

wyczyść wszystkie filtry niedostępne

wyświetlamy 1000 najlepszych wyników Pomoc

On practical application of Shannon theory to character recognition and more
Publikacja
- M. Jurkiewicz
- Rok 2014
Let us consider an optical character recognition system, which in particular can be used for identifying objects that were assigned strings of some length. The system is not perfect, for example, it sometimes recognizes wrongly the characters "Y" and "V". What is the largest set of strings of given length for the system under consideration, which can be mutually correctly recognized, and the corresponding objects correctly identified?...
Acquisition and indexing of RGB-D recordings for facial expressions and emotion recognition
Publikacja
- M. Szwoch
- Studia Informatica Pomerania - Rok 2015
In this paper KinectRecorder comprehensive tool is described which provides for convenient and fast acquisition, indexing and storing of RGB-D video streams from Microsoft Kinect sensor. The application is especially useful as a supporting tool for creation of fully indexed databases of facial expressions and emotions that can be further used for learning and testing of emotion recognition algorithms for affect-aware applications....

Pełny tekst do pobrania w serwisie zewnętrznym
Molecular Recognition in Complexes of TRF Proteins with Telomeric DNA
Publikacja
- M. Wieczór
- A. Tobiszewski
- P. Wityk
- B. Tomiczek
- J. Czub
- PLOS ONE - Rok 2014
Telomeres are specialized nucleoprotein assemblies that protect the ends of linear chromosomes. In humans and many other species, telomeres consist of tandem TTAGGG repeats bound by a protein complex known as shelterin that remodels telomeric DNA into a protective loop structure and regulates telomere homeostasis. Shelterin recognizes telomeric repeats through its two major components known as Telomere Repeat-Binding Factors, TRF1...

Pełny tekst do pobrania w portalu
Accelerometer-based Human Activity Recognition and the Impact of the Sample Size
Publikacja
- Rok 2014
The presented study focused on the recognition of eight user activities (e.g. walking, lying, climbing stairs) basing on the measurements from an accelerometer embedded in a mobile device. It is assumed that the device is carried in a specific location of the user’s clothing. Three types of classifiers were tested on different sizes of the samples. The influence of the time window (the duration of a single trial) on selected activities...

Pełny tekst do pobrania w serwisie zewnętrznym
Comparison of selected off-the-shelf solutions for emotion recognition based on facial expressions
Publikacja
- Rok 2016
The paper concerns accuracy of emotion recognition from facial expressions. As there are a couple of ready off-the-shelf solutions available in the market today, this study aims at practical evaluation of selected solutions in order to provide some insight into what potential buyers might expect. Two solutions were compared: FaceReader by Noldus and Xpress Engine by QuantumLab. The performed evaluation revealed that the recognition...

Pełny tekst do pobrania w serwisie zewnętrznym
Automatic analysis of the aggressive behavior of laboratory animals using thermal video processing
Publikacja
- M. Mazur-Milecka
- J. Rumiński
- Rok 2017
The bite detection is very important but difficult element of the social interaction analysis. Standard observation methods like human observer or a camcorder of visible light frequencies fail in this case. However, it is possible to discern cooler spots on the rodent's body that appear after body contact with another individual, and vanish after short time. These spots are assumed to be a saliva trace left on fur after bite. In...

Pełny tekst do pobrania w serwisie zewnętrznym
Next generation automatic IP configuration deployment issues
Publikacja
- T. Mrugalski
- K. Nowicki
- K. Wnuk
- Rok 2008
Although Dynamic Host Configuration Protocol for IPv6 (DHCPv6) protocol was defined in 2003, it was designed as a framework rather than a complete solution to the automatic configuration in IPv6 networks. There are still some unsolved problems and new options yet to be defined. One example of such case is Fully Qualified Domain Name (FQDN) option, which final version has been published in late 2007. It describes DHCPv6 client...
Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech
Publikacja
- D. Piotrowski
- R. Korzeniowski
- A. Falai
- S. Cygert
- K. Pokora
- G. Tinchev
- Z. Zhang
- K. Yanagisawa
- Rok 2023
In this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream Text-To-Speech (TTS) model. The proposed framework consists of 4 stages. In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker. In the third stage, the converted data is combined with the linguistic features and durations...

Pełny tekst do pobrania w serwisie zewnętrznym
Human-Computer Interface Based on Visual Lip Movement and Gesture Recognition
Publikacja
- P. Dalka
- A. Czyżewski
- International Journal of Computing Science and Mathematics - Rok 2010
The multimodal human-computer interface (HCI) called LipMouse is presented, allowing a user to work on a computer using movements and gestures made with his/her mouth only. Algorithms for lip movement tracking and lip gesture recognition are presented in details. User face images are captured with a standard webcam. Face detection is based on a cascade of boosted classifiers using Haar-like features. A mouth region is located in...

Pełny tekst do pobrania w serwisie zewnętrznym
Systematic Literature Review for Emotion Recognition from EEG Signals
Publikacja
- P. A. Leszczełowska
- N. Dawidowska
- Rok 2022
Researchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...

Pełny tekst do pobrania w serwisie zewnętrznym
Systematic Literature Review for Emotion Recognition from EEG Signals
Publikacja
- P. A. Leszczełowska
- N. Dawidowska
- Rok 2022
Researchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...

Pełny tekst do pobrania w portalu
Local Texture Pattern Selection for Efficient Face Recognition and Tracking
Publikacja
- M. Smiatacz
- J. Rumiński
- Advances in Intelligent Systems and Computing - Rok 2015
This paper describes the research aimed at finding the optimal configuration of the face recognition algorithm based on local texture descriptors (binary and ternary patterns). Since the identification module was supposed to be a part of the face tracking system developed for interactive wearable computer, proper feature selection, allowing for real-time operation, became particularly important. Our experiments showed that it is...

Pełny tekst do pobrania w serwisie zewnętrznym
Resolving conflicts in object tracking for automatic detection of events in video
Publikacja
- Rok 2010
W referacie przedstawiono algorytm rozwiązywania konfliktów w śledzeniu obiektów ruchomych. Proponowana metoda wykorzystuje predykcję stanu obiektu obliczaną przez filtry Kalmana oraz dopasowuje wykryte obiekty do struktur śledzących ich ruch na podstawie deskryptorów koloru i tekstury. Omówiono specyficzne sytuacje powodujące konflikty, takie jak rozdzielanie obiektów. Przedstawiono wyniki testów. Algorytm może być zastosowany...
Automatic Clustering of EEG-Based Data Associated with Brain Activity
Publikacja
- Rok 2018
The aim of this paper is to present a system for automatic assigning electroencephalographic (EEG) signals to appropriate classes associated with brain activity. The EEG signals are acquired from a headset consisting of 14 electrodes placed on skull. Data gathered are first processed by the Independent Component Analysis algorithm to obtain estimates of signals generated by primary sources reflecting the activity of the brain....

Pełny tekst do pobrania w serwisie zewnętrznym
Crowdsourcing-Based Evaluation of Automatic References Between WordNet and Wikipedia
Publikacja
- J. Szymański
- T. M. Boiński
- INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING - Rok 2019
The paper presents an approach to build references (also called mappings) between WordNet and Wikipedia. We propose four algorithms used for automatic construction of the references. Then, based on an aggregation algorithm, we produce an initial set of mappings that has been evaluated in a cooperative way. For that purpose, we implement a system for the distribution of evaluation tasks, that have been solved by the user community....

Pełny tekst do pobrania w portalu
Feasibility Study for Food Intake Tasks Recognition Based on Smart Glasses
Publikacja
- M. Biallas
- A. Andrushevich
- R. Kistler
- A. Klapproth
- K. Czuszyński
- A. Bujnowski
- Journal of Medical Imaging and Health Informatics - Rok 2015
In this exploratory study 13 adult test subjects have performed different food intake tasks while wearing a three axis accelerometer mounted at a temple of glasses. Two different algorithms for task recognition have been applied and compared. The retrospective data processing leads to better task recognition results when the frequency range of 50 Hz to 100 Hz is analysed within accelerometer signal recordings. A straightforward...

Pełny tekst do pobrania w serwisie zewnętrznym
Fuzzy rule-based dynamic gesture recognition employing camera & multimedia projector
Publikacja
- M. Lech
- B. Kostek
- Rok 2010
In the paper the system based on camera and multimedia projector enabling a user to control computer applications by dynamic hand gestures is presented. The main objective is to present the gesture recognition methodology which bases on representing hand movement trajectory by motion vectors analyzed using fuzzy rule-based inference. The approach was engineered in the system developed with J2SE and C++ / OpenCV technology. OpenCV...

Pełny tekst do pobrania w serwisie zewnętrznym
Specification-Oriented Automatic Design of Topologically Agnostic Antenna Structure
Publikacja
- A. Bekasiewicz
- M. Dzwonkowski
- T. Dhaene
- I. Couckuyt
- Rok 2024
Design of antennas for modern applications is a challenging task that combines cognition-driven development of topology intertwined with tuning of its parameters using rigorous numerical optimization. However, the process can be streamlined by neglecting the engineering insight in favor of automatic de-termination of structure geometry. In this work, a specification-oriented design of topologically agnostic antenna is considered....

Pełny tekst do pobrania w serwisie zewnętrznym
Automatic Marking of Allophone Boundaries in Isolated English spoken Words
Publikacja
- J. Rafałko
- A. Czyżewski
- Rok 2020
The work presents a method that allows delimiting the borders of allophones in isolated English words. The described method is based on the DTW algorithm combining two signals, a reference signal and an analyzed one. As the reference signal, recordings from the MODALITY database were used, from which the words were extracted. This database was also used for tests, which were described. Test results show that the automatic determination...

Pełny tekst do pobrania w portalu
Vapor correction of FTIR spectra – A simple automatic least squares approach
Publikacja
- P. Bruździak
- SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY - Rok 2019
FTIR spectroscopy is one of the best techniques to study intermolecular interactions. However, such an application requires high quality spectra with as little noise as possible, which are often difficult to obtain. One of the main sources of unwanted interference is water vapor. Here a robust method is proposed for automatic, fast and reliable vapor correction of FTIR spectra. The presented least squares approach of vapor subtraction...

Pełny tekst do pobrania w portalu
Automatic detection and correction of detuned singing system for use with query-by-humming applications
Publikacja
- M. Lech
- Rok 2008
The aim of the paper is to present an idea of using the automatic detection and correction of detuned singing as a subsystem in query-by-humming (QBH) applications. The common approach to searching for a requested song basing on the melody retrieved from hummed pattern usually employs the so-called Parsons code or melody contour. In such a case information about sound pitch is discarded. It was thought out that an additional module...
Integration of speech enhancement and coding techniques
Publikacja
- M. Kuropatwinski
- D. Leckschat
- K. Kroschel
- A. Czyzewski
- M. Kuropatwiński
- Rok 1999
Pełny tekst do pobrania w serwisie zewnętrznym
Novel approaches to wideband speech coding
Publikacja
- M. Kulesza
- A. Czyżewski
- Rok 2008
Dwie metoda kodowania szerokopasmowego mowy zostały zaprezentowane. W pierwszej metodzie wykorzystano algorytm kompresji i ekspansji czasowej sygnału mowy, pozwalający na kodowanie szerokopasmowe sygnału mowy z wykorzystaniem ustandaryzowanych kodeków. Metoda ta jest przewidziana do zastosowania w adaptacyjnych algorytmach kodowania mowy. Drugie z proponowanych rozwiazan dotyczy nowej metody estymacji obwiedni widma sygnalu mowy...

Pełny tekst do pobrania w serwisie zewnętrznym
Broadband interference in speech reinforcement systems
Publikacja
- H. Lasota
- R. Mazurek
- Rok 2008
Artykuł podejmuje niedoceniany problem wpływu liczby i rozkładu głośników w systemach nagłośnienia, na jakość przekazu głosowego, czyli na zrozumiałość mowy w audytoriach. Superpozycji przesuniętych w czasie szerokopasmowych sygnałów o tym samym kształcie i lekko różnych wielkościach, które docierają do słuchacza z licznych spójnych źródeł, towarzyszy zjawisko interferencji prowadzące do głębokiej modyfikacji odbieranych sygnałów...
Multitask Noisy Speech Enhancement System
Publikacja
- A. Czyżewski
- J. Kotus
- G. Szwoch
- M. Dziubiński
- A. Rypulak
- A. Pawlik
- Rok 2005
W referacie opisano Wielozadaniowy System Poprawy Jakości Sygnału Mowy. Jest to wyspecjalizowany pakiet oprogramowania przeznaczony do rejestrowania sygnału mowy i do poprawy jego jakości oraz zrozumiałości mowy, przy użyciu zaawansowanych procedur cyfrowego przetwarzania sygnału. Pakiet oprogramowania składa się z programów: Rejestrator, Przeglądarka oraz Rekonstruktor. Oprogramowanie to może być użyte w przypadkach, gdy zrozumiałość...
A system for multitask noisy speech enhancement.
Publikacja
- A. Czyżewski
- A. Kaczmarek
- J. Kotus
- A. Pawlik
- A. Rypulak
- P. Żwan
- Rok 2004
W artykule przedstawiono ogolną charakterystyke opracowanego systemu rejestracji i rekonstrukcji mowy. Artykuł zawiera opis składników systemu, ktory jest oprogramowaniem zawierającym zaawansowane narzędzia służące poprawie zrozumiałości mowy. Zaimplementowane narzędzia systemu umożliwiają wyszukiwanie nagrań dźwiękowych i ich obróbkę przy pomocy zaimplementowanych pluginów. W artykule przedstawione wykorzystane w systemie algorytmy...
Comparison of the effectiveness of automatic EEG signal class separation algorithms
Publikacja
- JOURNAL OF INTELLIGENT & FUZZY SYSTEMS - Rok 2019
In this paper, an algorithm for automatic brain activity class identification of EEG (electroencephalographic) signals is presented. EEG signals are gathered from seventeen subjects performing one of the three tasks: resting, watching a music video and playing a simple logic game. The methodology applied consists of several steps, namely: signal acquisition, signal processing utilizing z-score normalization, parametrization and...

Pełny tekst do pobrania w portalu
Dangerous sound event recognition using Support Vector Machine classifiers
Publikacja
- Rok 2010
A method of recognizing events connected to danger based on their acoustic representation through Support Vector Machine classification is presented. The method proposed is particularly useful in an automatic surveillance system. The set of 28 parameters used in the classifier consists of dedicated parameters and MPEG-7 features. Methods for parameter calculation are presented, as well as a design of SVM model used for classification....
Difference in Perceived Speech Signal Quality Assessment Among Monolingual and Bilingual Teenage Students
Publikacja
- P. Falkowski-Gilski
- Rok 2021
The user perceived quality is a mixture of factors, including the background of an individual. The process of auditory perception is discussed in a wide variety of fields, ranging from engineering to medicine. Many studies examine the difference between musicians and non-musicians. Since musical training develops musical hearing and other various auditory capabilities, similar enhancements should be observable in case of bilingual...

Pełny tekst do pobrania w serwisie zewnętrznym
Contextual Knowledge to Enhance Workplace Hazard Recognition and Interpretation in a Cognitive Vision Platform
Publikacja
- C. De
- C. Sanin
- E. Szczerbicki
- Rok 2018
The combination of vision and sensor data together with the resulting necessity for formal representations builds a central component of an autonomous Cyber Physical System for detection and tracking of laborers in workplaces environments. This system must be adaptable and perceive the environment as automatically as possible, performing in a variety of plants and scenes without the necessity of recoding the application for each...

Pełny tekst do pobrania w portalu
Automatic detection and correction of detuned singing system for use with query-by-humming applications
Publikacja
- M. Lech
- Archives of Acoustics - Rok 2008
The aim of the paper is to present an idea of using the automatic detection and correction of detuned singing as a subsystem in query-by-humming (QBH) applications. The common approach to searching for a requested song basing on the melody retrieved from hummed pattern usually employs the so-called Parsons code or melody contour. In such a case information about sound pitch is discarded. It was thought out that an additional module...

Pełny tekst do pobrania w serwisie zewnętrznym
Automatic Tracking with PTZ Cameras
Publikacja
- P. Dalka
- G. Szwoch
- Rok 2009
Automatic tagging of musical files
Publikacja
- B. Kostek
- A. Sitek
- Rok 2011
Celem niniejszej pracy jest zbadanie możliwości automatycznego tagowania utworów muzycznych z wykorzystaniem systemu śledzenia punktu fiksacji wzroku użytkownika. Badania przeprowadzono z udziałem dwudziestu osób o różnym doświadczeniu muzycznym. Zadaniem badanej osoby było wskazanie odpowiedzi na pytania zawarte w ankiecie internetowej, która pozwala na określenie cech utworów muzycznych, takich jak: tempo, dynamika, gatunek....
Automatic classification and mapping of the seabed using airborne LiDAR bathymetry
Publikacja
- Ł. Janowski
- P. Tysiąc
- R. Wróblewski
- M. Rucińska
- A. Kubowicz- Grajewska
- ENGINEERING GEOLOGY - Rok 2022
Shallow coastal areas are among the most inhabited areas and are valuable for biodiversity, recreation and the economy. Due to climate change and sea level rise, sustainable management of coastal areas involves extensive exploration, monitoring, and protection. Current high-resolution remote sensing methods for monitoring these areas include bathymetric LiDAR. Therefore, this study presents a novel methodological approach to assess...

Pełny tekst do pobrania w portalu
An automatic system for identification of random telegraph signal (RTS) noise in noise signals
Publikacja
- Metrology and Measurement Systems - Rok 2007
In the paper the automatic and universal system for identification of Random Telegraph Signal (RTS) noise as a non-Gaussian component of the inherent noise signal of semiconductor devices is presented. The system for data acquisition and processing is described. Histograms of the instantaneous values of the noise signals are calculated as the basis for analysis of the noise signal to determine the number of local maxima of histograms...

Pełny tekst do pobrania w portalu
Theory of recognition in a historical perspective. Axel Honneth's Anerkennung: Eine europäische Ideengeschichte
Publikacja
- A. Karalus
- Archiwum Historii Filozofii i Myśli Społecznej - Rok 2019
The article discusses Honneth excursion into the realm of the history of ideas. This time Honneth decides to laser it on the notion of "recognition" in three different cultural areas and three different traditions: French, English, and German. The article discusses Honneth's persepctive and attempts at finding the common thread that would link three aforementioned traditions.

Pełny tekst do pobrania w portalu
Automatic audio signal mixing system based on one-dimensional Wave-U-Net autoencoders
Publikacja
- D. Koszewski
- Rok 2023
The purpose of this dissertation is to develop an automatic song mixing system that is capable of automatically mixing a song with good quality in any music genre. This work recalls first the audio signal processing methods used in audio mixing, and it describes selected methods for automatic audio mixing. Then, a novel architecture built based on one-dimensional Wave-U-Net autoencoders is proposed for automatic music mixing. Models...

Pełny tekst do pobrania w portalu
JOURNAL OF MOLECULAR RECOGNITION

Czasopisma

ISSN: 0952-3499 , eISSN: 1099-1352
A review of emotion recognition methods based on keystroke dynamics and mouse movements
Publikacja
- A. Kołakowska
- Rok 2013
The paper describes the approach based on using standard input devices, such as keyboard and mouse, as sources of data for the recognition of users’ emotional states. A number of systems applying this idea have been presented focusing on three categories of research problems, i.e. collecting and labeling training data, extracting features and training classifiers of emotions. Moreover the advantages and examples of combining standard...

Pełny tekst do pobrania w serwisie zewnętrznym
Investigation of the road noise source employing an automatic noise monitoring station
Publikacja
- Archives of Acoustics - Rok 2008
The paper presents a pilot investigation of noise source models in two selected localizations in the context of future dynamic noise map creation. The experiments were carried out using the automatic noise monitoring station engineered at the Multimedia Systems Departmentof the Gda´nsk University of Technology. The results of the noise measurements employing monitoring stations and its comparison to the reference values are depicted....

Pełny tekst do pobrania w portalu
Speaker Recognition Using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions
Publikacja
- M. Wang
- T. Sirlapu
- A. Kwaśniewska
- M. Szankin
- M. Bartscherer
- R. Nicolas
- Rok 2018
With the technology advancements in smart home sector, voice control and automation are key components that can make a real difference in people's lives. The voice recognition technology market continues to involve rapidly as almost all smart home devices are providing speaker recognition capability today. However, most of them provide cloud-based solutions or use very deep Neural Networks for speaker recognition task, which are...

Pełny tekst do pobrania w serwisie zewnętrznym
COMPUTER SPEECH AND LANGUAGE

Czasopisma

ISSN: 0885-2308 , eISSN: 1095-8363
SEMINARS IN SPEECH AND LANGUAGE

Czasopisma

ISSN: 0734-0478 , eISSN: 1098-9056
Speech and Language Technology

Czasopisma

ISSN: 1895-0434
Speech Language and Hearing

Czasopisma

ISSN: 1361-3286 , eISSN: 2050-5728
Quarterly Journal of Speech

Czasopisma

ISSN: 0033-5630 , eISSN: 1479-5779
SpringerBriefs in Speech Technology

Czasopisma

ISSN: 2191-737X , eISSN: 2191-7388
Audiology and Speech Research

Czasopisma

ISSN: 2635-5019 , eISSN: 2635-5027
Voice and Speech Review

Czasopisma

ISSN: 2326-8263 , eISSN: 2326-8271
Mathematical Models of Control Systems of Angular Speed of Steam Turbines for Diagnostic Tests of Automatic and Mechatronic Devices
Publikacja
- G. Redlarski
- J. Piechocki
- M. A. Dąbkowski
- Solid State Phenomena - Rok 2013
Accurate modeling of physical processes of many automatics and mechatronics systems is often necessity. In power system such a process is control of angular velocity of power objects during connection to operation in parallel. This process is extremely dynamic. For this reason response of control system depends from changes of many physical parameters (temperature, pressure and flow of the medium, etc.). Precision modeling influences...

Pełny tekst do pobrania w serwisie zewnętrznym

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: automatic speech recognition