Wyniki wyszukiwania dla: speech recognition systems

Wyniki wyszukiwania dla: speech recognition systems

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 7136

wyczyść wszystkie filtry niedostępne

wyświetlamy 1000 najlepszych wyników Pomoc

Acceleration of decision making in sound event recognition employing supercomputing cluster
Publikacja
- K. Łopatka
- A. Czyżewski
- INFORMATION SCIENCES - Rok 2014
Parallel processing of audio data streams is introduced to shorten the decision making time in hazardous sound event recognition. A supercomputing cluster environment with a framework dedicated to processing multimedia data streams in real time is used. The sound event recognition algorithms employed are based on detecting foreground events, calculating their features in short time frames, and classifying the events with Support...

Pełny tekst do pobrania w serwisie zewnętrznym
Gesture Recognition With the Linear Optical Sensor and Recurrent Neural Networks
Publikacja
- IEEE SENSORS JOURNAL - Rok 2018
In this paper, the optical linear sensor, a representative of low-resolution sensors, was investigated in the multiclass recognition of near-field hand gestures. The recurrent neural network (RNN) with a gated recurrent unit (GRU) memory cell was utilized as a gestures classifier. A set of 27 gestures was collected from a group of volunteers. The 27 000 sequences obtained were divided into training, validation, and test subsets....

Pełny tekst do pobrania w portalu
Digits Recognition with Quadrant Photodiode and Convolutional Neural Network
Publikacja
- J. Kamil
- K. Czuszyński
- J. Rumiński
- Rok 2018
In this paper we have investigated the capabilities of a quadrant photodiode based gesture sensor in the recognition of digits drawn in the air. The sensor consisting of 4 active elements, 4 LEDs and a pinhole was considered as input interface for both discrete and continuous gestures. Index finger and a round pointer were used as navigating mediums for the sensor. Experiments performed with 5 volunteers...

Pełny tekst do pobrania w serwisie zewnętrznym
Balance recognition on the basis of EEG measurement.
Publikacja
- Annals of Computer Science and Information Systems - Rok 2016
Although electroencephalography (EEG) is not typically used for verifying the sense of balance, it can be used for analysing cortical signals responsible for this phenomenon. Simple balance tasks can be proposed as a good indicator of whether the sense of balance is acting more or less actively. This article presents preliminary results for the potential of using EEG to balance sensing....

Pełny tekst do pobrania w portalu
From Knowledge based Vision Systems to Cognitive Vision Systems: A Review
Publikacja
- T. Souza
- C. De
- C. Sanin
- E. Szczerbicki
- Rok 2018
Computer vision research and applications have their origins in 1960s. Limitations in computational resources inherent of that time, among other reasons, caused research to move away from artificial intelligence and generic recognition goals to accomplish simple tasks for constrained scenarios. In the past decades, the development in machine learning techniques has contributed to noteworthy progress in vision systems. However,...

Pełny tekst do pobrania w portalu
Quality Evaluation of Speech Transmission via Two-way BPL-PLC Voice Communication System in an Underground Mine
Publikacja
- P. Falkowski-Gilski
- G. Debita
- Archives of Acoustics - Rok 2023
In order to design a stable and reliable voice communication system, it is essential to know how many resources are necessary for conveying quality content. These parameters may include objective quality of service (QoS) metrics, such as: available bandwidth, bit error rate (BER), delay, latency as well as subjective quality of experience (QoE) related to user expectations. QoE is expressed as clarity of speech and the ability...

Pełny tekst do pobrania w portalu
Karolina Zielińska-Dąbkowska dr inż. arch.

Osoby

Katedra Architektury Miejskiej i Przestrzeni Nadwodnych

Karolina M. Zielinska-Dabkowska (dr inż. arch., Dipl.-Ing. Arch.[FH]) jest adiunktem na Wydziale Architektury Politechniki Gdańskiej. W roku 2002 ukończyła studia na Wydziale Architektury i Urbanistyki Politechniki Gdańskiej a w 2004 inżynierii architektonicznej na HAWK Hochschule für angewandte Wissenschaft und Kunst Hildesheim w Niemczech. Po studiach pracowała dla kilku firm o światowej renomie w Berlinie, Londynie, Nowym Jorku...
A Novel IoT-Perceptive Human Activity Recognition (HAR) Approach Using Multi-Head Convolutional Attention
Publikacja
- H. Zhang
- Z. Xiao
- J. Wang
- F. Li
- E. Szczerbicki
- IEEE Internet of Things Journal - Rok 2019
Together with fast advancement of the Internet of Things (IoT), smart healthcare applications and systems are equipped with increasingly more wearable sensors and mobile devices. These sensors are used not only to collect data, but also, and more importantly, to assist in daily activity tracking and analyzing of their users. Various human activity recognition (HAR) approaches are used to enhance such tracking. Most of the existing...

Pełny tekst do pobrania w portalu
System for automatic singing voice recognition
Publikacja
- P. Żwan
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Rok 2008
W artykule przedstawiono system automatycznego rozpoznawania jakości i typu głosu śpiewaczego. Przedstawiono bazę danych oraz zaimplementowane parametry. Algorytmem decyzyjnym jest algorytm sztucznych sieci neuronowych. Wytrenowany system decyzyjny osiąga skuteczność ok. 90% w obydwu kategoriach rozpoznawania. Dodatkowo wykazano przy pomocy metod statystycznych, że wyniki działania systemu automatycznej oceny jakości technicznej...
Tomasz Zubowicz dr inż.

Osoby

Katedra Inteligentnych Systemów Sterowania i Wspomagania Decyzji

Tomasz Zubowicz has received his M.Sc. Eng. degree in Control Engineering from the Faculty of Electrical and Control Engineering at the Gda{\'n}sk University of Technology (GUT) in $2008$. He received his Ph.D. Eng. (Hons.) in the field of Control Engineering from the same faculty in $2019$. In $2012$ he became a permanent staff member at the Department of Intelligent Control and Decision Support Systems at GUT and a member of...
Rafał Leszczyna dr hab. inż.

Osoby

Katedra Informatyki w Zarządzaniu

Dr hab. inż. Rafał Leszczyna jest profesorem uczelni na Wydziale Zarządzania i Ekonomii Politechniki Gdańskiej. W lipcu 2020 r., na podstawie osiągnięcia naukowego w obszarze zarządzania cyberbezpieczeństwem infrastruktur krytycznych w sektorze elektroenergetycznym, uzyskał stopień doktora habilitowanego w dziedzinie nauk inżynieryjno-technicznych, dyscyplina informatyka techniczna i telekomunikacja. W latach 2004–2008 pracował...
Influence of accelerometer signal pre-processing and classification method on human activity recognition
Publikacja
- Elektronika : konstrukcje, technologie, zastosowania - Rok 2010
A study of data pre-processing influence on accelerometer-based human activity recognition algorithms is presented. The frequency band used to filter-out the accelerometer signals and the number of accelerometers involved were considered in terms of their influence on the recognition accuracy. In the test four methods of classification were used: support vector machine, decision trees, neural network, k-nearest neighbor.

Pełny tekst do pobrania w serwisie zewnętrznym
Pose classification in the gesture recognition using the linear optical sensor
Publikacja
- Rok 2017
Gesture sensors for mobile devices, which have a capability of distinguishing hand poses, require efficient and accurate classifiers in order to recognize gestures based on the sequences of primitives. Two methods of poses recognition for the optical linear sensor were proposed and validated. The Gaussian distribution fitting and Artificial Neural Network based methods represent two kinds of classification approaches. Three types...

Pełny tekst do pobrania w serwisie zewnętrznym
On practical application of Shannon theory to character recognition and more
Publikacja
- M. Jurkiewicz
- Rok 2014
Let us consider an optical character recognition system, which in particular can be used for identifying objects that were assigned strings of some length. The system is not perfect, for example, it sometimes recognizes wrongly the characters "Y" and "V". What is the largest set of strings of given length for the system under consideration, which can be mutually correctly recognized, and the corresponding objects correctly identified?...
Acquisition and indexing of RGB-D recordings for facial expressions and emotion recognition
Publikacja
- M. Szwoch
- Studia Informatica Pomerania - Rok 2015
In this paper KinectRecorder comprehensive tool is described which provides for convenient and fast acquisition, indexing and storing of RGB-D video streams from Microsoft Kinect sensor. The application is especially useful as a supporting tool for creation of fully indexed databases of facial expressions and emotions that can be further used for learning and testing of emotion recognition algorithms for affect-aware applications....

Pełny tekst do pobrania w serwisie zewnętrznym
Molecular Recognition in Complexes of TRF Proteins with Telomeric DNA
Publikacja
- M. Wieczór
- A. Tobiszewski
- P. Wityk
- B. Tomiczek
- J. Czub
- PLOS ONE - Rok 2014
Telomeres are specialized nucleoprotein assemblies that protect the ends of linear chromosomes. In humans and many other species, telomeres consist of tandem TTAGGG repeats bound by a protein complex known as shelterin that remodels telomeric DNA into a protective loop structure and regulates telomere homeostasis. Shelterin recognizes telomeric repeats through its two major components known as Telomere Repeat-Binding Factors, TRF1...

Pełny tekst do pobrania w portalu
Parameters optimization in medicine supporting image recognition algorithms
Publikacja
- A. Brzeski
- Rok 2011
In this paper, a procedure of automatic set up of image recognition algorithms' parameters is proposed, for the purpose of reducing the time needed for algorithms' development. The procedure is presented on two medicine supporting algorithms, performing bleeding detection in endoscopic images. Since the algorithms contain multiple parameters which must be specified, empirical testing is usually required to optimise the algorithm's...
Accelerometer-based Human Activity Recognition and the Impact of the Sample Size
Publikacja
- Rok 2014
The presented study focused on the recognition of eight user activities (e.g. walking, lying, climbing stairs) basing on the measurements from an accelerometer embedded in a mobile device. It is assumed that the device is carried in a specific location of the user’s clothing. Three types of classifiers were tested on different sizes of the samples. The influence of the time window (the duration of a single trial) on selected activities...

Pełny tekst do pobrania w serwisie zewnętrznym
Comparison of selected off-the-shelf solutions for emotion recognition based on facial expressions
Publikacja
- Rok 2016
The paper concerns accuracy of emotion recognition from facial expressions. As there are a couple of ready off-the-shelf solutions available in the market today, this study aims at practical evaluation of selected solutions in order to provide some insight into what potential buyers might expect. Two solutions were compared: FaceReader by Noldus and Xpress Engine by QuantumLab. The performed evaluation revealed that the recognition...

Pełny tekst do pobrania w serwisie zewnętrznym
Influence of Thermal Imagery Resolution on Accuracy of Deep Learning based Face Recognition
Publikacja
- Rok 2019
Human-system interactions frequently require a retrieval of the key context information about the user and the environment. Image processing techniques have been widely applied in this area, providing details about recognized objects, people and actions. Considering remote diagnostics solutions, e.g. non-contact vital signs estimation and smart home monitoring systems that utilize person’s identity, security is a very important factor....

Pełny tekst do pobrania w portalu
Edyta Gołąb-Andrzejak dr hab.

Osoby

Katedra Marketingu
Magdalena Gajewska prof. dr hab. inż.

Osoby

Katedra Technologii w Inżynierii Środowiska

Magdalena Gajewska (ur. 1.06.1968 r. w Gdańsku) ukończyła studia w 1993 roku na Wydziale Hydrotechniki Politechniki Gdańskiej. Jest adiunktem w Katedrze Technologii Wody i Ścieków na Wydziale Inżynierii Lądowej i Środowiska Politechniki Gdańskiej. Doktorat (2001) i habilitacja (2013) w dyscyplinie inżynierii środowiska. W kadencji 2016–2020 pełni funkcję prodziekana ds. nauki. Specjalizuję się w technologiach związanych z ekoinżynierią:...
Adam Dąbrowski dr inż.

Osoby

Adam Dąbrowski uzyskał stopień doktora nauk inżynieryjno-technicznych w dyscyplinie inżynieria mechaniczna na Politechnice Gdańskiej oraz ukończył studia II stopnia na kierunku mechatronika na Technische Universität Hamburg a także double degree Engineering and Management of Space Systems na Hochschule Bremen. Posiada doświadczenie przemysłowe (Instytut Lotnictwa w Warszawie, SICK AG w Hamburgu, Blue Dot Solutions w Gdańsku, Niemieckie...
Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech
Publikacja
- D. Piotrowski
- R. Korzeniowski
- A. Falai
- S. Cygert
- K. Pokora
- G. Tinchev
- Z. Zhang
- K. Yanagisawa
- Rok 2023
In this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream Text-To-Speech (TTS) model. The proposed framework consists of 4 stages. In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker. In the third stage, the converted data is combined with the linguistic features and durations...

Pełny tekst do pobrania w serwisie zewnętrznym
Human-Computer Interface Based on Visual Lip Movement and Gesture Recognition
Publikacja
- P. Dalka
- A. Czyżewski
- International Journal of Computing Science and Mathematics - Rok 2010
The multimodal human-computer interface (HCI) called LipMouse is presented, allowing a user to work on a computer using movements and gestures made with his/her mouth only. Algorithms for lip movement tracking and lip gesture recognition are presented in details. User face images are captured with a standard webcam. Face detection is based on a cascade of boosted classifiers using Haar-like features. A mouth region is located in...

Pełny tekst do pobrania w serwisie zewnętrznym
Systematic Literature Review for Emotion Recognition from EEG Signals
Publikacja
- P. A. Leszczełowska
- N. Dawidowska
- Rok 2022
Researchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...

Pełny tekst do pobrania w portalu
Systematic Literature Review for Emotion Recognition from EEG Signals
Publikacja
- P. A. Leszczełowska
- N. Dawidowska
- Rok 2022
Researchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...

Pełny tekst do pobrania w serwisie zewnętrznym
Automatic recognition of therapy progress among children with autism
Publikacja
- A. Kołakowska
- A. Landowska
- A. Anzulewicz
- K. Sobota
- Scientific Reports - Rok 2017
The article presents a research study on recognizing therapy progress among children with autism spectrum disorder. The progress is recognized on the basis of behavioural data gathered via five specially designed tablet games. Over 180 distinct parameters are calculated on the basis of raw data delivered via the game flow and tablet sensors - i.e. touch screen, accelerometer and gyroscope. The results obtained confirm the possibility...

Pełny tekst do pobrania w portalu
Feasibility Study for Food Intake Tasks Recognition Based on Smart Glasses
Publikacja
- M. Biallas
- A. Andrushevich
- R. Kistler
- A. Klapproth
- K. Czuszyński
- A. Bujnowski
- Journal of Medical Imaging and Health Informatics - Rok 2015
In this exploratory study 13 adult test subjects have performed different food intake tasks while wearing a three axis accelerometer mounted at a temple of glasses. Two different algorithms for task recognition have been applied and compared. The retrospective data processing leads to better task recognition results when the frequency range of 50 Hz to 100 Hz is analysed within accelerometer signal recordings. A straightforward...

Pełny tekst do pobrania w serwisie zewnętrznym
SYSTEMS SCIENCE

Czasopisma

ISSN: 0137-1223
Fuzzy rule-based dynamic gesture recognition employing camera & multimedia projector
Publikacja
- M. Lech
- B. Kostek
- Rok 2010
In the paper the system based on camera and multimedia projector enabling a user to control computer applications by dynamic hand gestures is presented. The main objective is to present the gesture recognition methodology which bases on representing hand movement trajectory by motion vectors analyzed using fuzzy rule-based inference. The approach was engineered in the system developed with J2SE and C++ / OpenCV technology. OpenCV...

Pełny tekst do pobrania w serwisie zewnętrznym
A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems
Publikacja
- Rok 2018
This paper presents a comparison of STI (Speech Transmission Index) coefficient measurement results carried out by direct and indirect methods. First, acoustic parameters important in the context of public address and sound reinforcement systems are recalled. A measurement methodology is presented that employs various test signals to determine impulse responses. The process of evaluating sound system performance, signals enabling...

Pełny tekst do pobrania w serwisie zewnętrznym
Integration of speech enhancement and coding techniques
Publikacja
- M. Kuropatwinski
- D. Leckschat
- K. Kroschel
- A. Czyzewski
- M. Kuropatwiński
- Rok 1999
Pełny tekst do pobrania w serwisie zewnętrznym
Novel approaches to wideband speech coding
Publikacja
- M. Kulesza
- A. Czyżewski
- Rok 2008
Dwie metoda kodowania szerokopasmowego mowy zostały zaprezentowane. W pierwszej metodzie wykorzystano algorytm kompresji i ekspansji czasowej sygnału mowy, pozwalający na kodowanie szerokopasmowe sygnału mowy z wykorzystaniem ustandaryzowanych kodeków. Metoda ta jest przewidziana do zastosowania w adaptacyjnych algorytmach kodowania mowy. Drugie z proponowanych rozwiazan dotyczy nowej metody estymacji obwiedni widma sygnalu mowy...

Pełny tekst do pobrania w serwisie zewnętrznym
A system for multitask noisy speech enhancement.
Publikacja
- A. Czyżewski
- A. Kaczmarek
- J. Kotus
- A. Pawlik
- A. Rypulak
- P. Żwan
- Rok 2004
W artykule przedstawiono ogolną charakterystyke opracowanego systemu rejestracji i rekonstrukcji mowy. Artykuł zawiera opis składników systemu, ktory jest oprogramowaniem zawierającym zaawansowane narzędzia służące poprawie zrozumiałości mowy. Zaimplementowane narzędzia systemu umożliwiają wyszukiwanie nagrań dźwiękowych i ich obróbkę przy pomocy zaimplementowanych pluginów. W artykule przedstawione wykorzystane w systemie algorytmy...
Multitask Noisy Speech Enhancement System
Publikacja
- A. Czyżewski
- J. Kotus
- G. Szwoch
- M. Dziubiński
- A. Rypulak
- A. Pawlik
- Rok 2005
W referacie opisano Wielozadaniowy System Poprawy Jakości Sygnału Mowy. Jest to wyspecjalizowany pakiet oprogramowania przeznaczony do rejestrowania sygnału mowy i do poprawy jego jakości oraz zrozumiałości mowy, przy użyciu zaawansowanych procedur cyfrowego przetwarzania sygnału. Pakiet oprogramowania składa się z programów: Rejestrator, Przeglądarka oraz Rekonstruktor. Oprogramowanie to może być użyte w przypadkach, gdy zrozumiałość...
Multimedia i Interfejsy 2022
Kursy Online
- J. Daciuk
- W. Szwoch
- M. Szwoch
{mlang pl} Celem kursu jest zapoznanie studentów z: rodzajami danych multimedialnych oraz metodami ich pozyskiwania formatami i standardami danych multimedialnych metodami kompresji danych multimedialnych podstawami przetwarzania danych multimedialnych oraz ich rozpoznawania programowaniem aplikacji multimedialnych, w tym gier wideo rodzajami interfejsów użytkownika w systemach komputerowych metodami opisu oraz zasadami...
Multimedia i Interfejsy 2023
Kursy Online
- J. Daciuk
- W. Szwoch
- M. Szwoch
{mlang pl} Celem kursu jest zapoznanie studentów z: rodzajami danych multimedialnych oraz metodami ich pozyskiwania formatami i standardami danych multimedialnych metodami kompresji danych multimedialnych podstawami przetwarzania danych multimedialnych oraz ich rozpoznawania programowaniem aplikacji multimedialnych, w tym gier wideo rodzajami interfejsów użytkownika w systemach komputerowych metodami opisu oraz zasadami...
Difference in Perceived Speech Signal Quality Assessment Among Monolingual and Bilingual Teenage Students
Publikacja
- P. Falkowski-Gilski
- Rok 2021
The user perceived quality is a mixture of factors, including the background of an individual. The process of auditory perception is discussed in a wide variety of fields, ranging from engineering to medicine. Many studies examine the difference between musicians and non-musicians. Since musical training develops musical hearing and other various auditory capabilities, similar enhancements should be observable in case of bilingual...

Pełny tekst do pobrania w serwisie zewnętrznym
Piotr Rajchowski dr inż.

Osoby

Katedra Systemów i Sieci Radiokomunikacyjnych

Piotr Rajchowski (Member, IEEE) was born in Poland, in 1989. He received the E.Eng., M.Sc., and Ph.D. degrees in radio communication from the Gdańsk University of Technology (Gdańsk Tech), Poland, in 2012, 2013, and 2017, respectively. Since 2013, he has been working at the Department of Radiocommunication Systems and Networks, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, as a IT...
JOURNAL OF MOLECULAR RECOGNITION

Czasopisma

ISSN: 0952-3499 , eISSN: 1099-1352
Theory of recognition in a historical perspective. Axel Honneth's Anerkennung: Eine europäische Ideengeschichte
Publikacja
- A. Karalus
- Archiwum Historii Filozofii i Myśli Społecznej - Rok 2019
The article discusses Honneth excursion into the realm of the history of ideas. This time Honneth decides to laser it on the notion of "recognition" in three different cultural areas and three different traditions: French, English, and German. The article discusses Honneth's persepctive and attempts at finding the common thread that would link three aforementioned traditions.

Pełny tekst do pobrania w portalu
COMPUTER SPEECH AND LANGUAGE

Czasopisma

ISSN: 0885-2308 , eISSN: 1095-8363
SEMINARS IN SPEECH AND LANGUAGE

Czasopisma

ISSN: 0734-0478 , eISSN: 1098-9056
Speech and Language Technology

Czasopisma

ISSN: 1895-0434
Speech Language and Hearing

Czasopisma

ISSN: 1361-3286 , eISSN: 2050-5728
Quarterly Journal of Speech

Czasopisma

ISSN: 0033-5630 , eISSN: 1479-5779
SpringerBriefs in Speech Technology

Czasopisma

ISSN: 2191-737X , eISSN: 2191-7388
Audiology and Speech Research

Czasopisma

ISSN: 2635-5019 , eISSN: 2635-5027
Voice and Speech Review

Czasopisma

ISSN: 2326-8263 , eISSN: 2326-8271

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: speech recognition systems

Karolina Zielińska-Dąbkowska dr inż. arch.

Tomasz Zubowicz dr inż.

Rafał Leszczyna dr hab. inż.

Edyta Gołąb-Andrzejak dr hab.

Magdalena Gajewska prof. dr hab. inż.

Adam Dąbrowski dr inż.

Piotr Rajchowski dr inż.