Search results for: GESTURE%20RECOGNITION

Search results for: GESTURE%20RECOGNITION

results on page:
embed this view on your website

Displayed results came from alternative search method.

Filters

total: 1742

clear all filters disabled

displaying 1000 best results Help

ISCA Tutorial and Research Workshop Automatic Speech Recognition

Conferences
International Conference on Advances in Pattern Recognition and Digital Techniques

Conferences
The Hough transform in the classification process of inland ships
Publication
- K. Bobkowska
- N. Wawrzyniak
- Zeszyty Naukowe Akademii Morskiej w Szczecinie - Year 2019
This article presents an analysis of the possibilities of using image processing methods for feature extraction that allows kNN classification based on a ship’s image delivered from an on-water video surveillance system. The subject of the analysis is the Hough transform which enables the detection of straight lines in an image. The recognized straight lines and the information about them serve as features in the classification...

Full text available to download
Fully Automated AI-powered Contactless Cough Detection based on Pixel Value Dynamics Occurring within Facial Regions
Publication
- M. Szankin
- A. Kwaśniewska
- N. Kowalczyk
- J. Rumiński
- R. Nicolas
- D. Gamba
- Year 2021
Increased interest in non-contact evaluation of the health state has led to higher expectations for delivering automated and reliable solutions that can be conveniently used during daily activities. Although some solutions for cough detection exist, they suffer from a series of limitations. Some of them rely on gesture or body pose recognition, which might not be possible in cases of occlusions, closer camera distances or impediments...

Full text to download in external service
Pracujący w czasie rzeczywistym system detekcji gazów wykorzystujący przenośny komputer Raspberry PI oraz matrycę półprzewodnikowych czujników gazu
Publication
- Elektronika : konstrukcje, technologie, zastosowania - Year 2014
The gas-analyzing systems based on the array of partially selective gas sensors and pattern-recognition techniques are potentially fast and lowcost alternative for other devices, like gas‑analysers. They give the possibility of recognition the type and the concentration of measured volatile compounds in their working environment. In this work we present the implementation of gas recognition system, in which the signals from an...

Full text to download in external service
Intracranial hemorrhage detection in 3D computed tomography images using a bi-directional long short-term memory network-based modified genetic algorithm
Publication
- J. Sengupta
- R. Alzbutas
- P. Falkowski-Gilski
- B. Falkowska-Gilska
- Frontiers in Neuroscience - Year 2023
Introduction: Intracranial hemorrhage detection in 3D Computed Tomography (CT) brain images has gained more attention in the research community. The major issue to deal with the 3D CT brain images is scarce and hard to obtain the labelled data with better recognition results. Methods: To overcome the aforementioned problem, a new model has been implemented in this research manuscript. After acquiring the images from the Radiological...

Full text available to download
Video Classification Technology in a Knowledge-Vision-Integration Platform for Personal Protective Equipment Detection: An Evaluation
Publication
- C. De
- C. Sanin
- E. Szczerbicki
- Year 2018
This work is part of an effort for the development of a Knowledge-Vision Integration Platform for Hazard Control (KVIP-HC) in industrial workplaces, adaptable to a wide range of industrial environments. This paper focuses on hazards resulted from the non-use of personal protective equipment (PPE), and examines a few supervised learning techniques to compose the proposed system for the purpose of recognition of three protective...

Full text to download in external service
Międzynarodowa konferencja NANOSMAT

Events

11-09-2018 00:00 - 14-09-2018 23:59

Konferencja NANOSMAT jest poświęcona inżynierii materiałowej (Nanoscience, Engineering and Nanotechnology and Beyond NANO); www.nanosmat-conference.com
Virtual touchpad - video-based multimodal interface
Publication
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Year 2010
A new computer interface named Virtual-Touchpad (VTP) is presented. The Virtual-Touchpad provides a multimodal interface which enables controlling computer applications by hand gestures captured with a typical webcam. The video stream is processed in the software layer of the interface. Hitherto existing video-based interfaces analyzing frames of hand gestures are presented. Then, the hardware configuration and software features...
Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions
Publication
- Year 2016
Automatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...

Full text to download in external service
Interactions with recognized patients using smart glasses
Publication
- J. Rumiński
- M. Smiatacz
- A. Bujnowski
- A. Andrushevich
- M. Biallas
- R. Kistler
- Year 2015
Recently, different smart glasses solutions have been proposed on the market. The rapid development of this wearable technology has led to several research projects related to applications of smart glasses in healthcare. In this paper we propose a general architecture of the system enabling data integration for the recognized person. In the proposed system smart glasses integrates data obtained for the recognized patient from health...

Full text to download in external service
Michał Bernard Pietrzak dr hab.

People

Katedra Statystyki i Ekonometrii

Michal Pietrzak is head of the Department of Statistics and Econometrics at the Faculty of Economics and Management, Gdańsk University of Technology, and Deputy Editor-in-Chief for Statistical Reviewing of the journals: Oeconomia Copernicana and Equilibrium. Quarterly Journal of Economics and Economic Policy. Until October 2021, he worked as an associate professor at the Faculty of Economic Sciences and Management, Nicolaus...
Investigation of educational processes with affective computing methods
Publication
- A. Landowska
- G. Brodny
- e-mentor - Year 2017
This paper concerns the monitoring of educational processes with the use of new technologies for the recognition of human emotions. This paper summarizes results from three experiments, aimed at the validation of applying emotion recognition to e-learning. An analysis of the experiments’ executions provides an evaluation of the emotion elicitation methods used to monitor learners. The comparison of affect recognition algorithms...

Full text available to download
Memory and Imagination: Artwork as a Form of Testimony
Publication
- J. Kabrońska
- Year 2016
As the generation of witnesses is passing away and the testimonies of Shoah survivors are replaced by an inherited, culturally constructed memory - post-memory – art becomes a meaningful landmark in the landscape of memory. Although the narration of history - being the researchers’ discourse rather than an objective view - has been considered to a certain extent fictitious, the difference between real and fictional, between remembering...
Analysis of odour interactions in model gas mixtures using electronic nose and fuzzy logic
Publication
- CHEMICAL ENGINEERING TRANSACTIONS - Year 2018
Measurement and monitoring of air quality in terms of odour nuisance is an important problem. Although the source of these nuisances is different (e.g. wastewater treatment plants, municipal landfills), their common feature is that they are a complex mixture of odorants with different odour thresholds. An additional problem is occurrence of the odour interactions between mixture components. From a practical point of view, it would...

Full text available to download
Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing
Publication
- D. Koszewski
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2020
Developing signal processing methods to extract information automatically has potential in several applications, for example searching for multimedia based on its audio content, making context-aware mobile applications (e.g., tuning apps), or pre-processing for an automatic mixing system. However, the last-mentioned application needs a significant amount of research to reliably recognize real musical instruments in recordings....

Full text available to download
Smart Virtual Bass Synthesis Algorithm Based on Music Genre Classification
Publication
- Year 2014
The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm employed automatic music genre recognition to determine the optimum parameters for the synthesis of additional frequencies. The synthesis was carried out using the non-linear device (NLD) and phase vocoder (PV) methods depending on the music excerpt genre. Classification of musical...
SYNAT_PCA_48
Open Research Data
open access
There is a series of datasets containing feature vectors derived from music tracks. The dataset contains 51582 music tracks (22 music genres) and feature vector after Principal Component Analysis (PCA) performing, so there are 48-element vectors derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier...
Statistics II - lecture
e-Learning Courses
- M. Kuc-Czarnecka
Corporate Finance (lecture)
e-Learning Courses
- J. Kartasova
- K. Kubiszewska
International Finance (Lecture)
e-Learning Courses
- K. Kubiszewska
Inorganic Chemistry Lecture
e-Learning Courses
- A. Pladzyk
- A. Brillowska-Dąbrowska
Aim of the course is to give a general knowledge of the chemistry of the elements and the inorganic compounds, puting the attention on the relationships between structure, properties and reactivity.
Anna Rzeczycka dr hab.

People

Anna Rzeczycka is the deputy head of the Department of Finance at the Faculty of Economics and Management of the Gdańsk University of Technology. Publications are situated in the field of social sciences in the discipline of economics and finance. They include books, monographs, articles, publications and scientific editions of monographs and scientific journals. In terms of numbers, it includes the following items: 12 monographs...
Automatic music set organizatio based on mood of music / Automatyczna organizacja bazy muzycznej na podstawie nastroju muzyki
Publication
- M. Piotrowska
- Przegląd Telekomunikacyjny + Wiadomości Telekomunikacyjne - Year 2017
This work is focused on an approach based on the emotional content of music and its automatic recognition. A vector of features describing emotional content of music was proposed. Additionally, a graphical model dedicated to the subjective evaluation of mood of music was created. A series of listening tests was carried out, and results were compared with automatic mood recognition employing SOM (Self Organizing Maps) and ANN (Artificial...

Full text to download in external service
A new multi-process collaborative architecture for time series classification
Publication
- Z. Xiao
- X. Xu
- H. Zhang
- E. Szczerbicki
- KNOWLEDGE-BASED SYSTEMS - Year 2021
Time series classification (TSC) is the problem of categorizing time series data by using machine learning techniques. Its applications vary from cybersecurity and health care to remote sensing and human activity recognition. In this paper, we propose a novel multi-process collaborative architecture for TSC. The propositioned method amalgamates multi-head convolutional neural networks and capsule mechanism. In addition to the discovery...

Full text available to download
Visual Features for Improving Endoscopic Bleeding Detection Using Convolutional Neural Networks
Publication
- SENSORS - Year 2023
The presented paper investigates the problem of endoscopic bleeding detection in endoscopic videos in the form of a binary image classification task. A set of definitions of high-level visual features of endoscopic bleeding is introduced, which incorporates domain knowledge from the field. The high-level features are coupled with respective feature descriptors, enabling automatic capture of the features using image processing methods....

Full text available to download
Krzysztof Goczyła prof. dr hab. inż.

People

Department of Software Engineering

Krzysztof Goczyła, full professor of Gdańsk University of Technology, computer scientist, a specialist in software engineering, knowledge engineering and databases. He graduated from the Faculty of Electronics Technical University of Gdansk in 1976 with a degree in electronic engineering, specializing in automation. Since then he has been working at Gdańsk University of Technology. In 1982 he obtained a doctorate in computer science...
Introduction to the special issue on machine learning in acoustics
Publication
- Z. Michalopoulou
- P. Gerstoft
- B. Kostek
- M. A. Roch
- Journal of the Acoustical Society of America - Year 2021
When we started our Call for Papers for a Special Issue on “Machine Learning in Acoustics” in the Journal of the Acoustical Society of America, our ambition was to invite papers in which machine learning was applied to all acoustics areas. They were listed, but not limited to, as follows: • Music and synthesis analysis • Music sentiment analysis • Music perception • Intelligent music recognition • Musical source separation • Singing...

Full text available to download
Endoscopic Video Classification with the Consideration of Temporal Patterns
Publication
- Year 2012
The article describes a novel approach to automatic recognition and classification of diseases in endoscopic videos. Current directions of research in this field are discussed. Most presented methods focus on processing single frames and do not take into consideration the temporal relationship between continuous classifications. Existing approaches that consider the temporal structure of an incoming frame sequence are focused on...
Jan Daciuk dr hab. inż.

People

Faculty of Electronics, Telecommunications and Informatics, Department of Intelligent Interactive Systems

Jan Daciuk received his M.Sc. from the Faculty of Electronics of Gdansk University of Technology in 1986, and his Ph.D. from the Faculty of Electronics, Telecommunications and Informatics of Gdańsk University of Technology in 1999. He has been working at the Faculty from 1988. His research interests include finite state methods in natural language processing and computational linguistics including speech processing. Dr. Daciuk...
Wojciech Wyrzykowski dr hab.

People

Department of Economic Analysis and Finance

Wojciech Wyrzykowski is an employee of the Department of Finance at the Faculty of Management and Economics of the Gdańsk University of Technology. He is the author of 70 scientific publications, including 5 monographs, and co-author of 7 monographs. The most important of them reflecting the author's scientific interests include: Tax conditions for the development of entrepreneurship in Poland, Taxes in Poland - outline of the...
Wykorzystanie sztucznych sieci neuronowych do wykrywania i rozpoznawania tablic rejestracyjnych na zdjęciach pojazdów
Publication
- M. Huzarek
- T. A. Rutkowski
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Year 2015
W artykule przedstawiono koncepcję algorytmu wykrywania i rozpoznawania tablic rejestracyjnych (AWiRTR) na obrazach cyfrowych pojazdów. Detekcja i lokalizacja tablic rejestracyjnych oraz wyodrębnienie z obrazu tablicy rejestracyjnej poszczególnych znaków odbywa się z wykorzystaniem podstawowych technik przetwarzania obrazu (przekształcenia morfologiczne, wykrywanie krawędzi) jak i podstawowych danych statystycznych obiektów wykrytych...

Full text available to download
FEEDB: A multimodal database of facial expressions and emotions
Publication
- M. Szwoch
- Year 2013
In this paper a first version of a multimodal FEEDB database of facial expressions and emotions is presented. The database contains labeled RGB-D recordings of people expressing a specific set of expressions that have been recorded using Microsoft Kinect sensor. Such a database can be used for classifier training and testing in face recognition as well as in recognition of facial expressions and human emotions. Also initial experiences...

Full text to download in external service
A video monitoring system using ontology-driven identification of threats
Publication
- P. Kaczmarek
- P. Zielonka
- Year 2009
In this paper, we present a video monitoring systemthat leverages image recognition and ontological reasoningabout threats. In the solution, an image processing subsystemuses video recording of a monitored area and recognizesknown concepts in scenes. Then, a reasoning subsystem uses anontological description of security conditions and informationfrom image recognition to check if a violation of a conditionhas occurred. If a threat...

Full text to download in external service
Towards New Mappings between Emotion Representation Models
Publication
- A. Landowska
- Applied Sciences-Basel - Year 2018
There are several models for representing emotions in affect-aware applications, and available emotion recognition solutions provide results using diverse emotion models. As multimodal fusion is beneficial in terms of both accuracy and reliability of emotion recognition, one of the challenges is mapping between the models of affect representation. This paper addresses this issue by: proposing a procedure to elaborate new mappings,...

Full text available to download
Creating a Realible Music Discovery and Recomendation System
Publication
- Year 2014
The aim of this paper is to show problems related to creating a reliable music dis-covery system. The SYNAT database that contains audio files is used for the purpose of experiments. The files are divided into 22 classes corresponding to music genres with different cardinality. Of utmost importance for a reliable music recommendation system are the assignment of audio files to their appropriate gen-res and optimum parameterization...

Full text to download in external service
Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency
Publication
- International Journal of Image Processing and Visual Communication - Year 2013
In this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...

Full text to download in external service
SYNAT Music Genre Parameters PCA 19
Open Research Data
open access
The dataset contains feature vector after Principal Component Analysis (PCA) performing, so there are 11 music genres and 19-element vector derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier research studies carried out by the team of authors [1-6]. A collection of 52532 music excerpts described...
SYNAT_PCA_11
Open Research Data
open access
The dataset contains 51582 music tracks (22 music genres) and feature vector after Principal Component Analysis (PCA) performing, so there are 11-element vectors derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier research studies carried out by the team of authors [1-6]. A collection of more than...
Music Recommendation System
Publication
- Journal of Telecommunications and Information Technology - Year 2014
The paper focuses on optimization vector content feature for the music recommendation system. For the purpose of experiments a database is created consisting of excerpts of music les. They are assigned to 22 classes corresponding to dierent music genres. Various feature vectors based on low-level signal descriptors are tested and then optimized using correlation analysis and Principal Component Analysis (PCA). Results of the experiments...

Full text available to download
An electronic nose for quantitative determination of gas concentrations
Publication
- Year 2016
The practical application of human nose for fragrance recognition is severely limited by the fact that our sense of smell is subjective and gets tired easily. Consequen tly, there is considerable need for an instrument that can be a substitution of the human sense of smell. Electronic nose devices from the mid 1980s are used in growing number of applications. They comprise an array of several electrochemical gas sensors...

Full text to download in external service
Towards Emotion Acquisition in IT Usability Evaluation Context
Publication
- A. Landowska
- Year 2015
The paper concerns extension of IT usability studies with automatic analysis of the emotional state of a user. Affect recognition methods and emotion representation models are reviewed and evaluated for applicability in usability testing procedures. Accuracy of emotion recognition, susceptibility to disturbances, independence on human will and interference with usability testing procedures are...

Full text to download in external service
Ecotoxicology - lecture 2022/2023
e-Learning Courses
- M. Pawłowska
Inorganic chemistry 2sem-Lecture
e-Learning Courses
- A. Pladzyk
CCS-lecture-2023-2024
e-Learning Courses
- P. Raczyński
materiały wspierające wykład na studiach II stopnia na kierunku ACR pod tytułem komputerowe systemy automatyki 1. Computer system – controlled plant interfacing technique; simple interfacing and with both side acknowledgement; ideas, algorithms, acknowledge passing. 2. Methods of acknowledgement passing: software checking and passing, using interrupt techniques, using readiness checking (ready – wait lines). The best solution optimization...
Crystal structures of aminotransferases Aro8 and Aro9 from Candida albicans and structural insights into their properties
Publication
- A. Kiliszek
- W. Rypniewski
- K. Rząd
- S. Milewski
- I. Gabriel
- JOURNAL OF STRUCTURAL BIOLOGY - Year 2019
Aminotransferases catalyze reversibly the transamination reaction by a ping-pong bi-bi mechanism with pyridoxal 5′-phosphate (PLP) as a cofactor. Various aminotransferases acting on a range of substrates have been reported. Aromatic transaminases are able to catalyze the transamination reaction with both aromatic and acidic substrates. Two aminotransferases from C. albicans, Aro8p and Aro9p, have been identified recently, exhibiting...

Full text available to download
Enhanced voice user interface employing spatial filtration of signals from acoustic vector sensor
Publication
- Year 2015
Spatial filtration of sound is introduced to enhance speech recognition accuracy in noisy conditions. An acoustic vector sensor (AVS) is employed. The signals from the AVS probe are processed in order to attenuate the surrounding noise. As a result the signal to noise ratio is increased. An experiment is featured in which speech signals are disturbed by babble noise. The signals before and after spatial filtration are processed...

Full text to download in external service
Extending touch-less interaction with smart glasses by implementing EMG module
Publication
- Year 2017
In this paper we propose to use temporal muscle contraction to perform certain actions. Method: The set of muscle contractions corresponding to one of three actions including “single-click”, “double-click” “click-n-hold” and “non-action” were recorded. After recording certain amount of signals, the set of five parameters was calculated. These parameters served as an input matrix for the neural network. Two-layer feedforward neural...

Full text to download in external service
Video Semantic Analysis Framework based on Run-time Production Rules - Towards Cognitive Vision
Publication
- E. Szczerbicki
- C. Toro
- C. Sanin
- JOURNAL OF UNIVERSAL COMPUTER SCIENCE - Year 2015
This paper proposes a service-oriented architecture for video analysis which separates object detection from event recognition. Our aim is to introduce new tools to be considered in the pathway towards Cognitive Vision as a support for classical Computer Vision techniques that have been broadly used by the scientific community. In the article, we particularly focus in solving some of the reported scalability issues found in current...

Full text available to download
Online sound restoration system for digital library applications
Publication
- Year 2013
Audio signal processing algorithms were introduced to the new online non-commercial service for audio restoration intended to enhance the content of digitized audio repositories. Missing or distorted audio samples are predicted using neural networks and a specific implementation of the Jannsen interpolation method based on the autoregressive model (AR) combined with the iterative restoring of missing signal samples. Since the distortion...

Full text to download in external service

Search

Filters

Catalog

Search results for: GESTURE%20RECOGNITION

Michał Bernard Pietrzak dr hab.

Anna Rzeczycka dr hab.

Krzysztof Goczyła prof. dr hab. inż.

Jan Daciuk dr hab. inż.

Wojciech Wyrzykowski dr hab.