Wyniki wyszukiwania dla: AUTOMATIC SPEECH RECOGNITION, WHISPER, MEDICAL LANGUAGE RECOGNITION, SPEECH PROCESSING - MOST Wiedzy

Wyszukiwarka

Wyniki wyszukiwania dla: AUTOMATIC SPEECH RECOGNITION, WHISPER, MEDICAL LANGUAGE RECOGNITION, SPEECH PROCESSING

Wyniki wyszukiwania dla: AUTOMATIC SPEECH RECOGNITION, WHISPER, MEDICAL LANGUAGE RECOGNITION, SPEECH PROCESSING

  • Sensors integration in the smart home environment - a proposal to solve the problem with user identification

    In this preliminary study we, investigate the possibility of user recognition techniques suitable on smart home devices like chairs, beds, aiming for low–power, high accuracy and quick response time. We propose the two well know technique: voice speaker recognition and accelerometer signal from device mounted on the chair, and the third one optical system basing on IR LED transmitter/receiver circuit. The preliminary results proved...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Engineering Challenges in the Design of Cochlear Implants

    Publikacja

    - Rok 2021

    Hearing aids such as cochlear implants have been used by both adults and children for a long time. In addition, cochlear implants are used by patients who have severe hearing loss either by birth or after an accident. This paper aims to investigate the engineering challenges bounding the design of cochlear implants and present its possible solution...

  • Analysis-by-synthesis paradigm evolved into a new concept

    This work aims at showing how the well-known analysis-by-synthesis paradigm has recently been evolved into a new concept. However, in contrast to the original idea stating that the created sound should not fail to pass the foolproof synthesis test, the recent development is a consequence of the need to create new data. Deep learning models are greedy algorithms requiring a vast amount of data that, in addition, should be correctly...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Visual Detection of People Movement Rules Violation in Crowded Indoor Scenes

    Publikacja

    The paper presents a camera-independent framework for detecting violations of two typical people movement rules that are in force in many public transit terminals: moving in the wrong direction or across designated lanes. Low-level image processing is based on object detection with Gaussian Mixture Models and employs Kalman filters with conflict resolving extensions for the object tracking. In order to allow an effective event...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Modernization of historic healthcare buildings

    The practice of transforming and adapting the existing healthcare facilities to meet the growing demands of modern medicine applies not only to buildings of historical value. Of course, one can set a time point from which hospitals, erected mostly with industrialized technologies, undergo upgrades for better or worse effect. Existing healthcare buildings or facilities, including historic ones, have to be refurbished and adapted...

    Pełny tekst do pobrania w portalu

  • Semantic OLAP with FluentEditor and Ontorion Semantic Excel Toolchain

    Publikacja

    - Rok 2015

    Semantic technologies appear as a step on the way to creating systems capable of representing the physical world as real time computational processes. In this context, the paper presents a toolchain for an ontology based knowledge management system. It consists of the ontology editor, FluentEditor and the distributed knowledge representation system, Ontorion. FluentEditor is a comprehensive tool for editing and manipulating complex...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Interactions with recognized objects

    Publikacja

    - Rok 2014

    Implicit interaction combined with object recognition techniques opens a new possibility for gathering data and analyzing user behavior for activity and context recognition. The electronic eyewear platform, eGlasses, is being developed, as an integrated and autonomous system to provide interactions with smart environment. In this paper we present a method for the interactions with the recognized objects that can be used for electronic...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Identification of volatile compounds based on the electrocatalytic gas sensor responses

    Publikacja

    Measured response in case of electrocatalytic gas sensors is in form of a voltamperometric characteristic. Current-voltage (I-V) response shape depends on the gas type and its concentration. Such response contains significantly more information comparing with typical electrochemical sensors, but is quite difficult to analyze. When I-V curve contains current peaks, position of such peaks can be used...

  • Ontology of the Design Pattern Language for Smart Cities Systems

    Publikacja

    The paper presents the definition of the design pattern language of Smart Cities in the form of an ontology. Since the implementation of a Smart City system is difficult, expensive and closely linked with the problems concerning a given city, the knowledge acquired during a single implementation is extremely valuable. The language we defined supports the management of such knowledge as it allows for the expression of a solution...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • MACHINE LEARNING APPLICATIONS IN RECOGNIZING HUMAN EMOTIONS BASED ON THE EEG

    Publikacja
    • A. Kastrau
    • M. Koronowski
    • M. Liksza
    • P. Jasik

    - Rok 2021

    This study examined the machine learning-based approach allowing the recognition of human emotional states with the use of EEG signals. After a short introduction to the fundamentals of electroencephalography and neural oscillations, the two-dimensional valence-arousal Russell’s model of emotion was described. Next, we present the assumptions of the performed EEG experiment. Detail aspects of the data sanitization including preprocessing,...

  • Affect aware video games

    Publikacja

    - Rok 2022

    In this chapter a problem of affect aware video games is described, including such issue as: emotional model of the player, design, development and UX testing of affect-aware video games, multimodal emotion recognition and a featured review of affect-aware video games.

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Marcin Sikorski prof. dr hab. inż.

    Marcin Sikorski jest profesorem w Katedrze Informatyki w Zarządzaniu na Wydziale Zarządzania i Ekonomii Politechniki Gdańskiej. Wcześniej odbył liczne pobyty w instytucjach naukowych, m.in. w Niemczech (Uniwersytety w Bonn i w Heidelbergu), Szwajcarii (ETH Zurich), Holandii (TU Eindhoven) i USA (Harvard University). Prof. Sikorski jest przedstawicielem Polski w komitecie TC13 Human-Computer-Interaction w międzynarodowej organizacji...

  • Video recordings of static hand gestures for gesture based interaction

    Dane Badawcze
    open access

    This data set contains video recording of selected simple hand gestures related to sign language. The purpose of the data set is to evaluate different computer algorithms design for hand gesture detection as well as for hand features and hand pose detection and identification. The data set contains 5 video recordings in mp4 format.  Each recording is...

  • Multimodal human-computer interfaces based on advanced video and audio analysis

    Multimodal interfaces development history is reviewed briefly in the introduction. Examples of applications of multimodal interfaces to education software and for the disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and the audio interface for speech stretching for hearing impaired and stuttering people. The Smart...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • MODERNIST, 1920S AND 1930S INDUSTRIAL ARCHITECTURE OF THE PORT OF GDYNIA - IN SEARCH OF AN AESTHETIC LANGUAGE FOR UTILITARIAN BUILDINGS OF THE POLISH GATEWAY TO THE WORLD

    Publikacja

    - Rok 2016

    The purpose of the article is to present the results of the research on the aspects of the Port of Gdynia modernist architecture aesthetics. Its construction was one of the two major projects carried out in the interwar period in Poland. In the course of analyses it has been attempted to answer the question whether an individual aesthetic language has been created in the 1920s and 1930s for the industrial architecture of the Polish...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • International Conference on Advanced Language Processing and Web Information Technology

    Konferencje

  • Marek Sylwester Tatara dr inż.

    Marek Tatara w 2014 roku uzyskał tytuł magistra inżyniera z zakresu Automatyki i Robotyki w specjalności Intelligent Decision-making Systems na Wydziale Elektroniki, Telekomunikacji i Informatyki Politechniki Gdańskiej, wcześniej w tym roku uzyskał tytuł inżyniera Fizyki Technicznej w specjalności Nanotechnologia. W tym samym roku rozpoczął pracę jako wykładowca w Katedrze Systemów Decyzyjnych i Robotyki. Interesuje się przetwarzaniem...

  • Affective Learning Manifesto – 10 Years Later

    Publikacja

    - Rok 2014

    In 2004 a group of affective computing researchers proclaimed a manifesto of affective learning that outlined the prospects and white spots of research at that time. Ten years passed by and affective computing developed many methods and tools for tracking human emotional states as well as models for affective systems construction. There are multiple examples of affective methods applications in Intelligent Tutoring Systems (ITS)....

  • Study Analysis of Transmission Efficiency in DAB+ Broadcasting System

    Publikacja

    - Rok 2018

    DAB+ is a very innovative and universal multimedia broadcasting system. Thanks to its updated multimedia technologies and metadata options, digital radio keeps pace with changing consumer expectations and the impact of media convergence. Broadcasting analog and digital radio services does vary, concerning devices on both transmitting and receiving side, as well as content processing mechanisms. However, the biggest difference is...

    Pełny tekst do pobrania w portalu

  • Zastosowanie metod eksploracji danych do analizy odpowiedzi czujników gazu

    Publikacja

    - Rok 2018

    Zagadnienia poruszane w niniejszej rozprawie dotyczą zastosowania metod eksploracji danych do analizy odpowiedzi czujników gazu, umożliwiających poprawną identyfikację składu mieszaniny gazowej w elektronicznych systemach rozpoznawania gazu. Elektroniczne systemy rozpoznawania gazu to urządzenia wykorzystujące czujniki gazu oraz odpowiednio dobrane metody analizy danych pomiarowych, zdolne do określenia składu mierzonej mieszaniny...

    Pełny tekst do pobrania w portalu

  • A Model-Driven Solution for Development of Multimedia Stream Processing Applications

    Publikacja

    This paper presents results of action research related to model-driven solutions in the area of multimedia stream processing. The practical problem to be solved was the need to support application developers who make their multimedia stream processing applications in a supercomputer environment. The solution consists of a domain-specific visual language for composing complex services from simple services called Multimedia Stream...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Detection of Face Position and Orientation Using Depth Data

    Publikacja

    In this paper an original approach is presented for real-time detection of user's face position and orientation based only on depth channel from a Microsoft Kinect sensor which can be used in facial analysis on scenes with poor lighting conditions where traditional algorithms based on optical channel may have failed. Thus the proposed approach can support, or even replace, algorithms based on optical channel or based on skeleton...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • DEEP CONVOLUTIONAL NEURAL NETWORKS AS A DECISION SUPPORT TOOL IN MEDICAL PROBLEMS – MALIGNANT MELANOMA CASE STUDY

    The paper presents utilization of one of the latest tool from the group of Machine learning techniques, namely Deep Convolutional Neural Networks (CNN), in process of decision making in selected medical problems. After the survey of the most successful applications of CNN in solving medical problems, the paper focuses on the very difficult problem of automatic analyses of the skin lesions. The authors propose the CNN structure...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • EMBOA - affective loop in Socially Assistive Robotics as an intervention tool for children with autism

    Kursy Online
    • M. Wróbel
    • A. Landowska

    The aim of the training course "Intensive programmes for higher education learner" within the EMBOA project is to familiarise participants with the use of social robots as an intervention tool for children with autism, emotion recognition and the combination of both methods. Students will be informed about the guidelines and results of the project.

  • Surface EMG-based signal acquisition for decoding hand movements

    Dane Badawcze
    open access

    Biosignal processing plays a crucial role in modern hand prosthetics. The challenge is to restore functionality of a lost limb based on the signals acquired from the surface of the stump. The number of sensors (emg channels) used for signal acquisition influence the quality of a prosthetic hand. Modern algorithms (including neural networks) can significantly...

  • ''Computing with words'' concept applied to musical instrument recognition. W: [CD-ROM] International Symposium of Musical Acoustics. ISMA MEXICO CITY. Mexico City, 9-13 December 2002. Mexico City: Escuela Nacional de Musica UNAM**2002, 8 s. 3 rys. 3 tab. bibliogr. 25 poz. Automatyczne rozpoznawanie klas instrumentów muzycznych w oparciu o wyraże- nia opisujące barwę dźwięku.

    Publikacja

    - Rok 2002

    W referacie przedstawiono nowy sposób automatycznego przetwarzania danychmuzycznych w oparciu o paradygmat zaproponowany przez L. Zadeha. Pozwala tona automatyczne rozpoznawanie klas instrumentów muzycznych wykorzystując o-pis słowny barwy dźwięku. Przedstawiono system realizujący automatyczną kla-syfikację instrumentów muzycznych oparty o metodę zbiorów przybliżonych ilogikę rozmytą.

  • Eye Blink Based Detection of Liveness in Biometric Authentication Systems Using Conditional Random Fields

    Publikacja

    - Rok 2012

    The goal of this paper was to verify whether the conditional random fields are suitable and enough efficient for eye blink detection in user authentication systems based on face recognition with a standard web camera. To evaluate this approach several experiments were carried on using a specially developed test application and video database.

  • Knowledge Base Suitable for Answering Questions in Natural Language

    This paper presents three knowledge bases widely used by researchers coping with natural language processing: OpenCyc, DBpedia and YAGO. They are characterized from the point of view of questions answering system. In this paper a short description of the aforementioned system implementation is also presented.

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Audio Content and Crowdsourcing: A Subjective Quality Evaluation of Radio Programs Streamed Online

    Publikacja

    - Rok 2023

    Radio broadcasting has been present in our lives for over 100 years. The transmission of speech and music signals accompanies us from an early age. Broadcasts provide the latest information from home and abroad. They also shape musical tastes and allow many artists to share their creativity. Modern distribution involves transmission over a number of terrestrial systems. The most popular are analog FM (Frequency Modulation) and...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • DBpedia and YAGO as Knowledge Base for Natural Language Based Question Answering—The Evaluation

    The idea of automatic question answering system has a very long history. Despite constant improvement of the systems asking questions in the natural language requires very complex solutions. In this paper the DBpedia and YAGO are evaluated as a knowledge bases for simple class 1 and 2 question answering system. For this purpose a question answering system was designed and implemented. The proposed solution and the knowledge bases...

    Pełny tekst do pobrania w portalu

  • A new library for construction of automata

    Publikacja

    - Rok 2017

    We present a new library of functions that construct minimal, acyclic, deterministic, finite-state automata in the same format as the author's fsa package, and also accepted by the author's fadd library of functions that use finite-state automata as dictionaries in natural language processing.

  • Adrian Kastrau mgr inż.

    Osoby

  • Personal adaptive tuning of mobile computer audio

    An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of the acoustic track to the changing conditions and to the user's individual preferences. Original signal processing algorithms are introduced, which concern: linearization of frequency response, dialogue intelligibility enhancement and dynamics processing tuned up to the user's preferences....

  • Towards Facts Extraction From Texts in Polish Language

    The Polish language differs from English in many ways. It has more complicated conjugation and declination. Because of that automatic facts extraction from texts is difficult. In this paper we present basic differences between those languages. The paper presents an algorithm for extraction of facts from articles from Polish Wikipedia. The algorithm is based on 7 proposed facts schemes that are searched for in the analyzed text....

    Pełny tekst do pobrania w portalu

  • Robust unsupervised georeferencing algorithm for aerial and satellite imagery

    Publikacja

    In order to eliminate a human factor and fully automate the process of embedding the spatial localization information in a remote sensed image the integrated georeferencing method was proposed. The paper presents this unsupervised and robust approach which is comprised of pattern recognition, using SIFT-based detector, and RANSAC based outlier removal with matching algorithm.

  • Thermal Image Processing for Respiratory Estimation from Cubical Data with Expandable Depth

    Publikacja

    - Journal of Imaging - Rok 2023

    As healthcare costs continue to rise, finding affordable and non-invasive ways to monitor vital signs is increasingly important. One of the key metrics for assessing overall health and identifying potential issues early on is respiratory rate (RR). Most of the existing methods require multiple steps that consist of image and signal processing. This might be difficult to deploy on edge devices that often do not have specialized...

    Pełny tekst do pobrania w portalu

  • Smart Virtual Bass Synthesis Algorithm Based on Music Genre Classification

    Publikacja

    The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm employed automatic music genre recognition to determine the optimum parameters for the synthesis of additional frequencies. The synthesis was carried out using the non-linear device (NLD) and phase vocoder (PV) methods depending on the music excerpt genre. Classification of musical...

  • Automation of the Road Gate Operations Process at the Container Terminal—A Case Study of DCT Gdańsk SA

    The future increased terminal capacity will result in more container movement through the road complex and rail siding, which are one of the most critical areas (potential bottlenecks) in the container terminal. Truck turnaround time is one of the major factors that customers take into account while deciding how many container volumes they will handle through the container terminal. To enable to optimize increased traffic with...

    Pełny tekst do pobrania w portalu

  • Virtual Whiteboard: A gesture-controlled pen-free tool emulating school whiteboard

    Publikacja

    In the paper the so-called Virtual Whiteboard is presented which may be an alternative solution for modern electronic whiteboards based on electronic pens and sensors. The presented tool enables the user to write, draw and handle whiteboard contents using his/her hands only. An additional equipment such as infrared diodes, infrared cameras or cyber gloves is not needed. The user's interaction with the Virtual Whiteboard computer...

  • Adaptive Personal Tuning of Sound in Mobile Computers

    An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of their acoustic track to changing acoustic conditions of the environment and to users’ individual preferences. Signal processing algorithms are introduced that concern: linearization of frequency response, dialogue intelligibility enhancement, and dynamics processing tuned up to the users’...

    Pełny tekst do pobrania w portalu

  • State of the art electronic nose technology and future trends

    Publikacja

    - Rok 2010

    This chapter briefly reviews the progress in field of artificial olfaction and demonstrates future trends in electronic nose technology. The discussion about e-nose concern also a big challenge for the pattern recognition (PARC) systems due to several particular problems they involve. Finally, the application of e-nose in different areas of life is given.

  • On Facial Expressions and Emotions RGB-D Database

    Publikacja

    - Rok 2014

    The goal of this paper is to present the idea of creating reference database of RGB-D video recordings for recognition of facial expressions and emotions. Two different formats of the recordings used for creation of two versions of the database are described and compared using different criteria. Examples of first applications using databases are also presented to evaluate their usefulness.

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Secured wired BPL voice transmission system

    Publikacja

    - Scientific Journal of the Military University of Land Forces - Rok 2020

    Designing a secured voice transmission system is not a trivial task. Wired media, thanks to their reliability and resistance to mechanical damage, seem an ideal solution. The BPL (Broadband over Power Line) cable is resistant to electricity stoppage and partial damage of phase conductors, ensuring continuity of transmission in case of an emergency. It seems an appropriate tool for delivering critical data, mostly clear and understandable...

    Pełny tekst do pobrania w portalu

  • A Device for Measuring Auditory Brainstem Responses to Audio

    Standard ABR devices use clicks and tone bursts to assess subjects’ hearing in an objective way. A new device was developed that extends the functionality of a standard ABR audiometer by collecting and analyzing auditory brainstem responses (ABR). The developed accessory allows for the use of complex sounds (e.g., speech or music excerpts) as stimuli. Therefore, it is possible to find out how efficiently different types of sounds...

    Pełny tekst do pobrania w portalu

  • Cancer immune escape: the role of antigen presentation machinery

    The mechanisms of antigen processing and presentation play a crucial role in the recognition and targeting of cancer cells by the immune system. Cancer cells can evade the immune system by downregulating or losing the expression of the proteins recognized by the immune cells as antigens, creating an immunosuppressive microenvironment, and altering their ability to process and present antigens. This review focuses on the mechanisms...

    Pełny tekst do pobrania w portalu

  • Automatic evaluation of information credibility in Semantic Web and Knowledge Grid

    Publikacja

    - Rok 2008

    This article presents a novel algorithm for automatic estimation of information credibility. It concerns information collected in Knowledge Grid and Semantic Web. Possibilities to evaluate the credibility of information in such structures are much greater than those available for WWW sites which use natural language. The rating system presented in this paper estimates credibility automatically on the basis of the following metrics:...

  • Influence of e-beam irradiation on poly(aliphatic/aromatic-ester) multiblock copolymers used as biomaterials

    Publikacja

    - KGK-Kautschuk Gummi Kunststoffe - Rok 2009

    The use of polymers became more relevant in medical applications for the versatility in processing and enhanced material properties of polymers. In order to be suitable as a biomaterial, it is of medical importance that these polymers are biocompatible and do not elicit negative responses from the body after implantation. Furthermore during the course of usage under load-bearing conditions, these biomaterials must sustain dynamical...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Using Different Information Channels for Affect-Aware Video Games - A Case Study

    Publikacja

    - Rok 2018

    This paper presents the problem of creating affect-aware video games that use different information channels, such as image, video, physiological signals, input devices, and player’s behaviour, for emotion recognition. Presented case studies of three affect-aware games show certain conditions and limitations for using specific signals to recognize emotions and lead to interesting conclusions.

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Detecting Apples in the Wild: Potential for Harvest Quantity Estimation

    Publikacja
    • A. Janowski
    • R. Kaźmierczak
    • C. Kowalczyk
    • J. Szulwic

    - Sustainability - Rok 2021

    Knowing the exact number of fruits and trees helps farmers to make better decisions in their orchard production management. The current practice of crop estimation practice often involves manual counting of fruits (before harvesting), which is an extremely time-consuming and costly process. Additionally, this is not practicable for large orchards. Thanks to the changes that have taken place in recent years in the field of image...

    Pełny tekst do pobrania w portalu

  • Jan Cudzik dr inż. arch.

    Jan Cudzik (dr inż. arch.) jest adiunktem w Katedrze Architektury Miejskiej i Przestrzeni Nadwodnych na Wydziale Architektury Politechniki Gdańskiej oraz kierownikiem Laboratorium Cyfrowych Technologii i Materiałów Przyszłości. Prowadzi badania nad architekturą kinetyczną, technikami cyfrowymi w projektowaniu architektonicznym, fabrykacją cyfrową oraz formami sztucznej inteligencji w architekturze i sztuce. Jego badania nad automatyzacją...