Search results for: speech recognition, allophone, phonology, foreign language, audio features - Bridge of Knowledge

Search

Search results for: speech recognition, allophone, phonology, foreign language, audio features

Search results for: speech recognition, allophone, phonology, foreign language, audio features

  • Instructor Presence in Video Lectures: Preliminary Findings From an Online Experiment

    Publication

    - IEEE Access - Year 2021

    Motivation. Despite the widespread use of video lectures in online and blended learning environments, there is still debate whether the presence of an instructor in the video helps or hinders learning. According to social agency theory, seeing the instructor makes learners believe that s/he is personally teaching them, which leads to deeper cognitive processing and, in turn, better learning outcomes. Conversely, according to cognitive...

    Full text available to download

  • ANALIZA PARAMETRÓW SYGNAŁU MOWY W KONTEKŚCIE ICH PRZYDATNOŚCI W AUTOMATYCZNEJ OCENIE JAKOŚCI EKSPRESJI ŚPIEWU

    Praca dotyczy podejścia do parametryzacji w przypadku klasyfikacji emocji w śpiewie oraz porównania z klasyfikacją emocji w mowie. Do tego celu wykorzystano bazę mowy i śpiewu nacechowanego emocjonalnie RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song), zawierającą nagrania profesjonalnych aktorów prezentujących sześć różnych emocji. Następnie obliczono współczynniki mel-cepstralne (MFCC) oraz wybrane deskryptory...

    Full text available to download

  • Video recordings of static hand gestures for gesture based interaction

    Open Research Data
    open access

    This data set contains video recording of selected simple hand gestures related to sign language. The purpose of the data set is to evaluate different computer algorithms design for hand gesture detection as well as for hand features and hand pose detection and identification. The data set contains 5 video recordings in mp4 format.  Each recording is...

  • Robot-Based Intervention for Children With Autism Spectrum Disorder: A Systematic Literature Review

    Publication
    • K. D. Bartl-Pokorny
    • P. Uluer
    • D. E. Barkana
    • A. Baird
    • H. Kose
    • T. Zorcec
    • B. Robins
    • B. Schuller
    • A. Landowska
    • M. Pykała

    - IEEE Access - Year 2021

    Children with autism spectrum disorder (ASD) have deficits in the socio-communicative domain and frequently face severe difficulties in the recognition and expression of emotions. Existing literature suggested that children with ASD benefit from robot-based interventions. However, studies varied considerably in participant characteristics, applied robots, and trained skills. Here, we reviewed robot-based interventions targeting...

    Full text available to download

  • Non-Contact Temperature Measurements Dataset

    Publication

    - Year 2022

    The dataset titled The influence of the distance of the pyrometer from the surface of the radiating object on the accuracy of measurements contains temperature measurements using a selection of four commercially available pyrometers (CHY 314P, TM-F03B, TFA 31.1125 and AB-8855) as a function of the measuring distance. The dataset allows a comparison of the accuracy and measuring precision of the devices, which are very important...

    Full text available to download

  • Multimodal system for diagnosis and polysensory stimulation of subjects with communication disorders

    An experimental multimodal system, designed for polysensory diagnosis and stimulation of persons with impaired communication skills or even non-communicative subjects is presented. The user interface includes an eye tracking device and the EEG monitoring of the subject. Furthermore, the system consists of a device for objective hearing testing and an autostereoscopic projection system designed to stimulate subjects through their...

  • Searching of the buried objects into the sea bottom by means of nonlinear acouctic methods

    Publication

    The main goal of this paper is to introduce the methodology of preparing the area for investigations that will be carried out at the sea. As the first step there is recognition of the basic method both in the theory as well as experimental investigation. There were taken into account the nonlinear methods. These ones are very promising methods that have very interesting features, very convenient for examinations of the seabed structure....

  • Features extraction from the electrocatalytic gas sensor responses

    One of the types of gas sensors used for detection and identification of toxic-air pollutant is an electrocatalytic gas sensor. The electrocatalytic sensors are working in cyclic voltammetry mode, enable detection of various gases. Their response are in the form of I-V curves which contain information about the type and the concentration of measured volatile compound. However,...

    Full text to download in external service

  • Influence of Thermal Imagery Resolution on Accuracy of Deep Learning based Face Recognition

    Publication

    Human-system interactions frequently require a retrieval of the key context information about the user and the environment. Image processing techniques have been widely applied in this area, providing details about recognized objects, people and actions. Considering remote diagnostics solutions, e.g. non-contact vital signs estimation and smart home monitoring systems that utilize person’s identity, security is a very important factor....

    Full text available to download

  • Emotions in polish speech recordings

    Open Research Data
    open access

    The data set presents emotions recorded in sound files that are expressions of Polish speech. Statements were made by people aged 21-23, young voices of 5 men. Each person said the following words / nie – no, oddaj - give back, podaj – pass, stop - stop, tak - yes, trzymaj -hold / five times representing a specific emotion - one of three - anger (a),...

  • Camera angle invariant shape recognition in surveillance systems

    Publication

    A method for human action recognition in surveillance systems is described. Problems within this task are discussed and a solution based on 3D object models is proposed. The idea is shown and some of its limitations are talked over. Shape description methods are introduced along with their main features. Utilized parameterization algorithm is presented. Classification problem, restricted to bi-nary cases is discussed. Support vector...

  • A review of emotion recognition methods based on keystroke dynamics and mouse movements

    Publication

    - Year 2013

    The paper describes the approach based on using standard input devices, such as keyboard and mouse, as sources of data for the recognition of users’ emotional states. A number of systems applying this idea have been presented focusing on three categories of research problems, i.e. collecting and labeling training data, extracting features and training classifiers of emotions. Moreover the advantages and examples of combining standard...

    Full text to download in external service

  • Emotion Recognition Based on Facial Expressions of Gamers

    Publication

    This article presents an approach to emotion recognition based on facial expressions of gamers. With application of certain methods crucial features of an analysed face like eyebrows' shape, eyes and mouth width, height were extracted. Afterwards a group of artificial intelligence methods was applied to classify a given feature set as one of the following emotions: happiness, sadness, anger and fear.The approach presented in this...

  • Thermal imaging in automatic rodent’s social behaviour analysis

    Publication

    - Year 2016

    Laboratory rodent social behaviour analysis is an extremely important task for biological, medical and pharmacological researches. In this work thermal images features that facilitate analysis are presented. Methods to distinguish objects on the basis of thermal distribution are tested. Actions of grooming or biting one rodent by another - important social behaviour incidents - are clearly visible...

    Full text to download in external service

  • Emotion Recognition Based on Facial Expressions of Gamers

    This article presents an approach to emotion recognition based on facial expressions of gamers. With application of certain methods crucial features of an analyzed face like eyebrows' shape, eyes and mouth width, height were extracted. Afterwards a group of artificial intelligence methods was applied to classify a given feature set as one of the following emotions: happiness, sadness, anger and fear. The approach presented in this...

  • Classification of Music Genres Based on Music Separation into Harmonic and Drum Components . Klasyfikacja gatunków muzycznych wykorzystująca separację instrumentów muzycznych

    Publication

    - Archives of Acoustics - Year 2014

    This article presents a study on music genre classification based on music separation into harmonic and drum components. For this purpose, audio signal separation is executed to extend the overall vector of parameters by new descriptors extracted from harmonic and/or drum music content. The study is performed using the ISMIS database of music files represented by vectors of parameters containing music features. The Support Vector...

    Full text available to download

  • Evaluation of sound event detection, classification and localization in the presence of background noise for acoustic surveillance of hazardous situations

    Publication

    An evaluation of the sound event detection, classification and localization of hazardous acoustic events in the presence of background noise of different types and changing intensities is presented. The methods for separating foreground events from the acoustic background are introduced. The classifier, based on a Support Vector Machine algorithm, is described. The set of features and samples used for the training of the classifier...

    Full text to download in external service

  • Recognizing emotions on the basis of keystroke dynamics

    Publication

    - Year 2015

    The article describes a research on recognizing emotional states on the basis of keystroke dynamics. An overview of various studies and applications of emotion recognition based on data coming from keyboard is presented. Then, the idea of an experiment is presented, i.e. the way of collecting and labeling training data, extracting features and finally training classifiers. Different classification approaches are proposed to be...

    Full text to download in external service

  • Scoreboard Architectural Pattern and Integration of Emotion Recognition Results

    Publication

    This paper proposes a new design pattern, named Scoreboard , dedicated for applications solving complex, multi-stage, non-deterministic problems. The pattern provides a computational framework for the design and implementation of systems that integrate a large number of diverse specialized modules that may vary in accuracy, solution level, and modality. The Scoreboard is an extension of Blackboard design pattern and comes under...

    Full text available to download

  • Pose classification in the gesture recognition using the linear optical sensor

    Publication

    Gesture sensors for mobile devices, which have a capability of distinguishing hand poses, require efficient and accurate classifiers in order to recognize gestures based on the sequences of primitives. Two methods of poses recognition for the optical linear sensor were proposed and validated. The Gaussian distribution fitting and Artificial Neural Network based methods represent two kinds of classification approaches. Three types...

    Full text to download in external service

  • Ordinal pattern statistics for the assessment of heart rate variability

    Publication

    - The European Physical Journal-Special Topics - Year 2013

    The recognition of all main features of a healthy heart rhythm (the so-called sinus rhythm) is still one of the biggest challenges in contemporary cardiology. Recently the interesting physiological phenomenon of heart rate asymmetry has been observed. This phenomenon is related to unbalanced contributions of heart rate decelerations and accelerations to heart rate variability. In this paper we apply methods based on the concept...

    Full text available to download

  • Piotr Dominiak prof. dr hab.

    People

    He was born in Radom on June 29, 1948. He graduated in Economy at the University of Warsaw (1971), where he also obtained his doctorate (1976) and his habilitation (1989). He received the title of Professor in 2005. He has been working at GUT since 1971. Between 1991 and1993 he was a director of the Institute of Economic Sciences and Humanities at GUT. He was the Dean of the Faculty of Management and Economics in the years 1993-1999...

  • A simplified behavioral MOSFET model based on parameters extraction for circuit simulations.

    Publication

    The paper presents results on behavior modeling of general purpose Metal-Oxide Semiconductor Field-Effect Transistor (MOSFET) for simulation of power electronics systems requiring accuracy both in steady-state and in switching conditions. Methods of parameters extraction including nonlinearity of parasitic capacitances and steady-state characteristics are based on manufacturer data sheet and externally measurable characteristics....

    Full text to download in external service

  • Time window based features extraction from temperature modulated gas sensors for prediction of ammonia concentration

    Electronic gas recognition systems, in literature commonly referred as electronic noses, enable the recognition of a type and a concentration of various volatile compounds. Typical electronic gas-analyzing device consists of four main elements, namely, gas delivery subsystem, an array of gas sensors, data acquisition and power supply circuits and data analysis software. The commercially available metal-oxide TGS sensors are widely...

    Full text to download in external service

  • Study Analysis of Transmission Efficiency in DAB+ Broadcasting System

    Publication

    - Year 2018

    DAB+ is a very innovative and universal multimedia broadcasting system. Thanks to its updated multimedia technologies and metadata options, digital radio keeps pace with changing consumer expectations and the impact of media convergence. Broadcasting analog and digital radio services does vary, concerning devices on both transmitting and receiving side, as well as content processing mechanisms. However, the biggest difference is...

    Full text available to download

  • Modeling the Customer’s Contextual Expectations Based on Latent Semantic Analysis Algorithms

    Publication

    Nowadays, in the age of Internet, access to open data detects the huge possibilities for information retrieval. More and more often we hear about the concept of open data which is unrestricted access, in addition to reuse and analysis by external institutions, organizations and people. It’s such information that can be freely processed, add another data (so-called remix) and then published. More and more data are available in text...

    Full text available to download

  • Human carnosinases: A brief history, medicinal relevance, and in silico analyses

    Publication

    - DRUG DISCOVERY TODAY - Year 2024

    Carnosine, an endogenous dipeptide, has been found to have a plethora of medicinal properties, such as antioxidant, antiageing, and chelating effects, but with one downside: a short half-life. Carnosinases and two hydrolytic enzymes, which remain enigmatic, are responsible for these features. Hence, here we emphasize why research is valuable for better understanding crucial concepts like ageing, neurodegradation, and cancerogenesis,...

    Full text available to download

  • Improving the Accuracy in Sentiment Classification in the Light of Modelling the Latent Semantic Relations

    Publication

    - Information - Year 2018

    The research presents the methodology of improving the accuracy in sentiment classification in the light of modelling the latent semantic relations (LSR). The objective of this methodology is to find ways of eliminating the limitations of the discriminant and probabilistic methods for LSR revealing and customizing the sentiment classification process (SCP) to the more accurate recognition of text tonality. This objective was achieved...

    Full text available to download

  • Contactless hearing aid designed for infants

    It is a well known fact that language development through home intervention for a hearing-impaired infant should start in the early months of a newborn baby's life. The aim of this paper is to present a concept of a contactless digital hearing aid designed especially for infants. In contrast to all typical wearable hearing aid solutions (ITC, ITE, BTE), the proposed device is mounted in the infant's bed with any parts of its set-up...

    Full text available to download

  • Loudness Scaling Tests in Hearing Problems Detection

    Publication

    The number of people using portable audio players has increased significantly over the recent years. This implies the rise in the number of people having hearing loss problems. Therefore, there is a need to find appropriate procedures that simplify the process of the hearing problem detection. Investigations performed show that audiometric tests may not be sufficient to assess hearing in young people. Contrarily, the obtained results...

  • Interactions using passive optical proximity detector

    Publication

    - Year 2015

    In this paper we evaluated the possible application of a passive, optical sensor as an interface for human-smart glasses interactions. The designed proximity sensor is composed of set of photodiodes and the appropriate hardware and software components. First, experiments were performed for the estimations of such parameters as distance to an object, its width and velocity. Achieved results were satisfactory. Therefore, next, a...

    Full text to download in external service

  • Smart Virtual Bass Synthesis Algorithm Based on Music Genre Classification

    Publication

    The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm employed automatic music genre recognition to determine the optimum parameters for the synthesis of additional frequencies. The synthesis was carried out using the non-linear device (NLD) and phase vocoder (PV) methods depending on the music excerpt genre. Classification of musical...

  • Multimodal human-computer interfaces based on advanced video and audio analysis

    Multimodal interfaces development history is reviewed briefly in the introduction. Examples of applications of multimodal interfaces to education software and for the disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and the audio interface for speech stretching for hearing impaired and stuttering people. The Smart...

    Full text to download in external service

  • Conditions of Iranian international trade in terms of lifting the sanctions: A case study of Polish-Iranian trade perspectives

    The main aim of the paper is to identify the areas and determine the feasibility of trade between Polish and Iranian companies in relation to historical and cultural conditions. The authors describe the complexity of the issue of foreign trade, the determinants of relations between Poland and the Islamic Republic of Iran, and the Iranian economy – its features, strengths and weaknesses, and consequences of the recent UN, EU, and...

    Full text available to download

  • MODERNIST, 1920S AND 1930S INDUSTRIAL ARCHITECTURE OF THE PORT OF GDYNIA - IN SEARCH OF AN AESTHETIC LANGUAGE FOR UTILITARIAN BUILDINGS OF THE POLISH GATEWAY TO THE WORLD

    Publication

    - Year 2016

    The purpose of the article is to present the results of the research on the aspects of the Port of Gdynia modernist architecture aesthetics. Its construction was one of the two major projects carried out in the interwar period in Poland. In the course of analyses it has been attempted to answer the question whether an individual aesthetic language has been created in the 1920s and 1930s for the industrial architecture of the Polish...

    Full text to download in external service

  • Towards More Realistic Probabilistic Models for Data Structures: The External Path Length in Tries under the Markov Model

    Publication

    - Year 2013

    Tries are among the most versatile and widely used data structures on words. They are pertinent to the (internal) structure of (stored) words and several splitting procedures used in diverse contexts ranging from document taxonomy to IP addresses lookup, from data compression (i.e., Lempel- Ziv'77 scheme) to dynamic hashing, from partial-match queries to speech recognition, from leader election algorithms to distributed hashing...

  • ALOFON corpus

    The ALOFON corpus is one of the multimodal database of word recordings in English, available at http://www.modality-corpus.org/.  The ALOFON corpus is oriented towards the recording of the speech equivalence variants. For this purpose, a total of 7 people who are or speak English with native speaker fluency and a variety of Standard Southern British...

  • Modern trends in solid phase extraction: New sorbent media

    Based on the recently published literature, this review provides an update of the most important features and application of formats and devices employed in solid phase extraction (SPE). Special attention was paid on new trapping media proposed in SPE prior the chromatography analysis, based on the use of nanostructured materials, including carbon nanomaterials, electrospun nanofibers, dendrimes and magnetic nanoparticles, molecular...

    Full text to download in external service

  • Semi complex navigation with an active optical gesture sensor

    This paper presents the methods of diversified touchless interactions between a user and a mobile platform utilizing the optical gesture sensor. The sensor uses 8 photodiodes to measure the reflected light in the active mode (using embedded LEDs) or it measures shadows caused by fingers in the passive mode. Several algorithms were implemented: automatic mode switching, adaptive illumination level compensation, resolution improvements...

    Full text to download in external service

  • From Linear Classifier to Convolutional Neural Network for Hand Pose Recognition

    Publication

    Recently gathered image datasets and the new capabilities of high-performance computing systems have allowed developing new artificial neural network models and training algorithms. Using the new machine learning models, computer vision tasks can be accomplished based on the raw values of image pixels instead of specific features. The principle of operation of deep neural networks resembles more and more what we believe to be happening...

    Full text available to download

  • Poszukiwanie dobrych odpowiedzi na źle postawione pytania, rzecz o przedsiębiorczości kobiet

    Women entrepreneurship has been the subject of research for many years and its results are often compared against the men entrepreneurship results. The conducted research has often been aimed at identification/recognition of the differences between the entrepreneurs of opposite gender. The lack of satisfactory proof for the existence of the differences has been attributed by some determined researchers to inappropriate research...

    Full text available to download

  • Systematic Literature Review for Emotion Recognition from EEG Signals

    Publication

    Researchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...

    Full text to download in external service

  • Systematic Literature Review for Emotion Recognition from EEG Signals

    Researchers have recently become increasingly interested in recognizing emotions from electroencephalogram (EEG) signals and many studies utilizing different approaches have been conducted in this field. For the purposes of this work, we performed a systematic literature review including over 40 articles in order to identify the best set of methods for the emotion recognition problem. Our work collects information about the most...

    Full text available to download

  • Architecture Design of a Networked Music Performance Platform for a Chamber Choir

    This paper describes an architecture design process for Networked Music Performance (NMP) platform for medium-sized conducted music ensembles, based on remote rehearsals of Academic Choir of Gdańsk University of Technology. The issues of real-time remote communication, in-person music performance, and NMP are described. Three iterative steps defining and extending the architecture of the NMP platform with additional features to...

    Full text to download in external service

  • Controlling computer by lip gestures employing neural network

    Publication

    - Year 2010

    Results of experiments regarding lip gesture recognition with an artificial neural network are discussed. The neural network module forms the core element of a multimodal human-computer interface called LipMouse. This solution allows a user to work on a computer using lip movements and gestures. A user face is detected in a video stream from a standard web camera using a cascade of boosted classifiers working with Haar-like features....

    Full text to download in external service

  • Elgold partial: Automotive blogs

    Open Research Data

    The dataset contains 34 English texts scrapped from automotive blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and...

  • Elgold partial: Movie reviews

    Open Research Data

    The dataset contains 37 English texts with movie reviews. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.

  • Elgold partial: Job offers

    Open Research Data

    The dataset contains 34 English texts scrapped from the web portals offering job offers. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity...

  • Elgold partial: Scientific papers' abstracts

    Open Research Data

    The dataset contains 87 Scientific papers' abstracts in English randomly chosen from the folowing scientific disciplines: Biomedicine, Life Sciences, Mathematics, Medicine, Science, Humanities, Social Science.

  • Elgold partial: Amazon product reviews

    Open Research Data

    The dataset contains 34 Amazon product reviews in English. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.