Filtry
wszystkich: 569
Wyniki wyszukiwania dla: SPEECH RECOGNITION, SPEECH ANALYSIS, PHONEME, ALLOPHONE.
-
MODALITY corpus - SPEAKER 32 - COMMANDS C5
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 35 - COMMANDS C5
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 33 - COMMANDS C4
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 27 - SEQUENCE S3
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 27 - COMMANDS C3
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 33 - SEQUENCE S5
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 27 - SEQUENCE S2
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
Quality of graphical markers for the needs of eyewear devices
Publikacjain this paper we propose to cast the problem of identification of people, objects or places into an application for smart glasses that decodes information from graphical markers. We focus on analyzing different factors that can have influence on the processes of the automatic recognition of information from a code. The research we present aims at reviewing recognition performances in function of: size of a marker, distance from/to...
-
Gesture Recognition With the Linear Optical Sensor and Recurrent Neural Networks
PublikacjaIn this paper, the optical linear sensor, a representative of low-resolution sensors, was investigated in the multiclass recognition of near-field hand gestures. The recurrent neural network (RNN) with a gated recurrent unit (GRU) memory cell was utilized as a gestures classifier. A set of 27 gestures was collected from a group of volunteers. The 27 000 sequences obtained were divided into training, validation, and test subsets....
-
Knowledge representation of motor activity of patients with Parkinson’s disease
PublikacjaAn approach to the knowledge representation extraction from biomedical signals analysis concerning motor activity of Parkinson disease patients is proposed in this paper. This is done utilizing accelerometers attached to their body as well as exploiting video image of their hand movements. Experiments are carried out employing artificial neural networks and support vector machine to the recognition of characteristic motor activity...
-
Determination of toxic gases based on the responses of a single electrocatalytic sensor and pattern recognition techniques
PublikacjaA response from an electrocatalytic gas sensor contains fingerprint information about the type of gas and its concentration. As a result, a single gas sensor can be used for the determination of different gases. However, information about the type of gas and its concentration is hidden in the unique shape of the current–voltage response and it is quite difficult to explore. One of the ways to get precise information about the measured...
-
Visual Detection of People Movement Rules Violation in Crowded Indoor Scenes
PublikacjaThe paper presents a camera-independent framework for detecting violations of two typical people movement rules that are in force in many public transit terminals: moving in the wrong direction or across designated lanes. Low-level image processing is based on object detection with Gaussian Mixture Models and employs Kalman filters with conflict resolving extensions for the object tracking. In order to allow an effective event...
-
Video content analysis in the urban area telemonitoring system
PublikacjaThe task of constant monitoring of video streams from a large number of cameras and reviewing the recordings in order to find a specified event requires a considerable amount of time and effort from the system operators and it is prone to errors. A solution to this problem is an automatic system for constant analysis of camera images being able to raise an alarm if a predefined event is detected. The chapter presents various aspects...
-
Rapid Evaluation of Poultry Meat Shelf Life Using PTR-MS
PublikacjaThe use of proton transfer reaction mass spectrometry (PTR-MS) for freshness classification of chicken and turkey meat samples was investigated. A number of volatile organic compounds (VOCs) were selected based on the correlation (> 95%) of their concentration during storage at 4 °C over a period of 5 days with the results of the microbial analysis. In order to verify if the selected compounds are not sample-specific, a number...
-
Ordinal pattern statistics for the assessment of heart rate variability
PublikacjaThe recognition of all main features of a healthy heart rhythm (the so-called sinus rhythm) is still one of the biggest challenges in contemporary cardiology. Recently the interesting physiological phenomenon of heart rate asymmetry has been observed. This phenomenon is related to unbalanced contributions of heart rate decelerations and accelerations to heart rate variability. In this paper we apply methods based on the concept...
-
Analysis of human behavioral patterns
PublikacjaWidespread usage of Internet and mobile devices entailed growing requirements concerning security which in turn brought about development of biometric methods. However, a specially designed biometric system may infer more about users than just verifying their identity. Proper analysis of users’ characteristics may also tell much about their skills, preferences, feelings. This chapter presents biometric methods applied in several...
-
Application of fuzzy logic to determine the odour intensity of model gas mixtures using electronic nose
PublikacjaThe paper presents the possibility of application of fuzzy logic to determine the odour intensity of model, ternary gas mixtures (α-pinene, toluene and triethylamine) using electronic nose prototype. The results obtained using fuzzy logic algorithms were compared with the values obtained using multiple linear regression (MLR) model and sensory analysis. As the results of the studies, it was found the electronic nose prototype along...
-
IMAGE CORRELATION AS A TOLL FOR TRACKING FACIAL CHANGES CAUSING BY EXTERNAL STIMULI
PublikacjaExpressions of the human face bring a lot of information, which are a valuable source in the areas of computer vision, remote sensing and affective computing. For years, by analyzing the movement of the skin and facial muscles scientists are trying to create the perfect tool, based on image analysis, allowing the recognition of emotional states of human beings. To create a reliable algorithm, it is necessary to explore and examine...
-
Time window based features extraction from temperature modulated gas sensors for prediction of ammonia concentration
PublikacjaElectronic gas recognition systems, in literature commonly referred as electronic noses, enable the recognition of a type and a concentration of various volatile compounds. Typical electronic gas-analyzing device consists of four main elements, namely, gas delivery subsystem, an array of gas sensors, data acquisition and power supply circuits and data analysis software. The commercially available metal-oxide TGS sensors are widely...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S1
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S4
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S2
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S5
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S3
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S6
Dane BadawczeThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
Automatic recognition of males and females among web browser users based on behavioural patterns of peripherals usage
PublikacjaPurpose The purpose of this paper is to answer the question whether it is possible to recognise the gender of a web browser user on the basis of keystroke dynamics and mouse movements. Design/methodology/approach An experiment was organised in order to track mouse and keyboard usage using a special web browser plug-in. After collecting the data, a number of parameters describing the users’ keystrokes, mouse movements and clicks...
-
Multimodal learning application with interactive animated character. [Multimodalna aplikacja edukacyjna wykorzystująca interaktywną animowaną postać]
PublikacjaThe aim of this study is to design a computer application that may assist teachers and therapists in multimodal manner in their work with impaired or disabled children. The application can be operated in many different ways, giving to a child with special educational needs a possibility to learn and train many skills or treat speech disorders. The main stress in this research is on the creation of animated character that will serve...
-
Parallelization of video stream algorithms in kaskada platform
PublikacjaThe purpose of this work is to present different techniques of video stream algorithms parallelization provided by the Kaskada platform - a novel system working in a supercomputer environment designated for multimedia streams processing. Considered parallelization methods include frame-level concurrency, multithreading and pipeline processing. Execution performance was measured on four time-consuming image recognition algorithms,...
-
Comparison of Tracking Methods in Respect of Automation of Animal Behavioral Test
PublikacjaAutomation in experiments carried out on animals is getting more and more important in research. Computers take over laborious and time-consuming activities like recording and analysing images of experiment scene. The first step in an image analysis is finding and distinguishing between the observed animals, and then tracking all objects during the experiment. In this paper four tracking methods are presented. Quantitative and...
-
Human carnosinases: A brief history, medicinal relevance, and in silico analyses
PublikacjaCarnosine, an endogenous dipeptide, has been found to have a plethora of medicinal properties, such as antioxidant, antiageing, and chelating effects, but with one downside: a short half-life. Carnosinases and two hydrolytic enzymes, which remain enigmatic, are responsible for these features. Hence, here we emphasize why research is valuable for better understanding crucial concepts like ageing, neurodegradation, and cancerogenesis,...
-
Automatic Singing Voice Recognition EmployingNeural Networks and Rough Sets
PublikacjaCelem badań jest automatyczne rozpoznawanie głosów śpiewaczych w kategorii rodzaju i jakości technicznej śpiewu. W artykule opisano stworzoną bazę danych głosów, która zawiera próbki głosu śpiewaków profesjonalnych i amatorskich. W dalszej części opisano parametry zdefiniowane w oparciu o zjawiska biomechaniczne w narządzie głosu podczas śpiewania. W oparciu o stworzone macierze parametrów wytrenowano i porównano automatyczne klasyfikatory...
-
Ośrodki wzrostu na obszarach peryferyjnych regionów. W poszukiwaniu kapitału terytorialnego
PublikacjaPrzedmiotem badań były gminy miejskie i miejsko-wiejskie na obszarach peryferyjnych województw, czyli poza obszarami funkcjonalnymi miast wojewódzkich i aglomeracji śląskich. Cele badań było: (1) rozpoznanie najbardziej rozwiniętych gospodarczo gmin na badanym obszarze; (2) rozpoznanie, w jaki sposób czynniki rozwojowe i ich kombinacje, które mogą tworzyć kapitał terytorialny, są postrzegane i wykorzystywane w strategiach miejskich....
-
Application of gamma densitometry and statistical signal analysis to gas phase velocity measurements in pipeline hydrotransport
PublikacjaThe work presents selected methods of signal analysis used in the processing of data obtained from radiometric probes. The used data came from an exemplary study of a two-phase liquid-gas flow at the laboratory installation. In such rigs many possible transport types may be observed, i.e. slug, plug and bubble flow, and each of them gives different signal-to-noise ratio of recorded data. Therefore, available radiometric methods...
-
Application of gamma densitometry and statistical signal analysis to gas phase velocity measurements in pipeline hydrotransport
PublikacjaThe work presents selected methods of signal analysis used in the processing of data obtained from radiometric probes. The used data came from an exemplary study of a two-phase liquid-gas flow at the laboratory installation. In such rigs many possible transport types may be observed, i.e. slug, plug and bubble flow, and each of them gives different signal-to-noise ratio of recorded data. Therefore, available radiometric methods...
-
ADT in mammography
PublikacjaWe discuss limitations of the known methods of IR imaging in diagnostics of breast cancer. In conclusion we show that for practical reasons one requires new approaches because the known methods based on simple observation of external temperature distribution are not fully effective. Even advanced pattern recognition could not help too much for static images. We ask the question: may active dynamic thermography, known in nondestructive...
-
The Hough transform in the classification process of inland ships
PublikacjaThis article presents an analysis of the possibilities of using image processing methods for feature extraction that allows kNN classification based on a ship’s image delivered from an on-water video surveillance system. The subject of the analysis is the Hough transform which enables the detection of straight lines in an image. The recognized straight lines and the information about them serve as features in the classification...
-
Music Mood Visualization Using Self-Organizing Maps
PublikacjaDue to an increasing amount of music being made available in digital form in the Internet, an automatic organization of music is sought. The paper presents an approach to graphical representation of mood of songs based on Self-Organizing Maps. Parameters describing mood of music are proposed and calculated and then analyzed employing correlation with mood dimensions based on the Multidimensional Scaling. A map is created in which...
-
Remote Estimation of Video-Based Vital Signs in Emotion Invocation Studies
PublikacjaAbstract— The goal of this study is to examine the influence of various imitated and video invoked emotions on the vital signs (respiratory and pulse rates). We also perform an analysis of the possibility to extract signals from sequences acquired with cost-effective cameras. The preliminary results show that the respiratory rate allows for better separation of some emotions than the pulse rate, yet this relation highly depends...
-
Enabling Deeper Linguistic-based Text Analytics – Construct Development for the Criticality of Negative Service Experience
PublikacjaSignificant progress has been made in linguistic-based text analytics particularly with the increasing availability of data and deep learning computational models for more accurate opinion analysis and domain-specific entity recognition. In understanding customer service experience from texts, analysis of sentiments associated with different stages of the service lifecycle is a useful starting point. However, when richer insights...
-
Use of Neural Networks in Diagnostics of Rolling-Element Bearing of the Induction Motor
PublikacjaBearing defect is statistically the most frequent cause of an induction motor fault. The research described in the paper utilized the phenomenon of the current change in the induction motor with bearing defect. Methods based on the analysis of the supplying current are particularly useful when it is impossible to install diagnostic devices directly on the motor. The presented method of rolling-element bearing diagnostics used indirect...
-
Time-frequency analysis in optical coherence tomography for technical objects examination
PublikacjaOptical coherence tomography (OCT) is one of the most advanced optical measurement techniques for complex structure visualization. The advantages of OCT have been used for surface and subsurface defect detection in composite materials, polymers, ceramics, non-metallic protective coatings, and many more. Our research activity has been focused on timefrequency spectroscopic analysis in OCT. It is based on time resolved spectral analysis...
-
Problems of modelling toxic compounds emitted by a marine internal combustion engine for the evaluation of its structure parameters
PublikacjaThe paper presents the possibility of using an analytical study of the engine exhaust ignition to evaluate the technical condition of the selected components. Software tools available for the analysis of experimental data commonly use multiple regression model that allows the study of the effects and iterations between model input quantities and one output variable. The use of multi-equation models gives a lot of freedom in the...
-
Modern trends in solid phase extraction: New sorbent media
PublikacjaBased on the recently published literature, this review provides an update of the most important features and application of formats and devices employed in solid phase extraction (SPE). Special attention was paid on new trapping media proposed in SPE prior the chromatography analysis, based on the use of nanostructured materials, including carbon nanomaterials, electrospun nanofibers, dendrimes and magnetic nanoparticles, molecular...
-
Performance Analysis of Interaction between Smart Glasses and Smart Objects Using Image-Based Object Identification
PublikacjaWe propose the use of smart glasses to collaborate with smart objects in the Internet of Things environment. Particularly we are focusing on new interaction methods and the analysis of acceptable reaction times in the process of object recognition using smart glasses. We evaluated the proposed method using user studies and experiments with three different smart glasses: Google Glass, Epson Moverio, and the developed eGlasses platform....
-
Using Eye-tracking to get information on the skills acquisition by the radiology residents
PublikacjaThis paper describes the possibility of monitoring the progress of knowledge and skills acquisition by the students of radiology. It is achieved by an analysis of a visual attention distribution patterns during image-based tasks solving. The concept is to use the eye-tracking data to recognize the way how the radiographic images are read by recognized experts, radiography residents involved in the training program, and untrained...
-
Analysis of odour interactions in model gas mixtures using electronic nose and fuzzy logic
PublikacjaMeasurement and monitoring of air quality in terms of odour nuisance is an important problem. Although the source of these nuisances is different (e.g. wastewater treatment plants, municipal landfills), their common feature is that they are a complex mixture of odorants with different odour thresholds. An additional problem is occurrence of the odour interactions between mixture components. From a practical point of view, it would...
-
MODERNIST, 1920S AND 1930S INDUSTRIAL ARCHITECTURE OF THE PORT OF GDYNIA - IN SEARCH OF AN AESTHETIC LANGUAGE FOR UTILITARIAN BUILDINGS OF THE POLISH GATEWAY TO THE WORLD
PublikacjaThe purpose of the article is to present the results of the research on the aspects of the Port of Gdynia modernist architecture aesthetics. Its construction was one of the two major projects carried out in the interwar period in Poland. In the course of analyses it has been attempted to answer the question whether an individual aesthetic language has been created in the 1920s and 1930s for the industrial architecture of the Polish...
-
Multi-Criteria Approach in Multifunctional Building Design Process
PublikacjaThe paper presents new approach in multifunctional building design process. Publication defines problems related to the design of complex multifunctional buildings. Currently, contemporary urban areas are characterized by very intensive use of space. Today, buildings are being built bigger and contain more diverse functions to meet the needs of a large number of users in one capacity. The trends show the need for recognition of...
-
The preparation and evaluation of core-shell magnetic dummy-template molecularly imprinted polymers for preliminary recognition of the low-mass polybrominated diphenyl ethers from aqueous solutions
PublikacjaThe design, preparation process, binding abilities, morphological characteristic and prospective field of application of dummy-template magnetic molecularly imprinted polymer (DMMIP) for preliminary recognition of the selected low-mass polybrominated diphenyl ethers (PBDE-47 and PBDE-99) from aquatic environment were investigated. The surface of iron oxide (Fe3O4) nanopowder (50-100 nm particles size) was modified with tetraethoxysilane...
-
Detection, classification and localization of acoustic events in the presence of background noise for acoustic surveillance of hazardous situations
PublikacjaEvaluation of sound event detection, classification and localization of hazardous acoustic events in the presence of background noise of different types and changing intensities is presented. The methods for discerning between the events being in focus and the acoustic background are introduced. The classifier, based on a Support Vector Machine algorithm, is described. The set of features and samples used for the training of the...