Filtry
wszystkich: 1427
wybranych: 994
Wyniki wyszukiwania dla: VISUAL SPEECH RECOGNITION
-
Report no WOiO/II/67/2015 - Expertise about load capacity of twistlock foundation and sliding foundation
PublikacjaCustomer delivered to Laboratory 17 elements - twist lock foundations and sliding lock foundations (ship equipment for container lashing). Elements were selected from production series. Each element was loaded for braking load. After test visual inspection had been performed. The expertise contains: description of tested elements, test assumptions, test stand, results and conclusions
-
Affective Learning Manifesto – 10 Years Later
PublikacjaIn 2004 a group of affective computing researchers proclaimed a manifesto of affective learning that outlined the prospects and white spots of research at that time. Ten years passed by and affective computing developed many methods and tools for tracking human emotional states as well as models for affective systems construction. There are multiple examples of affective methods applications in Intelligent Tutoring Systems (ITS)....
-
Zastosowanie metod eksploracji danych do analizy odpowiedzi czujników gazu
PublikacjaZagadnienia poruszane w niniejszej rozprawie dotyczą zastosowania metod eksploracji danych do analizy odpowiedzi czujników gazu, umożliwiających poprawną identyfikację składu mieszaniny gazowej w elektronicznych systemach rozpoznawania gazu. Elektroniczne systemy rozpoznawania gazu to urządzenia wykorzystujące czujniki gazu oraz odpowiednio dobrane metody analizy danych pomiarowych, zdolne do określenia składu mierzonej mieszaniny...
-
Context-Aware Indexing and Retrieval for Cognitive Systems Using SOEKS and DDNA
PublikacjaVisual content searching, browsing and retrieval tools have been a focus area of interest as they are required by systems from many different domains. Context-based, Content-Based, and Semantic-based are different approaches utilized for indexing/retrieving, but have their drawbacks when applied to systems that aim to mimic the human capabilities. Such systems, also known as Cognitive Systems, are still limited in terms of processing...
-
Features extraction from the electrocatalytic gas sensor responses
PublikacjaOne of the types of gas sensors used for detection and identification of toxic-air pollutant is an electrocatalytic gas sensor. The electrocatalytic sensors are working in cyclic voltammetry mode, enable detection of various gases. Their response are in the form of I-V curves which contain information about the type and the concentration of measured volatile compound. However,...
-
Robot Eye Perspective in Perceiving Facial Expressions in Interaction with Children with Autism
PublikacjaThe paper concerns automatic facial expression analysis applied in a study of natural “in the wild” interaction between children with autism and a social robot. The paper reports a study that analyzed the recordings captured via a camera located in the eye of a robot. Children with autism exhibit a diverse level of deficits, including ones in social interaction and emotional expression. The aim of the study was to explore the possibility...
-
Trustworthy Applications of ML Algorithms in Medicine - Discussion and Preliminary Results for a Problem of Small Vessels Disease Diagnosis.
PublikacjaML algorithms are very effective tools for medical data analyzing, especially at image recognition. Although they cannot be considered as a stand-alone diagnostic tool, because it is a black-box, it can certainly be a medical support that minimize negative effect of human-factors. In high-risk domains, not only the correct diagnosis is important, but also the reasoning behind it. Therefore, it is important to focus on trustworthiness...
-
Digital document life cycle development
PublikacjaPrzedstawiono model DDLC wytwarzania interaktywnych dokumentów cyfrowych z ich pierwowzorów papierowych. Model DDLC opracowany w ramach 5 PR UE IST-2002-33441 MEMORIAL wyróżnia 6 faz i odpowiednie grupy funkcjonalności narzędzi do ich realizacji. Cykl wytwarzanie realizuje politykę całkowitej kontroli jakości, wykorzystującej specjalnie opracowaną metodę Visual GQM.
-
Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
PublikacjaIn this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor pr ocess priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bi- gram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of -grams with a topic model,...
-
Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering
PublikacjaThis paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...
-
Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing
PublikacjaIn this application-oriented paper we consider the problem of elimination of impulsive disturbances, such as clicks, pops and record scratches, from archive audio recordings. The proposed approach is based on bidirectional processing—noise pulses are localized by combining the results of forward-time and backward-time signal analysis. Based on the results of specially designed empirical tests (rather than on the results of theoretical analysis),...
-
Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling
PublikacjaSymbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is...
-
Automatic music signal mixing system based on one-dimensional Wave-U-Net autoencoders
PublikacjaThe purpose of this paper is to show a music mixing system that is capable of automatically mixing separate raw recordings with good quality regardless of the music genre. This work recalls selected methods for automatic audio mixing first. Then, a novel deep model based on one-dimensional Wave-U-Net autoencoders is proposed for automatic music mixing. The model is trained on a custom-prepared database. Mixes created using the...
-
''Computing with words'' concept applied to musical instrument recognition. W: [CD-ROM] International Symposium of Musical Acoustics. ISMA MEXICO CITY. Mexico City, 9-13 December 2002. Mexico City: Escuela Nacional de Musica UNAM**2002, 8 s. 3 rys. 3 tab. bibliogr. 25 poz. Automatyczne rozpoznawanie klas instrumentów muzycznych w oparciu o wyraże- nia opisujące barwę dźwięku.
PublikacjaW referacie przedstawiono nowy sposób automatycznego przetwarzania danychmuzycznych w oparciu o paradygmat zaproponowany przez L. Zadeha. Pozwala tona automatyczne rozpoznawanie klas instrumentów muzycznych wykorzystując o-pis słowny barwy dźwięku. Przedstawiono system realizujący automatyczną kla-syfikację instrumentów muzycznych oparty o metodę zbiorów przybliżonych ilogikę rozmytą.
-
Eye Blink Based Detection of Liveness in Biometric Authentication Systems Using Conditional Random Fields
PublikacjaThe goal of this paper was to verify whether the conditional random fields are suitable and enough efficient for eye blink detection in user authentication systems based on face recognition with a standard web camera. To evaluate this approach several experiments were carried on using a specially developed test application and video database.
-
Integration of thermographic data with the 3D object model
PublikacjaThe aim of the paper is to present new method for merging the 3D model data of the measured object with thermograms. Our technique is based on the combination of visual 3D imaging technique and thermal imaging technique, which maps the 2D thermograms on to 3D anatomical mesh model. The combination of these imaging modalities allows the generation of combined 3D and thermal data from which thermal signatures can be verified and...
-
Robot-Based Intervention for Children With Autism Spectrum Disorder: A Systematic Literature Review
PublikacjaChildren with autism spectrum disorder (ASD) have deficits in the socio-communicative domain and frequently face severe difficulties in the recognition and expression of emotions. Existing literature suggested that children with ASD benefit from robot-based interventions. However, studies varied considerably in participant characteristics, applied robots, and trained skills. Here, we reviewed robot-based interventions targeting...
-
Two-photon microperimetry with picosecond pulses
PublikacjaTwo-photon vision is a phenomenon associated with the perception of short pulsesof near-infrared radiation (900-1200 nm) as a visible light. It is caused by the nonlinear processof two-photon absorption by visual pigments. Here we present results showing the influence ofpulse duration and repetition rate of short pulsed lasers on the visual threshold. We comparedtwo-photon sensitivity maps of the retina obtained for subjects with...
-
Independent dynamics of slow, intermediate, and fast intracranial EEG spectral activities during human memory formation
PublikacjaA wide spectrum of brain rhythms are engaged throughout the human cortex in cognitive functions. How the rhythms of various low and high frequencies are spatiotemporally coordinated across the human brain during memory processing is inconclusive. They can either be coordinated together across a wide range of the frequency spectrum or induced in specific bands. We used a large dataset of human intracranial electroencephalography...
-
Robust unsupervised georeferencing algorithm for aerial and satellite imagery
PublikacjaIn order to eliminate a human factor and fully automate the process of embedding the spatial localization information in a remote sensed image the integrated georeferencing method was proposed. The paper presents this unsupervised and robust approach which is comprised of pattern recognition, using SIFT-based detector, and RANSAC based outlier removal with matching algorithm.
-
Computer-Supported Polysensory Integration Technology for Educationally Handicapped Pupils
PublikacjaIn this paper, a multimedia system providing technology for hearing and visual attention stimulation is shortly presented. The system aims to support the development of educationally handicapped pupils. The system has been presented in the context of its configuration, architecture, and therapeutic exercise implementation issues. Results of pupils’ improvements after 8 weeks of training with the system are also provided. Training...
-
Analiza stateczności walcowego pionowego zbiornika dwupłaszczowego posadowionego na fundamencie gruntowym
PublikacjaOmówiono wpływ nierównomiernego osiadania płaszcza wewnętrznego i zewnętrznego na nośność stalowego zbiornika dwupłaszczowego posadowionego za pośrednictwem podsypki piaskowej na uwarstwionym podłożu gruntowym. Analizę obliczeniową wykonano numerycznie przy zastosowaniu systemu komputerowego MSC Visual Nastran for Windows, version 2001. Stwierdzono, że przy zastosowaniu w obliczeniach podłoża typu Winklera otrzymuje się zbyt duże...
-
Zintegrowane środowiska projektowania aplikacji internetowych.
PublikacjaZintegrowane środowiska, umożliwiające analizę, projektowanie i implementację aplikacji, stanowią wymarzone narzędzie pracy każdego inżyniera oprogramowania. Opisano próby dostarczenia takiego środowiska w postaci Borland Delphi 5.0 oraz w postaci Rational XDE - środowiska projektowania w UML przeznaczonego do integracji z istniejącymi środowiskami implementacji, takimi jak Microsoft Visual Studio.NET i IBM Web Sphere...
-
Virtual Whiteboard: A gesture-controlled pen-free tool emulating school whiteboard
PublikacjaIn the paper the so-called Virtual Whiteboard is presented which may be an alternative solution for modern electronic whiteboards based on electronic pens and sensors. The presented tool enables the user to write, draw and handle whiteboard contents using his/her hands only. An additional equipment such as infrared diodes, infrared cameras or cyber gloves is not needed. The user's interaction with the Virtual Whiteboard computer...
-
Potential and Use of the Googlenet Ann for the Purposes of Inland Water Ships Classification
PublikacjaThis article presents an analysis of the possibilities of using the pre-degraded GoogLeNet artificial neural network to classify inland vessels. Inland water authorities monitor the intensity of the vessels via CCTV. Such classification seems to be an improvement in their statutory tasks. The automatic classification of the inland vessels from video recording is a one of the main objectives of the Automatic Ship Recognition and...
-
Automation of the Road Gate Operations Process at the Container Terminal—A Case Study of DCT Gdańsk SA
PublikacjaThe future increased terminal capacity will result in more container movement through the road complex and rail siding, which are one of the most critical areas (potential bottlenecks) in the container terminal. Truck turnaround time is one of the major factors that customers take into account while deciding how many container volumes they will handle through the container terminal. To enable to optimize increased traffic with...
-
State of the art electronic nose technology and future trends
PublikacjaThis chapter briefly reviews the progress in field of artificial olfaction and demonstrates future trends in electronic nose technology. The discussion about e-nose concern also a big challenge for the pattern recognition (PARC) systems due to several particular problems they involve. Finally, the application of e-nose in different areas of life is given.
-
On Facial Expressions and Emotions RGB-D Database
PublikacjaThe goal of this paper is to present the idea of creating reference database of RGB-D video recordings for recognition of facial expressions and emotions. Two different formats of the recordings used for creation of two versions of the database are described and compared using different criteria. Examples of first applications using databases are also presented to evaluate their usefulness.
-
A Device for Measuring Auditory Brainstem Responses to Audio
PublikacjaStandard ABR devices use clicks and tone bursts to assess subjects’ hearing in an objective way. A new device was developed that extends the functionality of a standard ABR audiometer by collecting and analyzing auditory brainstem responses (ABR). The developed accessory allows for the use of complex sounds (e.g., speech or music excerpts) as stimuli. Therefore, it is possible to find out how efficiently different types of sounds...
-
Secured wired BPL voice transmission system
PublikacjaDesigning a secured voice transmission system is not a trivial task. Wired media, thanks to their reliability and resistance to mechanical damage, seem an ideal solution. The BPL (Broadband over Power Line) cable is resistant to electricity stoppage and partial damage of phase conductors, ensuring continuity of transmission in case of an emergency. It seems an appropriate tool for delivering critical data, mostly clear and understandable...
-
The relationship between architectural detail and light in contemporary architecture
PublikacjaThe paper deals with the influence of modern artificial and natural lighting technology on contemporary architecture, especially in relation to architectural detail. Advanced complex lighting systems have an increasing importance in contemporary design solutions. Light itself, and the effect of its actions, and characteristic parts of the sophisticated lighting systems, play an essential role as independent architectural elements,...
-
Using Different Information Channels for Affect-Aware Video Games - A Case Study
PublikacjaThis paper presents the problem of creating affect-aware video games that use different information channels, such as image, video, physiological signals, input devices, and player’s behaviour, for emotion recognition. Presented case studies of three affect-aware games show certain conditions and limitations for using specific signals to recognize emotions and lead to interesting conclusions.
-
Perceptual and Motor Effects of Muscle Co-activation in a Force Production Task
PublikacjaWe tested several predictions of the theory of motor control with spatial referent coordinates related to effects of muscle coactivation on force production and perception. In particular, we predicted that subjects would produce unintentional force increase by finger flexors and be unaware of this force increase. Healthy subjects performed steady force production task in isometric conditions with visual feedback on the force level....
-
Awareness evaluation of patients in vegetative state employing eye-gaze tracking system
PublikacjaApplication of eye-gaze tracking system to awareness evaluation is demonstrated. Hitherto awareness evaluation methods are presented. The assumptions of proposed method based on analysis of visual activity of patients in vegetative state are demonstrated. The eye-gaze tracking system ''Cyber-Eye'' developed at the Multimedia Systems Department employed to conducted experiments is presented. Research described in the paper indicates...
-
Gesture-controlled Sound Mixing System With a Sonified Interface
PublikacjaIn this paper the Authors present a novel approach to sound mixing. It is materialized in a system that enables to mix sound with hand gestures recognized in a video stream. The system has been developed in such a way that mixing operations can be performed both with or without visual support. To check the hypothesis that the mixing process needs only an auditory display, the influence of audio information visualization on sound...
-
Guitar String Sound Retrieved from Moving Pixels
PublikacjaThe aim of this study was to develop a method of visual recording and analyzing the vibrations of guitar strings using high-speed cameras and dedicated video processing algorithms. The recording of a plucked string reveals the way in which the deformations propagate, composing the standing and travelling wave. The paper compares the results for a few selected models of classical and acoustic guitars, and it involves processing...
-
An electronic nose based on the semiconducting and electrochemical gas sensors
PublikacjaThe practical application of human nose for fragrance recognition is severely limited by the fact that our sense of smell is subjective and gets tired easily. Consequently, there is a significant need for an instrument that can be a substitution of the human sense of smell. Development of an electronic nose devices is an active area of research starting from pioneering research of Dodd and Persuad in the mid-1980s. Such systems...
-
Further developments of parameterization methods of audio stream analysis for secuirty purposes
PublikacjaThe paper presents an automatic sound recognition algorithm intended for application in an audiovisual security monitoring system. A distributed character of security systems does not allow for simultaneous observation of multiple multimedia streams, thus an automatic recognition algorithm must be introduced. In the paper, a module for the parameterization and automatic detection of audio events is described. The spectral analyses...
-
Comparison of Lithuanian and Polish Consonant Phonemes Based on Acoustic Analysis – Preliminary Results
PublikacjaThe goal of this research is to find a set of acoustic parameters that are related to differences between Polish and Lithuanian language consonants. In order to identify these differences, an acoustic analysis is performed, and the phoneme sounds are described as the vectors of acoustic parameters. Parameters known from the speech domain as well as those from the music information retrieval area are employed. These parameters are...
-
Multimedia industrial and medical applications supported by machine learning
PublikacjaThis article outlines a keynote paper presented at the Intelligent DecisionTechnologies conference providing a part of the KES Multi-theme Conference “Smart Digital Futures” organized in Rome on June 14–16, 2023. It briefly discusses projects related to traffic control using developed intelligent traffic signs and diagnosing the health of wind turbine mechanisms and multimodal biometric authentication for banking branches to provide...
-
Sound Art and Architecture: New Horizons for Architecture and Urbanism
PublikacjaThe article discusses the crossroad between art and architecture. It sketches out the theoretical and practical aspects of involving art into architecture and multisensory dimensions of space. The analysis is based on examples of innovative experimental activities for architecture: educational projects such as workshops, seminars and courses, combining art and architecture, with special emphasis on sound art, and the consequences...
-
Flock behavior and control
PublikacjaIn this paper we present the results of the Flock Behaviour and Control workshop cluster during “Shapes of Logic Conference 2015”. During the event, students got familiar with the techniques of both visual and sound real-time data processing. The second topic presented for students was behaviourbased approach of design process, mainly based on the mathematical rules set up by Craig Raynolds on the swarm behaviour. The aim of the...
-
Architektura medialna : kierunki rozwoju w kontekście sztuki i technologii
PublikacjaW artykule zaprezentowane jest zagadnienie architektury medialnej w obrębie której następuje integracja sztuki i technologii. Jest to architektura, która staje się nośnikiem zmiennych treści wizualnych w oparciu o technologie informacyjne i oświetleniowe. Zagadnienie to rozpatrywane jest w kontekście przestrzeni komercyjnych, w których architektura medialna staję się nowym sposobem ich promocji. Zarysowany zostanie kontekst rozwoju...
-
Independent dynamics of low, intermediate, and high frequency spectral intracranial EEG activities during human memory formation
PublikacjaA wide spectrum of brain rhythms are engaged throughout the human cortex in cognitive functions. How the rhythms of various frequency ranges are coordinated across the space of the human cortex and time of memory processing is inconclusive. They can either be coordinated together across the frequency spectrum at the same cortical site and time or induced independently in particular bands. We used a large dataset of human intracranial...
-
Nowa metoda diagnostyki stanu technicznego nakładek stykowych
PublikacjaThe current collection system, which consists of the overhead contact line and a current collector, is particularly important in electric rail vehicles, where their reliability is concerned. Faultless current collection is conditioned not only by suitable construction of these elements but also by their proper maintenance. Retaining permanent electrical contact is essential in DC systems, where current demand is relatively high. In...
-
Chirp Rate and Instantaneous Frequency Estimation: Application to Recursive Vertical Synchrosqueezing
PublikacjaThis letter introduces new chirp rate and instantaneous frequency estimators designed for frequency-modulated signals. These estimators are first investigated from a deterministic point of view, then compared together in terms of statistical efficiency. They are also used to design new recursive versions of the vertically synchrosqueezed short-time Fourier transform, using a previously published method (D. Fourer, F. Auger, and...
-
Distributed Representations Based on Geometric Algebra: the Continuous Model
PublikacjaAuthors revise the concept of a distributed representation of data as well as two previously developed models: Holographic Reduced Representation (HRR) and Binary Spatter Codes (BSC). A Geometric Analogue (GAc - ''c'' stands for continuous as opposed to its discrete version) of HRR is introduced - it employs role-filler binding based on geometric products. Atomic objects are real-valued vectors in n-dimensional Euclidean space...
-
Improving the Accuracy in Sentiment Classification in the Light of Modelling the Latent Semantic Relations
PublikacjaThe research presents the methodology of improving the accuracy in sentiment classification in the light of modelling the latent semantic relations (LSR). The objective of this methodology is to find ways of eliminating the limitations of the discriminant and probabilistic methods for LSR revealing and customizing the sentiment classification process (SCP) to the more accurate recognition of text tonality. This objective was achieved...
-
Validating data acquired with experimental multimodal biometric system installed in bank branches
PublikacjaAn experimental system was engineered and implemented in 100 copies inside a real banking environment comprising: dynamic handwritten signature verification, face recognition, bank client voice recognition and hand vein distribution verification. The main purpose of the presented research was to analyze questionnaire responses reflecting user opinions on: comfort, ergonomics, intuitiveness and other aspects of the biometric enrollment...
-
Comparison of Methods for Real and Imaginary Motion Classification from EEG Signals
PublikacjaA method for feature extraction and results of classification of EEG signals obtained from performed and imagined motion are presented. A set of 615 features was obtained to serve for the recognition of type and laterality of motion using 8 different classifications approaches. A comparison of achieved classifiers accuracy is presented in the paper, and then conclusions and discussion are provided. Among applied algorithms the...