Filters
total: 1859
filtered: 1380
-
Catalog
Chosen catalog filters
displaying 1000 best results Help
Search results for: audio-visual interaction
-
Human System Interaction in Review: Advancing the Artificial Intelligence Transformation
PublicationThe industrial advancement of human society has been fundamentally driven by diverse ‘systems’ that facilitate ‘human interaction’ within physical, digital, virtual, social and artificial environments, and upon the hyper-connected layers of system-system interactions across these environments. The research and practice of Human System Interaction (HSI) has undergone exponential development due to the enhanced capabilities, increased...
-
Visual method for detecting critical damage in railway contact strips
PublicationEnsuring an uninterrupted supply of power in the electric traction is vital for the safety of this important transport system. For this purpose, monitoring and diagnostics of the technical condition of the vehicle's power supply elements are becoming increasingly common. This paper presents a new visual method for detecting contact strip damage, based on measurement and analysis of the movement of the overhead contact line (OCL)...
-
FE analysis of support-specimen interaction of compressive experimental test
PublicationThe objective of this work is to investigate the support-specimen interaction during the compressive experimental testing of stiffened plates. The interaction is analyzed employing the nonlinear Finite Element Method using the commercial software ANSYS. The connection between the stiffened plate and testing supports is modelled with the use of contact elements, where several possible interaction scenarios are investigated, and...
-
Interaction between acoustic and non-acoustic mode in bubbly liquid
PublicationThe nonlinear interaction of acoustic and entropy modes in a bubbly liquid is the subject of investigation. Thedynamic equation governing an excess density of the entropy mode is derived. Nonlinearity and dispersion are the reasons forexcitation of the entropy mode. The nonlinear interaction of modes as a reason for bubble to grow due to sound, is discovered.Some numerical examples of the modes interactions are made.
-
Experimental verification of visual method for measuring displacements of contact line elements
PublicationThe increase of rail vehicles speed, as well as the increase of their power, puts high demands on the power delivery system for traction vehicles The most critical point in the vehicle's power supply circuit is the contact between the current collector and contact wires. Ensuring a reliable co-operation of the current collector and contact line, requires technical development...
-
Analysis of impact of lossy audio compression on the robustness of watermark embedded in the DWT domain for non-blind copyright protection
PublicationA methodology of non-blind watermarking of the audio content is proposed. The outline of audio copyright problem and motivation for practical applications are discussed. The algorithmic theory pertaining watermarking techniques is briefly introduced. The system architecture together with employed workflows for embedding and extracting the watermarks are described. The implemented approach is described and obtained results are reported....
-
AUDIO SIGNAL EQUALIZATION BASED ON IMPULSE RESPONSE OF A LISTENING ROOM AND MUSIC CONTENT REPRODUCED
PublicationA research study presents investigations of the influence of the room acoustics on the frequency characteristic of the audio signal playback. First, a concept of a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the room spectral response, a system for room acoustics compensation based on an equalizer designed is proposed. The system settings depend on music genre recognized automatically....
-
Multimodal human-computer interfaces based on advanced video and audio analysis
PublicationMultimodal interfaces development history is reviewed briefly in the introduction. Some applications of multimodal interfaces to education software for disabled people are presented. One of them, the LipMouse is a novel, vision-based human-computer interface that tracks user’s lip movements and detect lips gestures. A new approach to diagnosing Parkinson’s disease is also shown. The progression of the disease can be measured employing...
-
Child-Robot Interaction Studies During COVID-19 Pandemic
PublicationThe coronavirus disease (COVID-19) pandemic affected our lives deeply, just like everyone else, the children also suffered from the restrictions due to COVID-19 affecting their education and social interactions with others, being restricted from play areas and schools for a long time. Although social robots provide a promising solution to support children in their education, healthcare, and social interaction with others, the precautions...
-
Multiple Cues-Based Robust Visual Object Tracking Method
PublicationVisual object tracking is still considered a challenging task in computer vision research society. The object of interest undergoes significant appearance changes because of illumination variation, deformation, motion blur, background clutter, and occlusion. Kernelized correlation filter- (KCF) based tracking schemes have shown good performance in recent years. The accuracy and robustness of these trackers can be further enhanced...
-
Selection of Visual Descriptors for the Purpose of Multi-camera Object Re-identification
PublicationA comparative analysis of various visual descriptors is presented in this chapter. The descriptors utilize many aspects of image data: colour, texture, gradient, and statistical moments. The descriptor list is supplemented with local features calculated in close vicinity of key points found automatically in the image. The goal of the analysis is to find descriptors that are best suited for particular task, i.e. re-identification...
-
SZTUKA WIZUALNA W OBIEKTACH MEDYCZNYCH = VISUAL ARTS IN MEDICAL FACILITIES
PublicationWspółczesna architektura obiektów służby zdrowia podlega dynamicznym przeobrażeniom formalnym wynikającym zarówno z rozwoju technologii medycznych, zmian zachodzących w podejściu wobec pacjenta. Narastający w naukach medycznych kierunek holistyczny ustawia pacjenta jako użytkownika w trzech wymiarach: biologicznym, społecznym i psychologicznym. Stąd pojawiające się w procesie projektowym dotyczącym szpitali czy przychodni nowe...
-
Dynamic Semantic Visual Information Management
PublicationDominant Internet search engines use keywords and therefore are not suited for exploration of new domains of knowledge, when the user does not know specific vocabulary. Browsing through articles in a large encyclopedia, each presenting a small fragment of knowledge, it is hard to map the whole domain, see relevant concepts and their relations. In Wikipedia for example some highly relevant articles are not linked with each other....
-
Visual impairment and traits of autism in children
Publication -
A methodology of visual modeling language evaluation
PublicationMetody oceny jakości metod modelowania są istotnym elementem inżynierii języków modelowania wizualnego. W referacie zaproponowano metodę oceny języków modelowania wizualnego na podstawie wymiarów poznawczych. Zaprezentowano metodologiczną dyskusję zastosowania nauk psychologicznych do oceny metod modelowania, metodologię CD-VML, powiązaną z nią metodę CD-VML-UC do oceny przypadków użycia oraz weryfikację metodologii.
-
A new approach to visual system testing
PublicationOpisano budowę laboratoryjnego stanowiska prac bawczych nad perymetrią obiektywną. Przedstawiono zasadę działania algorytmu VEPDA oraz wyniki działania VEPDA na danych eksperymentalnych.
-
Application of the fluid–structure interaction technique for the analysis of hydrodynamic lubrication problems.
PublicationFluid–structure interaction technique seems to be one of the most promising possibilities for theoretical analysis of lubrication problems. It allows coupling of different physical fields in one computational task, taking into account the interaction between them. In this article, two sets of fluid–structure interaction analyses focusing on the bearing performance evaluation are presented. One analysis was applied to a water-lubricated...
-
Robot Eye Perspective in Perceiving Facial Expressions in Interaction with Children with Autism
PublicationThe paper concerns automatic facial expression analysis applied in a study of natural “in the wild” interaction between children with autism and a social robot. The paper reports a study that analyzed the recordings captured via a camera located in the eye of a robot. Children with autism exhibit a diverse level of deficits, including ones in social interaction and emotional expression. The aim of the study was to explore the possibility...
-
Examining Government-Citizen Interactions on Twitter using Visual and Sentiment Analysis
PublicationThe goal of this paper is to propose a methodology comprising a range of visualization techniques to analyze the interactions between government and citizens on the issues of public concern taking place on Twitter, mainly through the official government or ministry accounts. The methodology addresses: 1) the level of government activity in different countries and sectors; 2) the topics that are addressed through such activities;...
-
Towards Audio Signal Equalization Based on Spectral Characteristics of a Listening Room and Music Content Reproduced
PublicationThis study presents investigations of the influence of the room acoustics on the frequency characteristic of the audio signal playback. First, the concept of a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the room spectral response, a system for room acoustics compensation based on an equalizer designed is proposed. The system settings depend on music genre recognized automatically....
-
EMG and gaze based interaction with graphic interface of smart glasses application
PublicationIn this paper we investigate the effectiveness of the interaction using eye tracking and electromyography. Smart glasses requires reliable interfaces for controlling the graphic content displayed directly in front of the user's eye. Presented research is related with the eGlasses project, which is focused on the development of an open platform in the form of multisensory electronic glasses and related interaction methods. One of...
-
New semi-causal and noncausal techniques for detection of impulsive disturbances in multivariate signals with audio applications
PublicationThis paper deals with the problem of localization of impulsive disturbances in nonstationary multivariate signals. Both unidirectional and bidirectional (noncausal) detection schemes are proposed. It is shown that the strengthened pulse detection rule, which combines analysis of one-step-ahead signal prediction errors with critical evaluation of leave-one-out signal interpolation errors, allows one to noticeably improve detection results...
-
Text Categorization Improvement via User Interaction
PublicationIn this paper, we propose an approach to improvement of text categorization using interaction with the user. The quality of categorization has been defined in terms of a distribution of objects related to the classes and projected on the self-organizing maps. For the experiments, we use the articles and categories from the subset of Simple Wikipedia. We test three different approaches for text representation. As a baseline we use...
-
Visual Features for Improving Endoscopic Bleeding Detection Using Convolutional Neural Networks
PublicationThe presented paper investigates the problem of endoscopic bleeding detection in endoscopic videos in the form of a binary image classification task. A set of definitions of high-level visual features of endoscopic bleeding is introduced, which incorporates domain knowledge from the field. The high-level features are coupled with respective feature descriptors, enabling automatic capture of the features using image processing methods....
-
FEM modelling of screw displacement pile interaction with subsoil
PublicationPredicting the-settlement characteristics of piles is an important element in the designing of pile foundations. The most reliable method in evaluating pile-soil interaction is the static load test, preferably performed with instrumentation for measuring shaft and pile base resistances. This, however, is a mostly post-implementation test. In the design phase, prediction methods are needed, in which numerical simulations play an...
-
Elimination of impulsive disturbances from archive audio files – comparison of three noise pulse detection schemes
PublicationThe problem of elimination of impulsive disturbances (such as clicks, pops, ticks, crackles, and record scratches) from archive audio recordings is considered and solved using autoregressive modeling. Three classical noise pulse detection schemes are examined and compared: the approach based on open-loop multi-step-ahead signal prediction, the approach based on decision-feedback signal prediction, and the double threshold approach,...
-
A Device for Measuring Auditory Brainstem Responses to Audio
PublicationStandard ABR devices use clicks and tone bursts to assess subjects’ hearing in an objective way. A new device was developed that extends the functionality of a standard ABR audiometer by collecting and analyzing auditory brainstem responses (ABR). The developed accessory allows for the use of complex sounds (e.g., speech or music excerpts) as stimuli. Therefore, it is possible to find out how efficiently different types of sounds...
-
Adaptive filter for reconstruction of stereo audio signals.
PublicationArtykuł poświęcony jest omówieniu metody rekonstrukcji zakłóconych impulsowo sygnałów stereofonicznych. W pracy zdefiniowano model sygnału stereofonicznego i przedstawiono zaprojektowany dla tego modelu filtr Kalmana. Przedstawiono modyfikacje filtru, w wyniku których algorytm dokonuje rekonstrukcji zakłóconego impulsowo sygnału w jednym kanale z wykorzystaniem dodatkowej informacji zawartej w niezakłóconych próbkach sygnału pochodzącego...
-
Intelligent algorithms for optical track audio restoration
PublicationW referacie przedstawiono dwa algorytmy dedykowane redukcji pasożytniczych zniekształceń dźwięku spotykanych w optycznych ścieżkach dźwiękowych. Pierwszy algorytm umożliwia redukcję szerokopasmowego szumu w nagraniach fonicznych. Wykorzystano w nim psycho-akustyczny model słuchu oparty o miarę nieprzewidywalność sygnału (ang. Unpredictability Measure). Ocena jakości redukcji szumu została wykonana z wykorzystaniem metod inteligentnych....
-
Visual Attention Distribution Based Assessment of User's Skill in Electronic Medical Record Navigation
PublicationCurrently, the most precise way of reflecting the skills level is an expert’s subjective assessment. In this paper we investigate the possibility of the use of eye tracking data for scalar quantitative and objective assessment of medical staff competency in EMR system navigation. According to the experiment conducted by Yarbus the observation process of particular features is associated with thinking. Moreover, eye tracking is...
-
Nonlinear Interaction of Modes in a Planar Flow of a Gas with Viscous and Thermal Attenuation
PublicationThe nonlinear interaction of wave and non-wave modes in a gas planar flow are considered. Attention is mainly paid to the case when one sound mode is dominant and excites the counter-propagating sound mode and the entropy mode. The modes are determined by links between perturbations of pressure, density, and fluid velocity. This definition follows from the linear conservation equations in the differential form and thermodynamic...
-
Visual Traffic Noise Monitoring in Urban Areas
PublicationThe paper presents an advanced system for railway and road traffic noise monitoring in metropolitan areas. This system is a functional part of a more complex solution designed for environmental monitoring in cities utilizing analyses of sound, vision and air pollution, based on a ubiquitous computing approach. The system consists of many autonomous, universal measuring units and a multimedia server, which gathers, processes and...
-
Visual impact assessment of river regulation structures
Publication -
Vowel recognition based on acoustic and visual features
PublicationW artykule zaprezentowano metodę, która może ułatwić naukę mowy dla osób z wadami słuchu. Opracowany system rozpoznawania samogłosek wykorzystuje łączną analizę parametrów akustycznych i wizualnych sygnału mowy. Parametry akustyczne bazują na współczynnikach mel-cepstralnych. Do wyznaczenia parametrów wizualnych z kształtu i ruchu ust zastosowano Active Shape Models. Jako klasyfikator użyto sztuczną sieć neuronową. Działanie systemu...
-
Modeling pragmatics for visual modeling language evaluation
PublicationPodczas oceny użyteczności języków modelowania wizualnego istnieje potrzeba uwzględnienia ich pragmatyki. Języki modelowania wizualnego mogą być stosowane w różnym kontekście, co powoduje różnice w wymaganiach, które są im stawiane. Jawny opis kontekstu użycia ułatwia precyzyjną ocenę. Pragmatyka składa się ze zbioru profili, które opisują konkretne konteksty użycia. W referacie podjęto próbę zastosowania modeli zadań do opisu...
-
Genetic Programming for Interaction Efficient Supporting in Volunteer Computing Systems
PublicationVolunteer computing systems provide a middleware for interaction between project owners and great number volunteers. In this chapter, a genetic programming paradigm has been proposed to a multi-objective scheduler design for efficient using some resources of volunteer computers via the web. In a studied problem, genetic scheduler can optimize both a workload of a bottleneck computer and cost of system. Genetic programming has been...
-
Analiza jakości transmisji treści audio-wideo w symulowanym łączu telekomunikacyjnym z wykorzystaniem techniki OFDM
PublicationWdrożenie niezawodnego systemu komunikacji audio-wideo przynosi wiele korzyści. Z uwagi na fakt, że ilość dostępnego pasma stale się kurczy, badacze koncentrują się na nowatorskich metodach transmisji. Obecnie technika OFDM (Orthogonal Frequency Division Multiplexing) jest szeroko stosowana zarówno w mediach przewodowych, jak i bezprzewodowych. W pracy przedstawiono badania jakości QoS (Quality of Service) symulowanego łącza transmisji...
-
A commonly-accessible toolchain for live streaming music events with higher-order ambisonic audio and 4k 360 vision
PublicationAn immersive live stream is especially interesting in the ongoing development of telepresence tools, especially in the virtual reality (VR) or mixed reality (MR) domain. This paper explores the remote and immersive way of enabling telepresence for the audience to high-fidelity music performance using freely-available and easily-accessible tools. A functional VR live-streaming toolchain, comprising 360 vision and higher-order ambisonic...
-
Visual and auditory attention stimulator for assisting pedagogical therapy . Stymulator uwagi wzrokowej i słuchowej do wspomagania terapii pedagogicznej
PublicationVisual and auditory attention stimulator provides a system developed in order to improve reading skills using simultaneous presentation of text in its visual form and in transformed auditory form accompanied by related movie material. The described research employed 40 children at the age of 8 13 years having difficulties in learning of reading, who were diagnosed as having developmental dyslexia. It was shown that application...
-
Calculation methods of interaction of electromagnetic waves with objects of complex geometries
PublicationModeling of the electromagnetic interaction with different homogeneous or inhomo-geneous objects is a fundamental and important problem. It is relatively easy to solve Maxwellequations analytically when the scattering object is spherical or cylindrical, for example. How-ever, when it loses these properties all that is left for us is to useapproximation models, to ac-quire the solution we need. Modeling of complex, non-spherical,...
-
Nonlinear Interaction of Magnetoacoustic Modes in a Quasi-Isentropic Plasma Flow
PublicationThe nonlinear interaction of magnetoacoustic waves in a plasma is analytically studied. A plasma is an open system. It is affected by the straight constant equilibrium magnetic flux density forming constant angle with the wave vector which varies from 0 till . The nonlinear instantaneous equation which describes excitation of secondary wave modes in the field of intense magnetoacoustic perturbations is derived by use of projecting....
-
Wireless intelligent audio-video surveillance prototyping system
PublicationThe presented system is based on the Virtex6 FPGA and several supporting devices like a fast DDR3 memory, small HD camera, microphone with A/D converter, WiFi radio communication module, etc. The system is controlled by the Linux operating system. The Linux drivers for devices implemented in the system have been prepared. The system has been successfully verified in a H.264 compression accelerator prototype in which the most demanding...
-
Analysis of allophones based on audio signal recordings and parameterization
PublicationThe aim of this study is to develop an allophonic description of English plosive consonants based on recordings of 600 specially selected words. Allophonic variations addressed in the study may have two sources: positional and contextual. The former one depends on the syllabic or prosodic position in which a particular phoneme occurs. Contextual allophony is conditioned by the local phonetic environment. Co-articulation overlapping...
-
Audio codec employing frequency-derived tonality measure
PublicationA transform codec employing efficient algorithm for detection of spectral tonal components is presented. The tonality measure used in MPEG psychoacoustic model is replaced with the method providing adequate tonality estimates even if the tonal components are deeply frequency modulated. The reliability of hearing threshold estimated using psychoacoustic model with standardized tonality measure and the proposed one is investigated...
-
Applications of neural networks and perceptual masking to audio restoration
PublicationOmówiono zastosowania algorytmów uczących się w dziedzinie rekonstruowania nagrań fonicznych. Szczególną uwagę zwrócono na zastosowanie sztucznych sieci neuronowych do usuwania zakłócających impulsów. Ponadto opisano zastosowanie inteligentnego algorytmu decyzyjnego do sterowania maskowaniem perceptualnym w celu redukowania szumu.
-
Wow detection and compensation employing spectral processing of audio.
PublicationPraca zawiera opis opracowanych algorytmów detekcji i kompensacji pasożytniczych modulacji częstotliwości wynikających z nierównomiernego przesuwu nośnika dźwięku. Proponowane metody opracowano ze szczególnym uwzględnieniem przypadkowych zniekształceń drżenia obecnych w archiwalnych filmowych ścieżkach dźwiękowych. Dodatkowo algorytmy badają wpływ zniekształceń na strukturę formantową sygnałów. Analiza zmian położenia formantów...
-
New algorithms for wow and flutter detection and compensation in audio
PublicationW referacie przedstawiono nowe metody dyskryminacji naturalnych efektów muzycznych i pasożytniczych zniekształceń drżenia dźwięku. Dodatkowo, opisano w nim metody wyznaczania przebiegu zniekształceń drżenia. Wśród nich znajdują się: detekcja okresowości sygnału w poszczególnych ramkach czasowych, śledzenie zmian przydźwięku sieciowego wykorzystujące modelowane AR widma sygnału, śledzenie zmian wysokoczęstotliwościowego prądu podkładu....
-
New algorithms for wow and flutter detection and compensation in audio
PublicationW referacie przedstawiono nowe metody dyskryminacji naturalnych efektów muzycznych i pasożytniczych zniekształceń drżenia dźwięku. Dodatkowo, opisano w nim metody wyznaczania przebiegu zniekształceń drżenia. Wśród nich znajdują się: detekcja okresowości sygnału w poszczególnych ramkach czasowych, śledzenie zmian przydźwięku sieciowego wykorzystujące modelowane AR widma sygnału, śledzenie zmian wysokoczęstotliwościowego prądu podkładu....
-
Eye-tracking everywhere - software supporting disabled people in interaction with computers
PublicationIn this paper we present comprehensive system for communication with computer by gaze. One of the main assumptions behind this work was to provide solution that can be used with standard RGB webcam. The proposed comprehensive system included the eye tracking module and user interface for convenient gaze interaction with computer. As a result a fully functional application was developed. The average accuracy of the eye tracking...
-
Numerical simulation of screw displacement pile interaction with non-cohesive soil
PublicationA trial numerical simulation of screw displacement pile interaction with non-cohesive subsoil during the transfer of compression load. The simulation was carried out in an axisymmetric system. The technological phases of pile installation in the ground were numerically modelled using equivalent processes which provided similar effects to real technical actions. The results of the numerical calculations were verified by comparing...