Filters
total: 295
filtered: 288
Search results for: VISUAL MONITORING
-
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
PublicationThe problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
-
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
PublicationThe problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
-
Light formed through urban morphology and different organism groups: First findings from a systematic review
PublicationThe prevailing implementation and usage of contemporary lighting technologies and design practices in cities have created over-illuminated built environments. Recent studies indicate that exposure to electric lighting effects formed through spatial characteristics has visual, physiological, and behavioural effects on both humans and non-humans, such as wildlife. In order to gain a better understanding of the impact that electric...
-
Visual Data Encryption for Privacy Enhancement in Surveillance Systems
PublicationIn this paper a methodology for employing reversible visual encryption of data is proposed. The developed algorithms are focused on privacy enhancement in distributed surveillance architectures. First, motivation of the study performed and a short review of preexisting methods of privacy enhancement are presented. The algorithmic background, system architecture along with a solution for anonymization of sensitive regions of interest...
-
Public spaces connecting cities. Green and Blue Infrastructures potential.
PublicationA city fragmentation causes a lot of negative effects in urban environment such as: disconnecting the environmental, functional and compositional relations, a loss of urban compactness, chaotic development, visual chaos, a domination of technical landscape, reduction of security. This is why one of main challenges for urban planners is to connect the fragmented structures by creating friendly, attractive and safe public space....
-
Objectivization of audio-video correlation assessment experiments
PublicationThe purpose of this paper is to present a new method of conducting an audio-visual correlation analysis employing a head-motion-free gaze tracking system. First, a review of related works in the domain of sound and vision correlation is presented. Then assumptions concerning audio-visual scene creation are shortly described. The objectivization process of carrying out correlation tests employing gaze-tracking system is outlined....
-
Remote Estimation of Video-Based Vital Signs in Emotion Invocation Studies
PublicationAbstract— The goal of this study is to examine the influence of various imitated and video invoked emotions on the vital signs (respiratory and pulse rates). We also perform an analysis of the possibility to extract signals from sequences acquired with cost-effective cameras. The preliminary results show that the respiratory rate allows for better separation of some emotions than the pulse rate, yet this relation highly depends...
-
Preferences of the Facade Composition in the Context of Its Regularity and Irregularity
PublicationAbstract: The aim of this study is to determine the preferences of Polish society towards building facades depending on the degree of the composition regularity of the facade elements. The subject matter is inspired by the authors’ observations in relation to the current architectural trends. The purposefulness of the conducted research results from several issues. Firstly, the reports of psychology and neurosciences clearly indicate...
-
Gaze-tracking based audio-visual correlation analysis employing quality of experience methodology
PublicationThis paper investigates a new approach to audio-visual correlation assessment based on the gaze-tracking system developed at the Multimedia Systems Department (MSD) of Gdansk University of Technology (GUT). The gaze-tracking methodology, having roots in Human-Computer Interaction borrows the relevance feedback through gaze-tracking and applies it to the new area of interests, which is Quality of Experience. Results of subjective...
-
Support for argument structures review and assessment
PublicationArgument structures are commonly used to develop and present cases for safety, security and for other properties of systems. Such structures tend to grow excessively, which causes problems with their review and assessment. Two issues are of particular interest: (1) systematic and explicit assessment of the compelling power of an argument, and (2) communication of the result of such an assessment to relevant recipients. The paper...
-
Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition
PublicationThe problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy...
-
New Aspects of Virtual Sound Source Localization Research—Impact of Visual Angle and 3-D Video Content on Sound Perception
PublicationThe influence of image on virtual sound source localization, called the “image proximity effect” or the “ventriloquism effect”, is a well known phenomenon. This paper focuses on other aspects related to this effect, namely the impact of the visual angle of the presented object and 3D video content on sound perception. The research conducted confirmed that the visual angle of the presented object determines the image proximity effect...
-
An audio-visual corpus for multimodal automatic speech recognition
Publicationreview of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...
-
Information management enhancement with simulation: case studies.
PublicationW rozdziale omówiono rolę symulacji w procesach zarządzania wiedzą i informacją, wskazując na jej znaczenie jako podstawowego narzędzia w tym obszarze.Przybliżono wybraną platformę symulacyjną opartą na systemie Visual SLAM.
-
Effect of Storage Conditions of Rutile Flux Cored Welding Wires on Properties of Welds
PublicationThe influence of storage locations of two grades of rutile flux cored welding wires on their surface condition and the strength of the welds made with them were studied. Wires were stored in real urban conditions (Gdańsk and Katowice) for 1 month, simultaneously recording changes in conditions: temperature and relative humidity of the environment. Visual tests of wires in the delivered and stored condition as well as visual and...
-
Classification of Landscape Physiognomies in Rural Poland: The Case of the Municipality of Cekcyn
PublicationThis article presents a methodology and the results of the classification of the rural landscapes physiognomies conducted on the study area located in the municipality of Cekcyn, Poland. The study aimed to develop a landscape identification method that would combine natural, cultural, and visual criteria with which to implement the provisions of the European Landscape Convention. The realization of the European Landscape Convention...
-
Image Representation for Cognitive Systems Using SOEKS and DDNA: A Case Study for PPE Compliance
PublicationCognitive Vision Systems have gained significant interest from academia and industry during the past few decade, and one of the main reasons behind this is the potential of such technologies to revolutionize human life as they intend to work under complex visual scenes, adapting to a comprehensive range of unforeseen changes, and exhibiting prospective behavior. The combination of these properties aims to mimic the human capabilities...
-
Visual Features for Endoscopic Bleeding Detection
PublicationAims: To define a set of high-level visual features of endoscopic bleeding and evaluate their capabilities for potential use in automatic bleeding detection. Study Design: Experimental study. Place and Duration of Study: Department of Computer Architecture, Faculty of Electronics, Telecommunications and Informatics, Gdansk University of Technology, between March 2014 and May 2014. Methodology: The features have...
-
Reverberation divergence in VR applications
PublicationThe aim of this project was to investigate the correlation between virtual reality (VR) imagery and ambisonic sound. With the increasing popularity of VR applications, understanding how sound is perceived in virtual environments is crucial for enhancing the immersiveness of the experience. By examining the relationship between visual scenes and sound scenes, this research attempts to explore how the interaction between vision and...
-
Towards Precise Visual Navigation and Direct Georeferencing for MAV Using ORB-SLAM2
PublicationA low accuracy of positioning using Global Navigation Satellite System (GNSS) are not meet geodetic requirements for direct images georeferencing for Unmanned Aerial Vehicle (UAV) photogrammetry. A majority of UAVs are equipped with a monocular or stereo non-metric cameras for either visual data gathering or live video feed for operator. A cheap positioning techniques used on board commercial UAVs are not that precise as geodetic...
-
Selected aspects of customization of cognitive dimensions for evaluation of visual modeling languages.
PublicationFor the successful application of diagrams in software engineering, high quality visual modelling languages (VML) are required. There is a need for new effective methodologies of VML evaluation. This paper discusses selected aspects of applying cognitive dimensions as a basis of the evaluation. Then, it briefly presents CD-VML methodology which integrates the cognitive dimensions with a theory of visual modelling languages. Finally,...
-
Games and play with light in architecture
PublicationThe paper deals with the issue of the influence of daylight on the creation of architecture in the view of designers` play with light in the architectural space. Using the examples of contemporary realizations of some art museums, the work demonstrates the impact of exploration and experimentation conducted by the creators of visual arts on the design styles and architectural solutions. It also reveals the historical continuity...
-
Impact of Usability Website Attributes on Users’ Trust, Satisfaction and Loyalty
PublicationThis paper presents the results of an experimental study aimed at identifying possible relationships among website usability characteristics, consumer satisfaction, trust and loyalty. These factors regard not only customer satisfaction in a transactional sense, but in the long term they may affect e-customer behavior, opinions, recommendations and attitudes toward using on-line services in general. The study was performed with...
-
Consciousness Study of Subjects with Unresponsive Wakefulness Syndrome Employing Multimodal Interfaces
PublicationThe paper presents a novel multimodal-based methodology for consciousness study of individuals with unresponsive wakefulness syndrome. Two interfaces were employed in the experiments: eye gaze tracking system – CyberEye developed at the Multimedia Systems Department, and EEG device with electrode placement in the international 10-20 standard. It was a pilot study for checking if it is possible to determine objective methods based...
-
Daylighting Education in Practice Verification of a new goal within a European knowledge investigation
PublicationTwo independent surveys were conducted in 2017 and in 2018 among architecture students across Europe to investigate their knowledge on daylighting and the impact of that knowledge on the visual perception of daylit spaces. A total of 600 responders were involved. This paper presents findings from the second survey, which was distributed in six European countries. Based on the findings from the first survey, a new goal was set for...
-
Media architecture: participation through the senses
PublicationPervasive media and interactive technologies have become inseparable not only from our everyday life but also from architecture and city spaces. However, the generic use of new technologies in the design process and material production that affects contemporary architecture, results in buildings that become mere visual objects losing their hapticity and non-visual qualities. Despite the substantial advancement in the research studies...
-
Visual Content Representation for Cognitive Systems: Towards Augmented Intelligence
PublicationCognitive Vision Systems have gained significant attention from academia and industry during the past few decades. One of the main reasons behind this interest is the potential of such technologies to revolutionize human life since they intend to work robustly under complex visual scenes (which environmental conditions may vary), adapting to a comprehensive range of unforeseen changes, and exhibiting prospective behavior. The combination...
-
Report no WOiO/II/67/2015 - Expertise about load capacity of twistlock foundation and sliding foundation
PublicationCustomer delivered to Laboratory 17 elements - twist lock foundations and sliding lock foundations (ship equipment for container lashing). Elements were selected from production series. Each element was loaded for braking load. After test visual inspection had been performed. The expertise contains: description of tested elements, test assumptions, test stand, results and conclusions
-
Context-Aware Indexing and Retrieval for Cognitive Systems Using SOEKS and DDNA
PublicationVisual content searching, browsing and retrieval tools have been a focus area of interest as they are required by systems from many different domains. Context-based, Content-Based, and Semantic-based are different approaches utilized for indexing/retrieving, but have their drawbacks when applied to systems that aim to mimic the human capabilities. Such systems, also known as Cognitive Systems, are still limited in terms of processing...
-
Digital document life cycle development
PublicationPrzedstawiono model DDLC wytwarzania interaktywnych dokumentów cyfrowych z ich pierwowzorów papierowych. Model DDLC opracowany w ramach 5 PR UE IST-2002-33441 MEMORIAL wyróżnia 6 faz i odpowiednie grupy funkcjonalności narzędzi do ich realizacji. Cykl wytwarzanie realizuje politykę całkowitej kontroli jakości, wykorzystującej specjalnie opracowaną metodę Visual GQM.
-
Integration of thermographic data with the 3D object model
PublicationThe aim of the paper is to present new method for merging the 3D model data of the measured object with thermograms. Our technique is based on the combination of visual 3D imaging technique and thermal imaging technique, which maps the 2D thermograms on to 3D anatomical mesh model. The combination of these imaging modalities allows the generation of combined 3D and thermal data from which thermal signatures can be verified and...
-
Two-photon microperimetry with picosecond pulses
PublicationTwo-photon vision is a phenomenon associated with the perception of short pulsesof near-infrared radiation (900-1200 nm) as a visible light. It is caused by the nonlinear processof two-photon absorption by visual pigments. Here we present results showing the influence ofpulse duration and repetition rate of short pulsed lasers on the visual threshold. We comparedtwo-photon sensitivity maps of the retina obtained for subjects with...
-
Robust Object Detection with Multi-input Multi-output Faster R-CNN
PublicationRecent years have seen impressive progress in visual recognition on many benchmarks, however, generalization to the out-of-distribution setting remains a significant challenge. A state-of-the-art method for robust visual recognition is model ensembling. However, recently it was shown that similarly competitive results could be achieved with a much smaller cost, by using multi-input multi-output architecture (MIMO). In this work,...
-
Robust Object Detection with Multi-input Multi-output Faster R-CNN
PublicationRecent years have seen impressive progress in visual recognition on many benchmarks, however, generalization to the out-of-distribution setting remains a significant challenge. A state-of-the-art method for robust visual recognition is model ensembling. However, recently it was shown that similarly competitive results could be achieved with a much smaller cost, by using multi-input multi-output architecture (MIMO). In this work,...
-
Independent dynamics of slow, intermediate, and fast intracranial EEG spectral activities during human memory formation
PublicationA wide spectrum of brain rhythms are engaged throughout the human cortex in cognitive functions. How the rhythms of various low and high frequencies are spatiotemporally coordinated across the human brain during memory processing is inconclusive. They can either be coordinated together across a wide range of the frequency spectrum or induced in specific bands. We used a large dataset of human intracranial electroencephalography...
-
Zintegrowane środowiska projektowania aplikacji internetowych.
PublicationZintegrowane środowiska, umożliwiające analizę, projektowanie i implementację aplikacji, stanowią wymarzone narzędzie pracy każdego inżyniera oprogramowania. Opisano próby dostarczenia takiego środowiska w postaci Borland Delphi 5.0 oraz w postaci Rational XDE - środowiska projektowania w UML przeznaczonego do integracji z istniejącymi środowiskami implementacji, takimi jak Microsoft Visual Studio.NET i IBM Web Sphere...
-
Computer-Supported Polysensory Integration Technology for Educationally Handicapped Pupils
PublicationIn this paper, a multimedia system providing technology for hearing and visual attention stimulation is shortly presented. The system aims to support the development of educationally handicapped pupils. The system has been presented in the context of its configuration, architecture, and therapeutic exercise implementation issues. Results of pupils’ improvements after 8 weeks of training with the system are also provided. Training...
-
Analiza stateczności walcowego pionowego zbiornika dwupłaszczowego posadowionego na fundamencie gruntowym
PublicationOmówiono wpływ nierównomiernego osiadania płaszcza wewnętrznego i zewnętrznego na nośność stalowego zbiornika dwupłaszczowego posadowionego za pośrednictwem podsypki piaskowej na uwarstwionym podłożu gruntowym. Analizę obliczeniową wykonano numerycznie przy zastosowaniu systemu komputerowego MSC Visual Nastran for Windows, version 2001. Stwierdzono, że przy zastosowaniu w obliczeniach podłoża typu Winklera otrzymuje się zbyt duże...
-
The relationship between architectural detail and light in contemporary architecture
PublicationThe paper deals with the influence of modern artificial and natural lighting technology on contemporary architecture, especially in relation to architectural detail. Advanced complex lighting systems have an increasing importance in contemporary design solutions. Light itself, and the effect of its actions, and characteristic parts of the sophisticated lighting systems, play an essential role as independent architectural elements,...
-
Perceptual and Motor Effects of Muscle Co-activation in a Force Production Task
PublicationWe tested several predictions of the theory of motor control with spatial referent coordinates related to effects of muscle coactivation on force production and perception. In particular, we predicted that subjects would produce unintentional force increase by finger flexors and be unaware of this force increase. Healthy subjects performed steady force production task in isometric conditions with visual feedback on the force level....
-
Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej
PublicationThe bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...
-
Guitar String Sound Retrieved from Moving Pixels
PublicationThe aim of this study was to develop a method of visual recording and analyzing the vibrations of guitar strings using high-speed cameras and dedicated video processing algorithms. The recording of a plucked string reveals the way in which the deformations propagate, composing the standing and travelling wave. The paper compares the results for a few selected models of classical and acoustic guitars, and it involves processing...
-
Gesture-controlled Sound Mixing System With a Sonified Interface
PublicationIn this paper the Authors present a novel approach to sound mixing. It is materialized in a system that enables to mix sound with hand gestures recognized in a video stream. The system has been developed in such a way that mixing operations can be performed both with or without visual support. To check the hypothesis that the mixing process needs only an auditory display, the influence of audio information visualization on sound...
-
Awareness evaluation of patients in vegetative state employing eye-gaze tracking system
PublicationApplication of eye-gaze tracking system to awareness evaluation is demonstrated. Hitherto awareness evaluation methods are presented. The assumptions of proposed method based on analysis of visual activity of patients in vegetative state are demonstrated. The eye-gaze tracking system ''Cyber-Eye'' developed at the Multimedia Systems Department employed to conducted experiments is presented. Research described in the paper indicates...
-
Sound Art and Architecture: New Horizons for Architecture and Urbanism
PublicationThe article discusses the crossroad between art and architecture. It sketches out the theoretical and practical aspects of involving art into architecture and multisensory dimensions of space. The analysis is based on examples of innovative experimental activities for architecture: educational projects such as workshops, seminars and courses, combining art and architecture, with special emphasis on sound art, and the consequences...
-
Flock behavior and control
PublicationIn this paper we present the results of the Flock Behaviour and Control workshop cluster during “Shapes of Logic Conference 2015”. During the event, students got familiar with the techniques of both visual and sound real-time data processing. The second topic presented for students was behaviourbased approach of design process, mainly based on the mathematical rules set up by Craig Raynolds on the swarm behaviour. The aim of the...
-
Independent dynamics of low, intermediate, and high frequency spectral intracranial EEG activities during human memory formation
PublicationA wide spectrum of brain rhythms are engaged throughout the human cortex in cognitive functions. How the rhythms of various frequency ranges are coordinated across the space of the human cortex and time of memory processing is inconclusive. They can either be coordinated together across the frequency spectrum at the same cortical site and time or induced independently in particular bands. We used a large dataset of human intracranial...
-
Building Knowledge for the Purpose of Lip Speech Identification
PublicationConsecutive stages of building knowledge for automatic lip speech identification are shown in this study. The main objective is to prepare audio-visual material for phonetic analysis and transcription. First, approximately 260 sentences of natural English were prepared taking into account the frequencies of occurrence of all English phonemes. Five native speakers from different countries read the selected sentences in front of...
-
Human verbal memory encoding is hierarchically distributed in a continuous processing stream
PublicationProcessing of memory is supported by coordinated activity in a network of sensory, association, and motor brain regions. It remains a major challenge to determine where memory is encoded for later retrieval. Here we used direct intracranial brain recordings from epilepsy patients performing free recall tasks to determine the temporal pattern and anatomical distribution of verbal memory encoding across the entire human cortex. High...
-
Developing a Framework for the Implementation of Landscape and Greenspace Indicators in Sustainable Urban Planning. Waterfront Landscape Management: Case Studies in Gdańsk, Poznań and Bristol
PublicationUrban landscape (UL) management and urban greenspace (UG) delivery require effective planning tools. The aim of the study is to develop a conceptual framework for the implementation of ecological, structural and visual landscape and greenspace indicators (LGI) in spatial development of urban areas. The UL and UG management provisions in Poland are identified at various levels of urban planning (local, municipal and regional). Furthermore,...