Search results for: visual processing

Search results for: visual processing

results on page:
embed this view on your website

Filters

total: 303

clear all filters disabled

Hidden Markov Models for Visual Processing of Marketing Leaflets
Publication
- J. Grobelny
- R. Michalski
- Year 2021
Full text to download in external service
International Journal of Image Processing and Visual Communication

Journals

ISSN: 2319-1724
IEEE International Conference on Visual Communications and Image Processing

Conferences
Pan-Sydney Area Workshop on Visual Information Processing

Conferences
Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency
Publication
- International Journal of Image Processing and Visual Communication - Year 2013
In this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...

Full text to download in external service
Marek Blok dr hab. inż.

People

Marek Blok in 1994 graduated from the Faculty of Electronics at Gdansk University of Technology receiving his MSc in telecommunications. In 2003 received Ph.D. and in 2017 D.Sc. in telecommunications from the Faculty of Electronics, Telecommunications and Informatics of Gdańsk University of Technology. His research interests are focused on application of digital signal processing in telecommunications. He provides lectures, laboratory...
Michał Lech dr inż.

People

Michał Lech was born in Gdynia in 1983. In 2007 he graduated from the faculty of Electronics, Telecommunications and Informatics of Gdansk University of Technology. In June 2013, he received his Ph.D. degree. The subject of the dissertation was: “A Method and Algorithms for Controlling the Sound Mixing Processes by Hand Gestures Recognized Using Computer Vision”. The main focus of the thesis was the bias of audio perception caused...
Visual Features for Endoscopic Bleeding Detection
Publication
- A. Brzeski
- Current Journal of Applied Science and Technology (British Journal of Applied Science & Technology) - Year 2014
Aims: To define a set of high-level visual features of endoscopic bleeding and evaluate their capabilities for potential use in automatic bleeding detection. Study Design: Experimental study. Place and Duration of Study: Department of Computer Architecture, Faculty of Electronics, Telecommunications and Informatics, Gdansk University of Technology, between March 2014 and May 2014. Methodology: The features have...

Full text available to download
Piotr Szczuko dr hab. inż.

People

Department of Multimedia Systems

Piotr Szczuko received his M.Sc. degree in 2002. His thesis was dedicated to examination of correlation phenomena between perception of sound and vision for surround sound and digital image. He finished Ph.D. studies in 2007 and one year later completed a dissertation "Application of Fuzzy Rules in Computer Character Animation" that received award of Prime Minister of Poland. His interests include: processing of audio and video, computer...
Visual Data Encryption for Privacy Enhancement in Surveillance Systems
Publication
- Year 2013
In this paper a methodology for employing reversible visual encryption of data is proposed. The developed algorithms are focused on privacy enhancement in distributed surveillance architectures. First, motivation of the study performed and a short review of preexisting methods of privacy enhancement are presented. The algorithmic background, system architecture along with a solution for anonymization of sensitive regions of interest...

Full text to download in external service
Human verbal memory encoding is hierarchically distributed in a continuous processing stream
Publication
- M. T. Kucewicz
- K. Saboo
- B. M. Berry
- V. Kremen
- L. R. Miller
- F. Khadjevand
- C. S. Inman
- P. A. Wanda
- M. R. Sperling
- R. Gorniak... and 8 others
- eNeuro - Year 2019
Processing of memory is supported by coordinated activity in a network of sensory, association, and motor brain regions. It remains a major challenge to determine where memory is encoded for later retrieval. Here we used direct intracranial brain recordings from epilepsy patients performing free recall tasks to determine the temporal pattern and anatomical distribution of verbal memory encoding across the entire human cortex. High...

Full text available to download
UAV Design and Construction for Real Time Photogrammetry and Visual Navigation
Publication
- P. Burdziakowski
- Year 2018
A unmanned aerial vehicles applications in photogrammetry have increased rapidly last years. A fast data gathering and processing in real time in some cases become crucial and desired in some application. In the paper, a real time solution is proposed. A real time photogrammetry from UAV is proposed, where image data are gathered and processed on board UAV and finally reconstructed 3D model and measurements are delivered. The paper...

Full text available to download
A Model-Driven Solution for Development of Multimedia Stream Processing Applications
Publication
- Year 2014
This paper presents results of action research related to model-driven solutions in the area of multimedia stream processing. The practical problem to be solved was the need to support application developers who make their multimedia stream processing applications in a supercomputer environment. The solution consists of a domain-specific visual language for composing complex services from simple services called Multimedia Stream...

Full text to download in external service
An audio-visual corpus for multimodal automatic speech recognition
Publication
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Year 2017
review of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...

Full text available to download
High frequency oscillations are associated with cognitive processing in human recognition memory
Publication
- M. T. Kucewicz
- J. Cymbalnik
- J. Matsumoto
- B. H. Brinkmann
- M. R. Bower
- V. Vasoli
- V. Sulc
- F. Meyer
- W. Marsh
- S. M. Stead
- G. A. Worrell
- Brain: A Journal of Neurology - Year 2014
High frequency oscillations are associated with normal brain function, but also increasingly recognized as potential biomarkers of the epileptogenic brain. Their role in human cognition has been predominantly studied in classical gamma frequencies (30-100 Hz), which reflect neuronal network coordination involved in attention, learning and memory. Invasive brain recordings in animals and humans demonstrate that physiological oscillations...

Full text available to download
Combined Single Neuron Unit Activity and Local Field Potential Oscillations in a Human Visual Recognition Memory Task
Publication
- M. T. Kucewicz
- B. M. Berry
- M. R. Bower
- J. Cymbalnik
- V. Svehlik
- S. M. Stead
- G. A. Worrell
- IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING - Year 2016
GOAL: Activities of neuronal networks range from action potential firing of individual neurons, coordinated oscillations of local neuronal assemblies, and distributed neural populations. Here, we describe recordings using hybrid electrodes, containing both micro- and clinical macroelectrodes, to simultaneously sample both large-scale network oscillations and single neuron spiking activity in the medial temporal lobe structures...

Full text to download in external service
Guitar String Sound Retrieved from Moving Pixels
Publication
- Year 2016
The aim of this study was to develop a method of visual recording and analyzing the vibrations of guitar strings using high-speed cameras and dedicated video processing algorithms. The recording of a plucked string reveals the way in which the deformations propagate, composing the standing and travelling wave. The paper compares the results for a few selected models of classical and acoustic guitars, and it involves processing...

Full text to download in external service
Visual Features for Improving Endoscopic Bleeding Detection Using Convolutional Neural Networks
Publication
- SENSORS - Year 2023
The presented paper investigates the problem of endoscopic bleeding detection in endoscopic videos in the form of a binary image classification task. A set of definitions of high-level visual features of endoscopic bleeding is introduced, which incorporates domain knowledge from the field. The high-level features are coupled with respective feature descriptors, enabling automatic capture of the features using image processing methods....

Full text available to download
Patryk Ziółkowski dr inż.

People

Department of Engineering Structures

Patryk Ziolkowski is a graduate of the Faculty of Civil and Environmental Engineering at the Gdansk University of Technology, specializing in Building and Engineering Structures. He works as an Assistant Professor at the Department of Engineering Structures. He participated in international projects, including projects for the Ministry of Transportation of the State of Alabama (2015), he is also the winner of a grant from the Kosciuszko...
Piotr Odya dr inż.

People

Department of Multimedia Systems

Piotr Odya was born in Gdansk in 1974. He received his M.Sc. in 1999 from the Faculty of Electronics, Telecommunications and Informatics, Gdansk University of Technology, Poland. His thesis was related to the problem of sound quality improvement in the contemporary broadcasting studio. He is interested in video editing and multichannel sound systems. The goal of Mr. Odya Ph.D. thesis concerned methods and algorithms for correcting...
Flock behavior and control
Publication
- K. Radziszewski
- A. Krężlik
- Year 2016
In this paper we present the results of the Flock Behaviour and Control workshop cluster during “Shapes of Logic Conference 2015”. During the event, students got familiar with the techniques of both visual and sound real-time data processing. The second topic presented for students was behaviourbased approach of design process, mainly based on the mathematical rules set up by Craig Raynolds on the swarm behaviour. The aim of the...
Sensors and System for Vehicle Navigation
Publication
- A. Stateczny
- W. Kazimierski
- P. Burdziakowski
- SENSORS - Year 2022
In recent years, vehicle navigation, in particular autonomous navigation, has been at the center of several major developments, both in civilian and defense applications. New technologies, such as multisensory data fusion, big data processing, or deep learning, are changing the quality of areas of applications, improving the sensors and systems used. Recently, the influence of artificial intelligence on sensor data processing and...

Full text available to download
Multi-task Video Enhancement for Dental Interventions
Publication
- E. Katsaros
- P. Kopa Ostrowski
- K. P. Włódarczak
- E. Lewandowska
- J. Rumiński
- D. Siupka-Mróz
- Ł. Lassmann
- A. Jezierska
- D. Węsierski
- Year 2022
A microcamera firmly attached to a dental handpiece allows dentists to continuously monitor the progress of conservative dental procedures. Video enhancement in video-assisted dental interventions alleviates low-light, noise, blur, and camera handshakes that collectively degrade visual comfort. To this end, we introduce a novel deep network for multi-task video enhancement that enables macro-visualization of dental scenes. In particular,...

Full text to download in external service
Multimodal English corpus for automatic speech recognition
Publication
- Year 2013
A multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech...
Context-Aware Indexing and Retrieval for Cognitive Systems Using SOEKS and DDNA
Publication
- C. De Silva Oliveira
- C. Sanin
- E. Szczerbicki
- Advances in Intelligent Systems and Computing - Year 2019
Visual content searching, browsing and retrieval tools have been a focus area of interest as they are required by systems from many different domains. Context-based, Content-Based, and Semantic-based are different approaches utilized for indexing/retrieving, but have their drawbacks when applied to systems that aim to mimic the human capabilities. Such systems, also known as Cognitive Systems, are still limited in terms of processing...

Full text available to download
Framework for Structural Health Monitoring of Steel Bridges by Computer Vision
Publication
- A. Marchewka
- P. Ziółkowski
- V. Aguilar-vidal
- SENSORS - Year 2020
The monitoring of a structural condition of steel bridges is an important issue. Good condition of infrastructure facilities ensures the safety and economic well-being of society. At the same time, due to the continuous development, rising wealth of the society and socio-economic integration of countries, the number of infrastructural objects is growing. Therefore, there is a need to introduce an easy-to-use and relatively low-cost...

Full text available to download
Art Composition
e-Learning Courses
- K. Wróblewski
- P. Różycki
Person in charge: prof. Krzysztof Wróblewski, Department of Visual Arts Teacher: mgr Patryk Różycki, Department of Visual Arts Five Words. Society and Politics. What? By What? General assumptions. The aim of the proposed two artistic compositions is a creative processing of emotions related to the socio-political issues. In general, it is about personal views and feelings, but it must be also considered that architects are...
In vivo imaging of the human eye using a two-photon excited fluorescence scanning laser ophthalmoscope
Publication
- J. Boguslawski
- G. Palczewska
- S. Tomczewski
- J. Milkiewicz
- P. Kasprzycki
- D. Stachowiak
- K. Komar
- M. J. Marzejon
- B. L. Sikorski
- A. Hudzikowski... and 6 others
- JOURNAL OF CLINICAL INVESTIGATION - Year 2022
BACKGROUND. Noninvasive assessment of metabolic processes that sustain regeneration of human retinal visual pigments (visual cycle) is essential to improve ophthalmic diagnostics and to accelerate development of new treatments to counter retinal diseases. Fluorescent vitamin A derivatives, which are the chemical intermediates of these processes, are highly sensitive to UV light; thus, safe analyses of these processes in humans...

Full text available to download
Independent dynamics of slow, intermediate, and fast intracranial EEG spectral activities during human memory formation
Publication
- V. S. Marks
- K. V. Saboo
- C. Topcu
- T. P. Thayib
- P. Nejedly
- V. Kremen
- G. A. Worrell
- M. T. Kucewicz
- Year 2021
A wide spectrum of brain rhythms are engaged throughout the human cortex in cognitive functions. How the rhythms of various low and high frequencies are spatiotemporally coordinated across the human brain during memory processing is inconclusive. They can either be coordinated together across a wide range of the frequency spectrum or induced in specific bands. We used a large dataset of human intracranial electroencephalography...

Full text to download in external service
An Overview of Image Analysis Techniques in Endoscopic Bleeding Detection
Publication
- International Journal of Innovative Research in Computer and Communication Engineering - Year 2013
Authors review the existing bleeding detection methods focusing their attention on the image processing techniques utilised in the algorithms. In the article, 18 methods were analysed and their functional components were identified. The authors proposed six different groups, to which algorithms’ components were assigned: colour techniques, reflecting features of pixels as individual values, texture techniques, considering spatial...

Full text to download in external service
Automatic audio-visual threat detection
Publication
- J. Kotus
- J. Łopatka
- K. Kopaczewski
- A. Czyżewski
- Year 2010
The concept, practical realization and application of a system for detection and classification of hazardous situations based on multimodal sound and vision analysis are presented. The device consists of new kind multichannel miniature sound intensity sensors, digital Pan Tilt Zoom and fixed cameras and a bundle of signal processing algorithms. The simultaneous analysis of multimodal signals can significantly improve the accuracy...
Independent dynamics of low, intermediate, and high frequency spectral intracranial EEG activities during human memory formation
Publication
- V. Marks
- K. Saboo
- Ç. Topçu
- M. Lech
- T. Thayib
- P. Nejedly
- V. Kremen
- G. A. Worrell
- M. T. Kucewicz
- NEUROIMAGE - Year 2021
A wide spectrum of brain rhythms are engaged throughout the human cortex in cognitive functions. How the rhythms of various frequency ranges are coordinated across the space of the human cortex and time of memory processing is inconclusive. They can either be coordinated together across the frequency spectrum at the same cortical site and time or induced independently in particular bands. We used a large dataset of human intracranial...

Full text available to download
Video content analysis in the urban area telemonitoring system
Publication
- Year 2010
The task of constant monitoring of video streams from a large number of cameras and reviewing the recordings in order to find a specified event requires a considerable amount of time and effort from the system operators and it is prone to errors. A solution to this problem is an automatic system for constant analysis of camera images being able to raise an alarm if a predefined event is detected. The chapter presents various aspects...

Full text to download in external service
Brain-computer interaction based on EEG signal and gaze-tracking information = Analiza interackji mózg-komputer wykorzystująca sygnał EEg i informacje z systemu śledzenia punktu fiksacji wzroku
Publication
- K. Kaszuba-Miotke
- B. Kostek
- Elektronika : konstrukcje, technologie, zastosowania - Year 2012
The article presents an attempt to integrate EEG signal analysis with information about human visual activities, i.e. gaze fixation point. The results from gaze-tracking-based measurement were combined with the standard EEG analysis. A search for correlation between the brain activity and the region of the screen observed by the user was performed. The preliminary stage of the study consists in electrooculography (EOG) signal processing....
How Can We Identify Electrophysiological iEEG Activities Associated with Cognitive Functions?
Publication
- M. T. Kucewicz
- G. A. Worrell
- K. Saboo
- Year 2023
Electrophysiological activities of the brain are engaged in its various functions and give rise to a wide spectrum of low and high frequency oscillations in the intracranial EEG (iEEG) signals, commonly known as the brain waves. The iEEG spectral activities are distributed across networks of cortical and subcortical areas arranged into hierarchical processing streams. It remains a major challenge to identify these activities in...

Full text to download in external service
Bimodal classification of English allophones employing acoustic speech signal and facial motion capture
Publication
- Journal of the Acoustical Society of America - Year 2018
A method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...

Full text to download in external service
Multibeam Echosounder and LiDAR in Process of 360-Degree Numerical Map Production for Restricted Waters with HydroDron
Publication
- A. Stateczny
- M. Wlodarczyk-Sielicka
- D. Gronska
- W. Motyl
- Year 2018
In order to increase the safety of inland navigation and facilitate the monitoring of the coastal zone of restricted waters, a model of multi-sensory fusion of data from hydroacoustic and optoelectronic systems mounted on the autonomous survey vessel HydroDron will be developed. In the research will be used the LiDAR laser scanner and multibeam echosounder. To increase the visual quality and map accuracy, additionally side scan...

Full text to download in external service
Detection of debonding in adhesive joints using Lamb wave propagation
Publication
- MATEC Web of Conferences - Year 2019
Adhesively bonded joints are widely used in many branches of industry. Mechanical degradation of this type of connections does not have significant symptoms that can be noticed during visual assessment, so non-destructive testing becomes a very important issue. The paper deals with experimental investigations of adhesively bonded steel plates with different defects. Five samples (an intact one and four with damages in the form...

Full text available to download
Pupil size reflects successful encoding and recall of memory in humans
Publication
- M. T. Kucewicz
- J. Dolezal
- V. Kremen
- B. M. Berry
- L. R. Miller
- A. L. Magee
- V. Fabian
- G. A. Worrell
- Scientific Reports - Year 2018
Pupil responses are known to indicate brain processes involved in perception, attention and decision-making. They can provide an accessible biomarker of human memory performance and cognitive states in general. Here we investigated changes in the pupil size during encoding and recall of word lists. Consistent patterns in the pupil response were found across and within distinct phases of the free recall task. The pupil was most...

Full text available to download
Automatic Watercraft Recognition and Identification on Water Areas Covered by Video Monitoring as Extension for Sea and River Traffic Supervision Systems
Publication
- N. Wawrzyniak
- A. Stateczny
- Polish Maritime Research - Year 2018
The article presents the watercraft recognition and identification system as an extension for the presently used visual water area monitoring systems, such as VTS (Vessel Traffic Service) or RIS (River Information Service). The watercraft identification systems (AIS - Automatic Identification Systems) which are presently used in both sea and inland navigation require purchase and installation of relatively expensive transceivers...

Full text to download in external service
Hazard Control in Industrial Environments: A Knowledge-Vision-Based Approach
Publication
- C. De
- C. Sanin
- E. Szczerbicki
- Advances in Intelligent Systems and Computing - Year 2018
This paper proposes the integration of image processing techniques (such as image segmentation, feature extraction and selection) and a knowledge representation approach in a framework for the development of an automatic system able to identify, in real time, unsafe activities in industrial environments. In this framework, the visual information (feature extraction) acquired from video-camera images and other context based gathered...

Full text to download in external service
Visual Object Tracking System Employing Fixed and PTZ Cameras
Publication
- Intelligent Decision Technologies-Netherlands - Year 2011
The paper presents a video monitoring system utilizing fixed and PTZ cameras for tracking of moving objects. First type of camera provides image for background modelling, being employed for foreground objects localization. Estimated objects locations are then utilised for steering of PTZ cameras when observing targeted objects with high close-ups. Objects are classified into several classes, then basic event detection is being...

Full text to download in external service
Orientation-aware ship detection via a rotation feature decoupling supported deep learning approach
Publication
- X. Chen
- H. Wu
- B. Han
- W. Liu
- J. Montewka
- R. W. Liu
- ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE - Year 2023
Ship imaging position plays an important role in visual navigation, and thus significant focuses have been paid to accurately extract ship imaging positions in maritime videos. Previous studies are mainly conducted in the horizontal ship detection manner from maritime image sequences. This can lead to unsatisfied ship detection performance due to that some background pixels maybe wrongly identified as ship contours. To address...

Full text to download in external service
Distributed Framework for Visual Event Detection in Parking Lot Area
Publication
- Communications in Computer and Information Science - Year 2011
The paper presents the framework for automatic detection of various events occurring in a parking lot basing on multiple camera video analysis. The framework is massively distributed, both in the logical and physical sense. It consists of several entities called node stations that use XMPP protocol for internal communication and SRTP protocol with Jingle extension for video streaming. Recognized events include detecting parking...

Full text to download in external service
Visual Detection of People Movement Rules Violation in Crowded Indoor Scenes
Publication
- P. Dalka
- P. Bratoszewski
- Year 2013
The paper presents a camera-independent framework for detecting violations of two typical people movement rules that are in force in many public transit terminals: moving in the wrong direction or across designated lanes. Low-level image processing is based on object detection with Gaussian Mixture Models and employs Kalman filters with conflict resolving extensions for the object tracking. In order to allow an effective event...

Full text to download in external service
Human memory enhancement through electrical stimulation in the temporal cortex
Publication
- M. T. Kucewicz
- B. M. Berry
- L. R. Miller
- F. Khadjevand
- Y. Ezzyat
- J. M. Stein
- V. Kremen
- B. H. Brinkmann
- P. Wanda
- M. R. Sperling... and 10 others
- Brain: A Journal of Neurology - Year 2018
Direct electrical stimulation of the human brain can elicit sensory and motor perceptions as well as recall of memories. Stimulating higher order association areas of the lateral temporal cortex in particular was reported to activate visual and auditory memory representations of past experiences (Penfield and Perot, 1963). We hypothesized that this effect could be used to modulate memory processing. Recent attempts at memory enhancement...

Full text available to download
Advanced Visual Interfaces

Conferences
Visual Analytics [VA]

Conferences
Visual Information Communication and Interaction (Visual Information Communications International)

Conferences
Network oscillations modulate interictal epileptiform spike rate during human memory
Publication
- J. Matsumoto
- M. Stead
- M. T. Kucewicz
- A. Matsumoto
- P. Peters
- B. Brinkmann
- J. C. Danstrom
- S. Goerss
- W. Marsh
- F. Meyer
- G. Worrell
- Brain: A Journal of Neurology - Year 2013
Eleven patients being evaluated with intracranial electroencephalography for medically resistant temporal lobe epilepsy participated in a visual recognition memory task. Interictal epileptiform spikes were manually marked and their rate of occurrence compared between baseline and three 2 s periods spanning a 6 s viewing period. During successful, but not unsuccessful, encoding of the images there was a significant reduction in...

Full text to download in external service
Human Feedback and Knowledge Discovery: Towards Cognitive Systems Optimization
Publication
- C. S. de Oliveira
- C. Sanin
- E. Szczerbicki
- Procedia Computer Science - Year 2020
Current computer vision systems, especially those using machine learning techniques are data-hungry and frequently only perform well when dealing with patterns they have seen before. As an alternative, cognitive systems have become a focus of attention for applications that involve complex visual scenes, and in which conditions may vary. In theory, cognitive applications uses current machine learning algorithms, such as deep learning,...

Full text available to download
“Shadow” vs. “Phase 3D” method within endoscopic examinations of marine engines
Publication
- Z. Korczewski
- J. Rudnicki
- Combustion Engines - Year 2013
A visual investigation of surfaces creating internal, working spaces of marine combustion engines by means of specialized view-finders so called endoscopes is at present almost a basic method of technical diag-nostics. The surface structure of constructional material is visible during investigations like through the magnifying glass (usually with a precisely determined magnification), which makes possible a detection, recognition...

Full text available to download
User experience evaluation study on the quality of 1K, 2K, and 4K H.265/HEVC video content
Publication
- P. Falkowski-Gilski
- T. Uhl
- C. Hoppe
- Zeszyty Naukowe Akademii Morskiej w Szczecinie - Year 2024
Nowadays, most content creators focus on distributing rich media at the highest possible resolution. Currently, the majority of sold consoles, media players, computer hardware, as well as displays and TVs are advertised as 4K-compatible. The same trend is observed in the case of popular online streaming services and terrestrial TV broadcasts. Generally speaking, it is assumed that higher bitrates provide higher subjective judgements....

Full text available to download
Testing the Effect of Bathymetric Data Reduction on the Shape of the Digital Bottom Model
Publication
- W. Mujta
- M. Wlodarczyk-Sielicka
- A. Stateczny
- SENSORS - Year 2023
Depth data and the digital bottom model created from it are very important in the inland and coastal water zones studies and research. The paper undertakes the subject of bathymetric data processing using reduction methods and examines the impact of data reduction according to the resulting representations of the bottom surface in the form of numerical bottom models. Data reduction is an approach that is meant to reduce the size...

Full text available to download
Knowledge Visualization and Visual Thinking

Conferences
Visual Languages and Formal Methods

Conferences
International Symposium on Visual Computing

Conferences
BETWEEN IDEA AND INTERPRETATION - DESIGN PROCESS AUGMENTATION
Publication
- K. Radziszewski
- P. Świderski
- Year 2018
The following paper investigates the idea of reducing the human digital intervention to a minimum during the advanced design process. Augmenting the outcome attributes beyond the designer's capabilities by computational design methods, data collection, data computing and digital fabrication, altogether imitating the human design process. The primary technical goal of the research was verification of restrictions and abilities used...
International Conference on Visual Information Systems

Conferences
BP-EVD: Forward Block-Output Propagation for Efficient Video Denoising
Publication
- IEEE TRANSACTIONS ON IMAGE PROCESSING - Year 2022
Denoising videos in real-time is critical in many applications, including robotics and medicine, where varying light conditions, miniaturized sensors, and optics can substantially compromise image quality. This work proposes the first video denoising method based on a deep neural network that achieves state-of-the-art performance on dynamic scenes while running in real-time on VGA video resolution with no frame latency. The backbone...

Full text to download in external service
SPIE Conference on Visual Data Exploration and Analysis

Conferences
IEEE Symposium on Visual Analytics Science and Technology

Conferences
IFIP Working Conference on Visual Database Systems

Conferences
IEEE Workshop on Computational Intelligence for Visual Intelligence

Conferences
Special forms of echo visual representation in an ahead looking sonar.
Publication
- HYDROACOUSTICS - Year 2002
The paper discusses ways to organise visual representation in a multi-beam ahead looking sonars whose function is to detect objects on the bottom and in pelagic zones. Forms of visual representation are shown and illustrated on the basic screen (panoramic representation and setting, alarms) and on the auxiliary screen (type A, B and special). Special forms of visual representation are mainly used in detecting objects in difficult...

Full text to download in external service
Pursuing Listeners’ Perceptual Response in Audio-Visual Interactions - Headphones vs Loudspeakers: A Case Study
Publication
- B. Mróz
- B. Kostek
- Archives of Acoustics - Year 2022
This study investigates listeners’ perceptual responses in audio-visual interactions concerning binaural spatial audio. Audio stimuli are coupled with or without visual cues to the listeners. The subjective test participants are tasked to indicate the direction of the incoming sound while listening to the audio stimulus via loudspeakers or headphones with the head-related transfer function (HRTF) plugin. First, the methodology...

Full text available to download
Visual Management as the support in building the concept of continuous improvement in the enterprise
Publication
- A. Lewiński
- Przedsiębiorczość i Zarządzanie - Year 2018
The following article presents one of the selected tools of the Lean Management concept – visual management. This method enables enterprises to strengthen their process of continuous improvement. Due to the support of visual management, it is possible to manage information more effectively by the managerial board and to improve communication process within in the particular company. In the first part, the author describes the concept...

Full text available to download
Visual TreeCmp : Comprehensive Comparison of Phylogenetic Trees on the Web
Publication
- T. Goluch
- D. Bogdanowicz
- K. Giaro
- Methods in Ecology and Evolution - Year 2020
1. We present Visual TreeCmp—a package of applications for comparing phylogenetic tree sets. 2. Visual TreeCmp includes a graphical web interface allowing the visualization of compared trees and command line application extended by comparison methods recently proposed in the literature. 3. The phylogenetic tree similarity analysis in Visual TreeCmp can be performed using eighteen metrics, of which 11 are dedicated to rooted trees...

Full text available to download
Multimodal Attention Stimulator
Publication
- LECTURE NOTES IN COMPUTER SCIENCE - Year 2016
Multimodal attention stimulator was proposed and tested for improving auditory and visual attention, including pupils with developmental dyslexia. Results of the conducted experiments shown that the designed stimulator can be used in order to improve comprehension during reading tasks. The changes in the visual attention, observed in reading test results, translate into the overall reading performance.

Full text to download in external service
The short-term flicker severity level measured in the industrial power system supplying the rolling mill motors
Open Research Data
open access
- B. Pałczyńska
- series: The flicker measurement in the industrial power system supplying the rolling mill motors
The dataset presents a short-term flicker severity level measured on the bus bars of the main switchgear of the industrial power network for the supply of rolling mills. The data were obtained during an experiment whose purpose was to determine a level of short-term and long-term flicker caused by voltage fluctuations. In the virtual application of...
Visual Capacity Assessment of the Open Landscape in Terms of Protection and Shaping: Case Study of a Village in Poland
Publication
- A. Górka
- Sustainability - Year 2020
This article describes the methodology and results of research on landscape visual capacity. The aim of the project was to develop a tool that would support planning and design decisions at the level of communal management in rural areas in Poland through systematic application of visual criteria. Their importance in the protection, management and shaping of space is underlined by the document produced at the European Landscape...

Full text available to download
Visual Lip Contour Detection for the Purpose of Speech Recognition
Publication
- Year 2014
A method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...
Exploiting audio-visual correlation by means of gaze tracking
Publication
- B. Kunka
- B. Kostek
- International Journal of Computer Science and Applications - Year 2010
This paper presents a novel means for increasing audio-visual correlation analysis reliability. This is done based on gaze tracking technology engineered at the Multimedia Systems Department of the Gdansk University of Technology, Poland. In the paper, the past history and current research in the area of audio-visual perception analysis are shortly reviewed. Then the methodology employing gaze tracking is presented along with the...

Full text to download in external service
IEEE Symposium on Visual Languages and Human-Centric Computing (was VL)

Conferences
Modelling Of Commercial Websites. A New Perspective On Usability And Customer Relation
Publication
- I. Garnik
- B. Basińska
- Studia Ekonomiczne. Zeszyty Naukowe Uniwersytetu Ekonomicznego w Katowicach - Year 2013
From an economic point of view, a critical aspect of online services is their ability to retain customers. The aim of presented study was the use of a layered model VIPR (Visual - Interaction - Process - Relation ) for commercial services online. The indicator of trust and establishing lasting relationships were assessment achieved from experienced users of commercial online services (n = 207), obtained by means of Web Credibility...

Full text available to download
Objectivization of Audio-Visual Correlation analysis
Publication
- B. Kunka
- B. Kostek
- Archives of Acoustics - Year 2012
Simultaneous perception of audio and visual stimuli often causes the concealment or misrepresentation of information actually contained in these stimuli. Such effects are called the ''image proximity effect'' or the ''ventriloquism effect'' in literature. Until recently, most research carried out to understand their nature was based on subjective assessments. The Authors of this paper propose a methodology based on both subjective...

Full text available to download
A comparative study of English viseme recognition methods and algorithms
Publication
- MULTIMEDIA TOOLS AND APPLICATIONS - Year 2018
An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...

Full text available to download
A comparative study of English viseme recognition methods and algorithm
Publication
- D. Jachimski
- A. Czyżewski
- MULTIMEDIA TOOLS AND APPLICATIONS - Year 2018
An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector...

Full text available to download
Visual content representation and retrieval for Cognitive Cyber Physical Systems
Publication
- C. S. d. Oliveira
- C. Sanin
- E. Szczerbicki
- Procedia Computer Science - Year 2019
Cognitive Cyber Physical Systems have gained significant attention from academia and industry during the past few decade. One of the main reasons behind this interest is the potential of such technologies to revolutionize human life since they intend to work robustly under complex visual scenes, which environmental conditions may vary, adapting to a comprehensive range of unforeseen changes, and exhibiting prospective behavior...

Full text available to download
Lighting conditions in Home Office and occupant’s perception: an international study
Publication
- C. Naves David Amorim
- N. Giraldo Vasquez
- B. Matusiak
- J. Kanno
- N. Sokół
- J. Martyniuk-Pęczek
- S. Sibilio
- Y. Koga
- G. Ciampi
- M. Radziszewska
- ENERGY AND BUILDINGS - Year 2022
The global pandemic and physical distancing restrictions are forcing us to rethink how residential buildings are used regarding the visual environment. This paper describes home office lighting conditions within different countries and continents. The aim is to define the current limitations of home offices in providing a resilient visual environment. The work was developed by a team of international experts working together on...

Full text available to download
Methodology and technology for the polymodal allophonic speech transcription
Publication
- Journal of the Acoustical Society of America - Year 2016
A method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for...

Full text to download in external service
Methodology and technology for the polymodal allophonic speech transcription
Publication
- Journal of the Acoustical Society of America - Year 2016
A method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory...

Full text to download in external service
An new method of audio-visual correlation analysis
Publication
- B. Kunka
- B. Kostek
- Year 2009
This paper presents a new methodology of conducting the audio-visual correlation analysis employing the gaze tracking system. Interaction between two perceptual modalities, seeing and hearing, their interaction and mutual reinforcement in a complex relationship was a subject of many research studies. Earlier stage of the carried out experiments at the Multimedia Systems Department (MSD) showed that there exists a relationship between...

Full text to download in external service
Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.
Publication
- Year 2018
In this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...

Full text to download in external service
Vocalic Segments Classification Assisted by Mouth Motion Capture
Publication
- Year 2018
Visual features convey important information for automatic speech recognition (ASR), especially in noisy environment. The purpose of this study is to evaluate to what extent visual data (i.e. lip reading) can enhance recognition accuracy in the multi-modal approach. For that purpose motion capture markers were placed on speakers' faces to obtain lips tracking data during speaking. Different parameterizations strategies were tested...

Full text to download in external service
Simple gait parameterization and 3D animation for anonymous visual monitoring based on augmented reality
Publication
- P. Szczuko
- MULTIMEDIA TOOLS AND APPLICATIONS - Year 2016
The article presents a method for video anonymization and replacing real human silhouettes with virtual 3D figures rendered on a screen. Video stream is processed to detect and to track objects, whereas anonymization stage employs animating avatars accordingly to behavior of detected persons. Location, movement speed, direction, and person height are taken into account during animation and rendering phases. This approach requires...

Full text available to download
Robust and Efficient Machine Learning Algorithms for Visual Recognition
Publication
- S. Cygert
- Year 2022
In visual recognition, the task is to identify and localize all objects of interest in the input image. With the ubiquitous presence of visual data in modern days, the role of object recognition algorithms is becoming more significant than ever and ranges from autonomous driving to computer-aided diagnosis in medicine. Current models for visual recognition are dominated by models based on Convolutional Neural Networks (CNNs), which...

Full text available to download
Smart Modeling of Maritime Vessels
Publication
- P. Falkowski-Gilski
- J. Stefański
- Journal of Shipping and Ocean Engineering - Year 2015
Currently, the market offers many visualization tools available to graphic designers, engineers, managers and academics working on maritime environments. The practice of visualization involves making and manipulating images that convey novel phenomena and ideas. Visual communication, together with virtual reality environments, is an emerging and rapidly evolving discipline. It brings great advantage over written word or voice alone,...

Full text available to download
Visual and auditory attention stimulator for assisting pedagogical therapy . Stymulator uwagi wzrokowej i słuchowej do wspomagania terapii pedagogicznej
Publication
- Year 2015
Visual and auditory attention stimulator provides a system developed in order to improve reading skills using simultaneous presentation of text in its visual form and in transformed auditory form accompanied by related movie material. The described research employed 40 children at the age of 8 13 years having difficulties in learning of reading, who were diagnosed as having developmental dyslexia. It was shown that application...
Visual and Auditory Attention Stimulator for Assisting Pedagogical Therapy
Publication
- Ł. Kosikowski
- A. Czyżewski
- A. Senderski
- Year 2018
Visual and auditory attention stimulator provides a system developed in order to improve reading skills using simultaneous presentation of text in its visual form and in transformed auditory form accompanied by related movie material. The described research employed 40 children at the age of 8 13 years having difficulties in learning of reading, who were diagnosed as having developmental dyslexia. It was shown that application...

Full text available to download
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
Publication
- Year 2014
The problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
Publication
- Year 2014
The problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
Light formed through urban morphology and different organism groups: First findings from a systematic review
Publication
- S. Dincel
- U. Besenecker
- D. Koch
- K. Zielińska-Dąbkowska
- IOP Conference Series: Earth and Environmental Science - Year 2024
The prevailing implementation and usage of contemporary lighting technologies and design practices in cities have created over-illuminated built environments. Recent studies indicate that exposure to electric lighting effects formed through spatial characteristics has visual, physiological, and behavioural effects on both humans and non-humans, such as wildlife. In order to gain a better understanding of the impact that electric...

Full text available to download
Public spaces connecting cities. Green and Blue Infrastructures potential.
Publication
- A. Sas-Bojarska
- M. Rembeza
- Year 2015
A city fragmentation causes a lot of negative effects in urban environment such as: disconnecting the environmental, functional and compositional relations, a loss of urban compactness, chaotic development, visual chaos, a domination of technical landscape, reduction of security. This is why one of main challenges for urban planners is to connect the fragmented structures by creating friendly, attractive and safe public space....

Full text to download in external service
Augmented Reality for Privacy-Sensitive Visual Monitoring
Publication
- P. Szczuko
- Year 2014
The paper presents a method for video anonymization and replacing real human silhouettes with virtual 3D figures rendered on the screen. Video stream is processed to detect and to track objects, whereas anonymization stage employs fast blurring method. Substitute 3D figures are animated accordingly to behavior of detected persons. Their location, movement speed, direction, and person height are taken into account during the animation...

Full text to download in external service
Remote Estimation of Video-Based Vital Signs in Emotion Invocation Studies
Publication
- Year 2018
Abstract— The goal of this study is to examine the influence of various imitated and video invoked emotions on the vital signs (respiratory and pulse rates). We also perform an analysis of the possibility to extract signals from sequences acquired with cost-effective cameras. The preliminary results show that the respiratory rate allows for better separation of some emotions than the pulse rate, yet this relation highly depends...

Full text available to download
Objectivization of audio-video correlation assessment experiments
Publication
- B. Kunka
- B. Kostek
- Year 2010
The purpose of this paper is to present a new method of conducting an audio-visual correlation analysis employing a head-motion-free gaze tracking system. First, a review of related works in the domain of sound and vision correlation is presented. Then assumptions concerning audio-visual scene creation are shortly described. The objectivization process of carrying out correlation tests employing gaze-tracking system is outlined....

Full text to download in external service
Preferences of the Facade Composition in the Context of Its Regularity and Irregularity
Publication
- Buildings - Year 2022
Abstract: The aim of this study is to determine the preferences of Polish society towards building facades depending on the degree of the composition regularity of the facade elements. The subject matter is inspired by the authors’ observations in relation to the current architectural trends. The purposefulness of the conducted research results from several issues. Firstly, the reports of psychology and neurosciences clearly indicate...

Full text available to download
New Aspects of Virtual Sound Source Localization Research—Impact of Visual Angle and 3-D Video Content on Sound Perception
Publication
- B. Kunka
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2013
The influence of image on virtual sound source localization, called the “image proximity effect” or the “ventriloquism effect”, is a well known phenomenon. This paper focuses on other aspects related to this effect, namely the impact of the visual angle of the presented object and 3D video content on sound perception. The research conducted confirmed that the visual angle of the presented object determines the image proximity effect...

Full text to download in external service
Support for argument structures review and assessment
Publication
- Ł. Cyra
- J. Górski
- RELIABILITY ENGINEERING & SYSTEM SAFETY - Year 2011
Argument structures are commonly used to develop and present cases for safety, security and for other properties of systems. Such structures tend to grow excessively, which causes problems with their review and assessment. Two issues are of particular interest: (1) systematic and explicit assessment of the compelling power of an argument, and (2) communication of the result of such an assessment to relevant recipients. The paper...

Full text available to download

Search

Filters

Catalog

Search results for: visual processing

Marek Blok dr hab. inż.

Michał Lech dr inż.

Piotr Szczuko dr hab. inż.

Patryk Ziółkowski dr inż.

Piotr Odya dr inż.