Department of Multimedia Systems

Publications

Year 2022

Mining Knowledge of Respiratory Rate Quantification and Abnormal Pattern Prediction
Publication
- P. Szczuko
- A. Kurowski
- P. Odya
- A. Czyżewski
- B. Kostek
- B. Graff
- K. Narkiewicz
- Cognitive Computation - Year 2022
The described application of granular computing is motivated because cardiovascular disease (CVD) remains a major killer globally. There is increasing evidence that abnormal respiratory patterns might contribute to the development and progression of CVD. Consequently, a method that would support a physician in respiratory pattern evaluation should be developed. Group decision-making, tri-way reasoning, and rough set–based analysis...

Full text available to download
Medical Image Segmentation Using Deep Semantic-based Methods: A Review of Techniques, Applications and Emerging Trends
Publication
- I. Qureshi
- J. Yan
- Q. Abbas
- K. Shaheed
- A. B. Riaz
- A. Wahid
- M. W. J. Khan
- P. Szczuko
- Information Fusion - Year 2022
Semantic-based segmentation (Semseg) methods play an essential part in medical imaging analysis to improve the diagnostic process. In Semseg technique, every pixel of an image is classified into an instance, where each class is corresponded by an instance. In particular, the semantic segmentation can be used by many medical experts in the domain of radiology, ophthalmologists, dermatologist, and image-guided radiotherapy. The authors...

Full text to download in external service
Machine learning applied to acoustic-based road traffic monitoring
Publication
- K. Marciniuk
- B. Kostek
- Year 2022
The motivation behind this study lies in adapting acoustic noise monitoring systems for road traffic monitoring for driver’s safety. Such a system should recognize a vehicle type and weather-related pavement conditions based on the audio level measurement. The study presents the effectiveness of the selected machine learning algorithms in acoustic-based road traffic monitoring. Bases of the operation of the acoustic road traffic...

Full text available to download
Machine learning applied to acoustic-based road traffic monitoring
Publication
- K. Marciniuk
- B. Kostek
- Procedia Computer Science - Year 2022
The motivation behind this study lies in adapting acoustic noise monitoring systems for road traffic monitoring for driver’s safety. Such a system should recognize a vehicle type and weather-related pavement conditions based on the audio level measurement. The study presents the effectiveness of the selected machine learning algorithms in acoustic-based road traffic monitoring. Bases of the operation of the acoustic road traffic...

Full text available to download
Klasyfikacja emocji w muzyce filmowej z wykorzystaniem uczenia głębokiego
Publication
- T. Ciborowski
- S. Reginis
- D. Weber
- A. Kurowski
- B. Kostek
- Year 2022
Praca przedstawia zagadnienia związane z klasyfikacją emocji w muzyce filmowej. W artykule zaproponowano model emocji zawierający dziewięć stanów emocjonalnych, do których przypisany jest kolor zgodnie z teorią koloru w filmie. Kolejne kroki eksperymentu obejmowały wybór muzyki filmowej do testów (baza Epidemic Sound), przygotowanie założeń ankiety oraz modelu emocji wykorzystywanych w testach odsłuchowych, a także konstrukcję...

Full text to download in external service
Intracranial electrophysiological recordings from the human brain during memory tasks with pupillometry
Publication
- J. Cimbalnik
- J. Dolezal
- C. Topcu
- M. Lech
- V. Marks
- B. Joseph
- M. Dobias
- J. Van Gompel
- G. Worrell
- M. T. Kucewicz (formerly: M. Kucewicz)
- Scientific Data - Year 2022
Data comprise intracranial EEG (iEEG) brain activity represented by stereo EEG (sEEG) signals, recorded from over 100 electrode channels implanted in any one patient across various brain regions. The iEEG signals were recorded in epilepsy patients (N=10) undergoing invasive monitoring and localization of seizures when they were performing a battery of four memory tasks lasting approx. 1 hour in total. Gaze tracking on the task...

Full text available to download
Hotspot of human verbal memory encoding in the left anterior prefrontal cortex
Publication
- Ç. Topçu
- V. Marks
- K. Saboo
- M. Lech
- P. Nejedly
- V. Kremen
- G. A. Worrell
- M. T. Kucewicz (formerly: M. Kucewicz)
- EBioMedicine - Year 2022
Background: Treating memory and cognitive deficits requires knowledge about anatomical sites and neural activities to be targeted with particular therapies. Emerging technologies for local brain stimulation offer attractive therapeutic options but need to be applied to target specific neural activities, at distinct times, and in specific brain regions that are critical for memory formation. Methods: The areas that are critical...

Full text available to download
Examining Impact of Speed Recommendation Algorithm Operating in Autonomous Road Signs on Minimum Distance between Vehicles
Publication
- A. Sroczyński
- A. Kurowski
- S. Zaporowski
- A. Czyżewski
- Remote Sensing - Year 2022
An approach to a new kind of recommendation system design that suggests safe speed on the road is presented. Real data obtained on roads were used for the simulations. As part of a project related to autonomous road sign development, a number of measurements were carried out on both local roads and expressways. A speed recommendation model was created based on gathered traffic data employing the traffic simulator. Depending on...

Full text available to download
Evaluation of Decision Fusion Methods for Multimodal Biometrics in the Banking Application
Publication
- SENSORS - Year 2022
An evaluation of decision fusion methods based on Dempster-Shafer Theory (DST) and its modifications is presented in the article, studied over real biometric data from the engineered multimodal banking client verification system. First, the approaches for multimodal biometric data fusion for verification are explained. Then the proposed implementation of comparison scores fusion is presented, including details on the application...

Full text available to download
Edge-Computing based Secure E-learning Platforms
Publication
- S. A. Bhat
- D. Alyahya
- M. A. Dar
- S. Shah
- Year 2022
Implementation of Information and Communication Technologies (ICT) in E-Learning environments have brought up dramatic changes in the current educational sector. Distance learning, online learning, and networked learning are few examples that promote educational interaction between students, lecturers and learning communities. Although being an efficient form of real learning resource, online electronic resources are subject to...

Full text available to download
Detection of Anomalies in the Operation of a Road Lighting System Based on Data from Smart Electricity Meters
Publication
- T. Śmiałkowski
- A. Czyżewski
- ENERGIES - Year 2022
Smart meters in road lighting systems create new opportunities for automatic diagnostics of undesirable phenomena such as lamp failures, schedule deviations, or energy theft from the power grid. Such a solution fits into the smart cities concept, where an adaptive lighting system creates new challenges with respect to the monitoring function. This article presents research results indicating the practical feasibility of real‐time...

Full text available to download
Creating new voices using normalizing flows
Publication
- P. Biliński
- T. Merritt
- A. Ezzerg
- K. Pokora
- S. Cygert
- K. Yanagisawa
- R. Barra-Chicote
- D. Korzekwa
- Year 2022
Creating realistic and natural-sounding synthetic speech remains a big challenge for voice identities unseen during training. As there is growing interest in synthesizing voices of new speakers, here we investigate the ability of normalizing flows in text-to-speech (TTS) and voice conversion (VC) modes to extrapolate from speakers observed during training to create unseen speaker identities. Firstly, we create an approach for TTS...

Full text available to download
Creating a Remote Choir Performance Recording Based on an Ambisonic Approach
Publication
- Applied Sciences-Basel - Year 2022
The aim of this paper is three-fold. First, the basics of binaural and ambisonic techniques are briefly presented. Then, details related to audio-visual recordings of a remote performance of the Academic Choir of the Gdańsk University of Technology are shown. Due to the COVID-19 pandemic, artists had a choice, namely, to stay at home and not perform or stay at home and perform. In fact, staying at home brought in the possibility...

Full text available to download
Computer-Aided Detection of Hypertensive Retinopathy Using Depth-Wise Separable CNN
Publication
- I. Qureshi
- Q. Abbas
- J. Yan
- A. Hussain
- K. Shaheed
- A. R. Baig
- Applied Sciences-Basel - Year 2022
Hypertensive retinopathy (HR) is a retinal disorder, linked to high blood pressure. The incidence of HR-eye illness is directly related to the severity and duration of hypertension. It is critical to identify and analyze HR at an early stage to avoid blindness. There are presently only a few computer-aided systems (CADx) designed to recognize HR. Instead, those systems concentrated on collecting features from many retinopathy-related...

Full text available to download
Cognitive neuroscience: Theta network oscillations coordinate development of episodic memory
Publication
- M. T. Kucewicz (formerly: M. Kucewicz)
- CURRENT BIOLOGY - Year 2022
Our ability to remember life events matures through childhood and adolescence. A new study has revealed how theta oscillations between two anatomical brain regions supporting memory and executive functions are synchronized and develop across age through functional and structural connectivity.

Full text available to download
Broadening the scope of measurement and analysis of vibrations of an organ pipe employing intensity probe, simulations, and highspeed camera
Publication
- P. Bordoni
- J. Kotus
- P. Odya
- F. Antonacci
- B. Kostek
- Journal of the Acoustical Society of America - Year 2022
This paper shows an integrated approach to measure, analyze, and model phenomena occurring in an organ pipe driven by pressurized air. The aim of this paper is two-fold, i.e., to measure the pressure signal and the intensity field around the mouth by means of an intensity probe and to visualize and observe the motion of the air jet, which represents the excitation mechanism of the system. This is realized through two techniques,...

Full text to download in external service
BP-EVD: Forward Block-Output Propagation for Efficient Video Denoising
Publication
- IEEE TRANSACTIONS ON IMAGE PROCESSING - Year 2022
Denoising videos in real-time is critical in many applications, including robotics and medicine, where varying light conditions, miniaturized sensors, and optics can substantially compromise image quality. This work proposes the first video denoising method based on a deep neural network that achieves state-of-the-art performance on dynamic scenes while running in real-time on VGA video resolution with no frame latency. The backbone...

Full text to download in external service
Blockchain based Secure Data Exchange between Cloud Networks and Smart Hand-held Devices for use in Smart Cities
Publication
- M. A. Dar
- A. Askar
- S. A. Bhat
- Year 2022
In relation to smart city planning and management, processing huge amounts of generated data and execution of non-lightweight cryptographic algorithms on resource constraint devices at disposal, is the primary focus of researchers today. To enable secure exchange of data between cloud networks and mobile devices, in particular smart hand held devices, this paper presents Blockchain based approach that disperses a public/free key...

Full text available to download
Architecture Design of a Networked Music Performance Platform for a Chamber Choir
Publication
- J. Cychnerski
- B. Mróz
- Communications in Computer and Information Science - Year 2022
This paper describes an architecture design process for Networked Music Performance (NMP) platform for medium-sized conducted music ensembles, based on remote rehearsals of Academic Choir of Gdańsk University of Technology. The issues of real-time remote communication, in-person music performance, and NMP are described. Three iterative steps defining and extending the architecture of the NMP platform with additional features to...

Full text to download in external service
Algoritmically improved microwave radar monitors breathing more acurrate than sensorized belt
Publication
- A. Czyżewski
- B. Kostek
- A. Kurowski
- K. Narkiewicz
- B. Graff
- P. Odya
- T. Śmiałkowski
- A. Sroczyński
- Scientific Reports - Year 2022
This paper describes a novel way to measure, process, analyze, and compare respiratory signals acquired by two types of devices: a wearable sensorized belt and a microwave radar-based sensor. Both devices provide breathing rate readouts. First, the background research is presented. Then, the underlying principles and working parameters of the microwave radar-based sensor, a contactless device for monitoring breathing, are described....

Full text available to download

Year 2021

Towards Cancer Patients Classification Using Liquid Biopsy
Publication
- S. Cygert
- F. Górski
- P. Juszczyk
- S. Lewalski
- K. Pastuszak
- A. Czyżewski
- A. Supernat
- Year 2021
Liquid biopsy is a useful, minimally invasive diagnostic and monitoring tool for cancer disease. Yet, developing accurate methods, given the potentially large number of input features, and usually small datasets size remains very challenging. Recently, a novel feature parameterization based on the RNA-sequenced platelet data which uses the biological knowledge from the Kyoto Encyclopedia of Genes and Genomes, combined with a classifier...

Full text to download in external service
Techniki wielokanałowe wykorzystywane w koncertach i nagraniach muzycznych na odległość
Publication
- Year 2021
W czasie pandemii koronawirusa COVID-19 nowego znaczenia nabrały możliwości transmisji dźwięku z obrazem – zwłaszcza do pracy zdalnej, która w przypadku muzyków jest szczególnym wyzwaniem zarówno w kontekście wspólnych ćwiczeń i prób, jak i koncertów. Wynikła konieczność wieloźródłowego połączenia ujawniła potrzebę uprzestrzennienia dźwięku w celu łatwiejszej lokalizacji źródeł dźwięku. Tworzenie zdalnych nagrań muzycznych stało...

Full text to download in external service
Skuteczność klasyfikacji gatunków muzycznych za pomocą sieci neuronowej w zależności od typu danych wejściowych
Publication
- Year 2021
Rozpoznawanie gatunku muzycznego jest jednym z podstawowych elementów inteligentnych systemów tworzenia automatycznych list muzyki. Platformy strumieniowe oferujące taką usługę wymagają rozwiązań, które umożliwią jak najdokładniej określić przynależność utworu do gatunku muzycznego. Zgodnie z aktualnym stanem wiedzy – najskuteczniejszym klasyfikatorem są sztuczne sieci neuronowe (w tym w wersji uczenia głębokiego), dla których...

Full text to download in external service
Selective monitoring of noise emitted by vehicles involved in road traffic
Publication
- A. Czyżewski
- T. Śmiałkowski
- Journal of the Acoustical Society of America - Year 2021
An acoustic intensity probe was developed measures the sound intensity in three orthogonal directions, making possible to calculate the azimuth and elevation angles, describing the sound source position. The acoustic sensor is made in the form of a cube with a side of 10 mm, on the inner surfaces of which the digital MEMS microphones are mounted. The algorithm works in two stages. The first stage is based on the analysis of sound...

Full text available to download
Robustness in Compressed Neural Networks for Object Detection
Publication
- S. Cygert
- A. Czyżewski
- Year 2021
Model compression techniques allow to significantly reduce the computational cost associated with data processing by deep neural networks with only a minor decrease in average accuracy. Simultaneously, reducing the model size may have a large effect on noisy cases or objects belonging to less frequent classes. It is a crucial problem from the perspective of the models' safety, especially for object detection in the autonomous driving...

Full text available to download
Production of six-degrees-of-freedom (6DoF) navigable audio using 30 Ambisonic microphones
Publication
- B. Mróz
- M. Kabaciński
- T. Ciotucha
- A. Rumiński
- T. Żernicki
- Year 2021
This paper describes a method for planning, recording, and post-production of six-degrees-of-freedom audio recorded with multiple 3rd order Ambisonic microphone arrays. The description is based on the example of recordings conducted in August 2020 with the Poznan Philharmonic Orchestra using 30 units of Zylia ZM-1S. A convenient way to prepare and organize such a big project is proposed – this involves details of stage planning,...

Full text to download in external service
Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling
Publication
- D. Korzekwa
- J. Lorenzo-trueba
- S. Zaporowski
- S. Calamaro
- T. Drugman
- B. Kostek
- Year 2021
A common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...

Full text to download in external service
Leveraging spatio-temporal features for joint deblurring and segmentation of instruments in dental video microscopy
Publication
- E. Katsaros
- A. Jezierska
- D. Węsierski
- Year 2021
In dentistry, microscopes have become indispensable optical devices for high-quality treatment and micro-invasive surgery, especially in the field of endodontics. Recent machine vision advances enable more advanced, real-time applications including but not limited to dental video deblurring and workflow analysis through relevant metadata obtained by instrument motion trajectories. To this end, the proposed work addresses dental...

Full text to download in external service
Independent dynamics of slow, intermediate, and fast intracranial EEG spectral activities during human memory formation
Publication
- V. S. Marks
- K. V. Saboo
- C. Topcu
- T. P. Thayib
- P. Nejedly
- V. Kremen
- G. A. Worrell
- M. T. Kucewicz
- Year 2021
A wide spectrum of brain rhythms are engaged throughout the human cortex in cognitive functions. How the rhythms of various low and high frequencies are spatiotemporally coordinated across the human brain during memory processing is inconclusive. They can either be coordinated together across a wide range of the frequency spectrum or induced in specific bands. We used a large dataset of human intracranial electroencephalography...

Full text to download in external service
Independent dynamics of low, intermediate, and high frequency spectral intracranial EEG activities during human memory formation
Publication
- V. Marks
- K. Saboo
- Ç. Topçu
- M. Lech
- T. Thayib
- P. Nejedly
- V. Kremen
- G. A. Worrell
- M. T. Kucewicz (formerly: M. Kucewicz)
- NEUROIMAGE - Year 2021
A wide spectrum of brain rhythms are engaged throughout the human cortex in cognitive functions. How the rhythms of various frequency ranges are coordinated across the space of the human cortex and time of memory processing is inconclusive. They can either be coordinated together across the frequency spectrum at the same cortical site and time or induced independently in particular bands. We used a large dataset of human intracranial...

Full text available to download
Highlighting interlanguage phoneme differences based on similarity matrices and convolutional neural network
Publication
- G. Korvel
- P. Treigys
- B. Kostek
- Journal of the Acoustical Society of America - Year 2021
The goal of this research is to find a way of highlighting the acoustic differences between consonant phonemes of the Polish and Lithuanian languages. For this purpose, similarity matrices are employed based on speech acoustic parameters combined with a convolutional neural network (CNN). In the first experiment, we compare the effectiveness of the similarity matrices applied to discerning acoustic differences between consonant...

Full text available to download
Evaluation of aspiration problems in L2 English pronunciation employing machine learning
Publication
- M. Piotrowska
- A. Czyżewski
- T. Ciszewski
- G. Korvel
- A. Kurowski
- B. Kostek
- Journal of the Acoustical Society of America - Year 2021
The approach proposed in this study includes methods specifically dedicated to the detection of allophonic variation in English. This study aims to find an efficient method for automatic evaluation of aspiration in the case of Polish second-language (L2) English speakers’ pronunciation when whole words are analyzed instead of particular allophones extracted from words. Sample words including aspirated and unaspirated allophones...

Full text available to download
Estimation of Average Speed of Road Vehicles by Sound Intensity Analysis
Publication
- J. Kotus
- G. Szwoch
- SENSORS - Year 2021
Constant monitoring of road traffic is important part of modern smart city systems. The proposed method estimates average speed of road vehicles in the observation period, using a passive acoustic vector sensor. Speed estimation based on sound intensity analysis is a novel approach to the described problem. Sound intensity in two orthogonal axes is measured with a sensor placed alongside the road. Position of the apparent sound...

Full text available to download
Direct electrical stimulation of the human brain has inverse effects on the theta and gamma neural activities
Publication
- M. Lech
- B. M. Berry
- C. Topcu
- V. Kremen
- P. Nejedly
- B. Lega
- R. E. Gross
- M. R. Sperling
- B. C. Jobst
- S. A. Sheth... and 4 others
- IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING - Year 2021
Objective: Our goal was to analyze the electrophysiological response to direct electrical stimulation (DES) systematically applied at a wide range of parameters and anatomical sites, with particular focus on neural activities associated with memory and cognition. Methods: We used a large set of intracranial EEG (iEEG) recordings with DES from 45 subjects with electrodes...

Full text available to download
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
Publication
- D. Korzekwa
- R. Barra-Chicote
- S. Zaporowski
- G. Beringer
- J. Lorenzo-trueba
- A. Serafinowicz
- J. Droppo
- T. Drugman
- B. Kostek
- Year 2021
This paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...

Full text available to download
Designing acoustic scattering elements using machine learning methods
Publication
- A. Kurowski
- Year 2021
In the process of the design and correction of room acoustic properties, it is often necessary to select the appropriate type of acoustic treatment devices and make decisions regarding their size, geometry, and location of the devices inside the room under the treatment process. The goal of this doctoral dissertation is to develop and validate a mathematical model that allows predicting the effects of the application of the scattering...

Full text available to download
CyberEye: New Eye-Tracking Interfaces for Assessment and Modulation of Cognitive Functions beyond the Brain
Publication
- SENSORS - Year 2021
The emergence of innovative neurotechnologies in global brain projects has accelerated research and clinical applications of BCIs beyond sensory and motor functions. Both invasive and noninvasive sensors are developed to interface with cognitive functions engaged in thinking, communication, or remembering. The detection of eye movements by a camera offers a particularly attractive external sensor for computer interfaces to monitor,...

Full text available to download
Concurrent Video Denoising and Deblurring for Dynamic Scenes
Publication
- IEEE Access - Year 2021
Dynamic scene video deblurring is a challenging task due to the spatially variant blur inflicted by independently moving objects and camera shakes. Recent deep learning works bypass the ill-posedness of explicitly deriving the blur kernel by learning pixel-to-pixel mappings, which is commonly enhanced by larger region awareness. This is a difficult yet simplified scenario because noise is neglected when it is omnipresent in a wide...

Full text available to download
Closer Look at the Uncertainty Estimation in Semantic Segmentation under Distributional Shift
Publication
- Year 2021
While recent computer vision algorithms achieve impressive performance on many benchmarks, they lack robustness - presented with an image from a different distribution, (e.g. weather or lighting conditions not considered during training), they may produce an erroneous prediction. Therefore, it is desired that such a model will be able to reliably predict its confidence measure. In this work, uncertainty estimation for the task...

Full text available to download
AUTOMATYCZNE GENEROWANIE KOLEJNOŚCI LIST UTWORÓW MUZYCZNYCH
Publication
- K. Pietrusińska
- A. Kurowski
- B. Kostek
- Year 2021
W niniejszym rozdziale przedstawiono przygotowanie algorytmu do automa-tycznego układania kolejności utworów muzycznych i zgrywającego je do postaci jednego, długiego miksu. Dzięki algorytmowi dobierane są utwory na podstawie analizy podobieństwa fragmentów końcowych i początkowych utworów. Podo-bieństwo to jest obliczane za pomocą odległości euklidesowej między wektorami parametrów wyznaczonymi przez autoenkoder oraz na podstawie...

Full text to download in external service
Analiza zależności muzyczno-graficznej okładek albumów z użyciem algorytmów uczących się
Publication
- A. Dorochowicz
- Year 2021
Celem rozprawy jest analiza zależności muzyczno-graficznej okładek albumów z użyciem algorytmów uczących się. Brane są pod uwagę parametry badanych gatunków muzycznych, zależności pomiędzy gatunkami muzycznymi a typami osobowości, jak również cechy okładek albumów muzycznych i ich korelacje z gatunkami muzycznymi. Opracowana metodologia jest wykorzystana w celu sprawdzenia możliwości automatycznej klasyfikacji gatunku muzycznego...

Full text available to download
An Automated Method for Biometric Handwritten Signature Authentication Employing Neural Networks
Publication
- M. Kurowski
- A. Sroczyński
- G. Bogdanis
- A. Czyżewski
- Electronics - Year 2021
Handwriting biometrics applications in e-Security and e-Health are addressed in the course of the conducted research. An automated graphomotor analysis method for the dynamic electronic representation of the handwritten signature authentication was researched. The developed algorithms are based on dynamic analysis of electronically handwritten signatures employing neural networks. The signatures were acquired with the use of the...

Full text available to download
Ambisoniczna mapa wybranych miejsc w Trójmieście z obrazem 360°
Publication
- C. Pietrzak
- P. Odya
- Year 2021
W projekcie, który zostanie opisany w niniejszym rozdziale, założonym celem było stworzenie ambisonicznej mapy Trójmiasta w formie aplikacji internetowej. Materiały wideo w technologii 360° z dźwiękiem w postaci sygnału ambisonicznego zostały zarejestrowane w wybranych lokalizacjach uznanych za charakterystyczne dla tej aglomeracji. Celem badawczym projektu było porównanie dostępnych algorytmów miksowania sygnałów ambisonicznych...

Full text to download in external service
Adaptive Method for Modeling of Temporal Dependencies between Fields of Vision in Multi-Camera Surveillance Systems
Publication
- K. Lisowski
- A. Czyżewski
- Electronics - Year 2021
A method of modeling the time of object transition between given pairs of cameras based on the Gaussian Mixture Model (GMM) is proposed in this article. Temporal dependencies modeling is a part of object re-identification based on the multi-camera experimental framework. The previously utilized Expectation-Maximization (EM) approach, requiring setting the number of mixtures arbitrarily as an input parameter, was extended with the...

Full text available to download
Acoustic Sensing Analytics Applied to Speech in Reverberation Conditions
Publication
- SENSORS - Year 2021
The paper aims to discuss a case study of sensing analytics and technology in acoustics when applied to reverberation conditions. Reverberation is one of the issues that makes speech in indoor spaces challenging to understand. This problem is particularly critical in large spaces with few absorbing or diffusing surfaces. One of the natural remedies to improve speech intelligibility in such conditions may be achieved through speaking...

Full text available to download
Acoustic Detector of Road Vehicles Based on Sound Intensity
Publication
- G. Szwoch
- J. Kotus
- SENSORS - Year 2021
A method of detecting and counting road vehicles using an acoustic sensor placed by the road is presented. The sensor measures sound intensity in two directions: parallel and perpendicular to the road. The sound intensity analysis performs acoustic event detection. A normalized position of the sound source is tracked and used to determine if the detected event is related to a moving vehicle and to establish the direction of movement....

Full text available to download

Year 2020

Vehicle Detection with Self-Training for Adaptative Video Processing Embedded Platform
Publication
- S. Cygert
- A. Czyżewski
- Applied Sciences-Basel - Year 2020
Traffic monitoring from closed-circuit television (CCTV) cameras on embedded systems is the subject of the performed experiments. Solving this problem encounters difficulties related to the hardware limitations, and possible camera placement in various positions which affects the system performance. To satisfy the hardware requirements, vehicle detection is performed using a lightweight Convolutional Neural Network (CNN), named...

Full text available to download
Toward Robust Pedestrian Detection With Data Augmentation
Publication
- S. Cygert
- A. Czyżewski
- IEEE Access - Year 2020
In this article, the problem of creating a safe pedestrian detection model that can operate in the real world is tackled. While recent advances have led to significantly improved detection accuracy on various benchmarks, existing deep learning models are vulnerable to invisible to the human eye changes in the input image which raises concerns about its safety. A popular and simple technique for improving robustness is using data...

Full text available to download
System for monitoring road slippery based on CCTV cameras and convolutional neural networks
Publication
- D. Grabowski
- A. Czyżewski
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Year 2020
The slipperiness of the surface is essential for road safety. The growing number of CCTV cameras opens the possibility of using them to automatically detect the slippery surface and inform road users about it. This paper presents a system of developed intelligent road signs, including a detector based on convolutional neural networks (CNNs) and the transferlearning method employed to the processing of images acquired with video...

Full text available to download
Ranking Speech Features for Their Usage in Singing Emotion Classification
Publication
- S. Zaporowski
- B. Kostek
- Year 2020
This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...

Full text available to download

Search

Department of Multimedia Systems

Publications

Filters

Category

Year

Options

Catalog Publications

Year 2022

Year 2021

Year 2020