Katedra Systemów Multimedialnych - Jednostki Administracyjne - MOST Wiedzy

Wyszukiwarka

Katedra Systemów Multimedialnych

Filtry

wszystkich: 890

  • Kategoria
  • Rok
  • Opcje

wyczyść Filtry wybranego katalogu niedostępne

Katalog Publikacji

Rok 2024
  • Adapt Your Teacher: Improving Knowledge Distillation for Exemplar-free Continual Learning
    Publikacja
    • F. Szatkowski
    • M. Pyła
    • M. Przewięźlikowski
    • S. Cygert
    • B. Twardowski
    • T. Trzciński

    - Rok 2024

    In this work, we investigate exemplar-free class incremental learning (CIL) with knowledge distillation (KD) as a regularization strategy, aiming to prevent forgetting. KDbased methods are successfully used in CIL, but they often struggle to regularize the model without access to exemplars of the training data from previous tasks. Our analysis reveals that this issue originates from substantial representation shifts in the teacher...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Deep learning techniques for biometric security: A systematic review of presentation attack detection systems
    Publikacja

    - ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE - Rok 2024

    Biometric technology, including finger vein, fingerprint, iris, and face recognition, is widely used to enhance security in various devices. In the past decade, significant progress has been made in improving biometric sys- tems, thanks to advancements in deep convolutional neural networks (DCNN) and computer vision (CV), along with large-scale training datasets. However, these systems have become targets of various attacks, with...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Infographics in Educational Settings: A Literature Review
    Publikacja

    - IEEE Access - Rok 2024

    Infographics are visual representations of data that utilize various graphic elements, including pie charts, bar graphs, line graphs, and histograms. Educators and designers can maximize the potential of infographics as powerful educational tools by carefully addressing challenges and capitalizing on emerging technologies. However, current education systems showcase the need for development guidelines and the best practices targeted...

    Pełny tekst do pobrania w portalu

  • Looking through the past: better knowledge retention for generative replay in continual learning
    Publikacja
    • V. Khan
    • S. Cygert
    • K. Deja
    • T. Trzciński
    • B. Twardowski

    - IEEE Access - Rok 2024

    In this work, we improve the generative replay in a continual learning setting to perform well on challenging scenarios. Because of the growing complexity of continual learning tasks, it is becoming more popular, to apply the generative replay technique in the feature space instead of image space. Nevertheless, such an approach does not come without limitations. In particular, we notice the degradation of the continually trained...

    Pełny tekst do pobrania w portalu

  • Missing Puzzle Pieces in Dementia Research: HCN Channels and Theta Oscillations
    Publikacja

    - Aging and Disease - Rok 2024

    Increasing evidence indicates a role of hyperpolarization activated cation (HCN) channels in controlling the resting membrane potential, pacemaker activity, memory formation, sleep, and arousal. Their disfunction may be associated with the development of epilepsy and age-related memory decline. Neuronal hyperexcitability involved in epileptogenesis and EEG desynchronization occur in the course of dementia in human Alzheimer’s Disease...

    Pełny tekst do pobrania w portalu

  • Sounding Mechanism of a Flue Organ Pipe—A Multi-Sensor Measurement Approach
    Publikacja

    - SENSORS - Rok 2024

    This work presents an approach that integrates the results of measuring, analyzing, and modeling air flow phenomena driven by pressurized air in a flue organ pipe. The investigation concerns a Bourdon organ pipe. Measurements are performed in an anechoic chamber using the Cartesian robot equipped with a 3D acoustic vector sensor (AVS) that acquires both acoustic pressure and air particle velocity. Also, a high-speed camera is employed...

    Pełny tekst do pobrania w portalu

Rok 2023
Rok 2022
Rok 2021
  • Acoustic Detector of Road Vehicles Based on Sound Intensity
    Publikacja

    - SENSORS - Rok 2021

    A method of detecting and counting road vehicles using an acoustic sensor placed by the road is presented. The sensor measures sound intensity in two directions: parallel and perpendicular to the road. The sound intensity analysis performs acoustic event detection. A normalized position of the sound source is tracked and used to determine if the detected event is related to a moving vehicle and to establish the direction of movement....

    Pełny tekst do pobrania w portalu

  • Acoustic Sensing Analytics Applied to Speech in Reverberation Conditions
    Publikacja

    The paper aims to discuss a case study of sensing analytics and technology in acoustics when applied to reverberation conditions. Reverberation is one of the issues that makes speech in indoor spaces challenging to understand. This problem is particularly critical in large spaces with few absorbing or diffusing surfaces. One of the natural remedies to improve speech intelligibility in such conditions may be achieved through speaking...

    Pełny tekst do pobrania w portalu

  • Adaptive Method for Modeling of Temporal Dependencies between Fields of Vision in Multi-Camera Surveillance Systems
    Publikacja

    A method of modeling the time of object transition between given pairs of cameras based on the Gaussian Mixture Model (GMM) is proposed in this article. Temporal dependencies modeling is a part of object re-identification based on the multi-camera experimental framework. The previously utilized Expectation-Maximization (EM) approach, requiring setting the number of mixtures arbitrarily as an input parameter, was extended with the...

    Pełny tekst do pobrania w portalu

  • Ambisoniczna mapa wybranych miejsc w Trójmieście z obrazem 360°
    Publikacja

    - Rok 2021

    W projekcie, który zostanie opisany w niniejszym rozdziale, założonym celem było stworzenie ambisonicznej mapy Trójmiasta w formie aplikacji internetowej. Materiały wideo w technologii 360° z dźwiękiem w postaci sygnału ambisonicznego zostały zarejestrowane w wybranych lokalizacjach uznanych za charakterystyczne dla tej aglomeracji. Celem badawczym projektu było porównanie dostępnych algorytmów miksowania sygnałów ambisonicznych...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • An Automated Method for Biometric Handwritten Signature Authentication Employing Neural Networks
    Publikacja

    Handwriting biometrics applications in e-Security and e-Health are addressed in the course of the conducted research. An automated graphomotor analysis method for the dynamic electronic representation of the handwritten signature authentication was researched. The developed algorithms are based on dynamic analysis of electronically handwritten signatures employing neural networks. The signatures were acquired with the use of the...

    Pełny tekst do pobrania w portalu

  • Analiza zależności muzyczno-graficznej okładek albumów z użyciem algorytmów uczących się
    Publikacja

    - Rok 2021

    Celem rozprawy jest analiza zależności muzyczno-graficznej okładek albumów z użyciem algorytmów uczących się. Brane są pod uwagę parametry badanych gatunków muzycznych, zależności pomiędzy gatunkami muzycznymi a typami osobowości, jak również cechy okładek albumów muzycznych i ich korelacje z gatunkami muzycznymi. Opracowana metodologia jest wykorzystana w celu sprawdzenia możliwości automatycznej klasyfikacji gatunku muzycznego...

    Pełny tekst do pobrania w portalu

  • AUTOMATYCZNE GENEROWANIE KOLEJNOŚCI LIST UTWORÓW MUZYCZNYCH
    Publikacja

    - Rok 2021

    W niniejszym rozdziale przedstawiono przygotowanie algorytmu do automa-tycznego układania kolejności utworów muzycznych i zgrywającego je do postaci jednego, długiego miksu. Dzięki algorytmowi dobierane są utwory na podstawie analizy podobieństwa fragmentów końcowych i początkowych utworów. Podo-bieństwo to jest obliczane za pomocą odległości euklidesowej między wektorami parametrów wyznaczonymi przez autoenkoder oraz na podstawie...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Closer Look at the Uncertainty Estimation in Semantic Segmentation under Distributional Shift

    While recent computer vision algorithms achieve impressive performance on many benchmarks, they lack robustness - presented with an image from a different distribution, (e.g. weather or lighting conditions not considered during training), they may produce an erroneous prediction. Therefore, it is desired that such a model will be able to reliably predict its confidence measure. In this work, uncertainty estimation for the task...

    Pełny tekst do pobrania w portalu

  • Concurrent Video Denoising and Deblurring for Dynamic Scenes

    Dynamic scene video deblurring is a challenging task due to the spatially variant blur inflicted by independently moving objects and camera shakes. Recent deep learning works bypass the ill-posedness of explicitly deriving the blur kernel by learning pixel-to-pixel mappings, which is commonly enhanced by larger region awareness. This is a difficult yet simplified scenario because noise is neglected when it is omnipresent in a wide...

    Pełny tekst do pobrania w portalu

  • CyberEye: New Eye-Tracking Interfaces for Assessment and Modulation of Cognitive Functions beyond the Brain

    The emergence of innovative neurotechnologies in global brain projects has accelerated research and clinical applications of BCIs beyond sensory and motor functions. Both invasive and noninvasive sensors are developed to interface with cognitive functions engaged in thinking, communication, or remembering. The detection of eye movements by a camera offers a particularly attractive external sensor for computer interfaces to monitor,...

    Pełny tekst do pobrania w portalu

  • Designing acoustic scattering elements using machine learning methods
    Publikacja

    - Rok 2021

    In the process of the design and correction of room acoustic properties, it is often necessary to select the appropriate type of acoustic treatment devices and make decisions regarding their size, geometry, and location of the devices inside the room under the treatment process. The goal of this doctoral dissertation is to develop and validate a mathematical model that allows predicting the effects of the application of the scattering...

    Pełny tekst do pobrania w portalu

  • Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
    Publikacja

    - Rok 2021

    This paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...

    Pełny tekst do pobrania w portalu

  • Direct electrical stimulation of the human brain has inverse effects on the theta and gamma neural activities
    Publikacja
    • M. Lech
    • B. M. Berry
    • C. Topcu
    • V. Kremen
    • P. Nejedly
    • B. Lega
    • R. E. Gross
    • M. R. Sperling
    • B. C. Jobst
    • S. A. Sheth... i 4 innych

    - IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING - Rok 2021

    Objective: Our goal was to analyze the electrophysiological response to direct electrical stimulation (DES) systematically applied at a wide range of parameters and anatomical sites, with particular focus on neural activities associated with memory and cognition. Methods: We used a large set of intracranial EEG (iEEG) recordings with DES from 45 subjects with electrodes...

    Pełny tekst do pobrania w portalu

  • Estimation of Average Speed of Road Vehicles by Sound Intensity Analysis
    Publikacja

    - SENSORS - Rok 2021

    Constant monitoring of road traffic is important part of modern smart city systems. The proposed method estimates average speed of road vehicles in the observation period, using a passive acoustic vector sensor. Speed estimation based on sound intensity analysis is a novel approach to the described problem. Sound intensity in two orthogonal axes is measured with a sensor placed alongside the road. Position of the apparent sound...

    Pełny tekst do pobrania w portalu

  • Evaluation of aspiration problems in L2 English pronunciation employing machine learning

    The approach proposed in this study includes methods specifically dedicated to the detection of allophonic variation in English. This study aims to find an efficient method for automatic evaluation of aspiration in the case of Polish second-language (L2) English speakers’ pronunciation when whole words are analyzed instead of particular allophones extracted from words. Sample words including aspirated and unaspirated allophones...

    Pełny tekst do pobrania w portalu

  • Highlighting interlanguage phoneme differences based on similarity matrices and convolutional neural network
    Publikacja

    - Journal of the Acoustical Society of America - Rok 2021

    The goal of this research is to find a way of highlighting the acoustic differences between consonant phonemes of the Polish and Lithuanian languages. For this purpose, similarity matrices are employed based on speech acoustic parameters combined with a convolutional neural network (CNN). In the first experiment, we compare the effectiveness of the similarity matrices applied to discerning acoustic differences between consonant...

    Pełny tekst do pobrania w portalu

  • Independent dynamics of low, intermediate, and high frequency spectral intracranial EEG activities during human memory formation
    Publikacja

    - NEUROIMAGE - Rok 2021

    A wide spectrum of brain rhythms are engaged throughout the human cortex in cognitive functions. How the rhythms of various frequency ranges are coordinated across the space of the human cortex and time of memory processing is inconclusive. They can either be coordinated together across the frequency spectrum at the same cortical site and time or induced independently in particular bands. We used a large dataset of human intracranial...

    Pełny tekst do pobrania w portalu

  • Independent dynamics of slow, intermediate, and fast intracranial EEG spectral activities during human memory formation
    Publikacja

    - Rok 2021

    A wide spectrum of brain rhythms are engaged throughout the human cortex in cognitive functions. How the rhythms of various low and high frequencies are spatiotemporally coordinated across the human brain during memory processing is inconclusive. They can either be coordinated together across a wide range of the frequency spectrum or induced in specific bands. We used a large dataset of human intracranial electroencephalography...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Leveraging spatio-temporal features for joint deblurring and segmentation of instruments in dental video microscopy
    Publikacja

    - Rok 2021

    In dentistry, microscopes have become indispensable optical devices for high-quality treatment and micro-invasive surgery, especially in the field of endodontics. Recent machine vision advances enable more advanced, real-time applications including but not limited to dental video deblurring and workflow analysis through relevant metadata obtained by instrument motion trajectories. To this end, the proposed work addresses dental...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling
    Publikacja

    - Rok 2021

    A common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Production of six-degrees-of-freedom (6DoF) navigable audio using 30 Ambisonic microphones
    Publikacja
    • B. Mróz
    • M. Kabaciński
    • T. Ciotucha
    • A. Rumiński
    • T. Żernicki

    - Rok 2021

    This paper describes a method for planning, recording, and post-production of six-degrees-of-freedom audio recorded with multiple 3rd order Ambisonic microphone arrays. The description is based on the example of recordings conducted in August 2020 with the Poznan Philharmonic Orchestra using 30 units of Zylia ZM-1S. A convenient way to prepare and organize such a big project is proposed – this involves details of stage planning,...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Robustness in Compressed Neural Networks for Object Detection
    Publikacja

    Model compression techniques allow to significantly reduce the computational cost associated with data processing by deep neural networks with only a minor decrease in average accuracy. Simultaneously, reducing the model size may have a large effect on noisy cases or objects belonging to less frequent classes. It is a crucial problem from the perspective of the models' safety, especially for object detection in the autonomous driving...

    Pełny tekst do pobrania w portalu

  • Selective monitoring of noise emitted by vehicles involved in road traffic

    An acoustic intensity probe was developed measures the sound intensity in three orthogonal directions, making possible to calculate the azimuth and elevation angles, describing the sound source position. The acoustic sensor is made in the form of a cube with a side of 10 mm, on the inner surfaces of which the digital MEMS microphones are mounted. The algorithm works in two stages. The first stage is based on the analysis of sound...

    Pełny tekst do pobrania w portalu

  • Skuteczność klasyfikacji gatunków muzycznych za pomocą sieci neuronowej w zależności od typu danych wejściowych
    Publikacja

    Rozpoznawanie gatunku muzycznego jest jednym z podstawowych elementów inteligentnych systemów tworzenia automatycznych list muzyki. Platformy strumieniowe oferujące taką usługę wymagają rozwiązań, które umożliwią jak najdokładniej określić przynależność utworu do gatunku muzycznego. Zgodnie z aktualnym stanem wiedzy – najskuteczniejszym klasyfikatorem są sztuczne sieci neuronowe (w tym w wersji uczenia głębokiego), dla których...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Techniki wielokanałowe wykorzystywane w koncertach i nagraniach muzycznych na odległość
    Publikacja

    W czasie pandemii koronawirusa COVID-19 nowego znaczenia nabrały możliwości transmisji dźwięku z obrazem – zwłaszcza do pracy zdalnej, która w przypadku muzyków jest szczególnym wyzwaniem zarówno w kontekście wspólnych ćwiczeń i prób, jak i koncertów. Wynikła konieczność wieloźródłowego połączenia ujawniła potrzebę uprzestrzennienia dźwięku w celu łatwiejszej lokalizacji źródeł dźwięku. Tworzenie zdalnych nagrań muzycznych stało...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Towards Cancer Patients Classification Using Liquid Biopsy

    Liquid biopsy is a useful, minimally invasive diagnostic and monitoring tool for cancer disease. Yet, developing accurate methods, given the potentially large number of input features, and usually small datasets size remains very challenging. Recently, a novel feature parameterization based on the RNA-sequenced platelet data which uses the biological knowledge from the Kyoto Encyclopedia of Genes and Genomes, combined with a classifier...

    Pełny tekst do pobrania w serwisie zewnętrznym

Rok 2020
  • 1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type
    Publikacja

    A network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence....

  • A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces
    Publikacja
    • G. Tamulevicius
    • G. Korvel
    • A. B. Yayak
    • P. Treigys
    • J. Bernataviciene
    • B. Kostek

    - Electronics - Rok 2020

    In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation...

    Pełny tekst do pobrania w portalu

  • Adaptive traffic optimization using Variable Speed Limits; Adaptacyjna optymalizacja ruchu drogowego przy pomocy zmiennych ograniczeń prędkości
    Publikacja

    - Rok 2020

    Variable speed limits (VSL) is an intelligent transportation system (ITS) solution for traffic management. The speed limits can be changed dynamically in order to adapt to traffic, weather, or road surface conditions. This paper presents an approach for such an adaptive traffic control where the primary goal is to ensure traffic safety and efficiency of the traffic control system (fast response to dynamically changing traffic,...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Ambisoniczna mapa wybranych miejsc w Trójmieście
    Publikacja

    - Rok 2020

    Projekt miał na celu stworzenie ambisonicznej mapy Trójmiasta w formie aplikacji internetowej. Materiały wideo w technologii 360 z dźwiękiem w postaci sygnału ambisonicznego zostały zarejestrowane w lokalizacjach Trójmiasta, które uznano za charakterystyczne dla tej aglomeracji. Celem badawczym projektu było porównanie dostępnych algorytmów miksowania sygnałów ambisonicznych poprzez przeprowadzenie testów odsłuchowych. Przeprowadzono...

    Pełny tekst do pobrania w portalu