Department of Multimedia Systems - Administrative Units - Bridge of Knowledge

Search

Department of Multimedia Systems

Filters

total: 911

  • Category
  • Year
  • Options

clear Chosen catalog filters disabled

Catalog Publications

Year 2024
  • 3-D Printable Metal-Dielectric Metasurface for Risley Prism-Based Beam-Steering Antennas
    Publication

    - IEEE Access - Year 2024

    A 3-D printable, planar, metal-dielectric metasurface-based, 2-D beam-steering system for aperture-type antennas is presented in this paper. This beam steering system, also known as the near-field meta-steering system, comprises two fully passive phase-gradient metasurfaces placed in the antenna’s nearfield region to steer the radiation beam. To address the non-uniform electric field phase of the aperture antenna, phase correction...

    Full text available to download

  • A Comparison of Directional Beamforming Capabilities: High-Order Ambisonic Microphone vs. Shotgun Microphones

    This article presents the practical implications of the directional beamforming capability of a higher-order ambisonic microphone compared with popular shotgun microphones. Five different microphones were used in the study: Sennheiser MKH 416, Rode NTG2, Panasonic AG-MC200, Zoom SGH-6, and Zylia ZM-1 (ambisonic microphone). The results highlight the versatility of higher-order ambisonics for non-immersive use, which allows for...

    Full text to download in external service

  • A Mammography Data Management Application for Federated Learning
    Publication

    This study aimed to develop and assess an application designed to enhance the management of a local client database consisting of mammographic images with a focus on ensuring that images are suitably and uniformly prepared for federated learning applications. The application supports a comprehensive approach, starting with a versatile image-loading function that supports DICOM files from various medical imaging devices and settings....

    Full text to download in external service

  • Adapt Your Teacher: Improving Knowledge Distillation for Exemplar-free Continual Learning
    Publication
    • F. Szatkowski
    • M. Pyła
    • M. Przewięźlikowski
    • S. Cygert
    • B. Twardowski
    • T. Trzciński

    - Year 2024

    In this work, we investigate exemplar-free class incremental learning (CIL) with knowledge distillation (KD) as a regularization strategy, aiming to prevent forgetting. KDbased methods are successfully used in CIL, but they often struggle to regularize the model without access to exemplars of the training data from previous tasks. Our analysis reveals that this issue originates from substantial representation shifts in the teacher...

    Full text to download in external service

  • Category Adaptation Meets Projected Distillation in Generalized Continual Category Discovery
    Publication
    • G. Rypeść
    • D. Marczak
    • S. Cygert
    • T. Trzciński
    • B. Twardowski

    - Year 2024

    "Generalized Continual Category Discovery (GCCD) tackles learning from sequentially arriving, partially labeled datasets while uncovering new categories. Traditional methods depend on feature distillation to prevent forgetting the old knowledge. However, this strategy restricts the model’s ability to adapt and effectively distinguish new categories. To address this, we introduce a novel technique integrating a learnable projector...

    Full text to download in external service

  • Decoding imagined speech for EEG-based BCI
    Publication

    - Year 2024

    Brain–computer interfaces (BCIs) are systems that transform the brain's electrical activity into commands to control a device. To create a BCI, it is necessary to establish the relationship between a certain stimulus, internal or external, and the brain activity it provokes. A common approach in BCIs is motor imagery, which involves imagining limb movement. Unfortunately, this approach allows few commands. As an alternative, this...

    Full text to download in external service

  • Deep learning techniques for biometric security: A systematic review of presentation attack detection systems
    Publication

    - ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE - Year 2024

    Biometric technology, including finger vein, fingerprint, iris, and face recognition, is widely used to enhance security in various devices. In the past decade, significant progress has been made in improving biometric sys- tems, thanks to advancements in deep convolutional neural networks (DCNN) and computer vision (CV), along with large-scale training datasets. However, these systems have become targets of various attacks, with...

    Full text to download in external service

  • Developing a Low SNR Resistant, Text Independent Speaker Recognition System for Intercom Solutions - A Case Study
    Publication

    This article presents a case study on the development of a biometric voice verification system for an intercom solution, utilizing the DeepSpeaker neural network architecture. Despite the variety of solutions available in the literature, there is a noted lack of evaluations for "text-independent" systems under real conditions and with varying distances between the speaker and the microphone. This article aims to bridge this gap....

    Full text available to download

  • Divide and not forget: Ensemble of selectively trained experts in Continual Learning
    Publication
    • G. Rypeść
    • S. Cygert
    • V. Khan
    • T. Trzciński
    • B. Zieliński
    • B. Twardowski

    - Year 2024

    Class-incremental learning is becoming more popular as it helps models widen their applicability while not forgetting what they already know. A trend in this area is to use a mixture-of-expert technique, where different models work together to solve the task. However, the experts are usually trained all at once using whole task data, which makes them all prone to forgetting and increasing computational burden. To address this limitation,...

    Full text available to download

  • English Language Learning Employing Developments in Multimedia IS
    Publication

    In the realm of the development of information systems related to education, integrating multimedia technologies offers novel ways to enhance foreign language learning. This study investigates audio-video processing methods that leverage real-time speech rate adjustment and dynamic captioning to support English language acquisition. Through a mixed-methods analysis involving participants from a language school, we explore the impact...

    Full text available to download

  • Exploring music listening patterns: an online survey
    Publication

    An online survey was carried out to explore how respondents listen to music recordings. It was anticipated that the listener’s preferences would be influenced by various factors, such as age, music genre, the contexts in which they listen, and their favored methods of music consumption. Consequently, the data were collected to analyze these relationships. The survey, structured as a web application, encompassed 23 questions,...

    Full text available to download

  • Finger Vein Presentation Attack Detection Method Using a Hybridized Gray-Level Co-Occurrence Matrix Feature with Light-Gradient Boosting Machine Model
    Publication

    - Year 2024

    Presentation Attack Detection (PAD) is crucial in biometric finger vein recognition. The susceptibility of these systems to forged finger vein images is a significant challenge. Existing approaches to mitigate presentation attacks have computational complexity limitations and limited data availability. This study proposed a novel method for identifying presentation attacks in finger vein biometric systems. We have used optimal...

    Full text available to download

  • High frequency oscillations in human memory and cognition: a neurophysiological substrate of engrams?

    Despite advances in understanding the cellular and molecular processes underlying memory and cognition, and recent successful modulation of cognitive performance in brain disorders, the neurophysiological mechanisms remain underexplored. High frequency oscillations beyond the classic electroencephalogram spectrum have emerged as a potential neural correlate of fundamental cognitive processes. High frequency oscillations are detected...

    Full text available to download

  • Identyfikacja instrumentu muzycznego z nagrania fonicznego za pomocą sztucznych sieci neuronowych
    Publication

    - Year 2024

    Celem rozprawy jest zbadanie algorytmów do identyfikacji instrumentów występujących w sygnale polifonicznym z wykorzystaniem sztucznych sieci neuronowych. W części teoretycznej przywołano podstawy przetwarzania sygnałów fonicznych w kontekście ekstrakcji parametrów sygnałów wykorzystywanych w treningu sieci neuronowych. Dodatkowo dokonano analizy rozwoju metod uczenia maszynowego z uwzględnieniem podziału na sieci neuronowe pierwszej,...

    Full text available to download

  • Improving platelet‐RNA‐based diagnostics: a comparative analysis of machine learning models for cancer detection and multiclass classification
    Publication

    - Molecular Oncology - Year 2024

    Liquid biopsy demonstrates excellent potential in patient management by providing a minimally invasive and cost-effective approach to detecting and monitoring cancer, even at its early stages. Due to the complexity of liquid biopsy data, machine-learning techniques are increasingly gaining attention in sample analysis, especially for multidimensional data such as RNA expression profiles. Yet, there is no agreement in the community...

    Full text available to download

  • Learning sperm cells part segmentation with class-specific data augmentation
    Publication

    - Year 2024

    Infertility affects around 15% of couples worldwide. Male fertility problems include poor sperm quality and low sperm count. The advanced fertility treatment methods like ICSI are nowadays supported by vision systems to assist embryologists in selecting good quality sperm. Computer-Assisted Semen Analysis (CASA) provides quantitative and qualitative sperm analysis concerning concentration, motility, morphology, vitality, and fragmentation....

    Full text to download in external service

  • Leveraging Activation Maps for Improved Acoustic Events Detection and Classification
    Publication

    This paper presents a novel approach to enhance the accuracy of deep learning models for acoustic event detection and classification in real-world environments. We introduce a method that leverages activation maps to identify and address model overfitting, combined with an expert-knowledge-based event detection algorithm for data pre-processing. Our approach significantly improved classification performance, increasing the F1 score...

    Full text to download in external service

  • Looking through the past: better knowledge retention for generative replay in continual learning
    Publication
    • V. Khan
    • S. Cygert
    • K. Deja
    • T. Trzciński
    • B. Twardowski

    - IEEE Access - Year 2024

    In this work, we improve the generative replay in a continual learning setting to perform well on challenging scenarios. Because of the growing complexity of continual learning tasks, it is becoming more popular, to apply the generative replay technique in the feature space instead of image space. Nevertheless, such an approach does not come without limitations. In particular, we notice the degradation of the continually trained...

    Full text available to download

  • MagMax: Leveraging Model Merging for Seamless Continual Learning
    Publication
    • D. Marczak
    • B. Twardowski
    • T. Trzciński
    • S. Cygert

    - Year 2024

    This paper introduces a continual learning approach named MagMax, which utilizes model merging to enable large pre-trained models to continuously learn from new data without forgetting previously acquired knowledge. Distinct from traditional continual learning methods that aim to reduce forgetting during task training, MagMax combines sequential fine-tuning with a maximum magnitude weight selection for effective knowledge integration...

    Full text to download in external service

  • Missing Puzzle Pieces in Dementia Research: HCN Channels and Theta Oscillations
    Publication

    - Aging and Disease - Year 2024

    Increasing evidence indicates a role of hyperpolarization activated cation (HCN) channels in controlling the resting membrane potential, pacemaker activity, memory formation, sleep, and arousal. Their disfunction may be associated with the development of epilepsy and age-related memory decline. Neuronal hyperexcitability involved in epileptogenesis and EEG desynchronization occur in the course of dementia in human Alzheimer’s Disease...

    Full text available to download

  • Mobilenet-V2 Enhanced Parkinson's Disease Prediction with Hybrid Data Integration
    Publication

    - Year 2024

    This study investigates the role of deep learning models, particularly MobileNet-v2, in Parkinson's Disease (PD) detection through handwriting spiral analysis. Handwriting difficulties often signal early signs of PD, necessitating early detection tools due to potential impacts on patients' work capacities. The study utilizes a three-fold approach, including data augmentation, algorithm development for simulated PD image datasets,...

    Full text available to download

  • Opracowanie metodologii rozpoznawania i klasyfikowania emocji w filmach przy użyciu sztucznych sieci neuronowych
    Publication

    - Year 2024

    Celem rozprawy doktorskiej jest opracowanie metodologii pozwalającej na rozpoznawanie i klasyfikację emocji w filmie za pomocą sztucznych sieci neuronowych. W pracy przedstawiono tematykę związaną z kolorowaniem sceny filmowej w kontekście oddziaływania koloru na emocje widza. W celu analizy wpływu filmow na emocje widza dokonano wyboru tytułow filmowych, następnie przeprowadzono szereg wstępnych testow subiektywnych pozwalających...

    Full text available to download

  • Reverberation divergence in VR applications

    This project aimed to investigate the correlation between virtual reality (VR) imagery and ambisonic sound. With the increasing popularity of VR applications, understanding how sound is perceived in virtual environments is crucial for enhancing the immersiveness of the experience. In the experiment, participants were immersed in a virtual environment that replicated a concert hall. Their task was to assess the correspondence between...

    Full text to download in external service

  • Revisiting Supervision for Continual Representation Learning
    Publication
    • D. Marczak
    • S. Cygert
    • T. Trzciński
    • B. Twardowski

    - Year 2024

    "In the field of continual learning, models are designed to learn tasks one after the other. While most research has centered on supervised continual learning, there is a growing interest in unsupervised continual learning, which makes use of the vast amounts of unlabeled data. Recent studies have highlighted the strengths of unsupervised methods, particularly self-supervised learning, in providing robust representations. The improved...

    Full text to download in external service

  • Sounding Mechanism of a Flue Organ Pipe—A Multi-Sensor Measurement Approach
    Publication

    - SENSORS - Year 2024

    This work presents an approach that integrates the results of measuring, analyzing, and modeling air flow phenomena driven by pressurized air in a flue organ pipe. The investigation concerns a Bourdon organ pipe. Measurements are performed in an anechoic chamber using the Cartesian robot equipped with a 3D acoustic vector sensor (AVS) that acquires both acoustic pressure and air particle velocity. Also, a high-speed camera is employed...

    Full text available to download

  • Task-recency bias strikes back: Adapting covariances in Exemplar-Free Class Incremental Learning
    Publication
    • G. Rypeść
    • S. Cygert
    • T. Trzciński
    • B. Twardowski

    - Year 2024

    Exemplar-Free Class Incremental Learning (EFCIL) tackles the problem of training a model on a sequence of tasks without access to past data. Existing state-of-the-art methods represent classes as Gaussian distributions in the feature extractor's latent space, enabling Bayes classification or training the classifier by replaying pseudo features. However, we identify two critical issues that compromise their efficacy when the feature...

    Full text available to download

  • The Impact of Foreign Accents on the Performance of Whisper Family Models Using Medical Speech in Polish
    Publication

    - Year 2024

    The article presents preliminary experiments investigating the impact of accent on the performance of the Whisper automatic speech recognition (ASR) system, specifically for the Polish language and medical data. The literature review revealed a scarcity of studies on the influence of accents on speech recognition systems in Polish, especially concerning medical terminology. The experiments involved voice cloning of selected individuals...

    Full text available to download

Year 2023