Search results for: AUTOMATIC AUDIO RECONSTRUCTION - Bridge of Knowledge

Search

Search results for: AUTOMATIC AUDIO RECONSTRUCTION

Search results for: AUTOMATIC AUDIO RECONSTRUCTION

  • Measurement of Latency in the Android Audio Path

    Publication

    This paper provides a description of experimental investigations concerning comparison between the audio path characteristics of various Android versions. First, information about the changes in each system version in the context of latency caused by them is presented. Then, a measurement procedure employing available applications to measure latency is described comparing to results contained in the Internet. Finally, a comparison...

    Full text to download in external service

  • SYNAT_MUSIC_GENRE_FV_173

    Open Research Data

    This is the original dataset containing 51582 music tracks (22 music genres) and 173 element-feature vector [1-6,9]. A collection of more than 50000 music excerpts described with a set of descriptors obtained through the analysis of 30-second mp3 recordings was gathered in a database called SYNAT. The SYNAT database was realized by the Gdansk University...

  • Retrospecting Polish Audio Engineering Society Membership on 20th Anniversary of the Polish Section of the Audio Engineering Society

    Publication

    - Archives of Acoustics - Year 2011

    In this article some key events concerning founding Polish Section of the Audio Engineering Society were presented. In addition, the history covering International Symposia on Sound Engineering and Mastering was outlined. Also, papers contained in this issue were shortly reviewed.

    Full text available to download

  • An new method of audio-visual correlation analysis

    Publication

    - Year 2009

    This paper presents a new methodology of conducting the audio-visual correlation analysis employing the gaze tracking system. Interaction between two perceptual modalities, seeing and hearing, their interaction and mutual reinforcement in a complex relationship was a subject of many research studies. Earlier stage of the carried out experiments at the Multimedia Systems Department (MSD) showed that there exists a relationship between...

    Full text to download in external service

  • Objectivization of audio-video correlation assessment experiments

    Publication

    - Year 2010

    The purpose of this paper is to present a new method of conducting an audio-visual correlation analysis employing a head-motion-free gaze tracking system. First, a review of related works in the domain of sound and vision correlation is presented. Then assumptions concerning audio-visual scene creation are shortly described. The objectivization process of carrying out correlation tests employing gaze-tracking system is outlined....

    Full text to download in external service

  • Intelligent video and audio applications for learning enhancement

    The role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....

    Full text available to download

  • System for automatic singing voice recognition

    W artykule przedstawiono system automatycznego rozpoznawania jakości i typu głosu śpiewaczego. Przedstawiono bazę danych oraz zaimplementowane parametry. Algorytmem decyzyjnym jest algorytm sztucznych sieci neuronowych. Wytrenowany system decyzyjny osiąga skuteczność ok. 90% w obydwu kategoriach rozpoznawania. Dodatkowo wykazano przy pomocy metod statystycznych, że wyniki działania systemu automatycznej oceny jakości technicznej...

  • Detection of impulsive disturbances in archive audio signals

    Publication

    In this paper the problem of detection of impulsive disturbances in archive audio signals is considered. It is shown that semi-causal/noncausal solutions based on joint evaluation of signal prediction errors and leave-one-out signal interpolation errors, allow one to noticeably improve detection results compared to the prediction-only based solutions. The proposed approaches are evaluated on a set of clean audio signals contaminated...

    Full text available to download

  • Exploiting audio-visual correlation by means of gaze tracking

    This paper presents a novel means for increasing audio-visual correlation analysis reliability. This is done based on gaze tracking technology engineered at the Multimedia Systems Department of the Gdansk University of Technology, Poland. In the paper, the past history and current research in the area of audio-visual perception analysis are shortly reviewed. Then the methodology employing gaze tracking is presented along with the...

    Full text available to download

  • Weighted 2-sections and hypergraph reconstruction

    Publication

    In the paper we introduce the notion of weighted 2-sections of hypergraphs with integer weights and study the following hypergraph reconstruction problems: (1) Given a weighted graph , is there a hypergraph H such that is its weighted 2-section? (2) Given a weighted 2-section , find a hypergraph H such that is its weighted 2-section. We show that (1) is NP-hard even if G is a complete graph or integer weights w does not exceed...

    Full text to download in external service

  • 2022/2023_zima SCADA Systems in Automatic Control

    e-Learning Courses
    • P. A. Kaczmarek

    SCADA Systems in Automatic Control - project materials

  • 2021/2022_zima SCADA Systems in Automatic Control

    e-Learning Courses
    • P. A. Kaczmarek

    SCADA Systems in Automatic Control - project materials

  • Personal adaptive tuning of mobile computer audio

    An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of the acoustic track to the changing conditions and to the user's individual preferences. Original signal processing algorithms are introduced, which concern: linearization of frequency response, dialogue intelligibility enhancement and dynamics processing tuned up to the user's preferences....

  • Elimination of impulsive disturbances from stereo audio recordings

    Publication

    This paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. On-line tracking of signal model parameters is performed using the stability-preserving Whittle-Wiggins-Robinson algorithm with exponential data weighting. Detection of noise pulses and model-based interpolation of the irrevocably distorted samples...

    Full text to download in external service

  • 4D Reconstruction and Visualisation of Krakow Fortress

    Publication
    • E. G. Głowienka
    • K. Michałowska
    • P. Opaliński
    • B. Hejmanowska
    • S. Mikrut
    • P. Kramarczyk
    • A. Struś

    - Year 2017

    The specific aim of the European project named "Cultural Heritage Through Time" (CHT2) and reported in this paper is to fully integrate the fourth dimension (4D) into Cultural Heritage studies for analysing structures and landscapes over time. Krakow-the Fortress City (Poland) is the one of four case studies of the CHT2, which are used for the time varying reconstruction, analysis, visualization, and preservation. The goal of...

  • Digital Audio Broadcasting or Webcasting: A Network Quality Perspective

    In recent years, many alternative technologies of delivering audio content have emerged, with different advantages and disadvantages. In this paper pros and cons of digital audio broadcasting and webcasting transmission techniques in a network quality perspective are described. A case study of user expectations with respect to currently available services is analyzed, and the perceived quality of real digital broadcasted and webcasted...

    Full text available to download

  • System do prototypowania bezprzewodowych inteligentnych urządzeń monitoringu audio-video

    Publication

    - Year 2013

    W komunikacie przedstawiono system prototypowania bezprzewodowych urządzeń do monitoringu audio-video. System bazuje na układach FPGA Virtex6 i wielu dodatkowych wspierających urządzeniach jak: szybka pamięć DDR3, mała kamera HD, mikrofon z konwerterem A/C, moduł radiowy WiFi, itp. Funkcjonalność systemu została szczegółowo opisana w komunikacie. System został zoptymalizowany do pracy pod kontrolą systemu operacyjnego Linux, zostały...

  • Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering

    This paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...

    Full text available to download

  • Testing Watermark Robustness against Application of Audio Restoration Algorithms

    Publication

    The purpose of this study was to test to what extent watermarks embedded in distorted audio signals are immune to audio restoration algorithm performing. Several restoration routines such as noise reduction, spectrum expansion, clipping or clicks reduction were applied in the online website system. The online service was extended with some copyright protection mechanisms proposed by the authors. They contain low-level music features...

    Full text to download in external service

  • Multibeam data processing for 3D object shape reconstruction

    Publication

    The technology of hydroacoustic scanning offers an efficient and widely-used source of geospatial information regarding underwater environments, providing measurement data which usually have the structure of irregular groups of points known as point clouds. Since this data model has known disadvantages, a different form of representation based on representing surfaces with simple geometric structures, such as edges and facets,...

    Full text available to download

  • Reconstruction of 3D image of corona discharge streamer

    Publication
    • M. Kocik
    • M. Tański
    • J. Mizeraczyk
    • R. Ichiki
    • S. Kanazawa
    • J. Dembski

    - Year 2010

    In this paper, the method of reconstruction of the 3D structure of streamers in DC positive corona discharge in nozzle-to-plate electrode configuration is presented. For reconstructing of 3D image of corona discharge streamer we propose a stereographical method, where streamers are observed from several directions simultaneously. The multi-directional observation enabled to obtain fine positional coordinates of streamers for a...

    Full text to download in external service

  • A double-talk detector using audio watermarking

    a novel approach to double-talk detection in the acoustic echo canceler is proposed. a hidden signature is embedded into the arriving signal, using the echo-hiding method. next detection of the presence of this signature in the microphone signal is performed. the results of the signature detection may be used by the acoustic echo canceler to stop or restart the adaptation process.

    Full text to download in external service

  • Processing of Hydroacoustic and LiDAR Data for Three-dimensional Surface Reconstruction

    The technologies of sonar and laser scanning are commonly used for obtaining spatial information about underwater and over ground environments in the form of point clouds. Since this data model has known disadvantages, a more practical solution of visualising such data involves the creation of solid three-dimensional meshes composed of edges and facets. In this paper, several methods for 3D shape reconstruction of data obtained...

  • Bożena Kostek prof. dr hab. inż.

  • SYNAT Music Genre Parameters PCA 19

    Open Research Data

    The dataset contains feature vector after  Principal Component Analysis (PCA) performing, so there are 11 music genres and 19-element vector derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier research studies carried out by the team of authors [1-6]. A collection of 52532 music excerpts described...

  • SYNAT_PCA_48

    Open Research Data

    There is a series of datasets containing feature vectors derived from music tracks. The dataset contains 51582 music tracks (22 music genres) and feature vector after  Principal Component Analysis (PCA) performing, so there are 48-element vectors derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier...

  • SYNAT_PCA_11

    Open Research Data

    The dataset contains 51582 music tracks (22 music genres) and feature vector after  Principal Component Analysis (PCA) performing, so there are 11-element vectors derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier research studies carried out by the team of authors [1-6]. A collection of more than...

  • Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

    Publication
    • D. Korzekwa
    • R. Barra-Chicote
    • B. Kostek
    • T. Drugman
    • M. Łajszczak

    - Year 2019

    We present a novel deep learning model for the detection and reconstruction of dysarthric speech. We train the model with a multi-task learning technique to jointly solve dysarthria detection and speech reconstruction tasks. The model key feature is a low-dimensional latent space that is meant to encode the properties of dysarthric speech. It is commonly believed that neural networks are black boxes that solve problems but do not...

    Full text available to download

  • Localization of impulsive disturbances in audio signals using template matching

    In this paper, a new solution to the problem of elimination of impulsive disturbances from audio signals, based on the matched filtering technique, is proposed. The new approach stems from the observation that a large proportion of noise pulses corrupting audio recordings have highly repetitive shapes that match several typical “patterns”. In many cases a representative set of exemplary pulse waveforms can be extracted from the...

    Full text available to download

  • 3D-Breast System for Determining the Volume of Tissue Needed for Breast Reconstruction

    3D imaging systems can be used to effectively determine breast volumes for surgical applications. This article presents methods for surface reconstruction and volume determination based on the point cloud created by 3D imaging. Such a system would be used to accurately estimate breast volume in patients classified for breast reconstruction surgery at plastic surgery centers. To develop such a system, various methods of determining...

    Full text to download in external service

  • "3D-Breast System for Determining the Volume of Tissue Needed for Breast Reconstruction"

    Publication

    This article presents methods for surface reconstruction and volume determination based on the point cloud created by 3D imaging. Such a system would be used to accurately estimate breast volume in patients classified for breast reconstruction surgery at plastic surgery centers. To develop such a system, various methods of determining volume, based on images from the Intel D435i camera, were tested. In addition, an application...

  • Craniomaxillofacial Trauma & Reconstruction

    Journals

    ISSN: 1943-3875 , eISSN: 1943-3883

  • Objectivization of Audio-Visual Correlation analysis

    Publication

    Simultaneous perception of audio and visual stimuli often causes the concealment or misrepresentation of information actually contained in these stimuli. Such effects are called the ''image proximity effect'' or the ''ventriloquism effect'' in literature. Until recently, most research carried out to understand their nature was based on subjective assessments. The Authors of this paper propose a methodology based on both subjective...

    Full text available to download

  • Employing flowgraphs for forward route reconstruction in video surveillance system

    Pawlak’s flowgraphs were utilized as a base idea and knowledge container for prediction and decision making algorithms applied to experimental video surveillance system. The system is used for tracking people inside buildings in order to obtain information about their appearance and movement. The fields of view of the cameras did not overlap. Therefore, when an object was moving through unsupervised areas, prediction was needed...

    Full text available to download

  • Cartographic Representation of Route Reconstruction Results in Video Surveillance System

    Publication

    The video streams available in a surveillance system distributed on the wide area may be accompanied by metadata are obtained as a result of video processing. Many algorithms applied to surveillance systems, e.g. event detection or object tracking, are strictly connected with localization of the object and reconstruction of its route. Drawing related information on a plan of a building or on a map of the city can facilitate the...

    Full text to download in external service

  • Analysis of impact of audio modifications on the robustness of watermark for non-blind architecture

    The aim of this paper is to assess the robustness of the non-blind audio content watermarking scheme proposed by the authors. The authors present the architecture of the designed system along with the employed workflows for embedding and extracting the watermark followed by the implementation phase description and the analysis of the experimental results. Some possible attack simulations on the embedded watermarks are reviewed,...

    Full text available to download

  • Text classifiers for automatic articles categorization

    Publication

    The article concerns the problem of automatic classification of textual content. We present selected methods for generation of documents representation and we evaluate them in classification tasks. The experiments have been performed on Wikipedia articles classified automatically to their categories made by Wikipedia editors.

  • Thermal sequences database of the skin flaps in breast reconstruction and burns

    Publication

    This paper presents a database of Active Dynamic Thermography (ADT) thermal sequences gathered throughout 6 year study on ADT application in skin flap blood perfusion monitoring and burn wounds diagnosis. For skin flap monitoring the database comprises of data collected during three different breast reconstruction procedures. The patients were monitored pre, intra and post surgically within 90 days period. The sequences were used...

  • Thermal sequences database of the skin flaps in breast reconstruction and burns

    This paper presents a database of Active Dynamic Thermography (ADT) thermal sequences gathered throughout 6 year study on ADT application in skin flap blood perfusion monitoring and burn wounds diagnosis. For skin flap monitoring the database comprises of data collected during three different breast reconstruction procedures. The patients were monitored pre, intra and post surgically within 90 days period. The sequences were used...

    Full text available to download

  • Determining Pronunciation Differences in English Allophones Utilizing Audio Signal Parameterization

    Publication

    - Year 2017

    An allophonic description of English plosive consonants, based on audio-visual recordings of 600 specially selected words, was developed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted...

  • The use of the static thermography in monitoring flap perfusion in breast reconstruction with TRAM flap

    Publication

    - Year 2016

    This paper shows results of the static thermography for intraoperative and postoperative imaging of TRAM flap perfusion. The results were compared with the clinical examination of flap perfusion. The study was conducted on a group of 38 female patients who underwent breast reconstruction.

    Full text to download in external service

  • Parametric impulsive noise detector for corrupted audio signals based on hidden Markow model

    Publication

    - Year 2008

    The paper addresses the problem of impulsive noise detection for audio signals. A structure of threshold parameter detectors using modelingof signals was introduced. the algorithm of the noise detection, based on discrete-time hidden Markow model (HMM)of whitened audio signal is elaborated

  • Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?

    Publication

    - Year 2022

    In this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...

    Full text available to download

  • Using concentrated spectrogram for analysis of audio acoustic signals

    Publication

    The paper presents results of time-frequency analysis of audio acoustic signals using the method of Concentrated Spectrograph also known as ''Cross-spectral method'' or ''Reassignment method''. Presented algorithm involves signal's local group delay and channelized instantaneous frequency to relevantly redistribute all Short-time Fourier transform lines in time-frequency plain. The main intention of the paper is to compare various...

    Full text available to download

  • RENOVATION OF ARCHIVE AUDIO RECORDINGS USING SPARSE AUTOREGRESSIVE MODELING AND BIDIRECTIONAL PROCESSING

    Publication

    The paper presents a new approach to elimination of broadband noise and impulsive disturbances from archive audio recordings. The proposed adaptive Kalman-like algorithm, based on a sparse autoregressive model of the audio signal, simultaneously detects noise pulses, interpolates the irrevocably distorted samples and performs signal smoothing. It is shown that bidirectional (forward-backward) processing of the archive signal improves...

    Full text to download in external service

  • Active dynamic thermography method for TRAM flap blood perfusion mapping in breast reconstruction

    Publication

    - QIRT Journal - Year 2017

    This paper presents the new method of the transverse rectus abdominis musculocutaneous flap blood perfusion mapping based on the active dynamic thermography. The method is aimed at aiding a surgeon during breast reconstruction procedure. A pair of dTnorm and t90_10 parameters were used as parametric image descriptors of the flap blood perfusion. The method was tested on 38 patients that were subjected to breast reconstruction procedure....

    Full text available to download

  • Sparse vector autoregressive modeling of audio signals and its application to the elimination of impulsive disturbances

    Publication

    Archive audio files are often corrupted by impulsive disturbances, such as clicks, pops and record scratches. This paper presents a new method for elimination of impulsive disturbances from stereo audio signals. The proposed approach is based on a sparse vector autoregressive signal model, made up of two components: one taking care of short-term signal correlations, and the other one taking care of long-term correlations. The method...

    Full text to download in external service

  • Objectivization of phonological evaluation of speech elements by means of audio parametrization

    This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...

  • Quality Analysis of Audio-Video Transmission in an OFDM-Based Communication System

    Publication

    - Year 2022

    Application of a reliable audio-video communication system, brings many advantages. With the spoken word we can exchange ideas, provide descriptive information, as well as aid to another person. With the availability of visual information one can monitor the surrounding, working environment, etc. As the amount of available bandwidth continues to shrink, researchers focus on novel types of transmission. Currently, orthogonal frequency...

    Full text to download in external service

  • Pursuing Listeners’ Perceptual Response in Audio-Visual Interactions - Headphones vs Loudspeakers: A Case Study

    Publication

    This study investigates listeners’ perceptual responses in audio-visual interactions concerning binaural spatial audio. Audio stimuli are coupled with or without visual cues to the listeners. The subjective test participants are tasked to indicate the direction of the incoming sound while listening to the audio stimulus via loudspeakers or headphones with the head-related transfer function (HRTF) plugin. First, the methodology...

    Full text available to download