Search results for: AUTOMATIC AUDIO RECONSTRUCTION - Bridge of Knowledge

Search

Search results for: AUTOMATIC AUDIO RECONSTRUCTION

Filters

total: 1193
filtered: 895

clear all filters


Chosen catalog filters

  • Category

  • Year

  • Options

clear Chosen catalog filters disabled

Search results for: AUTOMATIC AUDIO RECONSTRUCTION

  • Intelligent video and audio applications for learning enhancement

    The role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....

    Full text available to download

  • System for automatic singing voice recognition

    W artykule przedstawiono system automatycznego rozpoznawania jakości i typu głosu śpiewaczego. Przedstawiono bazę danych oraz zaimplementowane parametry. Algorytmem decyzyjnym jest algorytm sztucznych sieci neuronowych. Wytrenowany system decyzyjny osiąga skuteczność ok. 90% w obydwu kategoriach rozpoznawania. Dodatkowo wykazano przy pomocy metod statystycznych, że wyniki działania systemu automatycznej oceny jakości technicznej...

  • Detection of impulsive disturbances in archive audio signals

    Publication

    In this paper the problem of detection of impulsive disturbances in archive audio signals is considered. It is shown that semi-causal/noncausal solutions based on joint evaluation of signal prediction errors and leave-one-out signal interpolation errors, allow one to noticeably improve detection results compared to the prediction-only based solutions. The proposed approaches are evaluated on a set of clean audio signals contaminated...

    Full text available to download

  • Exploiting audio-visual correlation by means of gaze tracking

    This paper presents a novel means for increasing audio-visual correlation analysis reliability. This is done based on gaze tracking technology engineered at the Multimedia Systems Department of the Gdansk University of Technology, Poland. In the paper, the past history and current research in the area of audio-visual perception analysis are shortly reviewed. Then the methodology employing gaze tracking is presented along with the...

    Full text available to download

  • Weighted 2-sections and hypergraph reconstruction

    Publication

    In the paper we introduce the notion of weighted 2-sections of hypergraphs with integer weights and study the following hypergraph reconstruction problems: (1) Given a weighted graph , is there a hypergraph H such that is its weighted 2-section? (2) Given a weighted 2-section , find a hypergraph H such that is its weighted 2-section. We show that (1) is NP-hard even if G is a complete graph or integer weights w does not exceed...

    Full text to download in external service

  • Personal adaptive tuning of mobile computer audio

    An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of the acoustic track to the changing conditions and to the user's individual preferences. Original signal processing algorithms are introduced, which concern: linearization of frequency response, dialogue intelligibility enhancement and dynamics processing tuned up to the user's preferences....

  • Elimination of impulsive disturbances from stereo audio recordings

    Publication

    This paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. On-line tracking of signal model parameters is performed using the stability-preserving Whittle-Wiggins-Robinson algorithm with exponential data weighting. Detection of noise pulses and model-based interpolation of the irrevocably distorted samples...

    Full text to download in external service

  • 4D Reconstruction and Visualisation of Krakow Fortress

    Publication
    • E. G. Głowienka
    • K. Michałowska
    • P. Opaliński
    • B. Hejmanowska
    • S. Mikrut
    • P. Kramarczyk
    • A. Struś

    - Year 2017

    The specific aim of the European project named "Cultural Heritage Through Time" (CHT2) and reported in this paper is to fully integrate the fourth dimension (4D) into Cultural Heritage studies for analysing structures and landscapes over time. Krakow-the Fortress City (Poland) is the one of four case studies of the CHT2, which are used for the time varying reconstruction, analysis, visualization, and preservation. The goal of...

  • Digital Audio Broadcasting or Webcasting: A Network Quality Perspective

    In recent years, many alternative technologies of delivering audio content have emerged, with different advantages and disadvantages. In this paper pros and cons of digital audio broadcasting and webcasting transmission techniques in a network quality perspective are described. A case study of user expectations with respect to currently available services is analyzed, and the perceived quality of real digital broadcasted and webcasted...

    Full text available to download

  • System do prototypowania bezprzewodowych inteligentnych urządzeń monitoringu audio-video

    Publication

    - Year 2013

    W komunikacie przedstawiono system prototypowania bezprzewodowych urządzeń do monitoringu audio-video. System bazuje na układach FPGA Virtex6 i wielu dodatkowych wspierających urządzeniach jak: szybka pamięć DDR3, mała kamera HD, mikrofon z konwerterem A/C, moduł radiowy WiFi, itp. Funkcjonalność systemu została szczegółowo opisana w komunikacie. System został zoptymalizowany do pracy pod kontrolą systemu operacyjnego Linux, zostały...

  • Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering

    This paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...

    Full text available to download

  • Testing Watermark Robustness against Application of Audio Restoration Algorithms

    Publication

    The purpose of this study was to test to what extent watermarks embedded in distorted audio signals are immune to audio restoration algorithm performing. Several restoration routines such as noise reduction, spectrum expansion, clipping or clicks reduction were applied in the online website system. The online service was extended with some copyright protection mechanisms proposed by the authors. They contain low-level music features...

    Full text to download in external service

  • Multibeam data processing for 3D object shape reconstruction

    Publication

    The technology of hydroacoustic scanning offers an efficient and widely-used source of geospatial information regarding underwater environments, providing measurement data which usually have the structure of irregular groups of points known as point clouds. Since this data model has known disadvantages, a different form of representation based on representing surfaces with simple geometric structures, such as edges and facets,...

    Full text available to download

  • Reconstruction of 3D image of corona discharge streamer

    Publication
    • M. Kocik
    • M. Tański
    • J. Mizeraczyk
    • R. Ichiki
    • S. Kanazawa
    • J. Dembski

    - Year 2010

    In this paper, the method of reconstruction of the 3D structure of streamers in DC positive corona discharge in nozzle-to-plate electrode configuration is presented. For reconstructing of 3D image of corona discharge streamer we propose a stereographical method, where streamers are observed from several directions simultaneously. The multi-directional observation enabled to obtain fine positional coordinates of streamers for a...

    Full text to download in external service

  • A double-talk detector using audio watermarking

    a novel approach to double-talk detection in the acoustic echo canceler is proposed. a hidden signature is embedded into the arriving signal, using the echo-hiding method. next detection of the presence of this signature in the microphone signal is performed. the results of the signature detection may be used by the acoustic echo canceler to stop or restart the adaptation process.

    Full text to download in external service

  • Processing of Hydroacoustic and LiDAR Data for Three-dimensional Surface Reconstruction

    The technologies of sonar and laser scanning are commonly used for obtaining spatial information about underwater and over ground environments in the form of point clouds. Since this data model has known disadvantages, a more practical solution of visualising such data involves the creation of solid three-dimensional meshes composed of edges and facets. In this paper, several methods for 3D shape reconstruction of data obtained...

  • Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

    Publication
    • D. Korzekwa
    • R. Barra-Chicote
    • B. Kostek
    • T. Drugman
    • M. Łajszczak

    - Year 2019

    We present a novel deep learning model for the detection and reconstruction of dysarthric speech. We train the model with a multi-task learning technique to jointly solve dysarthria detection and speech reconstruction tasks. The model key feature is a low-dimensional latent space that is meant to encode the properties of dysarthric speech. It is commonly believed that neural networks are black boxes that solve problems but do not...

    Full text available to download

  • Localization of impulsive disturbances in audio signals using template matching

    In this paper, a new solution to the problem of elimination of impulsive disturbances from audio signals, based on the matched filtering technique, is proposed. The new approach stems from the observation that a large proportion of noise pulses corrupting audio recordings have highly repetitive shapes that match several typical “patterns”. In many cases a representative set of exemplary pulse waveforms can be extracted from the...

    Full text available to download

  • 3D-Breast System for Determining the Volume of Tissue Needed for Breast Reconstruction

    3D imaging systems can be used to effectively determine breast volumes for surgical applications. This article presents methods for surface reconstruction and volume determination based on the point cloud created by 3D imaging. Such a system would be used to accurately estimate breast volume in patients classified for breast reconstruction surgery at plastic surgery centers. To develop such a system, various methods of determining...

    Full text to download in external service

  • "3D-Breast System for Determining the Volume of Tissue Needed for Breast Reconstruction"

    Publication

    This article presents methods for surface reconstruction and volume determination based on the point cloud created by 3D imaging. Such a system would be used to accurately estimate breast volume in patients classified for breast reconstruction surgery at plastic surgery centers. To develop such a system, various methods of determining volume, based on images from the Intel D435i camera, were tested. In addition, an application...

  • Objectivization of Audio-Visual Correlation analysis

    Publication

    Simultaneous perception of audio and visual stimuli often causes the concealment or misrepresentation of information actually contained in these stimuli. Such effects are called the ''image proximity effect'' or the ''ventriloquism effect'' in literature. Until recently, most research carried out to understand their nature was based on subjective assessments. The Authors of this paper propose a methodology based on both subjective...

    Full text available to download

  • Employing flowgraphs for forward route reconstruction in video surveillance system

    Pawlak’s flowgraphs were utilized as a base idea and knowledge container for prediction and decision making algorithms applied to experimental video surveillance system. The system is used for tracking people inside buildings in order to obtain information about their appearance and movement. The fields of view of the cameras did not overlap. Therefore, when an object was moving through unsupervised areas, prediction was needed...

    Full text available to download

  • Cartographic Representation of Route Reconstruction Results in Video Surveillance System

    Publication

    The video streams available in a surveillance system distributed on the wide area may be accompanied by metadata are obtained as a result of video processing. Many algorithms applied to surveillance systems, e.g. event detection or object tracking, are strictly connected with localization of the object and reconstruction of its route. Drawing related information on a plan of a building or on a map of the city can facilitate the...

    Full text to download in external service

  • Analysis of impact of audio modifications on the robustness of watermark for non-blind architecture

    The aim of this paper is to assess the robustness of the non-blind audio content watermarking scheme proposed by the authors. The authors present the architecture of the designed system along with the employed workflows for embedding and extracting the watermark followed by the implementation phase description and the analysis of the experimental results. Some possible attack simulations on the embedded watermarks are reviewed,...

    Full text available to download

  • Text classifiers for automatic articles categorization

    Publication

    The article concerns the problem of automatic classification of textual content. We present selected methods for generation of documents representation and we evaluate them in classification tasks. The experiments have been performed on Wikipedia articles classified automatically to their categories made by Wikipedia editors.

  • Thermal sequences database of the skin flaps in breast reconstruction and burns

    Publication

    This paper presents a database of Active Dynamic Thermography (ADT) thermal sequences gathered throughout 6 year study on ADT application in skin flap blood perfusion monitoring and burn wounds diagnosis. For skin flap monitoring the database comprises of data collected during three different breast reconstruction procedures. The patients were monitored pre, intra and post surgically within 90 days period. The sequences were used...

  • Thermal sequences database of the skin flaps in breast reconstruction and burns

    This paper presents a database of Active Dynamic Thermography (ADT) thermal sequences gathered throughout 6 year study on ADT application in skin flap blood perfusion monitoring and burn wounds diagnosis. For skin flap monitoring the database comprises of data collected during three different breast reconstruction procedures. The patients were monitored pre, intra and post surgically within 90 days period. The sequences were used...

    Full text available to download

  • Determining Pronunciation Differences in English Allophones Utilizing Audio Signal Parameterization

    Publication

    - Year 2017

    An allophonic description of English plosive consonants, based on audio-visual recordings of 600 specially selected words, was developed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted...

  • The use of the static thermography in monitoring flap perfusion in breast reconstruction with TRAM flap

    Publication

    - Year 2016

    This paper shows results of the static thermography for intraoperative and postoperative imaging of TRAM flap perfusion. The results were compared with the clinical examination of flap perfusion. The study was conducted on a group of 38 female patients who underwent breast reconstruction.

    Full text to download in external service

  • Parametric impulsive noise detector for corrupted audio signals based on hidden Markow model

    Publication

    - Year 2008

    The paper addresses the problem of impulsive noise detection for audio signals. A structure of threshold parameter detectors using modelingof signals was introduced. the algorithm of the noise detection, based on discrete-time hidden Markow model (HMM)of whitened audio signal is elaborated

  • Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?

    Publication

    - Year 2022

    In this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...

    Full text available to download

  • Using concentrated spectrogram for analysis of audio acoustic signals

    Publication

    The paper presents results of time-frequency analysis of audio acoustic signals using the method of Concentrated Spectrograph also known as ''Cross-spectral method'' or ''Reassignment method''. Presented algorithm involves signal's local group delay and channelized instantaneous frequency to relevantly redistribute all Short-time Fourier transform lines in time-frequency plain. The main intention of the paper is to compare various...

    Full text available to download

  • RENOVATION OF ARCHIVE AUDIO RECORDINGS USING SPARSE AUTOREGRESSIVE MODELING AND BIDIRECTIONAL PROCESSING

    Publication

    The paper presents a new approach to elimination of broadband noise and impulsive disturbances from archive audio recordings. The proposed adaptive Kalman-like algorithm, based on a sparse autoregressive model of the audio signal, simultaneously detects noise pulses, interpolates the irrevocably distorted samples and performs signal smoothing. It is shown that bidirectional (forward-backward) processing of the archive signal improves...

    Full text to download in external service

  • Active dynamic thermography method for TRAM flap blood perfusion mapping in breast reconstruction

    Publication

    - QIRT Journal - Year 2017

    This paper presents the new method of the transverse rectus abdominis musculocutaneous flap blood perfusion mapping based on the active dynamic thermography. The method is aimed at aiding a surgeon during breast reconstruction procedure. A pair of dTnorm and t90_10 parameters were used as parametric image descriptors of the flap blood perfusion. The method was tested on 38 patients that were subjected to breast reconstruction procedure....

    Full text available to download

  • Sparse vector autoregressive modeling of audio signals and its application to the elimination of impulsive disturbances

    Publication

    Archive audio files are often corrupted by impulsive disturbances, such as clicks, pops and record scratches. This paper presents a new method for elimination of impulsive disturbances from stereo audio signals. The proposed approach is based on a sparse vector autoregressive signal model, made up of two components: one taking care of short-term signal correlations, and the other one taking care of long-term correlations. The method...

    Full text to download in external service

  • Objectivization of phonological evaluation of speech elements by means of audio parametrization

    This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...

  • Quality Analysis of Audio-Video Transmission in an OFDM-Based Communication System

    Publication

    - Year 2022

    Application of a reliable audio-video communication system, brings many advantages. With the spoken word we can exchange ideas, provide descriptive information, as well as aid to another person. With the availability of visual information one can monitor the surrounding, working environment, etc. As the amount of available bandwidth continues to shrink, researchers focus on novel types of transmission. Currently, orthogonal frequency...

    Full text to download in external service

  • Pursuing Listeners’ Perceptual Response in Audio-Visual Interactions - Headphones vs Loudspeakers: A Case Study

    Publication

    This study investigates listeners’ perceptual responses in audio-visual interactions concerning binaural spatial audio. Audio stimuli are coupled with or without visual cues to the listeners. The subjective test participants are tasked to indicate the direction of the incoming sound while listening to the audio stimulus via loudspeakers or headphones with the head-related transfer function (HRTF) plugin. First, the methodology...

    Full text available to download

  • Seafloor relief reconstruction from side scan sonar data

    Publication

    Side scan sonar is one of the most widely used imaging systems in the underwater environment. It is relatively cheap and easy to deploy, in comparison with more powerful sensors. Although side scan sonar does not provide seafloor bathymetry directly, its records are directly related to seafloor images. In the paper, the method for 3D seafloor relief reconstruction from side scan sonar data is presented. The method is based on the...

    Full text available to download

  • Reconstruction of 3D structure of positive corona streamer by local methods

    Publication
    • M. Kocik
    • M. Tański
    • J. Mizeraczyk
    • R. Ichiki
    • S. Kanazawa
    • J. Dembski

    - Year 2009

    The computer algorithms were used for reconstruction of streamer 3D structure. We propose the 3D tree structure model of corona discharge streamer composed with nodes and edges between chosen couples of nodes, which enables easy computation of some important parameters ofstreamers. The 3D model can be derived directly from two projection images by global methods like evolutionary searching or particle simulations. In this paper...

    Full text to download in external service

  • Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.

    In this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...

    Full text to download in external service

  • Damage detection in plates based on Lamb wavefront shape reconstruction

    Publication

    - MEASUREMENT - Year 2021

    Many of the current studies in the area of damage detection using elastic wave propagation are based on deploying sensor networks with a large number of piezoelectric transducers to detect small-size cracks. A major limitation of these studies is that cracks are usually larger and have different shapes in real cases. Moreover, using a large number of sensing nodes for damage detection is both costly and computationally intensive....

    Full text available to download

  • An attempt to create a digital reconstruction of the Copper Ship = Próba cyfrowej rekonstrukcji kadłuba wraku Miedziowca

    Publication

    - Year 2014

    This study presents an attempt to create a digital reconstruction of the W-5 shipwreck (the Copper Ship) based on data acquired by 3D scanning of structural components held at the National Maritime Museum in Gdańsk and on a physical reconstruction model of the ship’s hull. A digital reconstruction would facilitate analysis of various possible options for the structural design of the hull, and would enable the preparation of a model for...

    Full text to download in external service

  • A study on of music features derived from audio recordings examples – a quantitative analysis

    Publication

    The paper presents a comparative study of music features derived from audio recordings, i.e. the same music pieces but representing different music genres, excerpts performed by different musicians, and songs performed by a musician, whose style evolved over time. Firstly, the origin and the background of the division of music genres were shortly presented. Then, several objective parameters of an audio signal were recalled that...

    Full text available to download

  • Gaze-tracking based audio-visual correlation analysis employing quality of experience methodology

    This paper investigates a new approach to audio-visual correlation assessment based on the gaze-tracking system developed at the Multimedia Systems Department (MSD) of Gdansk University of Technology (GUT). The gaze-tracking methodology, having roots in Human-Computer Interaction borrows the relevance feedback through gaze-tracking and applies it to the new area of interests, which is Quality of Experience. Results of subjective...

    Full text to download in external service

  • Localization of impulsive disturbances in archive audio signals using predictive matched filtering

    Publication

    The problem of elimination of impulsive disturbances from archive audio signals is considered and its new solution, called predictive matched filtering, is proposed. The new approach is based on the observation that a large percentage of noise pulses corrupting archive audio recordings have highly repetitive shapes that match several typical “patterns”, called click templates. To localize noise pulses, click templates can be correlated...

    Full text to download in external service

  • Automatic Rhythm Retrieval from Musical Files

    Publication

    - Year 2008

    This paper presents a comparison of the effectiveness of two computational intelligence approaches applied to the task of retrieving rhythmic structure from musical files. The method proposed by the authors of this paper generates rhythmic levels first, and then uses these levels to compose rhythmic hypotheses. Three phases: creating periods, creating simplified hypotheses and creating full hypotheses are examined within this study....

    Full text to download in external service

  • Quality Evaluation of Novel DTD Algorithm Based on Audio Watermarking

    Publication

    Echo cancellers typically employ a doubletalk detection (DTD) algorithm in order to keep the adaptive filter from diverging in the presence of near-end speech signal or other disruptive sounds in the microphone signal. A novel doubletalk detection algorithm based on techniques similar to those used for audio signal watermarking was introduced by the authors. The application of the described DTD algorithm within acoustic echo cancellation...

    Full text to download in external service

  • Automatic Analysis of Trajectories of Moving Objects

    Publication

    Ongoing monitoring is essential to providing security and safety of maritime and air operations. This paper presents the research in the area of automatic analysis of movement of unrestricted vehicles like ships and air-planes. The analysis is aimed at extraction of trajectory information, and the results can be used to identify anomalous behaviour in archived and real-time data. In this paper we focus on data acquired using the...

    Full text available to download

  • Automatic Classification of Polish Sign Language Words

    In the article we present the approach to automatic recognition of hand gestures using eGlove device. We present the research results of the system for detection and classification of static and dynamic words of Polish language. The results indicate the usage of eGlove allows to gain good recognition quality that additionally can be improved using additional data sources such as RGB cameras.

    Full text available to download