Search results for: audio processing objects - Bridge of Knowledge

Search

Search results for: audio processing objects

Search results for: audio processing objects

  • Intelligent video and audio applications for learning enhancement

    The role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....

    Full text available to download

  • SYNAT_MUSIC_GENRE_FV_173

    Open Research Data

    This is the original dataset containing 51582 music tracks (22 music genres) and 173 element-feature vector [1-6,9]. A collection of more than 50000 music excerpts described with a set of descriptors obtained through the analysis of 30-second mp3 recordings was gathered in a database called SYNAT. The SYNAT database was realized by the Gdansk University...

  • Detection of impulsive disturbances in archive audio signals

    Publication

    In this paper the problem of detection of impulsive disturbances in archive audio signals is considered. It is shown that semi-causal/noncausal solutions based on joint evaluation of signal prediction errors and leave-one-out signal interpolation errors, allow one to noticeably improve detection results compared to the prediction-only based solutions. The proposed approaches are evaluated on a set of clean audio signals contaminated...

    Full text available to download

  • Localization and identyfication of ferromagnetic objects

    Publication

    - Year 2008

    A compact ferromagnetic object placed in the earthly magnetic field causes disturbance of this field. This disturbance is associated with magnetization of the object. Ferromagnetic objects have induced and can also have permanent magnetization. In methods of locating and identifying ferromagnetic objects usually is using the model of the dipol moment. Determination of the position and values of the extremes of the magnetic field...

  • Exploiting audio-visual correlation by means of gaze tracking

    This paper presents a novel means for increasing audio-visual correlation analysis reliability. This is done based on gaze tracking technology engineered at the Multimedia Systems Department of the Gdansk University of Technology, Poland. In the paper, the past history and current research in the area of audio-visual perception analysis are shortly reviewed. Then the methodology employing gaze tracking is presented along with the...

    Full text available to download

  • Image Processing in Robotics (2021/2022)

    e-Learning Courses
    • P. Chudziak

    For ISD M.Sc. (II degr.) 2 sem. Participants are to learn image processing algorithms related to transformation, filtration, feature detection (image descriptors), image processing algorithms in robotic industrial systems.

  • Elimination of impulsive disturbances from stereo audio recordings

    Publication

    This paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. On-line tracking of signal model parameters is performed using the stability-preserving Whittle-Wiggins-Robinson algorithm with exponential data weighting. Detection of noise pulses and model-based interpolation of the irrevocably distorted samples...

    Full text to download in external service

  • CHEMICAL ENGINEERING AND PROCESSING

    Journals

    ISSN: 0255-2701 , eISSN: 1873-3204

  • Surveillance camera tracking of GEO positioned objects

    Rozdział opisuje system sterowania kamerami ruchomymi PTZ realizujący śledzenie poruszającego się obiektu o znanej pozycji GPS. Przedstawione są idea systemu oraz możliwości jego wykorzystania. Opisane są: procedura kalibracji pola widzenia kamery i sposób powiązania z danymi o lokalizacji, procedura predykcji ruchu w celu kompensacji opóźnień czasowych. Omówiony jest zaimplementowany system modułowy, w którego skład wchodzą: terminale...

    Full text to download in external service

  • Digital Audio Broadcasting or Webcasting: A Network Quality Perspective

    In recent years, many alternative technologies of delivering audio content have emerged, with different advantages and disadvantages. In this paper pros and cons of digital audio broadcasting and webcasting transmission techniques in a network quality perspective are described. A case study of user expectations with respect to currently available services is analyzed, and the perceived quality of real digital broadcasted and webcasted...

    Full text available to download

  • Layered background modeling for automatic detection of unattended objects in camera images

    Publication

    - Year 2011

    An algorithm for automatic detection of unattended objects in video camera images is presented. First, background subtraction is performed, using an approach based on the codebook method. Results of the detection are then processed by assigning the background pixels to time slots, based on the codeword age. Using this data, moving objects detected during a chosen period may be extracted from the background model. The proposed approach...

    Full text to download in external service

  • System do prototypowania bezprzewodowych inteligentnych urządzeń monitoringu audio-video

    Publication

    - Year 2013

    W komunikacie przedstawiono system prototypowania bezprzewodowych urządzeń do monitoringu audio-video. System bazuje na układach FPGA Virtex6 i wielu dodatkowych wspierających urządzeniach jak: szybka pamięć DDR3, mała kamera HD, mikrofon z konwerterem A/C, moduł radiowy WiFi, itp. Funkcjonalność systemu została szczegółowo opisana w komunikacie. System został zoptymalizowany do pracy pod kontrolą systemu operacyjnego Linux, zostały...

  • Testing Watermark Robustness against Application of Audio Restoration Algorithms

    Publication

    The purpose of this study was to test to what extent watermarks embedded in distorted audio signals are immune to audio restoration algorithm performing. Several restoration routines such as noise reduction, spectrum expansion, clipping or clicks reduction were applied in the online website system. The online service was extended with some copyright protection mechanisms proposed by the authors. They contain low-level music features...

    Full text to download in external service

  • Designing everydayness: 4 objects, place, atmosphere / Elective design II

    e-Learning Courses
    • M. Malewczyk
    • J. Borucka

    Students will become familiar with the principal theories within the philosophical trend called Everyday Aesthetics.They will learn how objects and places acquire an aesthetic value and produce aesthetic experience and how to create an atmosphere by stimulating all the senses.

  • A double-talk detector using audio watermarking

    a novel approach to double-talk detection in the acoustic echo canceler is proposed. a hidden signature is embedded into the arriving signal, using the echo-hiding method. next detection of the presence of this signature in the microphone signal is performed. the results of the signature detection may be used by the acoustic echo canceler to stop or restart the adaptation process.

    Full text to download in external service

  • Physicochemical Problems of Mineral Processing

    Journals

    ISSN: 1643-1049 , eISSN: 2084-4735

  • Automatic audio signal mixing system based on one-dimensional Wave-U-Net autoencoders

    Publication

    - Year 2023

    The purpose of this dissertation is to develop an automatic song mixing system that is capable of automatically mixing a song with good quality in any music genre. This work recalls first the audio signal processing methods used in audio mixing, and it describes selected methods for automatic audio mixing. Then, a novel architecture built based on one-dimensional Wave-U-Net autoencoders is proposed for automatic music mixing. Models...

    Full text available to download

  • Reconstruction Methods for 3D Underwater Objects Using Point Cloud Data

    Publication

    Existing methods for visualizing underwater objects in three dimensions are usually based on displaying the imaged objects either as unorganised point sets or in the form of edges connecting the points in a trivial way. To allow the researcher to recognise more details and characteristic features of an investigated object, the visualization quality may be improved by transforming the unordered point clouds into higher order structures....

    Full text available to download

  • Exploring contexts of use of cultural objects in virtual museums

    Publication

    - Year 2008

    This paper presents a system which facilitates discovering knowledge about cultural objects. The system is based on semantic modeling of a virtual museum which consists of cultural objects placed in a virtual 3D space. The article describes an extension to the concept of cultural objects which includes information on the use of these objects. This extension enables to place objects in an appropriate context in a virtual museum....

  • Automatic audio-visual threat detection

    Publication

    - Year 2010

    The concept, practical realization and application of a system for detection and classification of hazardous situations based on multimodal sound and vision analysis are presented. The device consists of new kind multichannel miniature sound intensity sensors, digital Pan Tilt Zoom and fixed cameras and a bundle of signal processing algorithms. The simultaneous analysis of multimodal signals can significantly improve the accuracy...

  • Objectivization of Audio-Visual Correlation analysis

    Publication

    Simultaneous perception of audio and visual stimuli often causes the concealment or misrepresentation of information actually contained in these stimuli. Such effects are called the ''image proximity effect'' or the ''ventriloquism effect'' in literature. Until recently, most research carried out to understand their nature was based on subjective assessments. The Authors of this paper propose a methodology based on both subjective...

    Full text available to download

  • Pose-Configurable Generic Tracking of Elongated Objects

    Publication

    - Year 2013

    Elongated objects have various shapes and can shift, rotate, change scale, and be rigid or deform by flexing, articulating, and vibrating, with examples as varied as a glass bottle, a robotic arm, a surgical suture, a finger pair, a tram, and a guitar string. This generally makes tracking of poses of elongated objects very challenging. We describe a unified, configurable framework for tracking the pose of elongated objects, which...

  • Integrated acoustical-optical system for inventory of hydrotechnical objects

    Publication

    - HYDROACOUSTICS - Year 2017

    The knowledge of the location, shape and other characteristics of spatial objects in the coastal areas has a significant impact on the functioning of ports, shipyards, and other waterinfrastructure facilities, both offshore and inland. Therefore, measurements of the underwater part of the waterside zone are taken, which means the bottom of the water and other underwater objects (e.g. breakwaters, docks, etc.), and objects above...

    Full text available to download

  • SYNAT Music Genre Parameters PCA 19

    Open Research Data

    The dataset contains feature vector after  Principal Component Analysis (PCA) performing, so there are 11 music genres and 19-element vector derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier research studies carried out by the team of authors [1-6]. A collection of 52532 music excerpts described...

  • SYNAT_PCA_48

    Open Research Data

    There is a series of datasets containing feature vectors derived from music tracks. The dataset contains 51582 music tracks (22 music genres) and feature vector after  Principal Component Analysis (PCA) performing, so there are 48-element vectors derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier...

  • SYNAT_PCA_11

    Open Research Data

    The dataset contains 51582 music tracks (22 music genres) and feature vector after  Principal Component Analysis (PCA) performing, so there are 11-element vectors derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier research studies carried out by the team of authors [1-6]. A collection of more than...

  • MECHANICAL SYSTEMS AND SIGNAL PROCESSING

    Journals

    ISSN: 0888-3270

  • SIGNAL PROCESSING

    Journals

    ISSN: 0165-1684 , eISSN: 1872-7557

  • Detection of moving objects in images combined from video and thermal cameras

    Publication

    - Year 2013

    An algorithm for detection of moving objects in video streams from the monitoring cameras is presented. A system composed of a standard video camera and a thermal camera, mounted in close proximity to each other, is used for object detection. First, a background subtraction is performed in both video streams separately, using the popular Gaussian Mixture Models method. For the next processing stage, the authors propose an algorithm...

    Full text to download in external service

  • New semi-causal and noncausal techniques for detection of impulsive disturbances in multivariate signals with audio applications

    This paper deals with the problem of localization of impulsive disturbances in nonstationary multivariate signals. Both unidirectional and bidirectional (noncausal) detection schemes are proposed. It is shown that the strengthened pulse detection rule, which combines analysis of one-step-ahead signal prediction errors with critical evaluation of leave-one-out signal interpolation errors, allows one to noticeably improve detection results...

    Full text available to download

  • Analysis of degaussing process of ferromagnetic objects

    Results of the analytical and numerical analysis of the degaussing process phenomena of ferromagnetic objects were presented in this paper. The screening effectiveness of the electromagnetic field of magnetic screens in most cases depends on thickness, conductivity, magnetic permeability of the screen and angular frequency of degaussing currents. The magnetic field inside thin-layer ferromagnetic object was presented in this paper....

  • Signatures and acoustic images of objects moving in water

    Publication

    Observation of underwater space is part of a generaltrend, which primary purpose is to protect and increasesafety in the selected area. The basic aim of the paper ispresentation of designated acoustic characteristics typicalfor objects moving on the water surface and under water,which represent some knowledge about detection of theseobjects. Create a catalog of acoustic signature and not onlyacoustic, as well as acoustic images...

  • Analysis of impact of audio modifications on the robustness of watermark for non-blind architecture

    The aim of this paper is to assess the robustness of the non-blind audio content watermarking scheme proposed by the authors. The authors present the architecture of the designed system along with the employed workflows for embedding and extracting the watermark followed by the implementation phase description and the analysis of the experimental results. Some possible attack simulations on the embedded watermarks are reviewed,...

    Full text available to download

  • Implementation of localization and identification of ferromagnetic objects algorithm in labview enviroment

    The problem with detecting dangerous objects is still a matter of concern today. One of the methods of detecting dangerous objects is the magnetic method. While measuring a magnetic field in the surrounding of objects with ferromagnetic properties, it is possible to detect, localize and identify such object.

  • Determining Pronunciation Differences in English Allophones Utilizing Audio Signal Parameterization

    Publication

    - Year 2017

    An allophonic description of English plosive consonants, based on audio-visual recordings of 600 specially selected words, was developed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted...

  • Classifying type of vehicles on the basis of data extracted from audio signal characteristics

    The aim of this study is to find and optimize a feature vector for an automatic recognition of the type of vehicles, extracted form an audio signal. First, the influence of weather-based conditions of road surface on spectral characteristic of the audio signal recorded from a passing vehicle in close proximity to the road is discussed. Next, parameterization of the recorded audio signal is performed. For that purpose, the MIRtoolbox,...

    Full text to download in external service

  • Parametric impulsive noise detector for corrupted audio signals based on hidden Markow model

    Publication

    - Year 2008

    The paper addresses the problem of impulsive noise detection for audio signals. A structure of threshold parameter detectors using modelingof signals was introduced. the algorithm of the noise detection, based on discrete-time hidden Markow model (HMM)of whitened audio signal is elaborated

  • Using concentrated spectrogram for analysis of audio acoustic signals

    Publication

    The paper presents results of time-frequency analysis of audio acoustic signals using the method of Concentrated Spectrograph also known as ''Cross-spectral method'' or ''Reassignment method''. Presented algorithm involves signal's local group delay and channelized instantaneous frequency to relevantly redistribute all Short-time Fourier transform lines in time-frequency plain. The main intention of the paper is to compare various...

    Full text available to download

  • Audio Content and Crowdsourcing: A Subjective Quality Evaluation of Radio Programs Streamed Online

    Publication

    - Year 2023

    Radio broadcasting has been present in our lives for over 100 years. The transmission of speech and music signals accompanies us from an early age. Broadcasts provide the latest information from home and abroad. They also shape musical tastes and allow many artists to share their creativity. Modern distribution involves transmission over a number of terrestrial systems. The most popular are analog FM (Frequency Modulation) and...

    Full text to download in external service

  • Objectivization of phonological evaluation of speech elements by means of audio parametrization

    This study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...

  • Sparse vector autoregressive modeling of audio signals and its application to the elimination of impulsive disturbances

    Publication

    Archive audio files are often corrupted by impulsive disturbances, such as clicks, pops and record scratches. This paper presents a new method for elimination of impulsive disturbances from stereo audio signals. The proposed approach is based on a sparse vector autoregressive signal model, made up of two components: one taking care of short-term signal correlations, and the other one taking care of long-term correlations. The method...

    Full text to download in external service

  • Quality Analysis of Audio-Video Transmission in an OFDM-Based Communication System

    Publication

    - Year 2022

    Application of a reliable audio-video communication system, brings many advantages. With the spoken word we can exchange ideas, provide descriptive information, as well as aid to another person. With the availability of visual information one can monitor the surrounding, working environment, etc. As the amount of available bandwidth continues to shrink, researchers focus on novel types of transmission. Currently, orthogonal frequency...

    Full text to download in external service

  • Data visualization of marine objects on digital maps

    The paper presents the implementation of two multithreaded applications for data visualization of marine objects written in C#, designed to run on operator consoles with 32-bit or 64-bit Windows 7 OS. The article describes the most important functionality and features of the developed C# .NET user controls for data visualization on digital maps and in the configurable tables.

    Full text to download in external service

  • Pursuing Listeners’ Perceptual Response in Audio-Visual Interactions - Headphones vs Loudspeakers: A Case Study

    Publication

    This study investigates listeners’ perceptual responses in audio-visual interactions concerning binaural spatial audio. Audio stimuli are coupled with or without visual cues to the listeners. The subjective test participants are tasked to indicate the direction of the incoming sound while listening to the audio stimulus via loudspeakers or headphones with the head-related transfer function (HRTF) plugin. First, the methodology...

    Full text available to download

  • Elimination of impulsive disturbances from archive audio files – comparison of three noise pulse detection schemes

    Publication

    The problem of elimination of impulsive disturbances (such as clicks, pops, ticks, crackles, and record scratches) from archive audio recordings is considered and solved using autoregressive modeling. Three classical noise pulse detection schemes are examined and compared: the approach based on open-loop multi-step-ahead signal prediction, the approach based on decision-feedback signal prediction, and the double threshold approach,...

    Full text to download in external service

  • Music Data Processing and Mining in Large Databases for Active Media

    Publication

    - Year 2014

    The aim of this paper was to investigate the problem of music data processing and mining in large databases. Tests were performed on a large data-base that included approximately 30000 audio files divided into 11 classes cor-responding to music genres with different cardinalities. Every audio file was de-scribed by a 173-element feature vector. To reduce the dimensionality of data the Principal Component Analysis (PCA) with variable...

    Full text to download in external service

  • Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.

    In this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...

    Full text to download in external service

  • A study on of music features derived from audio recordings examples – a quantitative analysis

    Publication

    The paper presents a comparative study of music features derived from audio recordings, i.e. the same music pieces but representing different music genres, excerpts performed by different musicians, and songs performed by a musician, whose style evolved over time. Firstly, the origin and the background of the division of music genres were shortly presented. Then, several objective parameters of an audio signal were recalled that...

    Full text available to download

  • Searching of the buried objects in the sea bottom by means of noninvasive methods

    Publication

    - Year 2012

    Searching of objects on the seabed or under its surface currently is a challenge for a number of researchers interested in the sea bottom. The problem relates to the objects on the depths of up to several tens of meters from the surface of the seabed. Finding the objects is the subject of interest for a wide group of users starting from archaeologists, and ending on groups interested in marine safety, as well as in military application...

  • Tracing of dynamic objects in distributed interactive simulation systems

    Publication

    - Year 2003

    Distributed interactive simulation systems require integration of several areas of computer science and applied mathematics to enable each individual simulation object to visualize effectively dynamic states of other objects. Objects are unpredictable,i.e., controlled by their local operators, and are remote, i.e., must rely on some transmission media to visualize dynamic scene from their local perspectives. The paper...

    Full text available to download