displaying 1000 best results Help
Search results for: audio-visual correlation
-
In uence of Low-Level Features Extracted from Rhythmic and Harmonic Sections on Music Genre Classi cation
PublicationWe present a comprehensive evaluation of the infuence of 'harmonic' and rhythmic sections contained in an audio file on automatic music genre classi cation. The study is performed using the ISMIS database composed of music files, which are represented by vectors of acoustic parameters describing low-level music features. Non-negative Matrix Factorization serves for blind separation of instrument components. Rhythmic components...
-
Network and Operating System Support for Digital Audio and Video (Network and OS Support for Digital A/V)
Conferences -
Edyta Urwanowicz dr sztuki
People -
Online sound restoration system for digital library applications.
PublicationAudio signal processing algorithms were introduced to the new online non-commercial service for audio restoration intended to enhance the content of digitized audio repositories. Missing or distorted audio samples are predicted using neural networks and a specific implementation of the Jannsen interpolation method based on the autoregressive model (AR) combined with the iterative restoring of missing signal samples. Since the distortion...
-
Multimodal Attention Stimulator
PublicationMultimodal attention stimulator was proposed and tested for improving auditory and visual attention, including pupils with developmental dyslexia. Results of the conducted experiments shown that the designed stimulator can be used in order to improve comprehension during reading tasks. The changes in the visual attention, observed in reading test results, translate into the overall reading performance.
-
Reduction of parasitic pitch variations in archival musical recordings
PublicationA new method for reducing parasitic pitch variations in archival audio recordings is presented. The method is intended for analyzing movie soundtracks recorded in optical films. It utilizes image processing for calculating and reducing effects of tape shrinkage being one of the main reasons for parasitic pitch variations in audio accompanying moving images. As long as the film tape characteristics are known the new method can be...
-
Fitting the mobile device characteristics to the user's hearing preferences
PublicationA method for fitting the mobile computer audio characteristics to the user's hearing preferences is proposed. The process consists of two stages: calibration and dynamics processing. During the calibration phase the user performs a loudness scaling test giving their response regarding the perceived loudness. The dynamics processing made on above basis sets the loudness to the most comfortable level. The processing accounts both...
-
Data, Information, Knowledge, Wisdom Pyramid Concept Revisited in the Context of Deep Learning
PublicationIn this paper, the data, information, knowledge, and wisdom (DIKW) pyramid is revisited in the context of deep learning applied to machine learningbased audio signal processing. A discussion on the DIKW schema is carried out, resulting in a proposal that may supplement the original concept. Parallels between DIWK pertaining to audio processing are presented based on examples of the case studies performed by the author and her collaborators....
-
1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type
PublicationA network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence....
-
IEEE Symposium on Visual Languages and Human-Centric Computing (was VL)
Conferences -
Art Composition
e-Learning CoursesPerson in charge: prof. Krzysztof Wróblewski, Department of Visual Arts Teacher: mgr Patryk Różycki, Department of Visual Arts Five Words. Society and Politics. What? By What? General assumptions. The aim of the proposed two artistic compositions is a creative processing of emotions related to the socio-political issues. In general, it is about personal views and feelings, but it must be also considered that architects are...
-
Evaluation of a Novel Approach to Virtual Bass Synthesis Strategy
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) strategy applied to portable computers. The developed algorithms involve intelligent, rule-based settings of bass synthesis parameters with regard to music genre of an audio excerpt and the type of a portable device in use. The Smart VBS algorithm performs the synthesis based on a nonlinear device (NLD) with artificial controlling synthesis...
-
Joint fingerprinting and decryption method for color images based on quaternion rotation with cipher quaternion chaining
PublicationThis paper addresses the problem of unauthorized redistribution of multimedia content by malicious users (pirates). In this method three color channels of the image are considered a 3D space and each component of the image is represented as a point in this 3D space. The distribution side uses a symmetric cipher to encrypt perceptually essential components of the image with the encryption key and then sends the encrypted data via...
-
Classification of Music Genres by Means of Listening Tests and Decision Algorithms
PublicationThe paper compares the results of audio excerpt assignment to a music genre obtained in listening tests and classification by means of decision algorithms. A short review on music description employing music styles and genres is given. Then, assumptions of listening tests to be carried out along with an online survey for assigning audio samples to selected music genres are presented. A framework for music parametrization is created...
-
Music genre classification applied to bass enhancement for mobile technology
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm is related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt. The classification of music genres is automatically executed employing MPEG 7 parameters and the Principal Component Analysis method applied to reduce information...
-
Machine learning applied to acoustic-based road traffic monitoring
PublicationThe motivation behind this study lies in adapting acoustic noise monitoring systems for road traffic monitoring for driver’s safety. Such a system should recognize a vehicle type and weather-related pavement conditions based on the audio level measurement. The study presents the effectiveness of the selected machine learning algorithms in acoustic-based road traffic monitoring. Bases of the operation of the acoustic road traffic...
-
Machine learning applied to acoustic-based road traffic monitoring
PublicationThe motivation behind this study lies in adapting acoustic noise monitoring systems for road traffic monitoring for driver’s safety. Such a system should recognize a vehicle type and weather-related pavement conditions based on the audio level measurement. The study presents the effectiveness of the selected machine learning algorithms in acoustic-based road traffic monitoring. Bases of the operation of the acoustic road traffic...
-
Lighting conditions in Home Office and occupant’s perception: an international study
PublicationThe global pandemic and physical distancing restrictions are forcing us to rethink how residential buildings are used regarding the visual environment. This paper describes home office lighting conditions within different countries and continents. The aim is to define the current limitations of home offices in providing a resilient visual environment. The work was developed by a team of international experts working together on...
-
Music Data Processing and Mining in Large Databases for Active Media
PublicationThe aim of this paper was to investigate the problem of music data processing and mining in large databases. Tests were performed on a large data-base that included approximately 30000 audio files divided into 11 classes cor-responding to music genres with different cardinalities. Every audio file was de-scribed by a 173-element feature vector. To reduce the dimensionality of data the Principal Component Analysis (PCA) with variable...
-
Zaawansowane Przetwarzanie Sygnału
e-Learning CoursesPrzedmiot prezentuje wybrane metody przetwarzania sygnałów w bardzo szerokim obszarze zastosowań. Ilustruje najnowsze osiągnięcia w tym zakresie, wsparte wybranymi publikacjami. Zajęcia są podzielone na wykład (15 h) i seminarium (15 h). Podstawowe pojęcia dotyczące cyfrowego przetwarzania sygnałów, zalecana literatura Analiza widmowa gęstość widmowa mocy, widmo falkowe, polispektra i gęstość widmowa mocy skrośnej Efekty...
-
Further Developments of the Online Sound Restoration System for Digital Library Applications
PublicationNew signal processing algorithms were introduced to the online service for audio restoration available at the web address: www.youarchive.net. Missing or distorted audio samples are estimated using a specific implementation of the Jannsen interpolation method. The algorithm is based on the autoregressive model (AR) combined with the iterative complementation of signal samples. Since the interpolation algorithm is computationally...
-
An Approach to Bass Enhancement in Portable Computers Employing Smart Virtual Bass Synthesis Algorithms
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The developed algorithms are related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt and to the type of a portable device in use. To find optimum synthesis parameters of the VBS algorithms, subjective listening tests based on a parametric procedure...
-
Sparse autoregressive modeling
PublicationIn the paper the comparison of the popular pitch determination (PD) algorithms for thepurpose of elimination of clicks from archive audio signals using sparse autoregressive (SAR)modeling is presented. The SAR signal representation has been widely used in code-excitedlinear prediction (CELP) systems. The appropriate construction of the SAR model is requiredto guarantee model stability. For this reason the signal representation...
-
Vocalic Segments Classification Assisted by Mouth Motion Capture
PublicationVisual features convey important information for automatic speech recognition (ASR), especially in noisy environment. The purpose of this study is to evaluate to what extent visual data (i.e. lip reading) can enhance recognition accuracy in the multi-modal approach. For that purpose motion capture markers were placed on speakers' faces to obtain lips tracking data during speaking. Different parameterizations strategies were tested...
-
Innovative method of localization airplanes in VCS (VCS-MLAT) distributed system
PublicationThe article presents the concept and the structure of the localization module. The prototype module is the part of the VCS (VCS-MLAT) localization distributed system. The device receives the audio signal transmitted in airplanes band (118 MHz – 136 MHz). Received data with the timestamps are send to the main server. The data from multiple devices estimates the localization of the airplane. The main aim of the project is the analysis...
-
Smart Modeling of Maritime Vessels
PublicationCurrently, the market offers many visualization tools available to graphic designers, engineers, managers and academics working on maritime environments. The practice of visualization involves making and manipulating images that convey novel phenomena and ideas. Visual communication, together with virtual reality environments, is an emerging and rapidly evolving discipline. It brings great advantage over written word or voice alone,...
-
Subjective and Objective Comparative Study of DAB+ Broadcast System
PublicationBroadcasting services seek to optimize their use of bandwidth in order to maximize user’s quality of experience. They aim to transmit high-quality digital speech and music signals at the lowest bitrate. They intend to offer the best quality under available conditions. Due to bandwidth limitations, audio quality is in conflict with the number of transmitted radio programs. This paper analyzes whether the quality of real-time digital...
-
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
PublicationThis paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...
-
Architectural project VI 2021/2022 _Erasmus _JB
e-Learning CoursesThe design task is to develop a draft revitalisation plan and a conceptual proposal for a complex of architectural objects for an area in Dolne Miasto, Gdańsk. The proposal should include spatial, social and environmental solutions to restore the attractiveness of a historical district and enhance its visual coherence and functionality. Moreover, the aim is to develop detailed architectural and functional solutions for a selected...
-
Light formed through urban morphology and different organism groups: First findings from a systematic review
PublicationThe prevailing implementation and usage of contemporary lighting technologies and design practices in cities have created over-illuminated built environments. Recent studies indicate that exposure to electric lighting effects formed through spatial characteristics has visual, physiological, and behavioural effects on both humans and non-humans, such as wildlife. In order to gain a better understanding of the impact that electric...
-
Public spaces connecting cities. Green and Blue Infrastructures potential.
PublicationA city fragmentation causes a lot of negative effects in urban environment such as: disconnecting the environmental, functional and compositional relations, a loss of urban compactness, chaotic development, visual chaos, a domination of technical landscape, reduction of security. This is why one of main challenges for urban planners is to connect the fragmented structures by creating friendly, attractive and safe public space....
-
Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency
PublicationIn this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...
-
Remote Estimation of Video-Based Vital Signs in Emotion Invocation Studies
PublicationAbstract— The goal of this study is to examine the influence of various imitated and video invoked emotions on the vital signs (respiratory and pulse rates). We also perform an analysis of the possibility to extract signals from sequences acquired with cost-effective cameras. The preliminary results show that the respiratory rate allows for better separation of some emotions than the pulse rate, yet this relation highly depends...
-
Potential energy curves of LiCs dimer
Open Research DataThis data presents potential energy curves of LiCs dimer in Hund's case (a). Calculated using Born-Oppenheimer approximation with scalar relativistic effects are included via large effective core potentials. Custom basis sets, core polarization potentials and MRCI method are used to accurately describe electron correlation. Dataset consists of 22 potential...
-
Potential energy curves of NaRb dimer
Open Research DataThis data presents potential energy curves of NaRb dimer in Hund's case (a). Calculated using Born-Oppenheimer approximation with scalar relativistic effects are included via large effective core potentials. Core polarization potentials and MRCI method is used to describe electron correlation. Dataset consists of 18 potential energy curves of ground...
-
Preferences of the Facade Composition in the Context of Its Regularity and Irregularity
PublicationAbstract: The aim of this study is to determine the preferences of Polish society towards building facades depending on the degree of the composition regularity of the facade elements. The subject matter is inspired by the authors’ observations in relation to the current architectural trends. The purposefulness of the conducted research results from several issues. Firstly, the reports of psychology and neurosciences clearly indicate...
-
Sound engineering as our commitment to its creators in Poland
PublicationSound engineering is an interdisciplinary and rapidly expanding domain. It covers many aspects, such as sound perception, studio and sound mastering technology, music information retrieval including content-based search systems and automatic music transcription frameworks, sound synthesis, sound restoration, electroacoustics, and other ones constituting multimedia technology. Moreover, machine learning methods applied to the topics...
-
Novel Fault Identification for Electromechanical Systems via Spectral Technique and Electrical Data Processing
PublicationIt is proposed, developed, investigated, and validated by experiments and modelling for the first time in worldwide terms new data processing technologies, higher order spectral multiple correlation technologies for fault identification for electromechanical systems via electrical data processing. Investigation of the higher order spectral triple correlation technology via modelling has shown that the proposed data processing technology...
-
A Meta-Analysis of Pulse Arrival Time Based Blood Pressure Estimation
PublicationThe paper presents a preliminary meta-analysis of the sample correlation between pulse arrival time (PAT) and blood pressure (BP). The aim of the study was to verify sample correlation coefficient between PAT and BP using an affine model BP = a · P AT + b for systolic and diastolic blood pressure. The databases included in the search were the IEEE Xplore Digital Library, Springer Link and Google Scholar. Only papers from 2005 to...
-
VSC converters control for offshore wind farms HVDC grid connection
PublicationThe paper proposes a voltage sourced converter (VSC) new control method. A well-known in the electric power systems correlation between voltage angle and active power and correlation between voltage and reactive power is used instead of feedforward control. This allows for a fast and almost independent control of active and reactive power flow.
-
Support for argument structures review and assessment
PublicationArgument structures are commonly used to develop and present cases for safety, security and for other properties of systems. Such structures tend to grow excessively, which causes problems with their review and assessment. Two issues are of particular interest: (1) systematic and explicit assessment of the compelling power of an argument, and (2) communication of the result of such an assessment to relevant recipients. The paper...
-
The ab initio and experimental study of the spectroscopic and magnetic properties of Ho(III)-EDTA
Open Research DataIn this dataset, the ab initio calculations of the electronic structure and the magnetic properties are discussed in the context of the experimental data for the Ho–EDTA complex. In the calculations different models of the cluster have been applied to examine the influence of various parts of the environment of the Ho(III)-EDTA complex on its properties....
-
Full CI ground state potential energy curves and one-electron relativistic corrections for hydrogen molecule in various basis sets
Open Research DataThis dataset consists of Full CI ground state Born-Oppenheimer potential energy curves and one-electron relativistic corrections for hydrogen dimer. Nonrelativistic energies, as well as one electron relativistic corrections (treated perturbatively with help of the Cowan-Griffin Hamiltonian) are presented for internuclear distances between 0.8 and 10...
-
Antiviral activity of bee bread derived from polish apiaries.
Open Research DataBee bread is a product of fermentation of bee-collected pollen and revealed a high nutritional value. Other bee products, such as honey and propolis, are known for their antiviral activity, but bee bread is still under investigation, thus its antiviral potential is still unspecified. For investigation antiviral activity of bee bread samples, cytotoxicity...
-
Ljung-Box test values of selected companies of the Warsaw Stock Exchange
Open Research DataThe following dataset includes the Warsaw Stock Exchange market analysis using the Ljung-Box test. Partial autocorrelations up to the 5th order were analyzed, because it will allow to observe the relationship within one week of stock exchange quotations. In the case of the WIG index, the 1st and 2nd order correlation turned out to be statistically significant....
-
Analysis of Vibration and Acoustic Signals for Noncontact Measurement of Engine Rotation Speed
PublicationThe non-contact measurement of engine speed can be realized by analyzing engine vibration frequency. However, the vibration signal is distorted by harmonics and noise in the measurement. This paper presents a novel method for the measurement of engine rotation speed by using the cross-correlation of vibration and acoustic signals. This method can enhance the same frequency components in engine vibration and acoustic signal. After...
-
Information management enhancement with simulation: case studies.
PublicationW rozdziale omówiono rolę symulacji w procesach zarządzania wiedzą i informacją, wskazując na jej znaczenie jako podstawowego narzędzia w tym obszarze.Przybliżono wybraną platformę symulacyjną opartą na systemie Visual SLAM.
-
Determination of time dependence of coated metal electrical and electrochemical parameters during exposure using principal component analysis
PublicationThe use of the principal component analysis (PCA) permits the complex and quantitative analysis of the time dependence of electrical and electrochemical parameters of coated metal obtained by fitting impedance data. So far, changes in electrical and electrochemical parameters during exposure were analyzed independently. In this way, some of the information contained in the relationship between changes in parameters over time are...
-
Free Convection Heat Transfer from Horizontal Cylinders
PublicationThe results of experimental investigation of free convection heat transfer in a rectangular container are presented. The ability of the commonly accepted correlation equations to reproduce present experimental data was tested as well. It was assumed that the examined geometry fulfils the requirement of no-interaction between heated cylinder and bounded surfaces. In order to check this assumption recently published correlation equations...
-
Effect of Storage Conditions of Rutile Flux Cored Welding Wires on Properties of Welds
PublicationThe influence of storage locations of two grades of rutile flux cored welding wires on their surface condition and the strength of the welds made with them were studied. Wires were stored in real urban conditions (Gdańsk and Katowice) for 1 month, simultaneously recording changes in conditions: temperature and relative humidity of the environment. Visual tests of wires in the delivered and stored condition as well as visual and...