displaying 1000 best results Help
Search results for: AUDIO-VISUAL SIGNALS
-
Examining Acoustic Emission of Engineered Ultrasound Loudspeakers
PublicationMeasurement results of the sound emitted from an ultrasound custom-made system with high spatial directivity are presented. The proposed system is using modulated ultrasound waves which demodulate in nonlinear medium resulting in audible sound. The system is aimed at enhancing the users’ personal audio space, therefore the measurements are performed using the Head and Torso Simulator which provides the realistic reproduction of...
-
Measurements and Simulations of Engineered Ultrasound Loudspeakers
PublicationSimulation and measurement results of the sound emitted from an ultrasound custom-made system with high spatial directivity are presented. The proposed system is using modulated ultrasound waves which demodulate in nonlinear medium resulting in audible sound. The system is aimed at enhancing the users’ personal audio space, therefore the measurements are performed using the Head and Torso Simulator which provides realistic reproduction...
-
Quality Aspects in Digital Broadcasting and Webcasting Systems: Bitrate versus Loudness
PublicationIn this paper the quality aspects of bitrate and loudness in digital broadcasting and webcasting systems are examined. The authors discuss a survey concerning user preferences related with processing and managing audio content. The coding efficiency of a popular audio format is analyzed in the context of storing media. An objective study on a representative group of signal samples, as well as a subjective study of the perceived...
-
Intelligent multimedia solutions supporting special education needs.
PublicationThe role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....
-
Adam Kupryjanow mgr inż.
People -
Online sound restoration system for digital library applications
PublicationAudio signal processing algorithms were introduced to the new online non-commercial service for audio restoration intended to enhance the content of digitized audio repositories. Missing or distorted audio samples are predicted using neural networks and a specific implementation of the Jannsen interpolation method based on the autoregressive model (AR) combined with the iterative restoring of missing signal samples. Since the distortion...
-
IEEE International Conference on Visual Communications and Image Processing
Conferences -
Pan-Sydney Area Workshop on Visual Information Processing
Conferences -
Wow defect reduction based on interpolation techniques
PublicationW referacie przedstawiono wyniki badania różnych technik interpolacji wykorzystanych w redukcji kołysania dźwięku. W badaniach użyto: interpolację liniową, dwie techniki interpolacji wielomianowej (Hermite i spline), i technikę sumowania okienkowanych funkcji sink. Jakość rekonstrukcji wykonano wykorzystując sztucznie spreparowany sygnał audio, rekonstruowany wymienionymi metodami interpolacji. Jakość rekonstrukcji oceniono wykorzystując...
-
Creating a Realible Music Discovery and Recomendation System
PublicationThe aim of this paper is to show problems related to creating a reliable music dis-covery system. The SYNAT database that contains audio files is used for the purpose of experiments. The files are divided into 22 classes corresponding to music genres with different cardinality. Of utmost importance for a reliable music recommendation system are the assignment of audio files to their appropriate gen-res and optimum parameterization...
-
Transmitting Alarm Information in DAB+ Broadcasting System
PublicationThe main goal of digital broadcasting is to deliver high-quality content with the lowest possible bitrate. This paper is focused on transmitting alarm information, such as emergency warning and alerting, in the DAB+ (Digital Audio Broadcasting plus) broadcasting system. These additional services should be available at the lowest possible bitrate, in order to provide a clear and understandable voice message to people. Furthermore, additional...
-
Edyta Urwanowicz dr sztuki
People -
Multimodal Attention Stimulator
PublicationMultimodal attention stimulator was proposed and tested for improving auditory and visual attention, including pupils with developmental dyslexia. Results of the conducted experiments shown that the designed stimulator can be used in order to improve comprehension during reading tasks. The changes in the visual attention, observed in reading test results, translate into the overall reading performance.
-
Network and Operating System Support for Digital Audio and Video (Network and OS Support for Digital A/V)
Conferences -
Online sound restoration system for digital library applications.
PublicationAudio signal processing algorithms were introduced to the new online non-commercial service for audio restoration intended to enhance the content of digitized audio repositories. Missing or distorted audio samples are predicted using neural networks and a specific implementation of the Jannsen interpolation method based on the autoregressive model (AR) combined with the iterative restoring of missing signal samples. Since the distortion...
-
Fitting the mobile device characteristics to the user's hearing preferences
PublicationA method for fitting the mobile computer audio characteristics to the user's hearing preferences is proposed. The process consists of two stages: calibration and dynamics processing. During the calibration phase the user performs a loudness scaling test giving their response regarding the perceived loudness. The dynamics processing made on above basis sets the loudness to the most comfortable level. The processing accounts both...
-
Reduction of parasitic pitch variations in archival musical recordings
PublicationA new method for reducing parasitic pitch variations in archival audio recordings is presented. The method is intended for analyzing movie soundtracks recorded in optical films. It utilizes image processing for calculating and reducing effects of tape shrinkage being one of the main reasons for parasitic pitch variations in audio accompanying moving images. As long as the film tape characteristics are known the new method can be...
-
Data, Information, Knowledge, Wisdom Pyramid Concept Revisited in the Context of Deep Learning
PublicationIn this paper, the data, information, knowledge, and wisdom (DIKW) pyramid is revisited in the context of deep learning applied to machine learningbased audio signal processing. A discussion on the DIKW schema is carried out, resulting in a proposal that may supplement the original concept. Parallels between DIWK pertaining to audio processing are presented based on examples of the case studies performed by the author and her collaborators....
-
Postprodukcja nagrania wideo z dzwiekiem dookolnym
PublicationOne of the aims of this paper is to present issues related to audio-video correlation. This is presented on the basis of a short film realization employing surround microphone techniques. First, some related works in the domain of sound and vision correlation are presented. Then assumptions concerning scene creation related to both audio and video are shortly described. Another objective is to discuss results of subjective tests...
-
1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type
PublicationA network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence....
-
IEEE Symposium on Visual Languages and Human-Centric Computing (was VL)
Conferences -
Evaluation of a Novel Approach to Virtual Bass Synthesis Strategy
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) strategy applied to portable computers. The developed algorithms involve intelligent, rule-based settings of bass synthesis parameters with regard to music genre of an audio excerpt and the type of a portable device in use. The Smart VBS algorithm performs the synthesis based on a nonlinear device (NLD) with artificial controlling synthesis...
-
Joint fingerprinting and decryption method for color images based on quaternion rotation with cipher quaternion chaining
PublicationThis paper addresses the problem of unauthorized redistribution of multimedia content by malicious users (pirates). In this method three color channels of the image are considered a 3D space and each component of the image is represented as a point in this 3D space. The distribution side uses a symmetric cipher to encrypt perceptually essential components of the image with the encryption key and then sends the encrypted data via...
-
Classification of Music Genres by Means of Listening Tests and Decision Algorithms
PublicationThe paper compares the results of audio excerpt assignment to a music genre obtained in listening tests and classification by means of decision algorithms. A short review on music description employing music styles and genres is given. Then, assumptions of listening tests to be carried out along with an online survey for assigning audio samples to selected music genres are presented. A framework for music parametrization is created...
-
Art Composition
e-Learning CoursesPerson in charge: prof. Krzysztof Wróblewski, Department of Visual Arts Teacher: mgr Patryk Różycki, Department of Visual Arts Five Words. Society and Politics. What? By What? General assumptions. The aim of the proposed two artistic compositions is a creative processing of emotions related to the socio-political issues. In general, it is about personal views and feelings, but it must be also considered that architects are...
-
Music genre classification applied to bass enhancement for mobile technology
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm is related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt. The classification of music genres is automatically executed employing MPEG 7 parameters and the Principal Component Analysis method applied to reduce information...
-
Machine learning applied to acoustic-based road traffic monitoring
PublicationThe motivation behind this study lies in adapting acoustic noise monitoring systems for road traffic monitoring for driver’s safety. Such a system should recognize a vehicle type and weather-related pavement conditions based on the audio level measurement. The study presents the effectiveness of the selected machine learning algorithms in acoustic-based road traffic monitoring. Bases of the operation of the acoustic road traffic...
-
Machine learning applied to acoustic-based road traffic monitoring
PublicationThe motivation behind this study lies in adapting acoustic noise monitoring systems for road traffic monitoring for driver’s safety. Such a system should recognize a vehicle type and weather-related pavement conditions based on the audio level measurement. The study presents the effectiveness of the selected machine learning algorithms in acoustic-based road traffic monitoring. Bases of the operation of the acoustic road traffic...
-
Zaawansowane Przetwarzanie Sygnału
e-Learning CoursesPrzedmiot prezentuje wybrane metody przetwarzania sygnałów w bardzo szerokim obszarze zastosowań. Ilustruje najnowsze osiągnięcia w tym zakresie, wsparte wybranymi publikacjami. Zajęcia są podzielone na wykład (15 h) i seminarium (15 h). Podstawowe pojęcia dotyczące cyfrowego przetwarzania sygnałów, zalecana literatura Analiza widmowa gęstość widmowa mocy, widmo falkowe, polispektra i gęstość widmowa mocy skrośnej Efekty...
-
Lighting conditions in Home Office and occupant’s perception: an international study
PublicationThe global pandemic and physical distancing restrictions are forcing us to rethink how residential buildings are used regarding the visual environment. This paper describes home office lighting conditions within different countries and continents. The aim is to define the current limitations of home offices in providing a resilient visual environment. The work was developed by a team of international experts working together on...
-
Digital microcontroller for sonar waveform generator
PublicationGenerating sounding signals is essential for the operation of active sonar. The system should be highly reliable. This can be achieved through architecture, communication between the devices, and a well-designed and self–testing software. The system presented in the article is responsible for the generation of hydroacoustic sounding signals, and ensures proper interaction between power amplifiers and power supplies. Thanks to its...
-
Surface EMG-based signal acquisition for decoding hand movements
Open Research DataBiosignal processing plays a crucial role in modern hand prosthetics. The challenge is to restore functionality of a lost limb based on the signals acquired from the surface of the stump. The number of sensors (emg channels) used for signal acquisition influence the quality of a prosthetic hand. Modern algorithms (including neural networks) can significantly...
-
Music Data Processing and Mining in Large Databases for Active Media
PublicationThe aim of this paper was to investigate the problem of music data processing and mining in large databases. Tests were performed on a large data-base that included approximately 30000 audio files divided into 11 classes cor-responding to music genres with different cardinalities. Every audio file was de-scribed by a 173-element feature vector. To reduce the dimensionality of data the Principal Component Analysis (PCA) with variable...
-
Multistatyczny, Dopplerowski System określania położenia i prędkości ruchomych celów w wodzie
PublicationW omawianym w pracy multistatycznym, dopplerowskim systemie określania położenia i prędkości ruchomych celów w wodzie źródłem sygnału są dwa nadajniki emitujące sinusoidalne, akustyczne fale ciągłe o różnych częstotliwościach, które po odbiciu od ruchomego celu są obierane przez cztery hydrofony. W artykule przedstawiono analize teoretyczna efektu Dopplera, na którym oparte jest działanie systemu oraz metodę rozwiązania głównych...
-
Further Developments of the Online Sound Restoration System for Digital Library Applications
PublicationNew signal processing algorithms were introduced to the online service for audio restoration available at the web address: www.youarchive.net. Missing or distorted audio samples are estimated using a specific implementation of the Jannsen interpolation method. The algorithm is based on the autoregressive model (AR) combined with the iterative complementation of signal samples. Since the interpolation algorithm is computationally...
-
Applicability of null-steering for spoofing mitigation in civilian GPS
PublicationCivilian GPS signals are currently used in many critical applications, such as precise timing for power grids and telecommunication networks. Spoofing may cause their improper functioning. It is a threat which emerges with the growing availability of GPS constellation simulators and other devices which may be used to perform such attack. Development of the effective countermeasures, covering detection and mitigation, is necessary...
-
Comparison of near infrared spectroscopy (NIRS) and near-infrared transillumination-backscattering sounding (NIR-T/BSS) methods
PublicationThe aim of the study was to compare simultaneously recorded a NIR-T/BSS and NIRS signals from healthy volunteers. NIR-T/BSS is a device which give an ability to non-invasively detect and monitor changes in the subarachnoid space width (SAS). Experiments were performed on a group of 30 healthy volunteers (28 males and 2 females, age 30.8 ± 13.4 years, BMI = 24.5 ± 2.3 kg/m2). We analysed recorded signals using analysis methods based...
-
A Wearable System Developed to Monitor People Suffering from Vasovagal Syncope
PublicationA wearable system for monitoring non-invasively signals invaluable when examining person suffering from vasovagal syncope is presented in the paper. Following signals are continuously recorded: electrocardiogram, photopletysmogram, impedance cardiogram and electrodermal resistance.
-
An Approach to Bass Enhancement in Portable Computers Employing Smart Virtual Bass Synthesis Algorithms
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The developed algorithms are related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt and to the type of a portable device in use. To find optimum synthesis parameters of the VBS algorithms, subjective listening tests based on a parametric procedure...
-
Vocalic Segments Classification Assisted by Mouth Motion Capture
PublicationVisual features convey important information for automatic speech recognition (ASR), especially in noisy environment. The purpose of this study is to evaluate to what extent visual data (i.e. lip reading) can enhance recognition accuracy in the multi-modal approach. For that purpose motion capture markers were placed on speakers' faces to obtain lips tracking data during speaking. Different parameterizations strategies were tested...
-
Reliability of Pulse Measurements in Videoplethysmography
PublicationReliable, remote pulse rate measurement is potentially very important for medical diagnostics and screening. In this paper the Videoplethysmography was analyzed especially to verify the possible use of signals obtained for the YUV color model in order to estimate the pulse rate, to examine what is the best pulse estimation method for short video sequences and finally, to analyze how potential PPG-signals can be distinguished from...
-
Self diagnostics using smart glasses - preliminary study
Publicationn this preliminary study we analyzed the possibility of the reliable measurement of biomedical signals with some potential hardware extensions of smart glasses. Using specially designed experimental prototypes four category of biomedical signals were measured: electrocardiograms, electromyograms, electroencephalograms and respiration waveforms. Experi- ments with volunteers proved that using even simple construc- tion of sensors...
-
Innovative method of localization airplanes in VCS (VCS-MLAT) distributed system
PublicationThe article presents the concept and the structure of the localization module. The prototype module is the part of the VCS (VCS-MLAT) localization distributed system. The device receives the audio signal transmitted in airplanes band (118 MHz – 136 MHz). Received data with the timestamps are send to the main server. The data from multiple devices estimates the localization of the airplane. The main aim of the project is the analysis...
-
The dynamic signature verification using population-based vertical partitioning
PublicationThe dynamic signature is an attribute used in behavioral biometrics for verifying the identity of an individual. This attribute, apart from the shape of the signature, also contains information about the dynamics of the signing process described by the signals which tend to change over time. It is possible to process those signals in order to obtain descriptors of the signature characteristic of an individual user. One of the methods...
-
Receiver of Doppler multistatic system for moving target detection and tracking
PublicationThe article presents a method for solving major structural problems that occur in the receiver used in the multistatic Doppler system, aimed at determination of the trajectory and velocity of a moving target. In the system two transmitters emit acoustic continuous sinusoidal waves at different frequencies. The signals, scattered from a moving target are received by four hydrophones. Beside of the echoes, much larger signals coming...
-
Smart Modeling of Maritime Vessels
PublicationCurrently, the market offers many visualization tools available to graphic designers, engineers, managers and academics working on maritime environments. The practice of visualization involves making and manipulating images that convey novel phenomena and ideas. Visual communication, together with virtual reality environments, is an emerging and rapidly evolving discipline. It brings great advantage over written word or voice alone,...
-
A multisensor detector of a sleep apnea for using at home
PublicationDiagnosis of obstructive sleep apnea usually involves polysomnographic analysis, which unfortunately requires overnight stay in a specialized clinic and is very uncomfortable for a patient. This paper describes the method and apparatus for recording a set of signals to detect sleep apnea. The device records the following signals simultaneously: three-channel ECG, respiratory functions, signals from the accelerometer, and snoring...
-
Effectiveness of the robust PSS design
PublicationThe paper discusses optimal PSS of synchronous generator synthesis. The optimal controller is an Hinf controller, what means that minimises Hinf norm of transfer function between the exogenous signals such as reference inputs and disturbances, and the error signals which are to be minimised to meet the control objective. The dynamic properties of the plant are shaped by choosing appropriate weighting function applied to the plant...
-
Zastosowanie sygnałów o projektowanych kształtach do diagnostyki obiektów wysoko-impedancyjnych metodą spektroskopii impedancyjnej
PublicationW artykule przedstawiono metodę szybkiej spektroskopii impedancyjnej obiektów o wysokich impedancjach (|Zx| > 1 GOhm) z zastosowaniem sygnałów o projektowanych kształtach. Sygnał pobudzenia wytwarzany jest w module DAQ U2531A i doprowadzany na wejście badanego obiektu za pośrednictwem przetwornika cyfrowo-analogowego (CA). Sygnały odpowiedzi proporcjonalne do napięcia na mierzonej impedancji Zx oraz prądu płynącego przez Zx są...
-
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
PublicationThis paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...