Filtry
wszystkich: 1594
wybranych: 1241
-
Katalog
Filtry wybranego katalogu
wyświetlamy 1000 najlepszych wyników Pomoc
Wyniki wyszukiwania dla: audio-visual correlation
-
Creating a Remote Choir Performance Recording Based on an Ambisonic Approach
PublikacjaThe aim of this paper is three-fold. First, the basics of binaural and ambisonic techniques are briefly presented. Then, details related to audio-visual recordings of a remote performance of the Academic Choir of the Gdańsk University of Technology are shown. Due to the COVID-19 pandemic, artists had a choice, namely, to stay at home and not perform or stay at home and perform. In fact, staying at home brought in the possibility...
-
Rough Sets Applied to Mood of Music Recognition
PublikacjaWith the growth of accessible digital music libraries over the past decade, there is a need for research into automated systems for searching, organizing and recommending music. Mood of music is considered as one of the most intuitive criteria for listeners, thus this work is focused on the emotional content of music and its automatic recognition. The research study presented in this work contains an attempt to music emotion recognition...
-
Reverberation divergence in VR applications
PublikacjaThis project aimed to investigate the correlation between virtual reality (VR) imagery and ambisonic sound. With the increasing popularity of VR applications, understanding how sound is perceived in virtual environments is crucial for enhancing the immersiveness of the experience. In the experiment, participants were immersed in a virtual environment that replicated a concert hall. Their task was to assess the correspondence between...
-
Stability evaluation of the marine propulsion unit mechanical system bymeans of vibrations measurements and their analysis.
PublikacjaIn paper selected issues concerning vibration diagnosis of the mechanical system within marine propulsion units have been given. The analysis of vibration parameters was carried out on the basis of experimental examinations of a real object in which an exceedance of the allowable vibration's level had been observed. In the beginning, the vibration apparatus being on laboratory equipment of the Department of Ship Power Plants of...
-
Stability evaluation of the marine propulsion unit's mechanical system by means of vibration measurements and their analysis
PublikacjaIn this paper selected issues concerning vibration diagnosis of the mechanical system within marine propulsion units have been given. The analysis of vibration parameters was carried out on the basis of experimental examinations of a real object in which an exceedance of the allowable vibration's level had been observed. In the beginning, the vibration apparatus being on laboratory equipment of the Department of Ship Power Plants...
-
New Applications of Multimodal Human-Computer Interfaces
PublikacjaMultimodal computer interfaces and examples of their applications to education software and for the disabled people are presented. The proposed interfaces include the interactive electronic whiteboard based on video image analysis, application for controlling computers with gestures and the audio interface for speech stretching for hearing impaired and stuttering people. Application of the eye-gaze tracking system to awareness...
-
Biological responses to environmental contamination. How can metal pollution impact signal honesty in avian species?
PublikacjaEnvironmental pollution, for example with metals, can significantly affect the ecosystem balance leading to severe changes. Biologically active pigments are relevant for the appearance and condition of birds. Melanin and carotenoid particles are the most frequently deposited pigments in avian integument. They are responsible for the majority of colors of bird plumage. The phenotypic expression can be affected by metal contamination....
-
Brain-computer interaction based on EEG signal and gaze-tracking information = Analiza interackji mózg-komputer wykorzystująca sygnał EEg i informacje z systemu śledzenia punktu fiksacji wzroku
PublikacjaThe article presents an attempt to integrate EEG signal analysis with information about human visual activities, i.e. gaze fixation point. The results from gaze-tracking-based measurement were combined with the standard EEG analysis. A search for correlation between the brain activity and the region of the screen observed by the user was performed. The preliminary stage of the study consists in electrooculography (EOG) signal processing....
-
Polish Adaptation of the Pregnancy-Related Anxiety Questionnaire—Revised 2 for All Pregnant Women
PublikacjaPregnancy-related anxiety (PrA) is a specific type of anxiety characteristic of the perinatal period. PrA can affect pregnancy and birth. However, no validated tool exists to measure PrA in Polish obstetric practice. The aim of this study was to translate the Pregnancy-Related Anxiety Questionnaire—Revised 2 (PRAQ-R2) into Polish and to evaluate its reliability and factorial and construct validity. This study was conducted in Poland...
-
Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing
PublikacjaIn this application-oriented paper we consider the problem of elimination of impulsive disturbances, such as clicks, pops and record scratches, from archive audio recordings. The proposed approach is based on bidirectional processing—noise pulses are localized by combining the results of forward-time and backward-time signal analysis. Based on the results of specially designed empirical tests (rather than on the results of theoretical analysis),...
-
Measurement of Latency in the Android Audio Path
PublikacjaThis paper provides a description of experimental investigations concerning comparison between the audio path characteristics of various Android versions. First, information about the changes in each system version in the context of latency caused by them is presented. Then, a measurement procedure employing available applications to measure latency is described comparing to results contained in the Internet. Finally, a comparison...
-
Visual Dimensions of Modeling Languages in Interdisciplinary Perspective
PublikacjaUżyteczność języków modelowania wizualnego zależy od notacji. Notacja może być postrzegana jako zestaw wizualnych komponentów, które w określony sposób oddziałują na ludzkie oko i ludzki mózg. Referat przedstawia analizę interdyscyplinarną wykonaną w celu lepszego zrozumienia wizualnych wymiarów języków modelowania. Wizualne wymiary pochodzą z teorii opisujących percepcję wzrokową, wizualizację danych oraz reprezentacje poznawcze....
-
Retrospecting Polish Audio Engineering Society Membership on 20th Anniversary of the Polish Section of the Audio Engineering Society
PublikacjaIn this article some key events concerning founding Polish Section of the Audio Engineering Society were presented. In addition, the history covering International Symposia on Sound Engineering and Mastering was outlined. Also, papers contained in this issue were shortly reviewed.
-
Intelligent video and audio applications for learning enhancement
PublikacjaThe role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....
-
Detection of impulsive disturbances in archive audio signals
PublikacjaIn this paper the problem of detection of impulsive disturbances in archive audio signals is considered. It is shown that semi-causal/noncausal solutions based on joint evaluation of signal prediction errors and leave-one-out signal interpolation errors, allow one to noticeably improve detection results compared to the prediction-only based solutions. The proposed approaches are evaluated on a set of clean audio signals contaminated...
-
Personal adaptive tuning of mobile computer audio
PublikacjaAn integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of the acoustic track to the changing conditions and to the user's individual preferences. Original signal processing algorithms are introduced, which concern: linearization of frequency response, dialogue intelligibility enhancement and dynamics processing tuned up to the user's preferences....
-
Elimination of impulsive disturbances from stereo audio recordings
PublikacjaThis paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. On-line tracking of signal model parameters is performed using the stability-preserving Whittle-Wiggins-Robinson algorithm with exponential data weighting. Detection of noise pulses and model-based interpolation of the irrevocably distorted samples...
-
Auditory-visual attention stimulator
PublikacjaNew approach to lateralization irregularities formation was proposed. The emphasis is put on the relationship between visual and auditory attention stimulation. In this approach hearing is stimulated using time scale modified speech and sight is stimulated by rendering the text of the currently heard speech. Moreover, displayed text is modified using several techniques i.e. zooming, highlighting etc. In the experimental part of...
-
Visual Features for Endoscopic Bleeding Detection
PublikacjaAims: To define a set of high-level visual features of endoscopic bleeding and evaluate their capabilities for potential use in automatic bleeding detection. Study Design: Experimental study. Place and Duration of Study: Department of Computer Architecture, Faculty of Electronics, Telecommunications and Informatics, Gdansk University of Technology, between March 2014 and May 2014. Methodology: The features have...
-
Digital Audio Broadcasting or Webcasting: A Network Quality Perspective
PublikacjaIn recent years, many alternative technologies of delivering audio content have emerged, with different advantages and disadvantages. In this paper pros and cons of digital audio broadcasting and webcasting transmission techniques in a network quality perspective are described. A case study of user expectations with respect to currently available services is analyzed, and the perceived quality of real digital broadcasted and webcasted...
-
A Study on Audio Signal Processed by "Instant Mastering"
PublikacjaAn increasing amount of music produced in home- and project-studios results in development and growth of "automatic mastering services". The presented investigation explores changes introduced to audio signal by various online mastering platforms. A music set consisting of 10 songs produced in small facilities was processed by eight on-line automatic mastering services. Additionally, some laboratory-constructed signals were tested....
-
System do prototypowania bezprzewodowych inteligentnych urządzeń monitoringu audio-video
PublikacjaW komunikacie przedstawiono system prototypowania bezprzewodowych urządzeń do monitoringu audio-video. System bazuje na układach FPGA Virtex6 i wielu dodatkowych wspierających urządzeniach jak: szybka pamięć DDR3, mała kamera HD, mikrofon z konwerterem A/C, moduł radiowy WiFi, itp. Funkcjonalność systemu została szczegółowo opisana w komunikacie. System został zoptymalizowany do pracy pod kontrolą systemu operacyjnego Linux, zostały...
-
Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering
PublikacjaThis paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...
-
Testing Watermark Robustness against Application of Audio Restoration Algorithms
PublikacjaThe purpose of this study was to test to what extent watermarks embedded in distorted audio signals are immune to audio restoration algorithm performing. Several restoration routines such as noise reduction, spectrum expansion, clipping or clicks reduction were applied in the online website system. The online service was extended with some copyright protection mechanisms proposed by the authors. They contain low-level music features...
-
Visual TreeCmp : Comprehensive Comparison of Phylogenetic Trees on the Web
Publikacja1. We present Visual TreeCmp—a package of applications for comparing phylogenetic tree sets. 2. Visual TreeCmp includes a graphical web interface allowing the visualization of compared trees and command line application extended by comparison methods recently proposed in the literature. 3. The phylogenetic tree similarity analysis in Visual TreeCmp can be performed using eighteen metrics, of which 11 are dedicated to rooted trees...
-
ARCHITECTURE IN BRANDING AND OVERBRANDING PROCESS. HOW GLOBALIZATION INFLUENCED ARCHITECTURE QUALITY.
PublikacjaGlobalization is unification. In the field of architecture and in the social structure alike, the globalization leads to a blending of economies and cultures that produces uniformly branded public space as a response to universal assumptions of integration and correlation. While branding consists in creating a recognizable and outstanding image a strong contradiction exists between both of these approaches. As a result, regional...
-
A double-talk detector using audio watermarking
Publikacjaa novel approach to double-talk detection in the acoustic echo canceler is proposed. a hidden signature is embedded into the arriving signal, using the echo-hiding method. next detection of the presence of this signature in the microphone signal is performed. the results of the signature detection may be used by the acoustic echo canceler to stop or restart the adaptation process.
-
Augmented Reality for Privacy-Sensitive Visual Monitoring
PublikacjaThe paper presents a method for video anonymization and replacing real human silhouettes with virtual 3D figures rendered on the screen. Video stream is processed to detect and to track objects, whereas anonymization stage employs fast blurring method. Substitute 3D figures are animated accordingly to behavior of detected persons. Their location, movement speed, direction, and person height are taken into account during the animation...
-
Audio content analysis in the urban area telemonitoring system
PublikacjaArtykuł przedstawia możliwości rozwinięcie monitoringu miejskiego o automatyczną analizę dźwięku. Przedstawiono metody parametryzacji dźwięku, które możliwe są do zastosowania w takim systemie oraz omówiono aspekty techniczne implementacji. W kolejnej części przedstawiono system decyzyjny oparty na drzewach zastosowany w systemie. System ten rozpoznaje dźwięki niebezpieczne (strzał, rozbita szyba, krzyk) wśród dźwięków zarejestrowanych...
-
Visual Lip Contour Detection for the Purpose of Speech Recognition
PublikacjaA method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are...
-
Special forms of echo visual representation in an ahead looking sonar.
PublikacjaThe paper discusses ways to organise visual representation in a multi-beam ahead looking sonars whose function is to detect objects on the bottom and in pelagic zones. Forms of visual representation are shown and illustrated on the basic screen (panoramic representation and setting, alarms) and on the auxiliary screen (type A, B and special). Special forms of visual representation are mainly used in detecting objects in difficult...
-
Visual Management as the support in building the concept of continuous improvement in the enterprise
PublikacjaThe following article presents one of the selected tools of the Lean Management concept – visual management. This method enables enterprises to strengthen their process of continuous improvement. Due to the support of visual management, it is possible to manage information more effectively by the managerial board and to improve communication process within in the particular company. In the first part, the author describes the concept...
-
Localization of impulsive disturbances in audio signals using template matching
PublikacjaIn this paper, a new solution to the problem of elimination of impulsive disturbances from audio signals, based on the matched filtering technique, is proposed. The new approach stems from the observation that a large proportion of noise pulses corrupting audio recordings have highly repetitive shapes that match several typical “patterns”. In many cases a representative set of exemplary pulse waveforms can be extracted from the...
-
Robust and Efficient Machine Learning Algorithms for Visual Recognition
PublikacjaIn visual recognition, the task is to identify and localize all objects of interest in the input image. With the ubiquitous presence of visual data in modern days, the role of object recognition algorithms is becoming more significant than ever and ranges from autonomous driving to computer-aided diagnosis in medicine. Current models for visual recognition are dominated by models based on Convolutional Neural Networks (CNNs), which...
-
New Aspects of Virtual Sound Source Localization Research—Impact of Visual Angle and 3-D Video Content on Sound Perception
PublikacjaThe influence of image on virtual sound source localization, called the “image proximity effect” or the “ventriloquism effect”, is a well known phenomenon. This paper focuses on other aspects related to this effect, namely the impact of the visual angle of the presented object and 3D video content on sound perception. The research conducted confirmed that the visual angle of the presented object determines the image proximity effect...
-
Visual and Auditory Attention Stimulator for Assisting Pedagogical Therapy
PublikacjaVisual and auditory attention stimulator provides a system developed in order to improve reading skills using simultaneous presentation of text in its visual form and in transformed auditory form accompanied by related movie material. The described research employed 40 children at the age of 8 13 years having difficulties in learning of reading, who were diagnosed as having developmental dyslexia. It was shown that application...
-
Visual Data Encryption for Privacy Enhancement in Surveillance Systems
PublikacjaIn this paper a methodology for employing reversible visual encryption of data is proposed. The developed algorithms are focused on privacy enhancement in distributed surveillance architectures. First, motivation of the study performed and a short review of preexisting methods of privacy enhancement are presented. The algorithmic background, system architecture along with a solution for anonymization of sensitive regions of interest...
-
Visual content representation and retrieval for Cognitive Cyber Physical Systems
PublikacjaCognitive Cyber Physical Systems have gained significant attention from academia and industry during the past few decade. One of the main reasons behind this interest is the potential of such technologies to revolutionize human life since they intend to work robustly under complex visual scenes, which environmental conditions may vary, adapting to a comprehensive range of unforeseen changes, and exhibiting prospective behavior...
-
Analysis of impact of audio modifications on the robustness of watermark for non-blind architecture
PublikacjaThe aim of this paper is to assess the robustness of the non-blind audio content watermarking scheme proposed by the authors. The authors present the architecture of the designed system along with the employed workflows for embedding and extracting the watermark followed by the implementation phase description and the analysis of the experimental results. Some possible attack simulations on the embedded watermarks are reviewed,...
-
Parametric impulsive noise detector for corrupted audio signals based on hidden Markow model
PublikacjaThe paper addresses the problem of impulsive noise detection for audio signals. A structure of threshold parameter detectors using modelingof signals was introduced. the algorithm of the noise detection, based on discrete-time hidden Markow model (HMM)of whitened audio signal is elaborated
-
Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?
PublikacjaIn this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...
-
Using concentrated spectrogram for analysis of audio acoustic signals
PublikacjaThe paper presents results of time-frequency analysis of audio acoustic signals using the method of Concentrated Spectrograph also known as ''Cross-spectral method'' or ''Reassignment method''. Presented algorithm involves signal's local group delay and channelized instantaneous frequency to relevantly redistribute all Short-time Fourier transform lines in time-frequency plain. The main intention of the paper is to compare various...
-
RENOVATION OF ARCHIVE AUDIO RECORDINGS USING SPARSE AUTOREGRESSIVE MODELING AND BIDIRECTIONAL PROCESSING
PublikacjaThe paper presents a new approach to elimination of broadband noise and impulsive disturbances from archive audio recordings. The proposed adaptive Kalman-like algorithm, based on a sparse autoregressive model of the audio signal, simultaneously detects noise pulses, interpolates the irrevocably distorted samples and performs signal smoothing. It is shown that bidirectional (forward-backward) processing of the archive signal improves...
-
Sparse vector autoregressive modeling of audio signals and its application to the elimination of impulsive disturbances
PublikacjaArchive audio files are often corrupted by impulsive disturbances, such as clicks, pops and record scratches. This paper presents a new method for elimination of impulsive disturbances from stereo audio signals. The proposed approach is based on a sparse vector autoregressive signal model, made up of two components: one taking care of short-term signal correlations, and the other one taking care of long-term correlations. The method...
-
Objectivization of phonological evaluation of speech elements by means of audio parametrization
PublikacjaThis study addresses two issues related to both machine- and subjective-based speech evaluation by investigating five phonological phenomena related to allophone production. Its aim is to use objective parametrization and phonological classification of the recorded allophones. These allophones were selected as specifically difficult for Polish speakers of English: aspiration, final obstruent devoicing, dark lateral /l/, velar nasal...
-
Visual perception of vowels from static and dynamic cues
PublikacjaThe purpose of the study was to analyse human identification of Polish vowels from static and dynamic durationally slowed visual cues. A total of 152 participants identified 6 Polish vowels produced by 4 speakers from static (still images) and dynamic (videos) cues. The results show that 59% of static vowels and 63% of dynamic vowels were successfully identified. There was a strong confusion between vowels within front, central,...
-
A study on of music features derived from audio recordings examples – a quantitative analysis
PublikacjaThe paper presents a comparative study of music features derived from audio recordings, i.e. the same music pieces but representing different music genres, excerpts performed by different musicians, and songs performed by a musician, whose style evolved over time. Firstly, the origin and the background of the division of music genres were shortly presented. Then, several objective parameters of an audio signal were recalled that...
-
Visual Content Representation for Cognitive Systems: Towards Augmented Intelligence
PublikacjaCognitive Vision Systems have gained significant attention from academia and industry during the past few decades. One of the main reasons behind this interest is the potential of such technologies to revolutionize human life since they intend to work robustly under complex visual scenes (which environmental conditions may vary), adapting to a comprehensive range of unforeseen changes, and exhibiting prospective behavior. The combination...
-
Localization of impulsive disturbances in archive audio signals using predictive matched filtering
PublikacjaThe problem of elimination of impulsive disturbances from archive audio signals is considered and its new solution, called predictive matched filtering, is proposed. The new approach is based on the observation that a large percentage of noise pulses corrupting archive audio recordings have highly repetitive shapes that match several typical “patterns”, called click templates. To localize noise pulses, click templates can be correlated...
-
Comparison of Acoustic and Visual Voice Activity Detection for Noisy Speech Recognition
PublikacjaThe problem of accurate differentiating between the speaker utterance and the noise parts in a speech signal is considered. The influence of utilizing a voice activity detection in speech signals on the accuracy of the automatic speech recognition (ASR) system is presented. The examined methods of voice activity detection are based on acoustic and visual modalities. The problem of detecting the voice activity in clean and noisy...