Wyniki wyszukiwania dla: AUDIO-VISUAL SIGNALS

A comparative study of English viseme recognition methods and algorithm

Publikacja

- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2018

An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector...

Pełny tekst do pobrania w portalu

A comparative study of English viseme recognition methods and algorithms

Publikacja

- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2018

An elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...

Pełny tekst do pobrania w portalu

Intelligent equalizer solution employing music genre and the room characteristics analysis

Publikacja

- Elektronika : konstrukcje, technologie, zastosowania - Rok 2017

The paper presents an intelligent equalizer solution based on room acoustic conditions and music genre analysis. A series of acoustic characteristic measurements are performed for checking the concept proposed. White noise (reference signal) and audio excerpts belonging to six music genres are utilized as excitation signals in measurements. This results in registration of frequency responses of rooms and reverberation times. Signals...

Pełny tekst do pobrania w serwisie zewnętrznym

In uence of Low-Level Features Extracted from Rhythmic and Harmonic Sections on Music Genre Classi cation

Publikacja

A. Rosner
F. Weninger
B. Schuller
M. Michalak
B. Kostek

- Rok 2013

We present a comprehensive evaluation of the infuence of 'harmonic' and rhythmic sections contained in an audio file on automatic music genre classi cation. The study is performed using the ISMIS database composed of music files, which are represented by vectors of acoustic parameters describing low-level music features. Non-negative Matrix Factorization serves for blind separation of instrument components. Rhythmic components...

Multimodal human-computer interfaces based on advanced video and audio analysis

Publikacja

- Rok 2013

Multimodal interfaces development history is reviewed briefly in the introduction. Examples of applications of multimodal interfaces to education software and for the disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and the audio interface for speech stretching for hearing impaired and stuttering people. The Smart...

Pełny tekst do pobrania w serwisie zewnętrznym

Gesture-controlled Sound Mixing System With a Sonified Interface

Publikacja

- Rok 2013

In this paper the Authors present a novel approach to sound mixing. It is materialized in a system that enables to mix sound with hand gestures recognized in a video stream. The system has been developed in such a way that mixing operations can be performed both with or without visual support. To check the hypothesis that the mixing process needs only an auditory display, the influence of audio information visualization on sound...

Pełny tekst do pobrania w serwisie zewnętrznym

A concept of Signal Equalization Method Based on Music Genre and the Listener's Room Characteristics

Publikacja

- Rok 2016

A research study that investigates the influence of the room acoustics environment on the frequency characteristic of the audio signal playback is presented. First, a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the frequency response of the room, a system for room acoustics compensation based on eight-band equalizer is proposed. The system settings depend on music genre. In...

Cross-domain applications of multimodal human-computer interfaces

Publikacja

A. Czyżewski

- Rok 2015

Developed multimodal interfaces for education applications and for disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and audio interface for speech stretching for hearing impaired and stuttering people and intelligent pen allowing for diagnosing and ameliorating developmental dyslexia. The eye-gaze tracking system named...

Sparse autoregressive modeling

Publikacja

M. Ciołek

- Rok 2012

In the paper the comparison of the popular pitch determination (PD) algorithms for thepurpose of elimination of clicks from archive audio signals using sparse autoregressive (SAR)modeling is presented. The SAR signal representation has been widely used in code-excitedlinear prediction (CELP) systems. The appropriate construction of the SAR model is requiredto guarantee model stability. For this reason the signal representation...

Remote Estimation of Video-Based Vital Signs in Emotion Invocation Studies

Publikacja

- Rok 2018

Abstract— The goal of this study is to examine the influence of various imitated and video invoked emotions on the vital signs (respiratory and pulse rates). We also perform an analysis of the possibility to extract signals from sequences acquired with cost-effective cameras. The preliminary results show that the respiratory rate allows for better separation of some emotions than the pulse rate, yet this relation highly depends...

Pełny tekst do pobrania w portalu

Implementation Of The Innovative Radiolocalization System VCS-MLAT (Voice Communication System Multilateration)

Publikacja

- Rok 2020

In the article the concept of the radiolocalization subsystem of the VHF communication for aviation VCS-MLAT (Voice Communication System – Multilateration) is presented. The distributed localization system can estimate the position of the aircraft using the audio signals from aircraft transmitters in the VHF band (118-136 MHz). This paper shows initial verification of the possibility to use voice airband communication to estimate...

Pełny tekst do pobrania w serwisie zewnętrznym

Subjective and Objective Comparative Study of DAB+ Broadcast System

Publikacja

- Archives of Acoustics - Rok 2017

Broadcasting services seek to optimize their use of bandwidth in order to maximize user’s quality of experience. They aim to transmit high-quality digital speech and music signals at the lowest bitrate. They intend to offer the best quality under available conditions. Due to bandwidth limitations, audio quality is in conflict with the number of transmitted radio programs. This paper analyzes whether the quality of real-time digital...

Pełny tekst do pobrania w portalu

Digital Transformation of Terrestrial Radio: An Analysis of Simulcasted Broadcasts in FM and DAB+ for a Smart and Successful Switchover

Publikacja

P. Falkowski-Gilski

- Applied Sciences-Basel - Rok 2021

The process of digitizing radio is far from over. It is an important interdisciplinary aspect, involving Big Data and AI (Artificial Intelligence) when it comes to classifying and handling content, and an organizational challenge in the Industry 4.0 concept. There exist several methods for delivering audio signals, including terrestrial broadcasting and internet streaming. Among them, the DAB+ (Digital Audio Broadcasting plus)...

Pełny tekst do pobrania w portalu

Application of gaze tracking technology to quality of experience domain

Publikacja

- Rok 2010

A new methodological approach to study subjective assessment results employing gaze tracking technology is shown. Notions of Human-Computer Interaction (HCI) and Quality of Experience (QoE) are shortly introduced in the context of their common application. Then, the gaze tracking system developed at the Multimedia Systems Department (MSD) of Gdansk University of Technology (GUT) is presented. A series of audio-visual subjective...

Automatic system for audio-video material reconstruction and archiving

Publikacja

- Rok 2008

Referat przedstawia propozycję modelu systemu automatycznej archiwizacji i rekonstrukcji nagrań audio-wideo. Założeniem tego rozwiązania jest uczynienie procesu rekonstrukcji nagrań bardziej niezależnym od człowieka. Ma to na celu redukcję kosztów rekonstrukcji przetwarzanych nagrań. Z powodu dużej liczby archiwalnych nagrań audio-wideo istnieje potrzeba stworzenia systemu który umożliwi automatyczną indeksację ich treści. Pomoże...

Evaluation of Sound Enhancement in Mobile Device Using Virtual Bass Synthesiss Algorithm

Publikacja

- Rok 2013

An experiment conducted to validate possibility of use virtual bass synthesis (VBS) algorithm in a portable computer is presented. The subjective listening tests based on the procedure of pairwise comparison between VBS, based on the so-called missing fundamental phenomenon, and standard bass boost technique are employed. The evaluation was carried out in two types of conditions: in a professional listening room and employing an...

Comparing traffic intensity estimates employing passive acoustic radar and microwave Doppler radar sensor

Publikacja

A. Czyżewski

- Journal of the Acoustical Society of America - Rok 2020

The purpose of our applied research project is to develop an autonomous road sign with built-in radar devices of our design. In this paper, we show that it is possible to calibrate the acoustic vector sensor so that it can be used to measure traffic volume and count the vehicles involved in the traffic through the analysis of the noise emitted by them. Signals obtained from a Doppler radar are used as a reference source. Although...

Pełny tekst do pobrania w serwisie zewnętrznym

Detection of debonding in adhesive joints using Lamb wave propagation

Publikacja

- MATEC Web of Conferences - Rok 2019

Adhesively bonded joints are widely used in many branches of industry. Mechanical degradation of this type of connections does not have significant symptoms that can be noticed during visual assessment, so non-destructive testing becomes a very important issue. The paper deals with experimental investigations of adhesively bonded steel plates with different defects. Five samples (an intact one and four with damages in the form...

Pełny tekst do pobrania w portalu

Recognition of hazardous acoustic events employing parallel processing on a supercomputing cluster . Rozpoznawanie niebezpiecznych zdarzeń dźwiękowych z wykorzystaniem równoległego przetwarzania na klastrze superkomputerowym

Publikacja

- Rok 2015

A method for automatic recognition of hazardous acoustic events operating on a super computing cluster is introduced. The methods employed for detecting and classifying the acoustic events are outlined. The evaluation of the recognition engine is provided: both on the training set and using real-life signals. The algorithms yield sufficient performance in practical conditions to be employed in security surveillance systems. The...

Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling

Publikacja

S. Raczyński
E. Vincent
S. Sagayama

- IEEE Transactions on Audio Speech and Language Processing - Rok 2013

Symbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is...

Pełny tekst do pobrania w serwisie zewnętrznym

Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej

Publikacja

A. Czyżewski
B. Kostek
T. Ciszewski
D. Majewicz

- Rok 2013

The bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...

Bimodal Emotion Recognition Based on Vocal and Facial Features

Publikacja

- Rok 2023

Emotion recognition is a crucial aspect of human communication, with applications in fields such as psychology, education, and healthcare. Identifying emotions accurately is challenging, as people use a variety of signals to express and perceive emotions. In this study, we address the problem of multimodal emotion recognition using both audio and video signals, to develop a robust and reliable system that can recognize emotions...

Pełny tekst do pobrania w portalu

Traffic Noise Analysis Applied to Automatic Vehicle Counting and Classification

Publikacja

- Communications in Computer and Information Science - Rok 2017

Problems related to determining traffic noise characteristics are discussed in the context of automatic dynamic noise analysis based on noise level measurements and traffic prediction models. The obtained analytical results provide the second goal of the study, namely automatic vehicle counting and classification. Several traffic prediction models are presented and compared to the results of in-situ noise level measurements. Synchronized...

Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets

Publikacja

- Electronics - Rok 2022

Artificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...

Pełny tekst do pobrania w portalu

Image Representation for Cognitive Systems Using SOEKS and DDNA: A Case Study for PPE Compliance

Publikacja

C. Silva de Oliveira
C. Sanin
E. Szczerbicki

- Rok 2020

Cognitive Vision Systems have gained significant interest from academia and industry during the past few decade, and one of the main reasons behind this is the potential of such technologies to revolutionize human life as they intend to work under complex visual scenes, adapting to a comprehensive range of unforeseen changes, and exhibiting prospective behavior. The combination of these properties aims to mimic the human capabilities...

Pełny tekst do pobrania w portalu

Bimodal classification of English allophones employing acoustic speech signal and facial motion capture

Publikacja

- Journal of the Acoustical Society of America - Rok 2018

A method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...

Pełny tekst do pobrania w serwisie zewnętrznym

Study Analysis of Transmission Efficiency in DAB+ Broadcasting System

Publikacja

P. Falkowski-Gilski

- Rok 2018

DAB+ is a very innovative and universal multimedia broadcasting system. Thanks to its updated multimedia technologies and metadata options, digital radio keeps pace with changing consumer expectations and the impact of media convergence. Broadcasting analog and digital radio services does vary, concerning devices on both transmitting and receiving side, as well as content processing mechanisms. However, the biggest difference is...

Pełny tekst do pobrania w portalu

In vivo imaging of the human eye using a two-photon excited fluorescence scanning laser ophthalmoscope

Publikacja

J. Boguslawski
G. Palczewska
S. Tomczewski
J. Milkiewicz
P. Kasprzycki
D. Stachowiak
K. Komar
M. J. Marzejon
B. L. Sikorski
A. Hudzikowski... i 6 innych

- JOURNAL OF CLINICAL INVESTIGATION - Rok 2022

BACKGROUND. Noninvasive assessment of metabolic processes that sustain regeneration of human retinal visual pigments (visual cycle) is essential to improve ophthalmic diagnostics and to accelerate development of new treatments to counter retinal diseases. Fluorescent vitamin A derivatives, which are the chemical intermediates of these processes, are highly sensitive to UV light; thus, safe analyses of these processes in humans...

Pełny tekst do pobrania w portalu

Music Information Retrieval – Soft Computing versus Statistics . Wyszukiwanie informacji muzycznej - algorytmy uczące versus metody statystyczne

Publikacja

B. Kostek

- Rok 2015

Music Information Retrieval (MIR) is an interdisciplinary research area that covers automated extraction of information from audio signals, music databases and services enabling the indexed information searching. In the early stages the primary focus of MIR was on music information through Query-by-Humming (QBH) applications, i.e. on identifying a piece of music by singing (singing/whistling), while more advanced implementations...

Pełny tekst do pobrania w serwisie zewnętrznym

ANALIZA PARAMETRÓW SYGNAŁU MOWY W KONTEKŚCIE ICH PRZYDATNOŚCI W AUTOMATYCZNEJ OCENIE JAKOŚCI EKSPRESJI ŚPIEWU

Publikacja

- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Rok 2019

Praca dotyczy podejścia do parametryzacji w przypadku klasyfikacji emocji w śpiewie oraz porównania z klasyfikacją emocji w mowie. Do tego celu wykorzystano bazę mowy i śpiewu nacechowanego emocjonalnie RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song), zawierającą nagrania profesjonalnych aktorów prezentujących sześć różnych emocji. Następnie obliczono współczynniki mel-cepstralne (MFCC) oraz wybrane deskryptory...

Pełny tekst do pobrania w portalu

Multimodal system for diagnosis and polysensory stimulation of subjects with communication disorders

Publikacja

- Rok 2017

An experimental multimodal system, designed for polysensory diagnosis and stimulation of persons with impaired communication skills or even non-communicative subjects is presented. The user interface includes an eye tracking device and the EEG monitoring of the subject. Furthermore, the system consists of a device for objective hearing testing and an autostereoscopic projection system designed to stimulate subjects through their...

How Can We Identify Electrophysiological iEEG Activities Associated with Cognitive Functions?

Publikacja

M. T. Kucewicz
G. A. Worrell
K. Saboo

- Rok 2023

Electrophysiological activities of the brain are engaged in its various functions and give rise to a wide spectrum of low and high frequency oscillations in the intracranial EEG (iEEG) signals, commonly known as the brain waves. The iEEG spectral activities are distributed across networks of cortical and subcortical areas arranged into hierarchical processing streams. It remains a major challenge to identify these activities in...

Pełny tekst do pobrania w serwisie zewnętrznym

Multimodal Surveillance Based Personal Protection System

Publikacja

- Rok 2013

A novel, multimodal approach for automatic detection of abduction of a protected individual, employing dedicated personal protection device and a city monitoring system is proposed and overviewed. The solution is based on combining four modalities (signals coming from: Bluetooth, fixed and PTZ cameras, thermal camera, acoustic sensors). The Bluetooth signal is used continuously to monitor the protected person presence, and in case...

Ranking Speech Features for Their Usage in Singing Emotion Classification

Publikacja

- Rok 2020

This paper aims to retrieve speech descriptors that may be useful for the classification of emotions in singing. For this purpose, Mel Frequency Cepstral Coefficients (MFCC) and selected Low-Level MPEG 7 descriptors were calculated based on the RAVDESS dataset. The database contains recordings of emotional speech and singing of professional actors presenting six different emotions. Employing the algorithm of Feature Selection based...

Pełny tekst do pobrania w portalu

Creating a Remote Choir Performance Recording Based on an Ambisonic Approach

Publikacja

- Applied Sciences-Basel - Rok 2022

The aim of this paper is three-fold. First, the basics of binaural and ambisonic techniques are briefly presented. Then, details related to audio-visual recordings of a remote performance of the Academic Choir of the Gdańsk University of Technology are shown. Due to the COVID-19 pandemic, artists had a choice, namely, to stay at home and not perform or stay at home and perform. In fact, staying at home brought in the possibility...

Pełny tekst do pobrania w portalu

Subjective and Objective Quality Evaluation Study of BPL -PLC Wired Medium

Publikacja

G. Debita
P. Falkowski-Gilski
M. Habrych
B. Miedziński
B. Polnik
J. Wandzio
P. Jedlikowski

- Elektronika Ir Elektrotechnika - Rok 2020

This paper presents results of research on the effectiveness of bi-directional voice transmission in a 6 kV mine cable network using BPL-PLC (Broadband over Power Line - Power Line Communication) technology. It concerns both emergency cable state (supply outage with cable shorted at both ends) and loaded with distorted current waveforms. The narrowband (0.5 MHz–15 MHz) and broadband (two different modes, frequency range of 3 MHz–7.5...

Pełny tekst do pobrania w portalu

Broadening the scope of measurement and analysis of vibrations of an organ pipe employing intensity probe, simulations, and highspeed camera

Publikacja

P. Bordoni
J. Kotus
P. Odya
F. Antonacci
B. Kostek

- Journal of the Acoustical Society of America - Rok 2022

This paper shows an integrated approach to measure, analyze, and model phenomena occurring in an organ pipe driven by pressurized air. The aim of this paper is two-fold, i.e., to measure the pressure signal and the intensity field around the mouth by means of an intensity probe and to visualize and observe the motion of the air jet, which represents the excitation mechanism of the system. This is realized through two techniques,...

Pełny tekst do pobrania w serwisie zewnętrznym

Detection and Imaging of Debonding in Adhesive Joints of Concrete Beams Strengthened with Steel Plates Using Guided Waves and Weighted Root Mean Square

Publikacja

- Materials - Rok 2020

Strengthening of engineering structures is an important issue, especially for elements subjected to variable loads. In the case of concrete beams or slabs, one of the most popular approaches assumes mounting an external reinforcement in the form of steel or composite elements by structural adhesives. A significant disadvantage of adhesive joints is the lack of access to the adhesive film for visual condition assessment, thus, there...

Pełny tekst do pobrania w portalu

Guided wave propagation in debonding detection in CFRP-reinforced steel plate-like structures

Publikacja

B. Zima
Ł. Breńkacz

- OCEAN ENGINEERING - Rok 2024

The present study investigates the guided wave propagation in multilayered steel specimens reinforced with carbon fiber reinforced polymer (CFRP) through theoretical, numerical, and experimental means. The effectiveness of externally bonded reinforcement (EBR) relies heavily on the bonding quality between the CFRP and the substrate. Premature debonding, a prevalent and hazardous defect, can arise from suboptimal manufacturing processes,...

Pełny tekst do pobrania w serwisie zewnętrznym

Wave Frequency Effects on Damage Imaging in Adhesive Joints Using Lamb Waves and RMS

Publikacja

- Materials - Rok 2019

Structural adhesive joints have numerous applications in many fields of industry. The gradual deterioration of adhesive material over time causes a possibility of unexpected failure and the need for non-destructive testing of existing joints. The Lamb wave propagation method is one of the most promising techniques for the damage identification of such connections. The aim of this study was experimental and numerical research on...

Pełny tekst do pobrania w portalu

New Applications of Multimodal Human-Computer Interfaces

Publikacja

A. Czyżewski

- Rok 2012

Multimodal computer interfaces and examples of their applications to education software and for the disabled people are presented. The proposed interfaces include the interactive electronic whiteboard based on video image analysis, application for controlling computers with gestures and the audio interface for speech stretching for hearing impaired and stuttering people. Application of the eye-gaze tracking system to awareness...

Analysis of Lombard speech using parameterization and the objective quality indicators in noise conditions

Publikacja

K. Kąkol
G. Korvel
B. Kostek

- Rok 2018

The aim of the work is to analyze Lombard speech effect in recordings and then modify the speech signal in order to obtain an increase in the improvement of objective speech quality indicators after mixing the useful signal with noise or with an interfering signal. The modifications made to the signal are based on the characteristics of the Lombard speech, and in particular on the effect of increasing the fundamental frequency...

MACHINE LEARNING–BASED ANALYSIS OF ENGLISH LATERAL ALLOPHONES

Publikacja

M. Piotrowska
G. Korvel
B. Kostek
T. Ciszewski
A. Czyżewski

- International Journal of Applied Mathematics and Computer Science - Rok 2019

Automatic classification methods, such as artificial neural networks (ANNs), the k-nearest neighbor (kNN) and selforganizing maps (SOMs), are applied to allophone analysis based on recorded speech. A list of 650 words was created for that purpose, containing positionally and/or contextually conditioned allophones. For each word, a group of 16 native and non-native speakers were audio-video recorded, from which seven native speakers’...

Pełny tekst do pobrania w portalu

Measurement of Latency in the Android Audio Path

Publikacja

- Rok 2018

This paper provides a description of experimental investigations concerning comparison between the audio path characteristics of various Android versions. First, information about the changes in each system version in the context of latency caused by them is presented. Then, a measurement procedure employing available applications to measure latency is described comparing to results contained in the Internet. Finally, a comparison...

Pełny tekst do pobrania w serwisie zewnętrznym

Visual Dimensions of Modeling Languages in Interdisciplinary Perspective

Publikacja

A. Bobkowska

- Rok 2013

Użyteczność języków modelowania wizualnego zależy od notacji. Notacja może być postrzegana jako zestaw wizualnych komponentów, które w określony sposób oddziałują na ludzkie oko i ludzki mózg. Referat przedstawia analizę interdyscyplinarną wykonaną w celu lepszego zrozumienia wizualnych wymiarów języków modelowania. Wizualne wymiary pochodzą z teorii opisujących percepcję wzrokową, wizualizację danych oraz reprezentacje poznawcze....

Pełny tekst do pobrania w serwisie zewnętrznym

Retrospecting Polish Audio Engineering Society Membership on 20th Anniversary of the Polish Section of the Audio Engineering Society

Publikacja

B. Kostek
M. Sankiewicz

- Archives of Acoustics - Rok 2011

In this article some key events concerning founding Polish Section of the Audio Engineering Society were presented. In addition, the history covering International Symposia on Sound Engineering and Mastering was outlined. Also, papers contained in this issue were shortly reviewed.

Pełny tekst do pobrania w portalu

Intelligent video and audio applications for learning enhancement

Publikacja

- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2011

The role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....

Pełny tekst do pobrania w portalu

Personal adaptive tuning of mobile computer audio

Publikacja

- Rok 2015

An integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of the acoustic track to the changing conditions and to the user's individual preferences. Original signal processing algorithms are introduced, which concern: linearization of frequency response, dialogue intelligibility enhancement and dynamics processing tuned up to the user's preferences....

Auditory-visual attention stimulator

Publikacja

- Rok 2013

New approach to lateralization irregularities formation was proposed. The emphasis is put on the relationship between visual and auditory attention stimulation. In this approach hearing is stimulated using time scale modified speech and sight is stimulated by rendering the text of the currently heard speech. Moreover, displayed text is modified using several techniques i.e. zooming, highlighting etc. In the experimental part of...

Pełny tekst do pobrania w serwisie zewnętrznym

Digital Audio Broadcasting or Webcasting: A Network Quality Perspective

Publikacja

- Journal of Telecommunications and Information Technology - Rok 2016

In recent years, many alternative technologies of delivering audio content have emerged, with different advantages and disadvantages. In this paper pros and cons of digital audio broadcasting and webcasting transmission techniques in a network quality perspective are described. A case study of user expectations with respect to currently available services is analyzed, and the perceived quality of real digital broadcasted and webcasted...

Pełny tekst do pobrania w portalu

Wyszukiwarka

Filtry

Katalog

Kategoria

Rok

Opcje

Wyniki wyszukiwania dla: AUDIO-VISUAL SIGNALS