Search results for: BINAURAL SPATIAL AUDIO

Search results for: BINAURAL SPATIAL AUDIO

results on page:
embed this view on your website

Filters

total: 1448

clear all filters disabled

displaying 1000 best results Help

Further Developments of the Online Sound Restoration System for Digital Library Applications
Publication
- Year 2014
New signal processing algorithms were introduced to the online service for audio restoration available at the web address: www.youarchive.net. Missing or distorted audio samples are estimated using a specific implementation of the Jannsen interpolation method. The algorithm is based on the autoregressive model (AR) combined with the iterative complementation of signal samples. Since the interpolation algorithm is computationally...

Full text to download in external service
MIASTO PORTOWE – STRUKTURA, WYZWANIA FUNKCJONALNE I MODELE ROZWOJU
Publication
- K. Krośnicka
- Studia Komitetu Przestrzennego Zagospodarowania Kraju PAN - Year 2018
Port cities are having different spatial structure than those located inlands. As a result of their seaside location, they face specific administrative and functional problems on a daily basis. In the economic and settlement structure of the country, they usually play the role of a "gate" through which streams of cargo are distributed further over the whole hinterland. It is the transport and logistics function of port cities,...

Full text available to download
An Approach to Bass Enhancement in Portable Computers Employing Smart Virtual Bass Synthesis Algorithms
Publication
- Year 2014
The aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The developed algorithms are related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt and to the type of a portable device in use. To find optimum synthesis parameters of the VBS algorithms, subjective listening tests based on a parametric procedure...

Full text to download in external service
Sparse autoregressive modeling
Publication
- M. Ciołek
- Year 2012
In the paper the comparison of the popular pitch determination (PD) algorithms for thepurpose of elimination of clicks from archive audio signals using sparse autoregressive (SAR)modeling is presented. The SAR signal representation has been widely used in code-excitedlinear prediction (CELP) systems. The appropriate construction of the SAR model is requiredto guarantee model stability. For this reason the signal representation...
Innovative method of localization airplanes in VCS (VCS-MLAT) distributed system
Publication
- S. Wiszniewski
- Year 2019
The article presents the concept and the structure of the localization module. The prototype module is the part of the VCS (VCS-MLAT) localization distributed system. The device receives the audio signal transmitted in airplanes band (118 MHz – 136 MHz). Received data with the timestamps are send to the main server. The data from multiple devices estimates the localization of the airplane. The main aim of the project is the analysis...
Hanna Obracht-Prondzyńska dr inż. arch.

People

Hanna Obracht-Prondzyńska, PhD MArch, Eng. Assistant Professor at the University of Gdańsk, Department of Spatial Management, academic teacher of urban design and spatial data analyses. Architect and urban planner experienced in data driven urban design and planning. She defended her PhD with distinction in engineering and technical sciences in the discipline of architecture and urban planning in 2020 at the Faculty of Architecture...
Monument of History in Gdynia - Problems with Protection of the Main Representative Axis of the City
Publication
- M. Sołtysik
- Ochrona Dziedzictwa Kulturowego - Year 2019
The spatial idea of the main representative axis was presented in 1938 in the project of the Representative District of Gdynia and partly realised in 1938-1939. Although the whole area was included in the Gdynia's Monument of History, now its spatial integration is on thread by the controversial current development plans

Full text available to download
THE 3D MODEL OF WATER SUPPLY NETWORK WITH APPLICATION OF THE ELEVATION DATA
Publication
- A. Sobieraj-Żłobińska
- B. Wieczorek
- Year 2017
3D visualization is a key element of research and analysis and as the source used by experts in various fields e.g.: experts from water and sewage systems. The aim of this study was to visualize in three-dimensional space model of water supply network with relief. The path of technological development of GESUT data (Geodezyjna Ewidencja Sieci Uzbrojenia Terenu – geodetic records of public utilities) for water supply and measurement...

Full text to download in external service
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
Publication
- D. Korzekwa
- R. Barra-Chicote
- S. Zaporowski
- G. Beringer
- J. Lorenzo-trueba
- A. Serafinowicz
- J. Droppo
- T. Drugman
- B. Kostek
- Year 2021
This paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...

Full text available to download
Subjective and Objective Comparative Study of DAB+ Broadcast System
Publication
- P. Falkowski-Gilski
- J. Stefański
- Archives of Acoustics - Year 2017
Broadcasting services seek to optimize their use of bandwidth in order to maximize user’s quality of experience. They aim to transmit high-quality digital speech and music signals at the lowest bitrate. They intend to offer the best quality under available conditions. Due to bandwidth limitations, audio quality is in conflict with the number of transmitted radio programs. This paper analyzes whether the quality of real-time digital...

Full text available to download
Cross-domain applications of multimodal human-computer interfaces
Publication
- A. Czyżewski
- Year 2015
Developed multimodal interfaces for education applications and for disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and audio interface for speech stretching for hearing impaired and stuttering people and intelligent pen allowing for diagnosing and ameliorating developmental dyslexia. The eye-gaze tracking system named...
A spatio-temporal approach to intersectoral labour and wage mobility
Publication
- K. Flisikowski
- Year 2017
The article presents the spatio-temporal approach for intersectoral labor and wage mobility. Analyses of interindustry mobility were performed with the use of general entropy mobility indices (GEMM). Spatio- temporal approach was obtained thanks to the separate measurement of spatial autocorrelation and regression for each set of sectoral wage and employment structure and was conducted in each year of the research period separately....

Full text available to download
A Spatio-temporal Approach to Intersectoral Labour and Wage Mobility
Publication
- K. Flisikowski
- Year 2017
The article presents the spatio-temporal approach for intersectoral labor and wage mobility. Analyses of interindustry mobility were performed with the use of general entropy mobility indices (GEMM). Spatio-temporal approach was obtained thanks to the separate measurement of spatial autocorrelation and regression for each set of sectoral wage and employment structure and was conducted in each year of the research period separately....

Full text available to download
Client-server Approach in the Navigation System for the Blind
Publication
- TransNav - The International Journal on Marine Navigation and Safety of Sea Transportation - Year 2013
The article presents the client‐server approach in the navigation system for the blind ‐ “Voice Maps”. The authors were among the main creators of the prototype and currently the commercialization phase is being finished. In the implemented prototype only exemplary, limited spatial data were used, therefore they could be stored and analysed (for path-finding process) in the mobile device’s memory without any difficulties. The...

Full text available to download
Speace frienly for the blind = Przestrzeń przyjazna dla niewidomych
Publication
- M. Wysocki
- Year 2012
The article presents issues connected with accessibility of public space for people with eyesight disabilities. The use of the extravisual spatial stimuli in shaping the urban environment has been analysed. Spaces in which musltisensory spatial reception is feasible become user-friendly, as they come to meet the changing needs of their users. The article introduces a system of textures aiding spatial orientation, navigation and...
Methodology and technology for the polymodal allophonic speech transcription
Publication
- Journal of the Acoustical Society of America - Year 2016
A method for automatic audiovisual transcription of speech employing: acoustic and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e. the changes in the articulatory setting of speech organs for...

Full text to download in external service
Methodology and technology for the polymodal allophonic speech transcription
Publication
- Journal of the Acoustical Society of America - Year 2016
A method for automatic audiovisual transcription of speech employing: acoustic, electromagnetical articulography and visual speech representations is developed. It adopts a combining of audio and visual modalities, which provide a synergy effect in terms of speech recognition accuracy. To establish a robust solution, basic research concerning the relation between the allophonic variation of speech, i.e., the changes in the articulatory...

Full text to download in external service
Sound engineering as our commitment to its creators in Poland
Publication
- B. Kostek
- A. Czyżewski
- Archives of Acoustics - Year 2019
Sound engineering is an interdisciplinary and rapidly expanding domain. It covers many aspects, such as sound perception, studio and sound mastering technology, music information retrieval including content-based search systems and automatic music transcription frameworks, sound synthesis, sound restoration, electroacoustics, and other ones constituting multimedia technology. Moreover, machine learning methods applied to the topics...

Full text to download in external service
The condition of economies. Do most valuable global brands matter?
Publication
- K. Flisikowski
- W. Kucharska
- EQUILIBRIUM Quarterly Journal of Economics and Economic Policy - Year 2018
Research background: Brands are considered to be the most valuable asset of a company. Some of them achieve spectacular global results. The significance of global brands is proved by the fact that their value is often greater than the sum of all company’s net assets. Purpose of the article: The aim of this article is to highlight that brand value does not only create company’s value, but also leverages economies. The Authors claim...

Full text available to download
MODALITY corpus - SPEAKER 17 - SEQUENCE S1
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S4
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S2
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S5
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S3
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 17 - SEQUENCE S6
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
Landscape protection - the challenge for sustainable planning
Publication
- A. Sas-Bojarska
- Year 2010
Growing spatial chaos reminds us of the need for systematic, complex approaches related to environmental and landscape issues within different planning, organizational, operational, legal and political activities. It has been proved, that LVIA within EIA plays an important role in the enhancement of the spatial planning system, and that there is a need to use the potential of EIA/LVIA in spatial development and management. But...
Measurements of OF QoS/QoE parameters for media streaming in a PMIPv6 TESTBED WITH 802.11 b/g/n WLANs
Publication
- Metrology and Measurement Systems - Year 2012
A growing number of mobile devices and the increasing popularity of multimedia services result in a new challenge of providing mobility in access networks. The paper describes experimental research on media (audio and video) streaming in a mobile IEEE 802.11 b/g/n environment realizing network-based mobility. It is an approach to mobility that requires little or no modification of the mobile terminal. Assessment of relevant parameters...

Full text available to download
UAV measurements and AI-driven algorithms fusion for real estate good governance principles support
Publication
- P. Tysiąc
- A. Janowski
- M. Walacik
- International Journal of Applied Earth Observation and Geoinformation - Year 2024
The paper introduces an original method for effective spatial data processing, particularly important for land administration and real estate governance. This approach integrates Unmanned Aerial Vehicle (UAV) data acquisition and processing with Artificial Intelligence (AI) and Geometric Transformation algorithms. The results reveal that: (1) while the separate applications of YOLO and Hough Transform algorithms achieve building detection...

Full text to download in external service
Assessment of Alterations in Settlement Patterns of Agricultural Landscape in the Example of Kashubia in Poland
Publication
- A. Górka
- Sustainability - Year 2024
Traditional agricultural landscapes are heavily exposed to change due to their relatively low agricultural productivity. However, they represent cultural values of great importance in maintaining the resilience of the environment and society. Although their cultural potential is important for sustainable development, it is still insufficiently recognized. The article fills this gap by examining old farmstead buildings as a distinguishing...

Full text available to download
System for automatic singing voice recognition
Publication
- P. Żwan
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2008
W artykule przedstawiono system automatycznego rozpoznawania jakości i typu głosu śpiewaczego. Przedstawiono bazę danych oraz zaimplementowane parametry. Algorytmem decyzyjnym jest algorytm sztucznych sieci neuronowych. Wytrenowany system decyzyjny osiąga skuteczność ok. 90% w obydwu kategoriach rozpoznawania. Dodatkowo wykazano przy pomocy metod statystycznych, że wyniki działania systemu automatycznej oceny jakości technicznej...
Expert system for automatic classification and quality assessment of singing voices
Publication
- P. Żwan
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2006
.

Full text to download in external service
DSP techniques for determining ''Wow'' distortions
Publication
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2007
Artykuł przedstawia opis algorytmów do wyznaczania charakterystyki zniekształceń kołysania dźwięku. Są to algorytmy: śledzenia przydźwięku sieciowego, śledzenia pozostałości magnetycznej prądu podkładu wielkich częstotliwości, adaptacyjnej analizy środka ciężkości widma dla wybranej części zniekształconego sygnału. Przedstawione algorytmy pozwalają na implementację programową i sprzętową.
Tonality Estimation and Frequency Tracking of Modulated Tonal Components
Publication
- M. Kulesza
- A. Czyżewski
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2009
A novel method for tonality estimation and frequency tracking of tonal components modulated in frequency and amplitude is presented. The algorithm detects the local maxima of magnitude spectra corresponding to three contiguous frames of a signal and matches them into the tonal track candidates. The magnitude-based and phase-based methods are used to estimate the frequency jumps between spectrum maxima belonging to the tonal track...

Full text to download in external service
Measurements and Visualization of Sound Intensity Around the Human Head in Free Field Using Acoustic Vector Sensor
Publication
- J. Kotus
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2015
This paper presents measurements and visualization of sound intensity around the human head simulator in a free field. A Cartesian robot, applied for precise positioning of the acoustic vector sensor, was used to measure sound intensity. Measurements were performed in a free field using a head and torso simulator and the setup consisting of four different loudspeaker configurations. The acoustic vector sensor was positioned around...

Full text available to download
New Aspects of Virtual Sound Source Localization Research—Impact of Visual Angle and 3-D Video Content on Sound Perception
Publication
- B. Kunka
- B. Kostek
- JOURNAL OF THE AUDIO ENGINEERING SOCIETY - Year 2013
The influence of image on virtual sound source localization, called the “image proximity effect” or the “ventriloquism effect”, is a well known phenomenon. This paper focuses on other aspects related to this effect, namely the impact of the visual angle of the presented object and 3D video content on sound perception. The research conducted confirmed that the visual angle of the presented object determines the image proximity effect...

Full text available to download
MODALITY corpus - SPEAKER 35 - COMMANDS C1
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - SEQUENCE S6
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - COMMANDS C5
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - SEQUENCE S4
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 10 - SEQUENCE S1
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - SEQUENCE S2
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 39 - COMMANDS C1
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - SEQUENCE S3
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - COMMANDS C3
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - SEQUENCE S2
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 33 - SEQUENCE S1
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - COMMANDS C2
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 21 - COMMANDS C3
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - SEQUENCE S4
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
MODALITY corpus - SPEAKER 01 - SEQUENCE S6
Open Research Data
- series: MODALITY corpus
The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...

Search

Filters

Catalog

Search results for: BINAURAL SPATIAL AUDIO

Hanna Obracht-Prondzyńska dr inż. arch.