Filters
total: 1193
filtered: 895
-
Catalog
Chosen catalog filters
Search results for: AUTOMATIC AUDIO RECONSTRUCTION
-
Production of six-degrees-of-freedom (6DoF) navigable audio using 30 Ambisonic microphones
PublicationThis paper describes a method for planning, recording, and post-production of six-degrees-of-freedom audio recorded with multiple 3rd order Ambisonic microphone arrays. The description is based on the example of recordings conducted in August 2020 with the Poznan Philharmonic Orchestra using 30 units of Zylia ZM-1S. A convenient way to prepare and organize such a big project is proposed – this involves details of stage planning,...
-
Multimodal English corpus for automatic speech recognition
PublicationA multimodal corpus developed for research of speech recognition based on audio-visual data is presented. Besides usual video and sound excerpts, the prepared database contains also thermovision images and depth maps. All streams were recorded simultaneously, therefore the corpus enables to examine the importance of the information provided by different modalities. Based on the recordings, it is also possible to develop a speech...
-
Application of smart glasses for fast and automatic color correction in health care
PublicationIn recent years different applications of smart glasses in health care have been proposed. In this paper we present the experiments related to automatic color correction using smart glasses platform developed within the eGlasses project. The color pattern is proposed and tested enabling the automatic detection of the pattern and automatic correction of colors. Additionally, the method for encoding and decoding of patient ID in...
-
Analysis of the Usefulness of Cheap Audio Recorders for Spectral Measurement of Environmental Noise
PublicationEnvironmental noise pollution is nowadays one of the most serious health threats. The impact of noise on the human body depends not only on the sound level but also on its spectral distribution. Reliable measurements of the environmental noise spectrum are often hampered by the very high price of top quality measuring devices. This paper explores the possibility of using much cheaper audio recorders for the frequency analysis....
-
Exploring Neural Networks for Musical Instrument Identification in Polyphonic Audio
PublicationThe purpose of this paper is to introduce neural network-based methods that surpass state-of-the-art (SOTA) models, either by training faster or having simpler architecture, while maintaining comparable effectiveness in musical instrument identification in polyphonic music. Several approaches are presented, including two authors’ proposals, i.e., spiking neural networks (SNN) and a modular deep learning model named FMCNN (Fully...
-
Signal Reconstruction from Sparse Measurements Using Compressive Sensing Technique
PublicationThe paper presents the possibility of applying a new class ofmathematical methods, known as Compressive Sensing (CS) for recovering thesignal from a small set of measured samples. CS allows the faithful recon-struction of the original signal back from fewer random measurements bymaking use of some non-linear reconstruction techniques. Since of all thesefeatures, CSfinds its applications especially in the areas where, sensing is...
-
In Memoriam Professors Marianna Sankiewicz-Budzyński and Gustaw K.E. Budzyński - Founders of the Polish Audio Engineering
PublicationBiography and scientific achievements of Professors Marianna Sankiewicz-Budzyński and Gustaw K.E. Budzyński - Founders of the Polish Audio Engineering.
-
Automatic Discovery of IaaS Cloud Workload Types
PublicationThe paper presents an approach to automatic discovery of workloads types. We perform functional characteristics of the workloads executed in our cloud environment, that have been used to create model of the computations. To categorize the resources utilization we used K-means algorithm, that allow us automatically select six types of computations. We perform analysis of the discovered types against to typical computational benchmarks,...
-
3D Object Shape Reconstruction from Underwater Multibeam Data and Over Ground Lidar Scanning
PublicationThe technologies of sonar and laser scanning are an efficient and widely used source of spatial information with regards to underwater and over ground environment respectively. The measurement data are usually available in the form of groups of separate points located irregularly in three-dimensional space, known as point clouds. This data model has known disadvantages, therefore in many applications a different form of representation,...
-
Automatic Reduction-Order Selection for Finite-Element Macromodels
PublicationAn automatic reduction-order selection algorithm for macromodels in finite-element analysis is presented. The algorithm is based on a goal-oriented a posteriori error estimator that operates on low-order reduced blocks of matrices, and hence, it can be evaluated extremely quickly.
-
Processing of LiDAR and Multibeam Sonar Point Cloud Data for 3D Surface and Object Shape Reconstruction
PublicationUnorganised point cloud dataset, as a transitional data model in several applications, usually contains a considerable amount of undesirable irregularities, such as strong variability of local point density, missing data, overlapping points and noise caused by scattering characteristics of the environment. For these reasons, further processing of such data, e.g. for construction of higher order geometric models of the topography...
-
3D seafloor reconstruction using data from side scan and synthetic aperture sonar
PublicationSide scan and synthetic aperture sonars are widely used imaging systems in the underwater environment. They are relatively cheap and easy to deploy, in comparison with more powerful sensors, like multibeam echosounders. Although side scan and synthetic aperture sonars does not provide seafloor bathymetry directly, their records are finally related to seafloor images. Moreover, the analysis of such images performed by human eye...
-
Post‐Second World War Reconstruction of Polish Cities: The Interplay Between Politics and Paradigms
PublicationBy the end of the Second World War, many of the Polish cities—and especially their historic centres—were in ruins. This was caused by both bombings and sieges conducted by the Nazis and Soviets. The particular group of cities is associated with former German lands—now called the “Recovered Territories”—which were incorporated into the borders of Poland as compensation for its Eastern Borderlands lost to the Soviet Union. These...
-
Audio Content and Crowdsourcing: A Subjective Quality Evaluation of Radio Programs Streamed Online
PublicationRadio broadcasting has been present in our lives for over 100 years. The transmission of speech and music signals accompanies us from an early age. Broadcasts provide the latest information from home and abroad. They also shape musical tastes and allow many artists to share their creativity. Modern distribution involves transmission over a number of terrestrial systems. The most popular are analog FM (Frequency Modulation) and...
-
Expert system for automatic classification and quality assessment of singing voices
Publication.
-
A New Method for Automatic Generation of Animated Motion
PublicationA new method for generation of animation with a quality comparable to a natural motion is presented. Proposed algorithm is based on fuzzy description of motion parameters and subjective features. It is assumed that such processing increases naturalness and quality of motion, which is verified by subjective evaluation tests. First, reference motion data are gathered utilizing a motion capture system, then these data are reduced...
-
Comparison of perforator location in dynamic and static thermographic imaging with Doppler ultrasound in breast reconstruction surgery
PublicationThis paper co mpares the effectiveness of the dTnorm and t90_10 parametrizations in dynamic thermography for imaging location of perforators in TRAM flaps in the intraoperative period. The results were compared with the location detected in a Doppler ultrasound examination. Cold and heat stimulation was used in dynamic thermography. Additionally, these results were compared with static...
-
Automatic road traffic safety management system in urban areas
PublicationTraffic incidents and accidents contribute to decreasing levels of transport system reliability and safety. Traffic management and emergency systems on the road, using, among others, automatic detection, video surveillance, communication technologies and institutional solutions improve the organization of the work of various departments involved in traffic and safety management. Automation of incident management helps to reduce...
-
Audio Feature Analysis for Precise Vocalic Segments Classification in English
PublicationAn approach to identifying the most meaningful Mel-Frequency Cepstral Coefficients representing selected allophones and vocalic segments for their classification is presented in the paper. For this purpose, experiments were carried out using algorithms such as Principal Component Analysis, Feature Importance, and Recursive Parameter Elimination. The data used were recordings made within the ALOFON corpus containing audio signal...
-
Layered background modeling for automatic detection of unattended objects in camera images
PublicationAn algorithm for automatic detection of unattended objects in video camera images is presented. First, background subtraction is performed, using an approach based on the codebook method. Results of the detection are then processed by assigning the background pixels to time slots, based on the codeword age. Using this data, moving objects detected during a chosen period may be extracted from the background model. The proposed approach...
-
Automatic Analysis System of TV Commercial Emission Level
PublicationThe purpose of the study was to determine whether the commercial emission level is higher than the emission level of a regular program and to check if the commercials broadcasters follow the recommended levels of loudness. The paper shortly reviews some chosen methods of volume measurements specified in the ITU and EBU recommendations. Then, it describes a prototype of a system implemented in Embarcadero C++ Builder 2010 which...
-
Computer vision techniques applied for reconstruction of seafloor 3D images from side scan and synthetic aperture sonars data
PublicationThe Side Scan Sonar and Synthetic Aperture Sonar are well known echo signal processing technologies that produce 2D images of the seafloor. Both systems combines a number of acoustic pings to form a high resolution image of seafloor. It was shown in numerous papers that 2D images acquired by such systems can be transformed into 3D models of seafloor surface by algorithmic approach using intensity information, contained in a grayscaled...
-
Automatic system for optical parameters measurements of biological tissues
PublicationIn this paper a system allowing execution of automatic measurements of optical parameters of scattering materials in an efficient and accurate manner is proposed and described. The system is designed especially for measurements of biological tissues including phantoms, which closely imitate optical characteristics of real tissue. The system has modular construction and is based on the ISEL system, luminance and color meter and...
-
An Automatic Self-Tuning Control System Design for an Inverted Pendulum
PublicationA control problem of an inverted pendulum in the presence of parametric uncertainty has been investigated in this paper. In particular, synthesis and implementation of an automatic self-tuning regulator for a real inverted pendulum have been given. The main cores of the control system are a swing-up control method and a stabilisation regulator. The first one is based on the energy of an inverted pendulum, whereas the second one...
-
Resolving conflicts in object tracking for automatic detection of events in video
PublicationAn algorithm for resolving conflicts in tracking of moving objects is presented. The proposed approach utilizes predicted states calculated by Kalman filters for estimation of trackers position, then it uses color and texture descriptors in order to match moving objects with trackers. Problematic situations, such as splitting objects, are addressed. Test results are presented and discussed. The algorithm may be used in the system...
-
Multimodal human-computer interfaces based on advanced video and audio analysis
PublicationMultimodal interfaces development history is reviewed briefly in the introduction. Examples of applications of multimodal interfaces to education software and for the disabled people are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with mouth gestures and the audio interface for speech stretching for hearing impaired and stuttering people. The Smart...
-
New perspective on the in vivo use of cold stress dynamic thermography in integumental reconstruction with the use of skin-muscle flaps
PublicationAmong the problems encountered by plastic surgeons is the reconstruction of defects following tumors. One of the reconstructive options is TRAM flap. Despite that anatomy is well-explored, marginal flap necrosis may develop. To minimize complications imaging examinations was designed to determine the degree of flap perfusion. One of them is the thermographic examination.
-
A framework for automatic detection of abandoned luggage in airport terminal
PublicationA framework for automatic detection of events in a video stream transmitted from a monitoring system is presented. The framework is based on the widely used background subtraction and object tracking algorithms. The authors elaborated an algorithm for detection of left and removed objects based on mor-phological processing and edge detection. The event detection algorithm collects and analyzes data of all the moving objects in...
-
Analysis of impact of lossy audio compression on the robustness of watermark embedded in the DWT domain for non-blind copyright protection
PublicationA methodology of non-blind watermarking of the audio content is proposed. The outline of audio copyright problem and motivation for practical applications are discussed. The algorithmic theory pertaining watermarking techniques is briefly introduced. The system architecture together with employed workflows for embedding and extracting the watermarks are described. The implemented approach is described and obtained results are reported....
-
Automatic evaluation of information credibility in Semantic Web and Knowledge Grid
PublicationThis article presents a novel algorithm for automatic estimation of information credibility. It concerns information collected in Knowledge Grid and Semantic Web. Possibilities to evaluate the credibility of information in such structures are much greater than those available for WWW sites which use natural language. The rating system presented in this paper estimates credibility automatically on the basis of the following metrics:...
-
Verification of the Parameterization Methods in the Context of Automatic Recognition of Sounds Related to Danger
PublicationW artykule opisano aplikację, która automatycznie wykrywa zdarzenia dźwiękowe takie jak: rozbita szyba, wystrzał, wybuch i krzyk. Opisany system składa się z bloku parametryzacji i klasyfikatora. W artykule dokonano porównania parametrów dedykowanych dla tego zastosowania oraz standardowych deskryptorów MPEG-7. Porównano też dwa klasyfikatory: Jeden oparty o Percetron (sieci neuronowe) i drugi oparty o Maszynę wektorów wspierających....
-
Automatic Singing Voice Recognition EmployingNeural Networks and Rough Sets
PublicationCelem badań jest automatyczne rozpoznawanie głosów śpiewaczych w kategorii rodzaju i jakości technicznej śpiewu. W artykule opisano stworzoną bazę danych głosów, która zawiera próbki głosu śpiewaków profesjonalnych i amatorskich. W dalszej części opisano parametry zdefiniowane w oparciu o zjawiska biomechaniczne w narządzie głosu podczas śpiewania. W oparciu o stworzone macierze parametrów wytrenowano i porównano automatyczne klasyfikatory...
-
AUDIO SIGNAL EQUALIZATION BASED ON IMPULSE RESPONSE OF A LISTENING ROOM AND MUSIC CONTENT REPRODUCED
PublicationA research study presents investigations of the influence of the room acoustics on the frequency characteristic of the audio signal playback. First, a concept of a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the room spectral response, a system for room acoustics compensation based on an equalizer designed is proposed. The system settings depend on music genre recognized automatically....
-
Application of passive acoustic radar to automatic localization, tracking and classification of sound sources
PublicationA concept, practical realization and applications of the passive acoustic radar to automatic localization, tracking and classification of sound sources were presented in the paper. The device consists of a new kind of multichannel miniature sound intensity sensors and a group of digital signal processing algorithms. Contrary to active radars, it does not emit the scanning beam but after receiving surrounding sounds it provides...
-
Automatic detection of abandoned luggage employing a dual camera system
PublicationA system for automatic detection of events using a system of fixed and PTZ (pan-tilt-zoom) cameras is described. Images from the fixed camera are analyzed by means of object detection and tracking. Event detection system uses a set of rules to analyze data on the tracked moving objects and to detect defined events. A PTZ camera is used to obtain a detailed view of a selected object. A procedure for conversion between the pixel...
-
Energy Efficiency Study of Audio-video Content Consumption on Selected Android Mobile Terminals
PublicationMobile devices are widely used by billions of users worldwide. Thanks to their main advantage, which is portability, they should be fully operational as long as possible, without the need to recharge or connect them to external power sources. This paper describes a study, carried out on four different mobile devices, with different hardware and software parameters, running the Android operating system. The research campaign involved...
-
Towards Audio Signal Equalization Based on Spectral Characteristics of a Listening Room and Music Content Reproduced
PublicationThis study presents investigations of the influence of the room acoustics on the frequency characteristic of the audio signal playback. First, the concept of a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the room spectral response, a system for room acoustics compensation based on an equalizer designed is proposed. The system settings depend on music genre recognized automatically....
-
New semi-causal and noncausal techniques for detection of impulsive disturbances in multivariate signals with audio applications
PublicationThis paper deals with the problem of localization of impulsive disturbances in nonstationary multivariate signals. Both unidirectional and bidirectional (noncausal) detection schemes are proposed. It is shown that the strengthened pulse detection rule, which combines analysis of one-step-ahead signal prediction errors with critical evaluation of leave-one-out signal interpolation errors, allows one to noticeably improve detection results...
-
Time reconstruction and performance of the CMS electromagnetic calorimeter
Publication -
Reconstruction of input signal of sensor with frequency output
Publication -
An EIT reconstruction algorithm based on noisy data.
PublicationPraca przedstawia algorytm rekonstrukcji oparty o zmodyfikowany algorytm Gaussa - Newtona. Algorytm uwzględnia istnienie elektrod pomiarowych w tomografii elektroimpedancyjnej. Elektrody charakteryzują się rozmiarem i impedancją. Dodatkowo algorytm zakłada istnienie szumu w sygnale mierzonym. Zostało pokazane, że dobór optymalnego wzorca pobudzenia znacząco poprawia odporność algorytmu rekonstrukcyjnego na szum w danych. Dwie...
-
Methods for quality improvement of multibeam and LiDAR point cloud data in the context of 3D surface reconstruction
PublicationPoint cloud dataset is the transitional data model used in several marine and land remote-sensing applications. During further steps of processing, the transformation of point cloud spatial data to more complex models containing higher order geometric structures like edges and facets may be possible, if an appropriate quality level of input data is provided. Point cloud datasets usually contain a considerable amount of undesirable...
-
Automatic analysis of the aggressive behavior of laboratory animals using thermal video processing
PublicationThe bite detection is very important but difficult element of the social interaction analysis. Standard observation methods like human observer or a camcorder of visible light frequencies fail in this case. However, it is possible to discern cooler spots on the rodent's body that appear after body contact with another individual, and vanish after short time. These spots are assumed to be a saliva trace left on fur after bite. In...
-
Next generation automatic IP configuration deployment issues
PublicationAlthough Dynamic Host Configuration Protocol for IPv6 (DHCPv6) protocol was defined in 2003, it was designed as a framework rather than a complete solution to the automatic configuration in IPv6 networks. There are still some unsolved problems and new options yet to be defined. One example of such case is Fully Qualified Domain Name (FQDN) option, which final version has been published in late 2007. It describes DHCPv6 client...
-
Automatic Breath Analysis System Using Convolutional Neural Networks
PublicationDiseases related to the human respiratory system have always been a burden for the entire society. The situation has become particularly difficult now after the outbreak of the COVID-19 pandemic. Even now, however, it is common for people to consult their doctor too late, after the disease has developed. To protect patients from severe disease, it is recommended that any symptoms disturbing the respiratory system be detected as...
-
Automatic Breath Analysis System Using Convolutional Neural Networks
PublicationDiseases related to the human respiratory system have always been a burden for the entire society. The situation has become particularly difficult now after the outbreak of the COVID-19 pandemic. Even now, however, it is not uncommon for people to consult their doctor too late, after the disease has developed. To protect patients from severe disease, it is recommended that any symptoms disturbing the respiratory system be detected...
-
Elimination of impulsive disturbances from archive audio files – comparison of three noise pulse detection schemes
PublicationThe problem of elimination of impulsive disturbances (such as clicks, pops, ticks, crackles, and record scratches) from archive audio recordings is considered and solved using autoregressive modeling. Three classical noise pulse detection schemes are examined and compared: the approach based on open-loop multi-step-ahead signal prediction, the approach based on decision-feedback signal prediction, and the double threshold approach,...
-
Intelligent algorithms for optical track audio restoration
PublicationW referacie przedstawiono dwa algorytmy dedykowane redukcji pasożytniczych zniekształceń dźwięku spotykanych w optycznych ścieżkach dźwiękowych. Pierwszy algorytm umożliwia redukcję szerokopasmowego szumu w nagraniach fonicznych. Wykorzystano w nim psycho-akustyczny model słuchu oparty o miarę nieprzewidywalność sygnału (ang. Unpredictability Measure). Ocena jakości redukcji szumu została wykonana z wykorzystaniem metod inteligentnych....
-
A Device for Measuring Auditory Brainstem Responses to Audio
PublicationStandard ABR devices use clicks and tone bursts to assess subjects’ hearing in an objective way. A new device was developed that extends the functionality of a standard ABR audiometer by collecting and analyzing auditory brainstem responses (ABR). The developed accessory allows for the use of complex sounds (e.g., speech or music excerpts) as stimuli. Therefore, it is possible to find out how efficiently different types of sounds...
-
Multimodal Audio-Visual Recognition of Traffic Events
PublicationPrzedstawiono demonstrator systemu wykrywania niebezpiecznych zdarzeń w ruchu drogowym oparty na jednoczesnej analizie danych wizyjnych i akustycznych. System jest częścią systemu automatycznego nadzoru bezpieczeństwa. Wykorzystuje on kamery i mikrofony jako źródła danych. Przedstawiono wykorzystane algorytmy - algorytmy rozpoznawania zdarzeń dźwiękowych oraz analizy obrazu. Zaprezentowano wyniki działania algorytmów na przykładzie...