displaying 1000 best results Help
Search results for: AUDIO COMPRESSION
-
Joanna Kabrońska dr inż. arch.
PeoplePhD with honours: Forma architektoniczna jako droga realizacji idei biblioteki przyszłości (Form of Architectural Solutions as a Means of Implementing the idea of Libraries of the Future), 1994 IV International Biennale of Architecture in Cracow Prize winner, 1991 DAAD post-doctoral scholarship, Berlin, 2002 Author of publications on architecture, art and memory, including the monograph Architektura jako forma pamięci. Rola architektury...
-
Material for Automatic Phonetic Transcription of Speech Recorded in Various Conditions
PublicationAutomatic speech recognition (ASR) is under constant development, especially in cases when speech is casually produced or it is acquired in various environment conditions, or in the presence of background noise. Phonetic transcription is an important step in the process of full speech recognition and is discussed in the presented work as the main focus in this process. ASR is widely implemented in mobile devices technology, but...
-
Analiza stanu nawierzchni i klas pojazdów na podstawie parametrów ekstrahowanych z sygnału fonicznego
PublicationCelem badań jest poszukiwanie parametrów wektora cech ekstrahowanego z sygnału fonicznego w kontekście automatycznego rozpoznawania stanu nawierzchni jezdni oraz typu pojazdów. W pierwszej kolejności przedstawiono wpływ warunków pogodowych na charakterystykę widmową sygnału fonicznego rejestrowanego przy przejeżdżających pojazdach. Następnie, dokonano parametryzacji sygnału fonicznego oraz przeprowadzano analizę korelacyjną w celu...
-
Testing A Novel Gesture-Based Mixing Interface
PublicationWith a digital audio workstation, in contrast to the traditional mouse-keyboard computer interface, hand gestures can be used to mix audio with eyes closed. Mixing with a visual representation of audio parameters during experiments led to broadening the panorama and a more intensive use of shelving equalizers. Listening tests proved that the use of hand gestures produces mixes that are aesthetically as good as those obtained using...
-
Compressed Projection Bases for Model-Order Reduction of Multiport Microwave Components Using FEM
PublicationThis paper presents a projection basis compression technique for generating compact reduced-order models (ROM) in the FE analysis of microwave devices. In this approach redundancy is removed from the projection basis by means of the proper orthogonal decomposition technique applied to the projected system of linear equations. Compression allows for keeping the size of a reduced-order model as small as possible without compromising...
-
Auxetic foams
PublicationThis paper presents a method of producing auxetic polyurethane foams (PUFs) and their unique properties. The experience was based on a synthesis of traditional flexible polyurethane foam and thermal conversion to auxetic foam. The foam specimens were transformed from conventional Poisson's ratio to auxetic (negative Poisson's ratio) [1]. Basic research was performed to determine the mechanical properties and cell structure. Auxetic...
-
Examining Acoustic Emission of Engineered Ultrasound Loudspeakers
PublicationMeasurement results of the sound emitted from an ultrasound custom-made system with high spatial directivity are presented. The proposed system is using modulated ultrasound waves which demodulate in nonlinear medium resulting in audible sound. The system is aimed at enhancing the users’ personal audio space, therefore the measurements are performed using the Head and Torso Simulator which provides the realistic reproduction of...
-
The effect of wax foundation addition to PCL filaments on mechanical properties.
Open Research DataThe dataset includes the effect of wax foundation addition on the basic mechanical properties of the filaments. PCL and wax foundation addition at 10 and 15% were used for extrusion. The mechanical properties of the resulting filaments were evaluated by a double compression test using an Instron model 5543 universal testing machine. Parameters such...
-
A concept of Signal Equalization Method Based on Music Genre and the Listener's Room Characteristics
PublicationA research study that investigates the influence of the room acoustics environment on the frequency characteristic of the audio signal playback is presented. First, a novel spectral equalization method of the room acoustic conditions is introduced. On the basis of the frequency response of the room, a system for room acoustics compensation based on eight-band equalizer is proposed. The system settings depend on music genre. In...
-
Measurements and Simulations of Engineered Ultrasound Loudspeakers
PublicationSimulation and measurement results of the sound emitted from an ultrasound custom-made system with high spatial directivity are presented. The proposed system is using modulated ultrasound waves which demodulate in nonlinear medium resulting in audible sound. The system is aimed at enhancing the users’ personal audio space, therefore the measurements are performed using the Head and Torso Simulator which provides realistic reproduction...
-
Quality Aspects in Digital Broadcasting and Webcasting Systems: Bitrate versus Loudness
PublicationIn this paper the quality aspects of bitrate and loudness in digital broadcasting and webcasting systems are examined. The authors discuss a survey concerning user preferences related with processing and managing audio content. The coding efficiency of a popular audio format is analyzed in the context of storing media. An objective study on a representative group of signal samples, as well as a subjective study of the perceived...
-
Intelligent multimedia solutions supporting special education needs.
PublicationThe role of computers in school education is briefly discussed. Multimodal interfaces development history is shortly reviewed. Examples of applications of multimodal interfaces for learners with special educational needs are presented, including interactive electronic whiteboard based on video image analysis, application for controlling computers with facial expression and speech stretching audio interface representing audio modality....
-
Bimodal deep learning model for subjectively enhanced emotion classification in films
PublicationThis research delves into the concept of color grading in film, focusing on how color influences the emotional response of the audience. The study commenced by recalling state-of-the-art works that process audio-video signals and associated emotions by machine learning. Then, assumptions of subjective tests for refining and validating an emotion model for assigning specific emotional labels to selected film excerpts were presented....
-
Online sound restoration system for digital library applications
PublicationAudio signal processing algorithms were introduced to the new online non-commercial service for audio restoration intended to enhance the content of digitized audio repositories. Missing or distorted audio samples are predicted using neural networks and a specific implementation of the Jannsen interpolation method based on the autoregressive model (AR) combined with the iterative restoring of missing signal samples. Since the distortion...
-
Porównanie mocy strat energetycznych w pompie wyporowej o zmiennej wydajności określonych bez uwzględnienia bądź z uwzględnieniem mocy ściskania oleju hydraulicznego
PublicationPorównano moce strat energetycznych w pompie wyporowej o zmiennej wydajności, określone bez uwzględnienia bądź z uwzględnieniem mocy ściskania oleju hydraulicznego. Ocena mocy ściskania cieczy w pompie stała się możliwa dzięki zastosowaniu, zaproponowanej przez autora, metody określenia stopnia zapowietrzenia cieczy w pompie. W metodzie określenia stopnia zapowietrzenia cieczy w pompie oraz w ocenie mocy strat objętościowych ściskania...
-
Applications of permeability, oedometer and direct shear tests to the sand mixed with waste tire crumb
PublicationThe amount of the used waste rubbers in the world has been increasing every year, and their utilization, become a major environmental problem worldwide. The present experimental work has been performed to investigate the influence of rubber inclusion on the behavior of a sand. Geotechnical properties of the sand, and sand with tire crumb at various ratios mixtures (0%, 2.5%, 7.5%, and 15%) were investigated through a series of...
-
Wow defect reduction based on interpolation techniques
PublicationW referacie przedstawiono wyniki badania różnych technik interpolacji wykorzystanych w redukcji kołysania dźwięku. W badaniach użyto: interpolację liniową, dwie techniki interpolacji wielomianowej (Hermite i spline), i technikę sumowania okienkowanych funkcji sink. Jakość rekonstrukcji wykonano wykorzystując sztucznie spreparowany sygnał audio, rekonstruowany wymienionymi metodami interpolacji. Jakość rekonstrukcji oceniono wykorzystując...
-
Robustness in Compressed Neural Networks for Object Detection
PublicationModel compression techniques allow to significantly reduce the computational cost associated with data processing by deep neural networks with only a minor decrease in average accuracy. Simultaneously, reducing the model size may have a large effect on noisy cases or objects belonging to less frequent classes. It is a crucial problem from the perspective of the models' safety, especially for object detection in the autonomous driving...
-
Compliance tests of the polymer layers used as hydrodynamic bearing coatings
PublicationOperational experience and scientific investigations results showed that polymer lined hydrodynamic bearings can withstand more severe operating conditions compared than white metal bearings. PTFE and PEEK-based coatings are the most frequently used as Babbitt alternatives. Both polymers differ significantly from the each other in material properties. According to catalogue data compression modulus of PTFE, it is about an order...
-
Creating a Realible Music Discovery and Recomendation System
PublicationThe aim of this paper is to show problems related to creating a reliable music dis-covery system. The SYNAT database that contains audio files is used for the purpose of experiments. The files are divided into 22 classes corresponding to music genres with different cardinality. Of utmost importance for a reliable music recommendation system are the assignment of audio files to their appropriate gen-res and optimum parameterization...
-
Transmitting Alarm Information in DAB+ Broadcasting System
PublicationThe main goal of digital broadcasting is to deliver high-quality content with the lowest possible bitrate. This paper is focused on transmitting alarm information, such as emergency warning and alerting, in the DAB+ (Digital Audio Broadcasting plus) broadcasting system. These additional services should be available at the lowest possible bitrate, in order to provide a clear and understandable voice message to people. Furthermore, additional...
-
In uence of Low-Level Features Extracted from Rhythmic and Harmonic Sections on Music Genre Classi cation
PublicationWe present a comprehensive evaluation of the infuence of 'harmonic' and rhythmic sections contained in an audio file on automatic music genre classi cation. The study is performed using the ISMIS database composed of music files, which are represented by vectors of acoustic parameters describing low-level music features. Non-negative Matrix Factorization serves for blind separation of instrument components. Rhythmic components...
-
Network and Operating System Support for Digital Audio and Video (Network and OS Support for Digital A/V)
Conferences -
Online sound restoration system for digital library applications.
PublicationAudio signal processing algorithms were introduced to the new online non-commercial service for audio restoration intended to enhance the content of digitized audio repositories. Missing or distorted audio samples are predicted using neural networks and a specific implementation of the Jannsen interpolation method based on the autoregressive model (AR) combined with the iterative restoring of missing signal samples. Since the distortion...
-
Self-compacting grout to produce two-stage concrete
PublicationTraditional concrete (TC) is primarily composed of a mixture of cement, fine and coarse aggregates, and water. TC is made by mixing together all the components before placing them. Using non-traditional concrete (two-stage concrete) to solve and to eliminate the problem of the aggregate segregation which appears in TC and in the self-compacting concrete. Two-stage concrete (TSC) consists of two main components, namely the grout...
-
Building Knowledge for the Purpose of Lip Speech Identification
PublicationConsecutive stages of building knowledge for automatic lip speech identification are shown in this study. The main objective is to prepare audio-visual material for phonetic analysis and transcription. First, approximately 260 sentences of natural English were prepared taking into account the frequencies of occurrence of all English phonemes. Five native speakers from different countries read the selected sentences in front of...
-
Data, Information, Knowledge, Wisdom Pyramid Concept Revisited in the Context of Deep Learning
PublicationIn this paper, the data, information, knowledge, and wisdom (DIKW) pyramid is revisited in the context of deep learning applied to machine learningbased audio signal processing. A discussion on the DIKW schema is carried out, resulting in a proposal that may supplement the original concept. Parallels between DIWK pertaining to audio processing are presented based on examples of the case studies performed by the author and her collaborators....
-
Fitting the mobile device characteristics to the user's hearing preferences
PublicationA method for fitting the mobile computer audio characteristics to the user's hearing preferences is proposed. The process consists of two stages: calibration and dynamics processing. During the calibration phase the user performs a loudness scaling test giving their response regarding the perceived loudness. The dynamics processing made on above basis sets the loudness to the most comfortable level. The processing accounts both...
-
Reduction of parasitic pitch variations in archival musical recordings
PublicationA new method for reducing parasitic pitch variations in archival audio recordings is presented. The method is intended for analyzing movie soundtracks recorded in optical films. It utilizes image processing for calculating and reducing effects of tape shrinkage being one of the main reasons for parasitic pitch variations in audio accompanying moving images. As long as the film tape characteristics are known the new method can be...
-
Audit of the existing surfaces /pavements of sidewalks and roads in the Gdańsk-Oliwa district, with particular emphasis on the location of the "Polanki" market and its direct neighbourhood in the contexts of pavement design of the "Polanki"market; stage from 2019 year.
Open Research DataThe document presents a valorization of paved surfaces (sidewalks and roads) in the Gdańsk-Oliwa district, prapared on the basis of an preliminary inventory work – 21 tables (one table for each street ) with a description of the street and materials used, mostly supplemented with photographic material. The valorization, after the initial inventory,...
-
Influence of the presence of rhamnolipids and ionic cross-linking conditions on the mechanical properties of alginate hydrogels.
Open Research DataThe dataset contains the results of determination the effect of rhamnolipids concentration, calcium chloride concentration and ionic cross-linking time on the mechanical properties of alginate hydrogels prepared by immersing the alginate mixture limited by the dialysis membrane in an appropriate cross-linking solution containing calcium ions. The mechanical...
-
Postprodukcja nagrania wideo z dzwiekiem dookolnym
PublicationOne of the aims of this paper is to present issues related to audio-video correlation. This is presented on the basis of a short film realization employing surround microphone techniques. First, some related works in the domain of sound and vision correlation are presented. Then assumptions concerning scene creation related to both audio and video are shortly described. Another objective is to discuss results of subjective tests...
-
Cyfrowy akcelerator wybranych modułów standardu kompresji wideo H.264
PublicationW artykule przedstawiono konfigurowalny cyfrowy akcelerator estymacji ruchu przeznaczony dla enkodera wideo standardu H.264. Akcelerator został zaimplementowany w technologii FPGA oraz w układzie ASIC w technologii UMC 90 nm. Obie implementacje zostały zweryfikowane, a szczegółowe wyniki pomiarów akceleratora ASIC zostały porównane z innymi dostępnymi w literaturze propozycjami. System został zoptymalizowany do współpracy z oprogramowaniem...
-
1D convolutional context-aware architectures for acoustic sensing and recognition of passing vehicle type
PublicationA network architecture that may be employed to sensing and recognition of a type of vehicle on the basis of audio recordings made in the proximity of a road is proposed in the paper. The analyzed road traffic consists of both passenger cars and heavier vehicles. Excerpts from recordings that do not contain vehicles passing sounds are also taken into account and marked as ones containing silence....
-
Numerical analysis of lumbar spine injury during road safety barrier collision
PublicationPurpose: Enhancing road safety is a critical goal worldwide, necessitating the development of clear standards for road safety systems. This study focuses on lumbar spine (L-spine) compression injuries during collisions with concrete road safety barriers (RSBs). It aims to analyze internal forces during impact to understand L-spine injury biomechanics in such accidents. Methods: The research included a literature review, analyzing...
-
Evaluation of a Novel Approach to Virtual Bass Synthesis Strategy
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) strategy applied to portable computers. The developed algorithms involve intelligent, rule-based settings of bass synthesis parameters with regard to music genre of an audio excerpt and the type of a portable device in use. The Smart VBS algorithm performs the synthesis based on a nonlinear device (NLD) with artificial controlling synthesis...
-
Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing
PublicationDeveloping signal processing methods to extract information automatically has potential in several applications, for example searching for multimedia based on its audio content, making context-aware mobile applications (e.g., tuning apps), or pre-processing for an automatic mixing system. However, the last-mentioned application needs a significant amount of research to reliably recognize real musical instruments in recordings....
-
Classification of Music Genres by Means of Listening Tests and Decision Algorithms
PublicationThe paper compares the results of audio excerpt assignment to a music genre obtained in listening tests and classification by means of decision algorithms. A short review on music description employing music styles and genres is given. Then, assumptions of listening tests to be carried out along with an online survey for assigning audio samples to selected music genres are presented. A framework for music parametrization is created...
-
Mechanical properties of cement pastes containing pristine and silica-coated bismuth oxide (Bi2O3) and gadolinium oxide (Gd2O3) structures
Open Research DataExcel file containing raw mechanical (compressive strength) data of cement pastes containing variable amount of Bi2O3, Gd2O3, Bi2O3-SiO2 and Gd2O3-SiO2 structures. Sample designation in the Excel file is in line with sample designation in the manuscript associated with dataset.
-
Source code - AI models (MLM1-5 - series I-III - QNM opt)
Open Research DataSource code - AI models (MLM1-5 - series I-III - QNM opt) for the paper "Computational Complexity and Its Influence on Concrete Compressive Strength Prediction Capabilities of Machine Learning Models for Concrete Mix Design Support" accepted for publication.
-
Music genre classification applied to bass enhancement for mobile technology
PublicationThe aim of this paper is to present a novel approach to the Virtual Bass Synthesis (VBS) algorithms applied to portable computers. The proposed algorithm is related to intelligent, rule-based setting of synthesis parameters according to music genre of an audio excerpt. The classification of music genres is automatically executed employing MPEG 7 parameters and the Principal Component Analysis method applied to reduce information...
-
Machine learning applied to acoustic-based road traffic monitoring
PublicationThe motivation behind this study lies in adapting acoustic noise monitoring systems for road traffic monitoring for driver’s safety. Such a system should recognize a vehicle type and weather-related pavement conditions based on the audio level measurement. The study presents the effectiveness of the selected machine learning algorithms in acoustic-based road traffic monitoring. Bases of the operation of the acoustic road traffic...
-
Machine learning applied to acoustic-based road traffic monitoring
PublicationThe motivation behind this study lies in adapting acoustic noise monitoring systems for road traffic monitoring for driver’s safety. Such a system should recognize a vehicle type and weather-related pavement conditions based on the audio level measurement. The study presents the effectiveness of the selected machine learning algorithms in acoustic-based road traffic monitoring. Bases of the operation of the acoustic road traffic...
-
Zaawansowane Przetwarzanie Sygnału
e-Learning CoursesPrzedmiot prezentuje wybrane metody przetwarzania sygnałów w bardzo szerokim obszarze zastosowań. Ilustruje najnowsze osiągnięcia w tym zakresie, wsparte wybranymi publikacjami. Zajęcia są podzielone na wykład (15 h) i seminarium (15 h). Podstawowe pojęcia dotyczące cyfrowego przetwarzania sygnałów, zalecana literatura Analiza widmowa gęstość widmowa mocy, widmo falkowe, polispektra i gęstość widmowa mocy skrośnej Efekty...
-
Implementacja pomierzonych imperfekcji geometrycznych powłokicylindrycznej do modelu obliczeniowego
PublicationThis paper presents the method of implementation of the geometrical imperfections used by the author during the experimental and numerical research into the load-carrying capacity of cylindrical steel shells subjected to uniform circumferential compression.
-
Simple empirical formula to estimate the main geomechanical parameters of preplaced aggregate concrete and conventional concrete
PublicationPreplaced aggregate concrete (PAC) or two-stage concrete is a specific type of concrete successfully employed in many projects including underwater concrete structures, massive concrete structures, structures made of reinforced concrete, and improvement of concrete structures. PAC is significantly different than the conventional concrete. In this type of concrete, aggregates are initially poured into the mold, the voids between...
-
Study on some of the strength properties of soft clay stabilized with plastic waste strips
PublicationIt is well known that if plastic wastes are not well managed, it has a negative impact on the environment as well as on human health. In this study, recycling plastic waste in form of strips for stabilizing weak subgrade soil is proposed. For this purpose, a weak clay soil sample was mixed with 0.2%, 0.3%, and 0.4% of plastic strips by weight of soil, and the experimental results were compared to the control soil sample with 0%...
-
Krystian Zawadzki dr hab. inż.
PeopleKrystian Zawadzki is associate professor in the Department of Finance at the Faculty of Management and Economics of the Gdańsk University of Technology; member of the Polish Olympic Academy at the Polish Olympic Committee; coordinator of the Stock Exchange School (Warsaw Stock Exchange) in Gdańsk; Head of postgraduate studies "Capital Investments and Personal Finance Management"; member of the audit committee of the sports club...
-
Adaptive Personal Tuning of Sound in Mobile Computers
PublicationAn integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of their acoustic track to changing acoustic conditions of the environment and to users’ individual preferences. Signal processing algorithms are introduced that concern: linearization of frequency response, dialogue intelligibility enhancement, and dynamics processing tuned up to the users’...
-
Music Data Processing and Mining in Large Databases for Active Media
PublicationThe aim of this paper was to investigate the problem of music data processing and mining in large databases. Tests were performed on a large data-base that included approximately 30000 audio files divided into 11 classes cor-responding to music genres with different cardinalities. Every audio file was de-scribed by a 173-element feature vector. To reduce the dimensionality of data the Principal Component Analysis (PCA) with variable...