Filters
total: 731
filtered: 674
Search results for: ACOUSTIC%20DIFFUSER
-
Waveguide modeling as a tool for fitting a hearing aid
Publication -
Transient detection algorithms for speech coding applications
Publication -
Recent developments in automatic classification of musical instruments
PublicationW referacie dokonano przeglądu aktualnego stanu badań w dziedzinie automatycznego rozpoznawania muzyki. Przedstawiono też eksperymenty prowadzone aktualnie w Katedrze Dźwięku i Obrazu PG. Prace te dotyczyły rozpoznawania klas instrumentów muzycznych i separacji duetów muzycznych. Pokazano przykładowe wyniki i przedstawiono projekt prac do zrealizowania w przyszłych eksperymentach.
-
Comparing some convolution-based methods for creation of surround sound
PublicationW referacie przedstawiono eksperymenty związane z symulacją dźwięku dookólnego w sali koncertowej. W tym celu wykorzystano splot odpowiedzi impulsowej z danego wnętrza (wielokanałowe nagrania odpowiedzi impulsowej) z nagraniami z komory bezechowej. Uzyskany w ten sposób sygnał został następnie przypisany do odpowiednich kanałów w systemie dookólnym. Uzyskane w ten sposób nagrania były następnie porównywane w testach subiektywnych...
-
Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning
PublicationText-to-Speech synthesis (TTS) can be used to generate training data for building Automatic Speech Recognition models (ASR). Access to medical speech data is because it is sensitive data that is difficult to obtain for privacy reasons; TTS can help expand the data set. Speech can be synthesized by mimicking different accents, dialects, and speaking styles that may occur in a medical language. Reinforcement Learning (RL), in the...
-
SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM
PublicationThe main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...
-
Testing a Variety of Features for Music Mood Recognition. Testowanie zestawu parametrów w celu rozpoznawania nastroju w muzyce
PublicationMusic collections are organized in a very different way depending on a target, number of songs or a distribution method, etc. One of the high-level feature, which can be useful and intuitive for listeners, is “mood”. Even if it seems to be the easiest way to describe music for people who are non-experts, it is very difficult to find the exact correlation between physical features and perceived impressions. The paper presents experiments...
-
Online sound restoration system for digital library applications.
PublicationAudio signal processing algorithms were introduced to the new online non-commercial service for audio restoration intended to enhance the content of digitized audio repositories. Missing or distorted audio samples are predicted using neural networks and a specific implementation of the Jannsen interpolation method based on the autoregressive model (AR) combined with the iterative restoring of missing signal samples. Since the distortion...
-
The importance of the bottom layer in double-layer porous asphalt for noise reduction
PublicationDouble-layer porous asphalt concrete (DPAC) surfaces are generally considered to be the acoustically most effective low noise road surfaces ready for implementation. While DPAC used on highways in warm climates may have an average life of around 8 years, in Scandinavia with severe winter climate DPAC usually survive only about 3 years; partly due to wear of studded tyres. An ongoing project in Sweden, applying DPAC and single-layer...
-
Classifying type of vehicles on the basis of data extracted from audio signal characteristics
PublicationThe aim of this study is to find and optimize a feature vector for an automatic recognition of the type of vehicles, extracted form an audio signal. First, the influence of weather-based conditions of road surface on spectral characteristic of the audio signal recorded from a passing vehicle in close proximity to the road is discussed. Next, parameterization of the recorded audio signal is performed. For that purpose, the MIRtoolbox,...
-
Analysis of allophones based on audio signal recordings and parameterization
PublicationThe aim of this study is to develop an allophonic description of English plosive consonants based on recordings of 600 specially selected words. Allophonic variations addressed in the study may have two sources: positional and contextual. The former one depends on the syllabic or prosodic position in which a particular phoneme occurs. Contextual allophony is conditioned by the local phonetic environment. Co-articulation overlapping...
-
Assessment of hearing in coma patients employing auditory brainstem response, electroencephalography, and eye-gaze-tracking
PublicationThe results of the study conducted by Tagliaferri et al. in 12 European countries indicate that the ratio of registered brain injury cases in Europe amounts to 150-300 per 100 000 people, with the European mean value of 235 cases per 100 000 people. The project presented in the paper assumes development of a combined metric of patients’ state remaining in coma by intelligent fusion of GCS (subjective Glasgow Coma Scale or its derivatives)...
-
Noise profiling for speech enhancement employing machine learning models
PublicationThis paper aims to propose a noise profiling method that can be performed in near real-time based on machine learning (ML). To address challenges related to noise profiling effectively, we start with a critical review of the literature background. Then, we outline the experiment performed consisting of two parts. The first part concerns the noise recognition model built upon several baseline classifiers and noise signal features...
-
Evaluation of aspiration problems in L2 English pronunciation employing machine learning
PublicationThe approach proposed in this study includes methods specifically dedicated to the detection of allophonic variation in English. This study aims to find an efficient method for automatic evaluation of aspiration in the case of Polish second-language (L2) English speakers’ pronunciation when whole words are analyzed instead of particular allophones extracted from words. Sample words including aspirated and unaspirated allophones...
-
Analysis-by-synthesis paradigm evolved into a new concept
PublicationThis work aims at showing how the well-known analysis-by-synthesis paradigm has recently been evolved into a new concept. However, in contrast to the original idea stating that the created sound should not fail to pass the foolproof synthesis test, the recent development is a consequence of the need to create new data. Deep learning models are greedy algorithms requiring a vast amount of data that, in addition, should be correctly...
-
Broadening the scope of measurement and analysis of vibrations of an organ pipe employing intensity probe, simulations, and highspeed camera
PublicationThis paper shows an integrated approach to measure, analyze, and model phenomena occurring in an organ pipe driven by pressurized air. The aim of this paper is two-fold, i.e., to measure the pressure signal and the intensity field around the mouth by means of an intensity probe and to visualize and observe the motion of the air jet, which represents the excitation mechanism of the system. This is realized through two techniques,...
-
Application of autoencoder to traffic noise analysis
PublicationThe aim of an autoencoder neural network is to transform the input data into a lower-dimensional code and then to reconstruct the output from this code representation. Applications of autoencoders to classifying sound events in the road traffic have not been found in the literature. The presented research aims to determine whether such an unsupervised learning method may be used for deploying classification algorithms applied to...
-
Discovering Rule-Based Learning Systems for the Purpose of Music Analysis
PublicationMusic analysis and processing aims at understanding information retrieved from music (Music Information Retrieval). For the purpose of music data mining, machine learning (ML) methods or statistical approach are employed. Their primary task is recognition of musical instrument sounds, music genre or emotion contained in music, identification of audio, assessment of audio content, etc. In terms of computational approach, music databases...
-
Music information retrieval—The impact of technology, crowdsourcing, big data, and the cloud in art.
PublicationThe exponential growth of computer processing power, cloud data storage, and crowdsourcing model of gathering data bring new possibilities to music information retrieval (mir) field. Mir is no longer music content retrieval only; the area also comprises the discovery of expressing feelings and emotions contained in music, incorporating other than hearing modalities for helping this issue, users’ profiling, merging music with social...
-
Improving the quality of speech in the conditions of noise and interference
PublicationThe aim of the work is to present a method of intelligent modification of the speech signal with speech features expressed in noise, based on the Lombard effect. The recordings utilized sets of words and sentences as well as disturbing signals, i.e., pink noise and the so-called babble speech. Noise signal, calibrated to various levels at the speaker's ears, was played over two loudspeakers located 2 m away from the speaker. In...
-
Visual perception of vowels from static and dynamic cues
PublicationThe purpose of the study was to analyse human identification of Polish vowels from static and dynamic durationally slowed visual cues. A total of 152 participants identified 6 Polish vowels produced by 4 speakers from static (still images) and dynamic (videos) cues. The results show that 59% of static vowels and 63% of dynamic vowels were successfully identified. There was a strong confusion between vowels within front, central,...
-
Experimental investigations on the mechanical properties and damage detection of carbon nanotubes modified crumb rubber concrete
PublicationThis study presents a modified crumb rubber (MCR) concrete design mix reinforced with multi-walled carbon nanotubes (MWCNTs), mechanical characterization, and cracking monitoring using the acoustic emission (AE) technique. The results showed that the bridging effect of MWCNTs and MCR in the concrete mix mitigated the shortcomings of MWCNT-MCR concrete and improved the flexural and compressive strengths by 18.3% and 26.5%, respectively,...
-
Creating a Realible Music Discovery and Recomendation System
PublicationThe aim of this paper is to show problems related to creating a reliable music dis-covery system. The SYNAT database that contains audio files is used for the purpose of experiments. The files are divided into 22 classes corresponding to music genres with different cardinality. Of utmost importance for a reliable music recommendation system are the assignment of audio files to their appropriate gen-res and optimum parameterization...
-
Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej
PublicationThe bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...
-
Distortion of speech signals in the listening area: its mechanism and measurements
PublicationThe paper deals with a problem of the influence of the number and distribution of loudspeakers in speech reinforcement systems on the quality of publicly addressed voice messages, namely on speech intelligibility in the listening area. Linear superposition of time-shifted broadband waves of a same form and slightly different magnitudes that reach a listener from numerous coherent sources, is accompanied by interference effects...
-
Bitumen-Based Poroelastic Pavements: Successful Improvements and Remaining Issues
PublicationThis article presents the development process of designing and testing poroelastic pavement based on highly polymer-modified bitumen. Poroelastic wearing course was composed of mineral and rubber aggregate mixed with highly polymer-modified bitumen, in contrast to previous trials, during which polyurethane resins were mainly used as binder, which led to several serious technological problems concerning difficult production, insufficient...
-
ECHOES REDUCTION DURING DIGITAL DATA TRANSMISSION IN HYDROACOUSTIC CHANNEL – LABORATORY EXPERIMENT
PublicationThe possibility of using a hydroacoustic channel for digital data transmission is very limited. This is due to the effect of multipath propagation of the emitted acoustic wave and the damping of the mechanical wave in this medium, which increase with frequency. The first of these phenomena results in inter-symbol interference disturbances in data transmission systems, including even hundreds of symbols. Due to the number of reflections...
-
System oceny efektywności użytkowania aparatów słuchowych
PublicationCelem rozprawy jest opracowanie metody oceny efektywności protezowania słuchu przy użyciu aparatów słuchowych, która pozwoli w łatwy sposób poddawać ocenie korzyść z użytkowania protez słuchowych w najbardziej typowych sytuacjach akustycznych. Przedstawiono genezę podjętych badań i na tej podstawie zaproponowano cele i tezy rozprawy doktorskiej. W pracy w pierwszej kolejności zawarto przegląd dotyczący rodzajów ubytku słuchu i...
-
Examining Feature Vector for Phoneme Recognition / Analiza parametrów w kontekście automatycznej klasyfikacji fonemów
PublicationThe aim of this paper is to analyze usability of descriptors coming from music information retrieval to the phoneme analysis. The case study presented consists in several steps. First, a short overview of parameters utilized in speech analysis is given. Then, a set of time and frequency domain-based parameters is selected and discussed in the context of stop consonant acoustical characteristics. A toolbox created for this purpose...
-
A spice equivalent circuit for modeling the performance of dual frequency echo-sounder
PublicationThe paper presents novel network equivalent circuit of piezoceramic circular disc transducers that takes into account thickness and radial mode of vibrations. The starting point of the analysis is 4-port description of circular disc element representing the solution of wave equation set in radial and thickness directions. The approximate solution for harmonic case is represented in the form of 4x4 matrix, which is synthesised and...
-
Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling
PublicationSymbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is...
-
Multisensor System for the Protection of Critical Infrastructure of Seaport
PublicationThere are many separated infrastructural objects within a harbor area that may be considered “critical”, such as gas and oil terminals or anchored naval vessels. Those objects require special protection, including security systems capable of monitoring both surface and underwater areas, because an intrusion into the protected area may be attempted using small surface vehicles (boats, kayaks, rafts, floating devices with weapons...
-
An audio-visual corpus for multimodal automatic speech recognition
Publicationreview of available audio-visual speech corpora and a description of a new multimodal corpus of English speech recordings is provided. The new corpus containing 31 hours of recordings was created specifically to assist audio-visual speech recognition systems (AVSR) development. The database related to the corpus includes high-resolution, high-framerate stereoscopic video streams from RGB cameras, depth imaging stream utilizing Time-of-Flight...
-
Data Compression in Ultrasonic Network Communication via Sparse Signal Processing
PublicationThis document presents the approach of using compressed sensing in signal encoding and information transferring within a guided wave sensor network, comprised of specially designed frequency steerable acoustic transducers (FSATs). Wave propagation in a damaged plate was simulated using commercial FEM-based software COMSOL. Guided waves were excited by means of FSATs, characterized by the special shape of its electrodes, and modeled...
-
Evaluation of Medical Staff Satisfaction for Workplace Architecture in Temporary COVID-19 Hospital: A Case Study in Gdańsk, Poland
PublicationThis article analyses the architecture that was used in the temporary AmberExpo hospital in Gdańsk, Poland which was installed during the COVID-19 pandemic. The construction of this type of facility is often based on experimental approaches, aimed at caring for patients suffering from an infectious disease in emergency conditions. In order to assess the level of employee satisfaction with the architectural and technical elements...
-
Effective sonophotocatalytic degradation of tetracycline in water: Optimization, kinetic modeling, and degradation pathways
PublicationHybrid advanced oxidation processes (AOPs) are gaining interest in degradation of variety of recalcitrant compounds for water and wastewater treatment, due to possible synergistic effects. The present study systematically evaluated the degradation of tetracycline (TC) with a sonophotocatalytic process combining acoustic cavitation (sonocavitation) and photocatalysis based on N-doped TiO2 catalyst. The TC degradation rate constant...
-
Influence of temperature and anion type on thermophysical properties of aqueous solutions of morpholine based amino acid ionic liquids
PublicationDensities and sound velocities of aqueous solutions of N-butyl-N-methylmorpholine based amino acid ionic liquids (AAILs), including N-acetyl-L-alanine, N-acetyl-Lvalinate, N-acetyl-L-leucinate, and N-acetyl-L-izoleucinate anions were measured at a temperature from 293.15 to 313.15 K at 5 K intervals and atmospheric pressure. These data were used to calculate the apparent molar volumes and the apparent molar compressibilities in...
-
Mobile inventory system for hydrotechnical objects using data from multiple sensors operating simultaneously
PublicationThe knowledge of the location, shape and other characteristics of spatial objects in the coastal areas has a significant impact on the functioning of ports, shipyards, and other water-infrastructure facilities, both offshore and inland. Therefore, measurements are taken of the underwater part of the waterside zone, which means the bottom of water and other underwater objects (e.g. breakwaters, docks, etc.), and objects above the...
-
Comparative study on the effectiveness of various types of road traffic intensity detectors
PublicationVehicle detection and speed measurements are crucial tasks in traffic monitoring systems. In this work, we focus on several types of electronic sensors, operating on different physical principles in order to compare their effectiveness in real traffic conditions. Commercial solutions are based on road tubes, microwave sensors, LiDARs, and video cameras. Distributed traffic monitoring systems require a high number of monitoring...
-
Soft-mode enhanced type-I superconductivity in LiPd2Ge
PublicationThe synthesis, crystal structure, and physical properties (magnetization, resistivity, heat capacity) in combination with theoretical calculations of the electronic structure and phonon properties are reported for intermetallic compounds LiPd2X (X = Si, Ge, and Sn). LeBail refinement of powder x-ray diffraction data confirms that all compounds belong to the Heusler family (space group Fm-3m, No. 225). The lattice parameter increases...
-
ENERGY ANALYSIS OF THE PROPULSION SHAFT FATIGUE PROCESS IN A ROTATING MECHANICAL SYSTEM PART II IDENTIFICATION STUDIES – DEVELOPING THE FATIGUE DURABILITY MODEL OF A DRIVE SHAFT
PublicationThe article presents a continuation of research carried out concerning identification of energy consequences of mechanical fatigue within a propeller shaft in a rotating mechanical system, while working under conditions of the loss of the required alignment of shaft lines. Experimental research was carried out on a physical model reflecting a full-sized real object: i.e., the propulsion system of the ship. It is proven, by means...
-
Interactions of N-alkyl-N-methylmorpholinium based ionic liquids with acetonitrile studied by density and velocity of sound measurements and molecular dynamics simulations
PublicationMorpholinium-based ionic liquids (ILs) and their mixtures with polar co-solvents are an interesting class of emerging electrolytes in electrochemistry that is relatively poorly studied. In this work, densities and sound velocities of four ILs, N-ethyl-N-methylmorpholinium tetrafluoroborate, N-butyl-N-methylmorpholinium tetrafluoroborate, N-octyl-N-methylmorpho-linium tetrafluoroborate and N-decyl-N-methylmorpholinium tetrafluoroborate...
-
Examining Feature Vector for Phoneme Recognition
PublicationThe aim of this paper is to analyze usability of descriptors coming from music information retrieval to the phoneme analysis. The case study presented consists in several steps. First, a short overview of parameters utilized in speech analysis is given. Then, a set of time and frequency domain-based parameters is selected and discussed in the context of stop consonant acoustical characteristics. A toolbox created for this purpose...
-
Waveguide model of the hearing aid earmold system
PublicationBackground The earmold system of the Behind-The-Ear hearing aid is an acoustic system that modifies the spectrum of the propagated sound waves. Improper selection of the earmold system may result in deterioration of sound quality and speech intelligibility. Computer modeling methods may be useful in the process of hearing aid fitting, allowing physician to examine various earmold system configurations and choose the optimum one...
-
Waveguide model of the hearing aid earmold system
PublicationBackground The earmold system of the Behind-The-Ear hearing aid is an acoustic system that modifies the spectrum of the propagated sound waves. Improper selection of the earmold system may result in deterioration of sound quality and speech intelligibility. Computer modeling methods may be useful in the process of hearing aid fitting, allowing physician to examine various earmold system configurations and choose the optimum one...
-
The effect of fishing basin construction on the behaviour of a footbrdge over the port channel
PublicationThe paper analyses possible causes of failure of the rotating footbridge over the Ustka port channel. In July, 2015, strange behaviour of this object was observed in the form of excessive vibrations of bridge platform suspension rods, with the accompanying acoustic effects. A preliminary geotechnical analysis has revealed that this destructive effect was caused by the nearby construction works, namely construction of a fishing...
-
Cavitation-Based Processes for Water and Wastewater Treatment
PublicationCavitation based on advanced oxidation processes (Cav-AOPs) is interesting alternatives for already implemented wastewater treatment technologies. Destructive and strongly undesirable phenomena in the industry, i.e., cavitation, revealed to be useful in a positive manner as a source of energy for chemical reactions. During the implosion of cavitation bubbles, focused energy and resulting high temperature and pressure allows to...
-
Comprehensive Investigation of Stoichiometry–Structure–Performance Relationships in Flexible Polyurethane Foams
PublicationPolyurethane (PU) foams are versatile materials with a broad application range. Their performance is driven by the stoichiometry of polymerization reaction, which has been investigated in several works. However, the analysis was often limited only to selected properties and compared samples differing in apparent density, significantly influencing their performance. In the bigger picture, there is still a lack of comprehensive studies...
-
Management of ground tire rubber waste by incorporation into polyurethane-based composite foams
PublicationRapid economic growth implicated the developing multiple industry sectors, including the automotive branch, increasing waste generation since recycling and utilization methods have not been established simultaneously. A very severe threat is the generation of enormous amounts of post-consumer tires considered burdensome waste, e.g., due to the substantial emissions of volatile organic compounds (VOCs). Therefore, it is essential...
-
A pilot study to assess manufacturing processes using selected point measures of vibroacoustic signals generated on a multitasking machine
PublicationThe article presents the method for the evaluation of selected manufacturing processes using the analysis of vibration and sound signals. This method is based on the use of sensors installed outside the machining zone, allowing to be used quickly and reliably in real production conditions. The article contains a developed measurement methodology based on the specific location of microphones and vibration transducers mounted on...