Filters
total: 2634
filtered: 1895
displaying 1000 best results Help
Search results for: IMAGINED SPEECH
-
Wizyjny system monitoringu stanu nakładek stykowych odbieraków prądu
PublicationEksploatowane do niedawna w Polsce nakładki pantografów kolejowych były wykonywane w postaci styków miedzianych. Na początku 2011 r. przeprowadzono kompleksową zmianę typu nakładek stykowych, stosowanych przez operatorów korzystających z infrastruktury PKP PLK S.A., z miedzianych na metalizowane węglowe. Wraz z wprowadzeniem nakładek węglowych istnieje potrzeba opracowania nowych metod diagnostyki tego istotnego elementu toru zasilania...
-
Instantaneous complex frequency for pipeline pitch estimation
PublicationIn the paper a pipeline algorithm for estimating the pitch of speech signal is proposed. The algorithm uses instantaneous complex frequencies estimated for four waveforms obtained by filtering the original speech signal through four bandpass complex Hilbert filters. The imaginary parts of ICFs from each channel give four candidates for pitch estimates. The decision regarding the final estimate is made based on the real parts of...
-
XVIII Międzynarodowe Sympozjum Inżynierii i Reżyserii Dźwięku
PublicationThe subjective assessment of speech signals takes into account previous experiences and habits of an individual. Since the perception process deteriorates with age, differences should be noticeable among people from dissimilar age groups. In this work, we investigated the difference of speech quality assessment between high school students and university students. The study involved 60 participants, with 30 people in both the adolescents...
-
Orientation-aware ship detection via a rotation feature decoupling supported deep learning approach
PublicationShip imaging position plays an important role in visual navigation, and thus significant focuses have been paid to accurately extract ship imaging positions in maritime videos. Previous studies are mainly conducted in the horizontal ship detection manner from maritime image sequences. This can lead to unsatisfied ship detection performance due to that some background pixels maybe wrongly identified as ship contours. To address...
-
Vibration surveillance for efficient milling of flexible details fixed in adjustable stiffness holder
PublicationThe paper presents the results of research related to the possibility of using an intelligent workpiece holder with adjustable stiffness, during end milling process. Machining a one side supported flexible workpiece will be performed with constant spindle speed and feed speed. In order to avoid hazardous vibration, stiffness of the especially designed spring (mounted in a workpiece holder) will be modified off-line. In order to...
-
Analysis of the Surface Stereometry of Alloyed Austenitic Steel after Fibre Laser Cutting using Confocal Microscopy
PublicationThe paper extends the concept of cut edge quality and examines the fibre laser cutting process. A Prima Power Platino Fiber Evo device with a reference speed (RS) of 3500 mm/min was used for laser cutting. In order to analyse the influence of the laser cutting speed on the cut edge quality of X5CrNi18-10 stainless steel sheets, macroscopic studies were conducted on a stereoscopic microscope and surface stereometry on a confocal...
-
Graphical presentation of the power of energy losses and power developed in the elements hydrostatic drive and control system. Part II. Rotational hydraulic motor speed parallel throtling control and volumetric control systems
PublicationPrzedstawiono interpretację graficzną mocy strat energetycznych występujących w elementach układów napędu i sterowania hydrostatycznego, a także mocy rozwijanych przez te elementy. Dokonano analizy układu indywidualnego ze sterowaniem dławieniowym równoległym prędkości silnika hydraulicznego obrotowego, układu indywidualnego ze sterowaniem objętościowym, pompą o zmiennej wydajności, prędkości silnika hydrailicznego obrotowego,...
-
Creating new voices using normalizing flows
PublicationCreating realistic and natural-sounding synthetic speech remains a big challenge for voice identities unseen during training. As there is growing interest in synthesizing voices of new speakers, here we investigate the ability of normalizing flows in text-to-speech (TTS) and voice conversion (VC) modes to extrapolate from speakers observed during training to create unseen speaker identities. Firstly, we create an approach for TTS...
-
PHONEME DISTORTION IN PUBLIC ADDRESS SYSTEMS
PublicationThe quality of voice messages in speech reinforcement and public address systems is often poor. The sound engineering projects of such systems take care of sound intensity and possible reverberation phenomena in public space without, however, considering the influence of acoustic interference related to the number and distribution of loudspeakers. This paper presents the results of measurements and numerical simulations of the...
-
AUTOMATYCZNA KLASYFIKACJA MOWY PATOLOGICZNEJ
PublicationAplikacja przedstawiona w niniejszym rozdziale służy do automatycznego wykrywania mowy patologicznej na podstawie bazy nagrań. W pierwszej kolejności przedstawiono założenia leżące u podstaw przeprowadzonych badan wraz z wyborem bazy mowy patologicznej. Zaprezentowano również zastosowane algorytmy oraz cechy sygnału mowy, które pozwalają odróżnić mowę niezaburzoną od mowy patologicznej. Wytrenowane sieci neuronowe zostały następnie...
-
Integration of thermographic data with the 3D object model
PublicationThe aim of the paper is to present new method for merging the 3D model data of the measured object with thermograms. Our technique is based on the combination of visual 3D imaging technique and thermal imaging technique, which maps the 2D thermograms on to 3D anatomical mesh model. The combination of these imaging modalities allows the generation of combined 3D and thermal data from which thermal signatures can be verified and...
-
Comparison of Methods for Real and Imaginary Motion Classification from EEG Signals
PublicationA method for feature extraction and results of classification of EEG signals obtained from performed and imagined motion are presented. A set of 615 features was obtained to serve for the recognition of type and laterality of motion using 8 different classifications approaches. A comparison of achieved classifiers accuracy is presented in the paper, and then conclusions and discussion are provided. Among applied algorithms the...
-
Human voice modification using instantaneous complex frequency
PublicationThe paper presents the possibilities of changing human voice by modifying instantaneous complex frequency (ICF) of the speech signal. The proposed method provides a flexible way of altering voice without the necessity of finding fundamental frequency and formants' positions or detecting voiced and unvoiced fragments of speech. The algorithm is simple and fast. Apart from ICF it uses signal factorization into two factors: one fully...
-
Thermal Image Processing for Respiratory Estimation from Cubical Data with Expandable Depth
PublicationAs healthcare costs continue to rise, finding affordable and non-invasive ways to monitor vital signs is increasingly important. One of the key metrics for assessing overall health and identifying potential issues early on is respiratory rate (RR). Most of the existing methods require multiple steps that consist of image and signal processing. This might be difficult to deploy on edge devices that often do not have specialized...
-
POPRAWA OBIEKTYWNYCH WSKAŹNIKÓW JAKOŚCI MOWY W WARUNKACH HAŁASU
PublicationCelem pracy jest modyfikacja sygnału mowy, aby uzyskać zwiększenie poprawy obiektywnych wskaźników jakości mowy po zmiksowaniu sygnału użytecznego z szumem bądź z sygnałem zakłócającym. Wykonane modyfikacje sygnału bazują na cechach mowy lombardzkiej, a w szczególności na efekcie podniesienia częstotliwości podstawowej F0. Sesja nagraniowa obejmowała zestawy słów i zdań w języku polskim, nagrane w warunkach ciszy, jak również w...
-
Investigating Feature Spaces for Isolated Word Recognition
PublicationThe study addresses the issues related to the appropriateness of a two-dimensional representation of speech signal for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and time-frequency signal representation converted to the investigated feature spaces. In particular, waveforms and fractal dimension features of the signal were chosen for the time domain, and...
-
3D scanning system for railway current collector contact strips
PublicationUndisturbed collection of current from a contact wire of the catenary constitutes one of the basic elements in reliable operation of electrified rail transport, particularly when vehicles move at high speed. Quality of current collection is influenced by the construction of catenary and current collectors, as well as by the technical condition and regulation of these two elements. Total contact force of a current collector head...
-
Consideration of dynamic loads in the determination of axle load spectra for pavement design
PublicationAxle load spectra constitute a crucial part of the data for pavement design and pavement distress analysis. Typically, axle load spectra represent static load from vehicles and do not include dynamic loads generated by vehicles in motion. While dynamic loads can significantly contribute to faster pavement distress, this fact is mostly omitted in pavement design methods. The paper presents a methodology for consideration of dynamic...
-
Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
PublicationIn this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor pr ocess priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bi- gram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of -grams with a topic model,...
-
Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering
PublicationThis paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...
-
Automatic music signal mixing system based on one-dimensional Wave-U-Net autoencoders
PublicationThe purpose of this paper is to show a music mixing system that is capable of automatically mixing separate raw recordings with good quality regardless of the music genre. This work recalls selected methods for automatic audio mixing first. Then, a novel deep model based on one-dimensional Wave-U-Net autoencoders is proposed for automatic music mixing. The model is trained on a custom-prepared database. Mixes created using the...
-
Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling
PublicationSymbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is...
-
Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing
PublicationIn this application-oriented paper we consider the problem of elimination of impulsive disturbances, such as clicks, pops and record scratches, from archive audio recordings. The proposed approach is based on bidirectional processing—noise pulses are localized by combining the results of forward-time and backward-time signal analysis. Based on the results of specially designed empirical tests (rather than on the results of theoretical analysis),...
-
New approach for determining the QoS of MP3-coded voice signals in IP networks
PublicationPresent-day IP transport platforms being what they are, it will never be possible to rule out conflicts between the available services. The logical consequence of this assertion is the inevitable conclusion that the quality of service (QoS) must always be quantifiable no matter what. This paper focuses on one method to determine QoS. It defines an innovative, simple model that can evaluate the QoS of MP3-coded voice data transported...
-
Auditory-visual attention stimulator
PublicationNew approach to lateralization irregularities formation was proposed. The emphasis is put on the relationship between visual and auditory attention stimulation. In this approach hearing is stimulated using time scale modified speech and sight is stimulated by rendering the text of the currently heard speech. Moreover, displayed text is modified using several techniques i.e. zooming, highlighting etc. In the experimental part of...
-
Measurement system for nonlinear surface spectroscopy by atomic force microscopy for corrosion processes monitoring
PublicationIn addition to traditional imaging the surface, atomic force microscopy (AFM) enables wide variety of additional measurements. One of them is higher harmonic imaging. In tapping mode the nonlinear contact between tip and specimen results in higher frequency vibrations. More information available from the higher harmonics analysis proves to be helpful for more detailed imaging. Such visualization is especially useful for heterogeneous...
-
INVESTIGATION OF THE LOMBARD EFFECT BASED ON A MACHINE LEARNING APPROACH
PublicationThe Lombard effect is an involuntary increase in the speaker’s pitch, intensity, and duration in the presence of noise. It makes it possible to communicate in noisy environments more effectively. This study aims to investigate an efficient method for detecting the Lombard effect in uttered speech. The influence of interfering noise, room type, and the gender of the person on the detection process is examined. First, acoustic parameters...
-
A measurement system for nonlinear surface spectroscopy with an atomic force microscope during corrosion process monitoring
PublicationIn addition to traditional imaging the surface, atomic force microscopy (AFM) enables wide variety of additional measurements. One of them is higher harmonic imaging. In tapping mode the nonlinear contact between tip and specimen results in higher frequency vibrations. More information available from the higher harmonics analysis proves to be helpful for more detailed imaging. Such visualization is espe-cially useful for heterogeneous...
-
HYDROGRAPHIC SURVEY PLANNING FOR THE DETERMINATION OF TERRITORIAL SEA BASELINE ON THE EXAMPLE OF SELECTED POLISH SEA AREAS
Publication -
THE USE OF GNSS GEODETIC NETWORKS ON THE APPROACH TO THE PORTS � GULF OF GDANSK STUDY
Publication -
Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.
PublicationIn this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...
-
Metoda i algorytmy modyfikacji sygnału do celu wspomagania rozumienia mowy przez osoby z pogorszoną rozdzielczością czasową słuchu
PublicationPrzedmiotem badań przeprowadzonych w ramach rozprawy są metody modyfikacji czasu trwania sygnału (ang. Time Scale Modification –TSM) mowy operujące w czasie rzeczywistym oraz ocena ich wpływu na rozumienie wypowiedzi przez osoby z pogorszoną rozdzielczością czasową słuchu. Pogorszona rozdzielczość słuchu jest jednym z symptomów związanych z ośrodkowymi zaburzeniami słuchu (ang. Cetnral Auditory Processing Disorder – CAPD). W odróżnieniu...
-
Concept of an Innovative System for Dimensioning and Predicting Changes in the Coastal Zone Topography Using UAVs and USVs (4DBatMap System)
PublicationThis publication is aimed at developing a concept of an innovative system for dimensioning and predicting changes in the coastal zone topography using Unmanned Aerial Vehicles (UAVs) and Unmanned Surface Vehicles (USVs). The 4DBatMap system will consist of four components: 1. Measurement data acquisition module. Bathymetric and photogrammetric measurements will be carried out with a specific frequency in the coastal zone using...
-
WYKORZYSTANIE SIECI NEURONOWYCH DO SYNTEZY MOWY WYRAŻAJĄCEJ EMOCJE
PublicationW niniejszym artykule przedstawiono analizę rozwiązań do rozpoznawania emocji opartych na mowie i możliwości ich wykorzystania w syntezie mowy z emocjami, wykorzystując do tego celu sieci neuronowe. Przedstawiono aktualne rozwiązania dotyczące rozpoznawania emocji w mowie i metod syntezy mowy za pomocą sieci neuronowych. Obecnie obserwuje się znaczny wzrost zainteresowania i wykorzystania uczenia głębokiego w aplikacjach związanych...
-
Optimization of Bread Production Using Neuro-Fuzzy Modelling
PublicationAutomation of food production is an actively researched domain. One of the areas, where automation is still not progressing significantly is bread making. The process still relies on expert knowledge regarding how to react to procedure changes depending on environmental conditions, quality of the ingredients, etc. In this paper, we propose an ANFIS-based model for changing the mixer speed during the kneading process. Although the...
-
Speed, alcohol and safety belts as important factors influencing the number voivodship = Prędkość, alkohol i pasy bezpieczeństwa jako istotne czynniki wpływające na liczbę ofiar śmiertelnych wypadków drogowych na obszarze województw
PublicationNiniejszy referat prezentuje wyniki wstępne szerszego programu prac badawczych dotyczących bezpieczeństwa ruchu drogowego na obszarach województw.
-
The shallow sea experiment with usage of linear hydrophone array
PublicationPurpose of this article is to present designed and made linear hydrophone array and the results obtained during in situ trails on Gulf of Gdańsk. The measuring system allowed to localize hydrophones in the selected points and perform measurements in both the horizontal antenna positioning and vertical. Made in this way recordings allow creating accurate 3D imaging of sound intensity/propagation. During research three floating objects...
-
Modelling and Simulation of a New Variable Stiffness Holder for Milling of Flexible Details
PublicationModern industry expectations in terms of milling operations often demand the milling of the flexible details by using slender ball-end tools. This is a difficult task because of possible vibration occurrence. Due to existence of certain conditions (small depths of cutting, regeneration phenomena), cutting process may become unstable and self-excited chatter vibration may appear. Frequency of the chatter vibration is close to dominant...
-
Molecularly targeted nanoparticles: an emerging tool for evaluation of expression of the receptor for advanced glycation end products in a murine model of peripheral artery disease
PublicationAbstract Background: Molecular imaging with molecularly targeted probes is a powerful tool for studying the spatio-temporal interactions between complex biological processes. The pivotal role of the receptor for advanced glycation end products (RAGE) in numerous pathological processes, aroused the demand for RAGE targeted imaging in various diseases. In the study, we evaluated the use of a diagnostic imaging agent for RAGE quantification...
-
Active Dynamic Thermography in Medical Diagnostics
PublicationThis is an overview of active thermal imaging methods in medical diagnostics using external thermal stimulation. In this chapter, several clinical cases diagnosed using the active dynamic thermography method, ADT, are presented. Features of this technology are discussed and main advantages underlined. Applications in skin burn diagnostics and quantitative evaluation leading to modern classification of burned patients for further...
-
Variable Ratio Sample Rate Conversion Based on Fractional Delay Filter
PublicationIn this paper a sample rate conversion algorithm which allows for continuously changing resampling ratio has been presented. The proposed implementation is based on a variable fractional delay filter which is implemented by means of a Farrow structure. Coefficients of this structure are computed on the basis of fractional delay filters which are designed using the offset window method. The proposed approach allows us to freely...
-
KORPUS MOWY ANGIELSKIEJ DO CELÓW MULTIMODALNEGO AUTOMATYCZNEGO ROZPOZNAWANIA MOWY
PublicationW referacie zaprezentowano audiowizualny korpus mowy zawierający 31 godzin nagrań mowy w języku angielskim. Korpus dedykowany jest do celów automatycznego audiowizualnego rozpoznawania mowy. Korpus zawiera nagrania wideo pochodzące z szybkoklatkowej kamery stereowizyjnej oraz dźwięk zarejestrowany przez matrycę mikrofonową i mikrofon komputera przenośnego. Dzięki uwzględnieniu nagrań zarejestrowanych w warunkach szumowych korpus...
-
Effect of Processing Parameters on Strength and Corrosion Resistance of Friction Stir-Welded AA6082
PublicationThe friction stir welding method is increasingly attracting interest in the railway sector due to its environmental friendliness, low cost, and ease of producing high-quality joints. Using aluminum alloys reduces the weight of structures, increasing their payload and reducing fuel consumption and running costs. The following paper presents studies on the microstructure, strength, and corrosion resistance of AA6082 aluminum alloy...
-
Investigation of Weigh-in-Motion Measurement Accuracy on the Basis of Steering Axle Load Spectra
PublicationWeigh-in-motion systems are installed in pavements or on bridges to identify and reduce the number of overloaded vehicles and minimise their adverse eect on road infrastructure. Moreover, the collected trac data are used to obtain axle load characteristics, which are very useful in road infrastructure design. Practical application of data from weigh-in-motion has become more common recently, which calls for adequate attention to...
-
Prof. Haitham Abu-Rub - A Visit to Poland's Gdansk University of Technology
PublicationReport on visit of Prof. Haitham Abu-Rub in Gdansk University of Technology. Speech on the Smart Grid Centre. Visit in the new smart grid laboratory of the GUT, the Laboratory for Innovative Power Technologies and Integration of Renewable Energy Sources (LINTE^2).
-
A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems
PublicationThis paper presents a comparison of STI (Speech Transmission Index) coefficient measurement results carried out by direct and indirect methods. First, acoustic parameters important in the context of public address and sound reinforcement systems are recalled. A measurement methodology is presented that employs various test signals to determine impulse responses. The process of evaluating sound system performance, signals enabling...
-
Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling
PublicationA common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...
-
Thermal degradation of butyl acrylate-methyl acrylate-acrylic acid-copolymers
Publication -
A History of Maritime Radio-Navigation Positioning Systems used in Poland
Publication -
Application of an Autonomous/Unmanned Survey Vessel (ASV/USV) in Bathymetric Measurements
Publication