Filtry
wszystkich: 1948
wybranych: 1519
-
Katalog
Filtry wybranego katalogu
wyświetlamy 1000 najlepszych wyników Pomoc
Wyniki wyszukiwania dla: SPEECH STRETCHING
-
PHONEME DISTORTION IN PUBLIC ADDRESS SYSTEMS
PublikacjaThe quality of voice messages in speech reinforcement and public address systems is often poor. The sound engineering projects of such systems take care of sound intensity and possible reverberation phenomena in public space without, however, considering the influence of acoustic interference related to the number and distribution of loudspeakers. This paper presents the results of measurements and numerical simulations of the...
-
AUTOMATYCZNA KLASYFIKACJA MOWY PATOLOGICZNEJ
PublikacjaAplikacja przedstawiona w niniejszym rozdziale służy do automatycznego wykrywania mowy patologicznej na podstawie bazy nagrań. W pierwszej kolejności przedstawiono założenia leżące u podstaw przeprowadzonych badan wraz z wyborem bazy mowy patologicznej. Zaprezentowano również zastosowane algorytmy oraz cechy sygnału mowy, które pozwalają odróżnić mowę niezaburzoną od mowy patologicznej. Wytrenowane sieci neuronowe zostały następnie...
-
Human voice modification using instantaneous complex frequency
PublikacjaThe paper presents the possibilities of changing human voice by modifying instantaneous complex frequency (ICF) of the speech signal. The proposed method provides a flexible way of altering voice without the necessity of finding fundamental frequency and formants' positions or detecting voiced and unvoiced fragments of speech. The algorithm is simple and fast. Apart from ICF it uses signal factorization into two factors: one fully...
-
Electron collisions with cyanoacetylene HC3N : Vibrational excitation and dissociative electron attachment
PublikacjaWe experimentally probe electron collisions with HC3N in the energy range from 0 to 10 eV with the focus on vibrational excitation and dissociative electron attachment. The vibrational excitation cross sections show a number of resonances which are mode specific: the two dominant π∗ resonances are visible in the excitation of all the vibrational modes; however, broad σ ∗ resonances are visible only in certain bond-stretching vibrational...
-
The bismuth vanadate thin layers modified by cobalt hexacyanocobaltate as visible-light active photoanodes for photoelectrochemical water oxidation
PublikacjaBismuth vanadate thin films deposited using the pulsed laser deposition technique were modified using cobalt hexacyanocobaltate (Cohcc). The 2-step method of Cohcc nanocubes preparation was applied: i) metallic cobalt deposition and ii) cobalt electrooxidation in Co(CN)63− containing electrolyte. The presence of CN stretching vibrations was confirmed by Raman spectroscopy. The energy band gap was equal to 2.5 eV and was estimated...
-
POPRAWA OBIEKTYWNYCH WSKAŹNIKÓW JAKOŚCI MOWY W WARUNKACH HAŁASU
PublikacjaCelem pracy jest modyfikacja sygnału mowy, aby uzyskać zwiększenie poprawy obiektywnych wskaźników jakości mowy po zmiksowaniu sygnału użytecznego z szumem bądź z sygnałem zakłócającym. Wykonane modyfikacje sygnału bazują na cechach mowy lombardzkiej, a w szczególności na efekcie podniesienia częstotliwości podstawowej F0. Sesja nagraniowa obejmowała zestawy słów i zdań w języku polskim, nagrane w warunkach ciszy, jak również w...
-
Investigating Feature Spaces for Isolated Word Recognition
PublikacjaThe study addresses the issues related to the appropriateness of a two-dimensional representation of speech signal for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and time-frequency signal representation converted to the investigated feature spaces. In particular, waveforms and fractal dimension features of the signal were chosen for the time domain, and...
-
New Applications of Multimodal Human-Computer Interfaces
PublikacjaMultimodal computer interfaces and examples of their applications to education software and for the disabled people are presented. The proposed interfaces include the interactive electronic whiteboard based on video image analysis, application for controlling computers with gestures and the audio interface for speech stretching for hearing impaired and stuttering people. Application of the eye-gaze tracking system to awareness...
-
Consideration of dynamic loads in the determination of axle load spectra for pavement design
PublikacjaAxle load spectra constitute a crucial part of the data for pavement design and pavement distress analysis. Typically, axle load spectra represent static load from vehicles and do not include dynamic loads generated by vehicles in motion. While dynamic loads can significantly contribute to faster pavement distress, this fact is mostly omitted in pavement design methods. The paper presents a methodology for consideration of dynamic...
-
Automatic music signal mixing system based on one-dimensional Wave-U-Net autoencoders
PublikacjaThe purpose of this paper is to show a music mixing system that is capable of automatically mixing separate raw recordings with good quality regardless of the music genre. This work recalls selected methods for automatic audio mixing first. Then, a novel deep model based on one-dimensional Wave-U-Net autoencoders is proposed for automatic music mixing. The model is trained on a custom-prepared database. Mixes created using the...
-
Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
PublikacjaIn this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor pr ocess priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bi- gram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of -grams with a topic model,...
-
Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing
PublikacjaIn this application-oriented paper we consider the problem of elimination of impulsive disturbances, such as clicks, pops and record scratches, from archive audio recordings. The proposed approach is based on bidirectional processing—noise pulses are localized by combining the results of forward-time and backward-time signal analysis. Based on the results of specially designed empirical tests (rather than on the results of theoretical analysis),...
-
Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling
PublikacjaSymbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is...
-
New approach for determining the QoS of MP3-coded voice signals in IP networks
PublikacjaPresent-day IP transport platforms being what they are, it will never be possible to rule out conflicts between the available services. The logical consequence of this assertion is the inevitable conclusion that the quality of service (QoS) must always be quantifiable no matter what. This paper focuses on one method to determine QoS. It defines an innovative, simple model that can evaluate the QoS of MP3-coded voice data transported...
-
Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering
PublikacjaThis paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...
-
Auditory-visual attention stimulator
PublikacjaNew approach to lateralization irregularities formation was proposed. The emphasis is put on the relationship between visual and auditory attention stimulation. In this approach hearing is stimulated using time scale modified speech and sight is stimulated by rendering the text of the currently heard speech. Moreover, displayed text is modified using several techniques i.e. zooming, highlighting etc. In the experimental part of...
-
INVESTIGATION OF THE LOMBARD EFFECT BASED ON A MACHINE LEARNING APPROACH
PublikacjaThe Lombard effect is an involuntary increase in the speaker’s pitch, intensity, and duration in the presence of noise. It makes it possible to communicate in noisy environments more effectively. This study aims to investigate an efficient method for detecting the Lombard effect in uttered speech. The influence of interfering noise, room type, and the gender of the person on the detection process is examined. First, acoustic parameters...
-
HYDROGRAPHIC SURVEY PLANNING FOR THE DETERMINATION OF TERRITORIAL SEA BASELINE ON THE EXAMPLE OF SELECTED POLISH SEA AREAS
Publikacja -
THE USE OF GNSS GEODETIC NETWORKS ON THE APPROACH TO THE PORTS � GULF OF GDANSK STUDY
Publikacja -
Influence of the femoral offset on the muscles passive resistance in total hip arthroplasty
PublikacjaBackground Soft tissue tension is treated as a crucial factor influencing the post-THA dislocation. The femoral offset is regarded as one of the major parameters responsible for the stabilization of the prosthesis. It is unclear which soft tissue is mostly affected by the offset changes. Methods A finite element model of the hip was created. The model comprised muscles, bones, a stem, the acetabular component and a liner. The muscles...
-
Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.
PublikacjaIn this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...
-
Concept of an Innovative System for Dimensioning and Predicting Changes in the Coastal Zone Topography Using UAVs and USVs (4DBatMap System)
PublikacjaThis publication is aimed at developing a concept of an innovative system for dimensioning and predicting changes in the coastal zone topography using Unmanned Aerial Vehicles (UAVs) and Unmanned Surface Vehicles (USVs). The 4DBatMap system will consist of four components: 1. Measurement data acquisition module. Bathymetric and photogrammetric measurements will be carried out with a specific frequency in the coastal zone using...
-
WYKORZYSTANIE SIECI NEURONOWYCH DO SYNTEZY MOWY WYRAŻAJĄCEJ EMOCJE
PublikacjaW niniejszym artykule przedstawiono analizę rozwiązań do rozpoznawania emocji opartych na mowie i możliwości ich wykorzystania w syntezie mowy z emocjami, wykorzystując do tego celu sieci neuronowe. Przedstawiono aktualne rozwiązania dotyczące rozpoznawania emocji w mowie i metod syntezy mowy za pomocą sieci neuronowych. Obecnie obserwuje się znaczny wzrost zainteresowania i wykorzystania uczenia głębokiego w aplikacjach związanych...
-
Optimization of Bread Production Using Neuro-Fuzzy Modelling
PublikacjaAutomation of food production is an actively researched domain. One of the areas, where automation is still not progressing significantly is bread making. The process still relies on expert knowledge regarding how to react to procedure changes depending on environmental conditions, quality of the ingredients, etc. In this paper, we propose an ANFIS-based model for changing the mixer speed during the kneading process. Although the...
-
Speed, alcohol and safety belts as important factors influencing the number voivodship = Prędkość, alkohol i pasy bezpieczeństwa jako istotne czynniki wpływające na liczbę ofiar śmiertelnych wypadków drogowych na obszarze województw
PublikacjaNiniejszy referat prezentuje wyniki wstępne szerszego programu prac badawczych dotyczących bezpieczeństwa ruchu drogowego na obszarach województw.
-
A general isogeometric finite element formulation for rotation‐free shells with in‐plane bending of embedded fibers
PublikacjaThis article presents a general, nonlinear isogeometric finite element formulation for rotation-free shells with embedded fibers that captures anisotropy in stretching, shearing, twisting, and bending - both in-plane and out-of-plane. These capabilities allow for the simulation of large sheets of heterogeneous and fibrous materials either with or without matrix, such as textiles, composites, and pantographic structures. The work...
-
A general theory for anisotropic Kirchhoff–Love shells with in-plane bending of embedded fibers
PublikacjaThis work presents a generalized Kirchhoff–Love shell theory that can explicitly capture fiber-induced anisotropy not only in stretching and out-of-plane bending, but also in in-plane bending. This setup is particularly suitable for heterogeneous and fibrous materials such as textiles, biomaterials, composites and pantographic structures. The presented theory is a direct extension of classical Kirchhoff–Love shell theory to incorporate...
-
A literature survey of the influence of preform reheating and stretch blow moulding with hot mould process parameters on the properties of PET containers – part 2.
PublikacjaThe hot fill process is an inexpensive conventional filling technology for high-acidity products (pH < 4.5). It allows certain drinks (sensitive beverages such as fruit and vegetable juices, nectars, soft drinks, vitaminised water) to be stored at ambient temperature without the need for chemical preservatives. The primary feature of the bottles used in the hot fill process is their temperature stability, i.e. the ability to retain...
-
A literature survey of the influence of preform reheating and stretch blow molding with hot mold process parameters on the properties of PET containers. Part I.
PublikacjaThe hot fill process is an inexpensive conventional filling technology for high-acidity products (pH < 4.5). It allows certain drinks (sensitive beverages such as fruit and vegetable juices, nectars, soft drinks, vitaminized water) to be stored at ambient temperature without the need for chemical preservatives. The primary feature of the bottles used in the hot fill process is their temperature stability, i.e. the ability to retain...
-
Modelling and Simulation of a New Variable Stiffness Holder for Milling of Flexible Details
PublikacjaModern industry expectations in terms of milling operations often demand the milling of the flexible details by using slender ball-end tools. This is a difficult task because of possible vibration occurrence. Due to existence of certain conditions (small depths of cutting, regeneration phenomena), cutting process may become unstable and self-excited chatter vibration may appear. Frequency of the chatter vibration is close to dominant...
-
Variable Ratio Sample Rate Conversion Based on Fractional Delay Filter
PublikacjaIn this paper a sample rate conversion algorithm which allows for continuously changing resampling ratio has been presented. The proposed implementation is based on a variable fractional delay filter which is implemented by means of a Farrow structure. Coefficients of this structure are computed on the basis of fractional delay filters which are designed using the offset window method. The proposed approach allows us to freely...
-
City scan as a tool to assess resilience challenges and vulnerabilities at the community level
PublikacjaThe majority of the world’s population lives in cities and cities are the key to achieving resilience. Local governments own only part of the land and can only partially decide about measures that should be taken ‘on the ground’. Local governments are therefore highly dependent on individuals, communities, and businesses to adapt and transform and take action in their own backyards or neighbourhoods. Since, for many people, climate...
-
KORPUS MOWY ANGIELSKIEJ DO CELÓW MULTIMODALNEGO AUTOMATYCZNEGO ROZPOZNAWANIA MOWY
PublikacjaW referacie zaprezentowano audiowizualny korpus mowy zawierający 31 godzin nagrań mowy w języku angielskim. Korpus dedykowany jest do celów automatycznego audiowizualnego rozpoznawania mowy. Korpus zawiera nagrania wideo pochodzące z szybkoklatkowej kamery stereowizyjnej oraz dźwięk zarejestrowany przez matrycę mikrofonową i mikrofon komputera przenośnego. Dzięki uwzględnieniu nagrań zarejestrowanych w warunkach szumowych korpus...
-
Effect of Processing Parameters on Strength and Corrosion Resistance of Friction Stir-Welded AA6082
PublikacjaThe friction stir welding method is increasingly attracting interest in the railway sector due to its environmental friendliness, low cost, and ease of producing high-quality joints. Using aluminum alloys reduces the weight of structures, increasing their payload and reducing fuel consumption and running costs. The following paper presents studies on the microstructure, strength, and corrosion resistance of AA6082 aluminum alloy...
-
Investigation of Weigh-in-Motion Measurement Accuracy on the Basis of Steering Axle Load Spectra
PublikacjaWeigh-in-motion systems are installed in pavements or on bridges to identify and reduce the number of overloaded vehicles and minimise their adverse eect on road infrastructure. Moreover, the collected trac data are used to obtain axle load characteristics, which are very useful in road infrastructure design. Practical application of data from weigh-in-motion has become more common recently, which calls for adequate attention to...
-
Prof. Haitham Abu-Rub - A Visit to Poland's Gdansk University of Technology
PublikacjaReport on visit of Prof. Haitham Abu-Rub in Gdansk University of Technology. Speech on the Smart Grid Centre. Visit in the new smart grid laboratory of the GUT, the Laboratory for Innovative Power Technologies and Integration of Renewable Energy Sources (LINTE^2).
-
A significance of multi slip condition for inclined MHD nano-fluid flow with non linear thermal radiations, Dufuor and Sorrot, and chemically reactive bio-convection effect
PublikacjaThe aim of this research is to discuss the significance of slip conditions for magnetized nanofluid flow with the impact of nonlinear thermal radiations, activation energy, inclined MHD, sorrot and dufour, and gyrotactic micro motile organisms over continuous stretching of a two-dimensional sheet. The governing equations emerge in the form of partial differential equations. Since the resultant governing differential equations...
-
New ceramic materials derived from pyrolyzed poly(1,2-dimethylsilazane) and starch as a potential anode for Li-ion batteries
PublikacjaNewmaterialswere obtained by pyrolysis of starch (S) and poly(1,2-dimethylsilazane) (PSN) (weight ratio: PSN/S 30/70) at temperature a) 500 °C, b) 700 °C and c) 900 °C. Ceramic materials were characterized by infrared spectroscopy, TGA, Raman spectroscopy and SEM. New Si\O and shifted Si\C stretching vibration modes emerged confirming direct interaction between silicon originating fromsilazane and oxygen coming fromstarch. The...
-
Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling
PublikacjaA common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...
-
A Comparison of STI Measured by Direct and Indirect Methods for Interiors Coupled with Sound Reinforcement Systems
PublikacjaThis paper presents a comparison of STI (Speech Transmission Index) coefficient measurement results carried out by direct and indirect methods. First, acoustic parameters important in the context of public address and sound reinforcement systems are recalled. A measurement methodology is presented that employs various test signals to determine impulse responses. The process of evaluating sound system performance, signals enabling...
-
Thermal degradation of butyl acrylate-methyl acrylate-acrylic acid-copolymers
Publikacja -
A History of Maritime Radio-Navigation Positioning Systems used in Poland
Publikacja -
Application of an Autonomous/Unmanned Survey Vessel (ASV/USV) in Bathymetric Measurements
Publikacja -
Assessment of the Steering Precision of a Hydrographic USV along Sounding Profiles Using a High-Precision GNSS RTK Receiver Supported Autopilot
Publikacja -
Testing the Accuracy of the Modified ICP Algorithm with Multimodal Weighting Factors
Publikacja -
Rediscovering Automatic Detection of Stuttering and Its Subclasses through Machine Learning—The Impact of Changing Deep Model Architecture and Amount of Data in the Training Set
PublikacjaThis work deals with automatically detecting stuttering and its subclasses. An effective classification of stuttering along with its subclasses could find wide application in determining the severity of stuttering by speech therapists, preliminary patient diagnosis, and enabling communication with the previously mentioned voice assistants. The first part of this work provides an overview of examples of classical and deep learning...
-
A comparative study of English viseme recognition methods and algorithm
PublikacjaAn elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector...
-
Comparative analysis of various transformation techniques for voiceless consonants modeling
PublikacjaIn this paper, a comparison of various transformation techniques, namely Discrete Fourier Transform (DFT), Discrete Cosine Transform (DCT) and Discrete Walsh Hadamard Transform (DWHT) are performed in the context of their application to voiceless consonant modeling. Speech features based on these transformation techniques are extracted. These features are mean and derivative values of cepstrum coefficients, derived from each transformation....
-
Modeling and Designing Acoustical Conditions of the Interior – Case Study
PublikacjaThe primary aim of this research study was to model acoustic conditions of the Courtyard of the Gdańsk University of Technology Main Building, and then to design a sound reinforcement system for this interior. First, results of measurements of the parameters of the acoustic field are presented. Then, the comparison between measured and predicted values using the ODEON program is shown. Collected data indicate a long reverberation...
-
A comparative study of English viseme recognition methods and algorithms
PublikacjaAn elementary visual unit – the viseme is concerned in the paper in the context of preparing the feature vector as a main visual input component of Audio-Visual Speech Recognition systems. The aim of the presented research is a review of various approaches to the problem, the implementation of algorithms proposed in the literature and a comparative research on their effectiveness. In the course of the study an optimal feature vector construction...