Filtry
wszystkich: 1966
-
Katalog
wyświetlamy 1000 najlepszych wyników Pomoc
Wyniki wyszukiwania dla: SPEECH PARAMETRIZATION
-
Music Mood Visualization Using Self-Organizing Maps
PublikacjaDue to an increasing amount of music being made available in digital form in the Internet, an automatic organization of music is sought. The paper presents an approach to graphical representation of mood of songs based on Self-Organizing Maps. Parameters describing mood of music are proposed and calculated and then analyzed employing correlation with mood dimensions based on the Multidimensional Scaling. A map is created in which...
-
INFLUENCE OF DATA NORMALIZATION ON THE EFFECTIVENESS OF NEURAL NETWORKS APPLIED TO CLASSIFICATION OF PAVEMENT CONDITIONS – CASE STUDY
PublikacjaIn recent years automatic classification employing machine learning seems to be in high demand for tele-informatic-based solutions. An example of such solutions are intelligent transportation systems (ITS), in which various factors are taken into account. The subject of the study presented is the impact of data pre-processing and normalization on the accuracy and training effectiveness of artificial neural networks in the case...
-
Performance Analysis of the OpenCL Environment on Mobile Platforms
PublikacjaToday’s smartphones have more and more features that so far were only assigned to personal computers. Every year these devices are composed of better and more efficient components. Everything indicates that modern smartphones are replacing ordinary computers in various activities. High computing power is required for tasks such as image processing, speech recognition and object detection. This paper analyses the performance of...
-
Michał Michna dr hab. inż.
OsobyJest absolwentem Wydziału Elektrycznego Politechniki Gdańskiej (1998). W 2004 r. uzyskał stopień doktora. Od 2004 r. zatrudniony w Katedrze Energoelektroniki i Maszyn Elektrycznych Politechniki Gdańskiej (asystent, adiunkt, starszy wykładowca). W latach 2010-2015 zastępca kierownik katedry. Jego zainteresowania naukowe i dydaktyczne obejmują szerokie spektrum zagadnień związanych z projektowanie, modelowanie i diagnostyką maszyn...
-
Instantaneous complex frequency for pipeline pitch estimation
PublikacjaIn the paper a pipeline algorithm for estimating the pitch of speech signal is proposed. The algorithm uses instantaneous complex frequencies estimated for four waveforms obtained by filtering the original speech signal through four bandpass complex Hilbert filters. The imaginary parts of ICFs from each channel give four candidates for pitch estimates. The decision regarding the final estimate is made based on the real parts of...
-
XVIII Międzynarodowe Sympozjum Inżynierii i Reżyserii Dźwięku
PublikacjaThe subjective assessment of speech signals takes into account previous experiences and habits of an individual. Since the perception process deteriorates with age, differences should be noticeable among people from dissimilar age groups. In this work, we investigated the difference of speech quality assessment between high school students and university students. The study involved 60 participants, with 30 people in both the adolescents...
-
Towards Cancer Patients Classification Using Liquid Biopsy
PublikacjaLiquid biopsy is a useful, minimally invasive diagnostic and monitoring tool for cancer disease. Yet, developing accurate methods, given the potentially large number of input features, and usually small datasets size remains very challenging. Recently, a novel feature parameterization based on the RNA-sequenced platelet data which uses the biological knowledge from the Kyoto Encyclopedia of Genes and Genomes, combined with a classifier...
-
Low-Level Music Feature Vectors Embedded as Watermarks
PublikacjaIn this paper a method consisting in embedding low-level music feature vectors as watermarks into a musical signal is proposed. First, a review of some recent watermarking techniques and the main goals of development of digital watermarking research are provided. Then, a short overview of parameterization employed in the area of Music Information Retrieval is given. A methodology of non-blind watermarking applied to music-content...
-
Resistant to correlated noise and outliers discrete identification of continuous non-linear non-stationary dynamic objects
PublikacjaIn this article, specific methods of parameter estimation were used to identify the coefficients of continuous models represented by linear and nonlinear differential equations. The necessary discrete-time approximation of the base model is achieved by appropriately tuned FIR linear integral filters. The resulting discrete descriptions, which retain the original continuous parameterization, can then be identified using the classical...
-
Resistant to correlated noise and outliers discrete identification of continuous non-linear non-stationary dynamic objects
PublikacjaIn this study, dedicated methods of parameter estimation were used to identify the coefficients of continuous models represented by linear and nonlinear differential equations. The necessary discrete-time approximation of the base model is achieved by appropriately tuned FIR linear integral filters. The resulting discrete descriptions, which retain the original continuous parameterization, can then be identified using the classical...
-
Vibration surveillance for efficient milling of flexible details fixed in adjustable stiffness holder
PublikacjaThe paper presents the results of research related to the possibility of using an intelligent workpiece holder with adjustable stiffness, during end milling process. Machining a one side supported flexible workpiece will be performed with constant spindle speed and feed speed. In order to avoid hazardous vibration, stiffness of the especially designed spring (mounted in a workpiece holder) will be modified off-line. In order to...
-
Analysis of the Surface Stereometry of Alloyed Austenitic Steel after Fibre Laser Cutting using Confocal Microscopy
PublikacjaThe paper extends the concept of cut edge quality and examines the fibre laser cutting process. A Prima Power Platino Fiber Evo device with a reference speed (RS) of 3500 mm/min was used for laser cutting. In order to analyse the influence of the laser cutting speed on the cut edge quality of X5CrNi18-10 stainless steel sheets, macroscopic studies were conducted on a stereoscopic microscope and surface stereometry on a confocal...
-
Graphical presentation of the power of energy losses and power developed in the elements hydrostatic drive and control system. Part II. Rotational hydraulic motor speed parallel throtling control and volumetric control systems
PublikacjaPrzedstawiono interpretację graficzną mocy strat energetycznych występujących w elementach układów napędu i sterowania hydrostatycznego, a także mocy rozwijanych przez te elementy. Dokonano analizy układu indywidualnego ze sterowaniem dławieniowym równoległym prędkości silnika hydraulicznego obrotowego, układu indywidualnego ze sterowaniem objętościowym, pompą o zmiennej wydajności, prędkości silnika hydrailicznego obrotowego,...
-
Creating new voices using normalizing flows
PublikacjaCreating realistic and natural-sounding synthetic speech remains a big challenge for voice identities unseen during training. As there is growing interest in synthesizing voices of new speakers, here we investigate the ability of normalizing flows in text-to-speech (TTS) and voice conversion (VC) modes to extrapolate from speakers observed during training to create unseen speaker identities. Firstly, we create an approach for TTS...
-
AUTOMATYCZNA KLASYFIKACJA MOWY PATOLOGICZNEJ
PublikacjaAplikacja przedstawiona w niniejszym rozdziale służy do automatycznego wykrywania mowy patologicznej na podstawie bazy nagrań. W pierwszej kolejności przedstawiono założenia leżące u podstaw przeprowadzonych badan wraz z wyborem bazy mowy patologicznej. Zaprezentowano również zastosowane algorytmy oraz cechy sygnału mowy, które pozwalają odróżnić mowę niezaburzoną od mowy patologicznej. Wytrenowane sieci neuronowe zostały następnie...
-
PHONEME DISTORTION IN PUBLIC ADDRESS SYSTEMS
PublikacjaThe quality of voice messages in speech reinforcement and public address systems is often poor. The sound engineering projects of such systems take care of sound intensity and possible reverberation phenomena in public space without, however, considering the influence of acoustic interference related to the number and distribution of loudspeakers. This paper presents the results of measurements and numerical simulations of the...
-
Human voice modification using instantaneous complex frequency
PublikacjaThe paper presents the possibilities of changing human voice by modifying instantaneous complex frequency (ICF) of the speech signal. The proposed method provides a flexible way of altering voice without the necessity of finding fundamental frequency and formants' positions or detecting voiced and unvoiced fragments of speech. The algorithm is simple and fast. Apart from ICF it uses signal factorization into two factors: one fully...
-
Jarosław Guziński prof. dr hab. inż.
OsobySTOPNIE NAUKOWE 2021 Tytuł profesora nauk inżynieryjno-technicznych. 2012 Stopień doktora habilitowanego nauk technicznych – Wydział Elektrotechniki i Automatyki PG. Rozprawa habilitacyjna „Układy napędowe z silnikami indukcyjnymi i filtrami wyjściowymi falowników. Zagadnienia wybrane”. Kolokwium i nadanie stopnia doktora habilitowanego 29 maja 2012 r. Monografia uzyskała nagrodę naukową Wydziału IV Nauk Technicznych Polskiej...
-
POPRAWA OBIEKTYWNYCH WSKAŹNIKÓW JAKOŚCI MOWY W WARUNKACH HAŁASU
PublikacjaCelem pracy jest modyfikacja sygnału mowy, aby uzyskać zwiększenie poprawy obiektywnych wskaźników jakości mowy po zmiksowaniu sygnału użytecznego z szumem bądź z sygnałem zakłócającym. Wykonane modyfikacje sygnału bazują na cechach mowy lombardzkiej, a w szczególności na efekcie podniesienia częstotliwości podstawowej F0. Sesja nagraniowa obejmowała zestawy słów i zdań w języku polskim, nagrane w warunkach ciszy, jak również w...
-
Classifying type of vehicles on the basis of data extracted from audio signal characteristics
PublikacjaThe aim of this study is to find and optimize a feature vector for an automatic recognition of the type of vehicles, extracted form an audio signal. First, the influence of weather-based conditions of road surface on spectral characteristic of the audio signal recorded from a passing vehicle in close proximity to the road is discussed. Next, parameterization of the recorded audio signal is performed. For that purpose, the MIRtoolbox,...
-
Investigating Feature Spaces for Isolated Word Recognition
PublikacjaThe study addresses the issues related to the appropriateness of a two-dimensional representation of speech signal for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and time-frequency signal representation converted to the investigated feature spaces. In particular, waveforms and fractal dimension features of the signal were chosen for the time domain, and...
-
Consideration of dynamic loads in the determination of axle load spectra for pavement design
PublikacjaAxle load spectra constitute a crucial part of the data for pavement design and pavement distress analysis. Typically, axle load spectra represent static load from vehicles and do not include dynamic loads generated by vehicles in motion. While dynamic loads can significantly contribute to faster pavement distress, this fact is mostly omitted in pavement design methods. The paper presents a methodology for consideration of dynamic...
-
Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling
PublikacjaSymbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is...
-
Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
PublikacjaIn this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor pr ocess priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bi- gram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of -grams with a topic model,...
-
Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering
PublikacjaThis paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...
-
Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing
PublikacjaIn this application-oriented paper we consider the problem of elimination of impulsive disturbances, such as clicks, pops and record scratches, from archive audio recordings. The proposed approach is based on bidirectional processing—noise pulses are localized by combining the results of forward-time and backward-time signal analysis. Based on the results of specially designed empirical tests (rather than on the results of theoretical analysis),...
-
New approach for determining the QoS of MP3-coded voice signals in IP networks
PublikacjaPresent-day IP transport platforms being what they are, it will never be possible to rule out conflicts between the available services. The logical consequence of this assertion is the inevitable conclusion that the quality of service (QoS) must always be quantifiable no matter what. This paper focuses on one method to determine QoS. It defines an innovative, simple model that can evaluate the QoS of MP3-coded voice data transported...
-
Automatic music signal mixing system based on one-dimensional Wave-U-Net autoencoders
PublikacjaThe purpose of this paper is to show a music mixing system that is capable of automatically mixing separate raw recordings with good quality regardless of the music genre. This work recalls selected methods for automatic audio mixing first. Then, a novel deep model based on one-dimensional Wave-U-Net autoencoders is proposed for automatic music mixing. The model is trained on a custom-prepared database. Mixes created using the...
-
Auditory-visual attention stimulator
PublikacjaNew approach to lateralization irregularities formation was proposed. The emphasis is put on the relationship between visual and auditory attention stimulation. In this approach hearing is stimulated using time scale modified speech and sight is stimulated by rendering the text of the currently heard speech. Moreover, displayed text is modified using several techniques i.e. zooming, highlighting etc. In the experimental part of...
-
INVESTIGATION OF THE LOMBARD EFFECT BASED ON A MACHINE LEARNING APPROACH
PublikacjaThe Lombard effect is an involuntary increase in the speaker’s pitch, intensity, and duration in the presence of noise. It makes it possible to communicate in noisy environments more effectively. This study aims to investigate an efficient method for detecting the Lombard effect in uttered speech. The influence of interfering noise, room type, and the gender of the person on the detection process is examined. First, acoustic parameters...
-
HYDROGRAPHIC SURVEY PLANNING FOR THE DETERMINATION OF TERRITORIAL SEA BASELINE ON THE EXAMPLE OF SELECTED POLISH SEA AREAS
Publikacja -
THE USE OF GNSS GEODETIC NETWORKS ON THE APPROACH TO THE PORTS � GULF OF GDANSK STUDY
Publikacja -
Natalia Stawicka-Morawska dr inż.
OsobyMgr inż. Natalia Stawicka-Morawska pracuje na Politechnice Gdańskiej od października 2017 r. na stanowisku Asystenta, na Wydziale Inżynierii Mechanicznej i Okrętownictwa (poprzednio: Wydziale Mechanicznym), w Instytucie Mechaniki i Konstrukcji Maszyn (poprzednio: Katedrze Mechaniki i Mechatroniki).Prowadzona przez nią działalność naukowa dotyczy dziedziny nauk technicznych w dyscyplinie naukowej budowa i eksploatacja maszyn. Główną...
-
Lighting education for architects, the barriers and challenges: a survey of architecture students
PublikacjaCreating a well-lit environment requires the understanding of daylight and electric lighting design principles within the built environment. Recent years have brought a large number of new lighting assessment and design methods. The discovery of new photoreceptor cells in the eye - photosensitive retinal ganglion cells - forced lighting researchers to focus on parametrisation for the image forming (IF) and non-image forming (NIF)...
-
Audio-visual aspect of the Lombard effect and comparison with recordings depicting emotional states.
PublikacjaIn this paper an analysis of audio-visual recordings of the Lombard effect is shown. First, audio signal is analyzed indicating the presence of this phenomenon in the recorded sessions. The principal aim, however, was to discuss problems related to extracting differences caused by the Lombard effect, present in the video , i.e. visible as tension and work of facial muscles aligned to an increase in the intensity of the articulated...
-
Metoda i algorytmy modyfikacji sygnału do celu wspomagania rozumienia mowy przez osoby z pogorszoną rozdzielczością czasową słuchu
PublikacjaPrzedmiotem badań przeprowadzonych w ramach rozprawy są metody modyfikacji czasu trwania sygnału (ang. Time Scale Modification –TSM) mowy operujące w czasie rzeczywistym oraz ocena ich wpływu na rozumienie wypowiedzi przez osoby z pogorszoną rozdzielczością czasową słuchu. Pogorszona rozdzielczość słuchu jest jednym z symptomów związanych z ośrodkowymi zaburzeniami słuchu (ang. Cetnral Auditory Processing Disorder – CAPD). W odróżnieniu...
-
Concept of an Innovative System for Dimensioning and Predicting Changes in the Coastal Zone Topography Using UAVs and USVs (4DBatMap System)
PublikacjaThis publication is aimed at developing a concept of an innovative system for dimensioning and predicting changes in the coastal zone topography using Unmanned Aerial Vehicles (UAVs) and Unmanned Surface Vehicles (USVs). The 4DBatMap system will consist of four components: 1. Measurement data acquisition module. Bathymetric and photogrammetric measurements will be carried out with a specific frequency in the coastal zone using...
-
WYKORZYSTANIE SIECI NEURONOWYCH DO SYNTEZY MOWY WYRAŻAJĄCEJ EMOCJE
PublikacjaW niniejszym artykule przedstawiono analizę rozwiązań do rozpoznawania emocji opartych na mowie i możliwości ich wykorzystania w syntezie mowy z emocjami, wykorzystując do tego celu sieci neuronowe. Przedstawiono aktualne rozwiązania dotyczące rozpoznawania emocji w mowie i metod syntezy mowy za pomocą sieci neuronowych. Obecnie obserwuje się znaczny wzrost zainteresowania i wykorzystania uczenia głębokiego w aplikacjach związanych...
-
Piotr Chrostowski dr hab. inż.
OsobyPiotr Chrostowski – specjalizuje się w zagadnieniach infrastruktury transportu kolejowego. Główne kierunki działalności naukowej dotyczą właściwości mechanicznych elementów nawierzchni torowej oraz problematyki identyfikacji i oceny układów geometrycznych dróg szynowych z wykorzystaniem technik GNSS. W roku 2004 Uzyskał stopień magistra inżyniera na kierunku Budownictwo w specjalności Inżynieria Kolejowa na Wydziale Inżynierii...
-
Optimization of Bread Production Using Neuro-Fuzzy Modelling
PublikacjaAutomation of food production is an actively researched domain. One of the areas, where automation is still not progressing significantly is bread making. The process still relies on expert knowledge regarding how to react to procedure changes depending on environmental conditions, quality of the ingredients, etc. In this paper, we propose an ANFIS-based model for changing the mixer speed during the kneading process. Although the...
-
Speed, alcohol and safety belts as important factors influencing the number voivodship = Prędkość, alkohol i pasy bezpieczeństwa jako istotne czynniki wpływające na liczbę ofiar śmiertelnych wypadków drogowych na obszarze województw
PublikacjaNiniejszy referat prezentuje wyniki wstępne szerszego programu prac badawczych dotyczących bezpieczeństwa ruchu drogowego na obszarach województw.
-
Utilization of fuzzy rules in computer character animation
PublikacjaThe chapter presents a method for automatic enhancement of computer character animation utilizing fuzzy inference. First the user designs a prototype version of animation, with keyframes only for important poses, roughly describing the action. Then animation is enriched with new motion phases calculated by the fuzzy inference system using descriptors given by the user. Various degrees of motion fluency and naturalness are possible...
-
Rough Sets Applied to Mood of Music Recognition
PublikacjaWith the growth of accessible digital music libraries over the past decade, there is a need for research into automated systems for searching, organizing and recommending music. Mood of music is considered as one of the most intuitive criteria for listeners, thus this work is focused on the emotional content of music and its automatic recognition. The research study presented in this work contains an attempt to music emotion recognition...
-
Modelling and Simulation of a New Variable Stiffness Holder for Milling of Flexible Details
PublikacjaModern industry expectations in terms of milling operations often demand the milling of the flexible details by using slender ball-end tools. This is a difficult task because of possible vibration occurrence. Due to existence of certain conditions (small depths of cutting, regeneration phenomena), cutting process may become unstable and self-excited chatter vibration may appear. Frequency of the chatter vibration is close to dominant...
-
Variable Ratio Sample Rate Conversion Based on Fractional Delay Filter
PublikacjaIn this paper a sample rate conversion algorithm which allows for continuously changing resampling ratio has been presented. The proposed implementation is based on a variable fractional delay filter which is implemented by means of a Farrow structure. Coefficients of this structure are computed on the basis of fractional delay filters which are designed using the offset window method. The proposed approach allows us to freely...
-
KORPUS MOWY ANGIELSKIEJ DO CELÓW MULTIMODALNEGO AUTOMATYCZNEGO ROZPOZNAWANIA MOWY
PublikacjaW referacie zaprezentowano audiowizualny korpus mowy zawierający 31 godzin nagrań mowy w języku angielskim. Korpus dedykowany jest do celów automatycznego audiowizualnego rozpoznawania mowy. Korpus zawiera nagrania wideo pochodzące z szybkoklatkowej kamery stereowizyjnej oraz dźwięk zarejestrowany przez matrycę mikrofonową i mikrofon komputera przenośnego. Dzięki uwzględnieniu nagrań zarejestrowanych w warunkach szumowych korpus...
-
Szymon Andrzejewski dr
OsobyUkończył Uniwersytet Gdański na kierunku Politologia, specjalizacja ustrojowo-samorządowa w 2008 roku. Ukończył studia podyplomowe na Politechnice Gdańskiej pod nazwą "Zarządzanie i ewaluacja projektów finansowanych z funduszy Unii Europejskiej" w 2010r. i na Akademii Górniczo-Hutniczej pod nazwą "Ochrona Środowiska przed hałasem i drganiami" w 2012 roku. Student Socjologicznych Studiów Doktoranckich na Uniwersytecie Gdańskim od...
-
Investigation of Weigh-in-Motion Measurement Accuracy on the Basis of Steering Axle Load Spectra
PublikacjaWeigh-in-motion systems are installed in pavements or on bridges to identify and reduce the number of overloaded vehicles and minimise their adverse eect on road infrastructure. Moreover, the collected trac data are used to obtain axle load characteristics, which are very useful in road infrastructure design. Practical application of data from weigh-in-motion has become more common recently, which calls for adequate attention to...
-
Effect of Processing Parameters on Strength and Corrosion Resistance of Friction Stir-Welded AA6082
PublikacjaThe friction stir welding method is increasingly attracting interest in the railway sector due to its environmental friendliness, low cost, and ease of producing high-quality joints. Using aluminum alloys reduces the weight of structures, increasing their payload and reducing fuel consumption and running costs. The following paper presents studies on the microstructure, strength, and corrosion resistance of AA6082 aluminum alloy...
-
Prof. Haitham Abu-Rub - A Visit to Poland's Gdansk University of Technology
PublikacjaReport on visit of Prof. Haitham Abu-Rub in Gdansk University of Technology. Speech on the Smart Grid Centre. Visit in the new smart grid laboratory of the GUT, the Laboratory for Innovative Power Technologies and Integration of Renewable Energy Sources (LINTE^2).