Filters
total: 2574
displaying 1000 best results Help
Search results for: SPECH PROCESSING
-
Playback detection using machine learning with spectrogram features approach
PublicationThis paper presents 2D image processing approach to playback detection in automatic speaker verification (ASV) systems using spectrograms as speech signal representation. Three feature extraction and classification methods: histograms of oriented gradients (HOG) with support vector machines (SVM), HAAR wavelets with AdaBoost classifier and deep convolutional neural networks (CNN) were compared on different data partitions in respect...
-
Signal Processing: An International Journal (SPIJ)
Journals -
Journal of Real-Time Image Processing
Journals -
Michał Lech dr inż.
PeopleMichał Lech was born in Gdynia in 1983. In 2007 he graduated from the faculty of Electronics, Telecommunications and Informatics of Gdansk University of Technology. In June 2013, he received his Ph.D. degree. The subject of the dissertation was: “A Method and Algorithms for Controlling the Sound Mixing Processes by Hand Gestures Recognized Using Computer Vision”. The main focus of the thesis was the bias of audio perception caused...
-
Image Processing in Robotics (2021/2022)
e-Learning CoursesFor ISD M.Sc. (II degr.) 2 sem. Participants are to learn image processing algorithms related to transformation, filtration, feature detection (image descriptors), image processing algorithms in robotic industrial systems.
-
Environmental Protection in Energetics, PG_00049751, W, PE-ET, sem.1, winter 2023/24
e-Learning CoursesEnvironmental aspects of energy production and processing.
-
Environmental Protection in Energetics (PG_00049751), W, PE-ET, sem.1, winter 2023/24
e-Learning CoursesEnvironmental aspects of energy production and processing.
-
Badanie rozkładów parametrów sygnału mowy w zastosowaniach do prognozowania prawdopodobieństwa popełnienia błędów w systemach identyfikacji mówców = Examining distribution of speech signal parameters for the prognosis of error probability in speaker verification systems
PublicationPrzedmiotem pracy jest system identyfikacji mówców w sposób zależny od tekstu ("text dependent''). Dokonano analizy wielu różnych wypowiedzi kilkudziesięciu mówców. Zastosowana metoda parametryzacji to metoda oparta na wynikach analizy cepstralnej sygnału mowy. Zdefiniowane zostały nowe parametry skojarzone z elementarnymi zdarzeniami w procesie weryfikacji mówców. Na tej podstawie dokonano estymacji funkcji gęstości prawdopodobieństwa...
-
IEEE Automatic Speech Recognition and Understanding Workshop
Conferences -
Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
PublicationIn this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor pr ocess priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bi- gram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of -grams with a topic model,...
-
Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering
PublicationThis paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...
-
Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling
PublicationSymbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is...
-
New approach for determining the QoS of MP3-coded voice signals in IP networks
PublicationPresent-day IP transport platforms being what they are, it will never be possible to rule out conflicts between the available services. The logical consequence of this assertion is the inevitable conclusion that the quality of service (QoS) must always be quantifiable no matter what. This paper focuses on one method to determine QoS. It defines an innovative, simple model that can evaluate the QoS of MP3-coded voice data transported...
-
Automatic music signal mixing system based on one-dimensional Wave-U-Net autoencoders
PublicationThe purpose of this paper is to show a music mixing system that is capable of automatically mixing separate raw recordings with good quality regardless of the music genre. This work recalls selected methods for automatic audio mixing first. Then, a novel deep model based on one-dimensional Wave-U-Net autoencoders is proposed for automatic music mixing. The model is trained on a custom-prepared database. Mixes created using the...
-
Przetwarzanie rozproszone
e-Learning CoursesFoundations and rules of distributed and parallel processing in networked computer systems.
-
Spatial data processing technologies
e-Learning Courses -
The effect of current signal filtering method on the value of cutting power while sawing wood
PublicationThe goal of this work was to investigate an effect of various signal pre-processings on the outline of the electrical power curve and its influence on the measured cutting force estimation. Two signal processing methods were selected for the needs of the experiment, including digital filter and wavelet transform. The filter used was Butterworth, 3rd order band-stop with the cut-out band from 45 Hz to 55 Hz. The second approach...
-
Geometry Modeling and Processing
Conferences -
Symposium on Geometry Processing
Conferences -
Marcin Sikorski prof. dr hab. inż.
PeopleMarcin Sikorski is a professor at the Department of Informatics in Management at the Faculty of Management and Economics of the Gdańsk University of Technology. Earlier he had numerous fellowships in academic institutions, among others in Germany (Universities in Bonn and in Heidelberg), Switzerland (ETH Zurich), the Netherlands (TU Eindhoven) and the USA (Harvard University). Professor Sikorski is a representative of Poland in...
-
AUTOMATYCZNA KLASYFIKACJA MOWY PATOLOGICZNEJ
PublicationAplikacja przedstawiona w niniejszym rozdziale służy do automatycznego wykrywania mowy patologicznej na podstawie bazy nagrań. W pierwszej kolejności przedstawiono założenia leżące u podstaw przeprowadzonych badan wraz z wyborem bazy mowy patologicznej. Zaprezentowano również zastosowane algorytmy oraz cechy sygnału mowy, które pozwalają odróżnić mowę niezaburzoną od mowy patologicznej. Wytrenowane sieci neuronowe zostały następnie...
-
Intelligent information services 23/24
e-Learning CoursesInformation retrieval Text categorization Natural language processing
-
A Novel Approach to the Assessment of Cough Incidence
PublicationIn this paper we consider the problem of identication of cough events in patients suffering from chronic respiratory diseases. The information about frequency of cough events is necessary to medical treatment. The proposed approach is based on bidirectional processing of a measured vibration signal - cough events are localized by combining the results of forward-time and backward-time analysis. The signal is at rst transformed...
-
ACM/SPEC International Conference on Performance Engineering
Conferences -
Exception handling model influence factors for discributed systems. W: Proceedings. PPAM 2003. Parallel Processing and Applied Mathematics. 5th In- ternational Conference. Częstochowa, 7-10 September 2003.Model obsługi wyjątków uwzględniający wpływ czynników systemu rozproszonego.
PublicationSpecyfikacja programu jest jasno określona w systemach sekwencyjnych, gdzie posiada standardowe i wyjątkowe przejścia. Praca przedstawia rozszerzony model specyfikacji systemu w środowiskach rozproszonych uwzględniający szereg specyficznych czynników. Model zawiera analizę specyfikacji pod kątem obsługi wyjątków dla rozproszonych danych oraz komunikacji międzyprocesorowej. Ogólny model został zaimplementowany w środowisku...
-
POPRAWA OBIEKTYWNYCH WSKAŹNIKÓW JAKOŚCI MOWY W WARUNKACH HAŁASU
PublicationCelem pracy jest modyfikacja sygnału mowy, aby uzyskać zwiększenie poprawy obiektywnych wskaźników jakości mowy po zmiksowaniu sygnału użytecznego z szumem bądź z sygnałem zakłócającym. Wykonane modyfikacje sygnału bazują na cechach mowy lombardzkiej, a w szczególności na efekcie podniesienia częstotliwości podstawowej F0. Sesja nagraniowa obejmowała zestawy słów i zdań w języku polskim, nagrane w warunkach ciszy, jak również w...
-
ISCA Tutorial and Research Workshop Automatic Speech Recognition
Conferences -
Jacek Rak dr hab. inż.
PeopleJacek Rak uzyskał stopień doktora habilitowanego nauk technicznych w dyscyplinie telekomunikacji (specjalność: teleinformatyka) w 2016 r., a stopień doktora nauk technicznych w dyscyplinie informatyka w 2009 r. Obecnie jest pracownikiem naukowo-dydaktycznym Katedry Teleinformatyki Wydziału Elektroniki, Telekomunikacji i Informatyki Politechniki Gdańskiej. Jego działalność naukowa koncentruje się w obszarze doboru tras, projektowania...
-
Przetwarzanie Rozproszone 2021/2022
e-Learning Courses{mlang pl}Przetwarzanie równoległe i rozproszone{mlang}{mlang en}Concurrent and Distributed Processing{mlang}
-
Przetwarzanie Rozproszone 2022/2023
e-Learning Courses{mlang pl}Przetwarzanie równoległe i rozproszone{mlang}{mlang en}Concurrent and Distributed Processing{mlang}
-
Przetwarzanie Rozproszone 2023/2024
e-Learning Courses{mlang pl}Przetwarzanie równoległe i rozproszone{mlang}{mlang en}Concurrent and Distributed Processing{mlang}
-
Piotr Szczuko dr hab. inż.
PeoplePiotr Szczuko received his M.Sc. degree in 2002. His thesis was dedicated to examination of correlation phenomena between perception of sound and vision for surround sound and digital image. He finished Ph.D. studies in 2007 and one year later completed a dissertation "Application of Fuzzy Rules in Computer Character Animation" that received award of Prime Minister of Poland. His interests include: processing of audio and video, computer...
-
DPCTM Data Processing Centre - Task Manager (DPCTM) product
ProjectsProject realized in Department of Geoinformatics
-
Digital Signal Processing - 22/23
e-Learning CoursesPo ukończeniu kursu, student projektuje podstawowe algorytmy cyfrowego przetwarzania sygnałów - filtrów cyfrowych FIR i IIR, i estymuje widmo za pomocą FFT.Student opisuje architektury i ścieżki danych procesorów stało-przecinkowych i zmienno-przecinkowych. Student tłumaczy podstawy arytmetyki procesorów i podaje przykłady zastosowań.
-
Big Data processing frameworks - 2022
e-Learning CoursesInformatics, postgraduate studies Data Engineering, undergraduate studies
-
Big Data processing frameworks - 2023
e-Learning CoursesInformatics, postgraduate studies Data Engineering, undergraduate studies
-
Big Data processing frameworks - 2024
e-Learning CoursesInformatics, postgraduate studies Data Engineering, undergraduate studies
-
Digital Signal Processing-23/24
e-Learning CoursesPo ukończeniu kursu, student projektuje podstawowe algorytmy cyfrowego przetwarzania sygnałów - filtrów cyfrowych FIR i IIR, i estymuje widmo za pomocą FFT.Student opisuje architektury i ścieżki danych procesorów stało-przecinkowych i zmienno-przecinkowych. Student tłumaczy podstawy arytmetyki procesorów i podaje przykłady zastosowań.
-
Zdzisław Kowalczuk prof. dr hab. inż.
PeopleZdzislaw Kowalczuk received his M.Sc. degree in 1978 and Ph.D. degree in 1986, both in Automatic Control from Technical University of Gdańsk (TUG), Gdańsk, Poland. In 1993 he received his D.Sc. degree (Dr Habilitus) in Automatic Control from Silesian Technical University, Gliwice, Poland, and the title of Professor from the President of Poland in 2003. Since 1978 he has been with Faculty of Electronics, Telecommunications and Informatics...
-
Krzysztof Goczyła prof. dr hab. inż.
PeopleKrzysztof Goczyła, full professor of Gdańsk University of Technology, computer scientist, a specialist in software engineering, knowledge engineering and databases. He graduated from the Faculty of Electronics Technical University of Gdansk in 1976 with a degree in electronic engineering, specializing in automation. Since then he has been working at Gdańsk University of Technology. In 1982 he obtained a doctorate in computer science...
-
International Symposium on Information Processing
Conferences -
European Signal Processing Conference
Conferences -
Workshop on Quantum Information Processing
Conferences -
Information Processing in Sensor Networks
Conferences -
International Conference on Parallel Processing
Conferences -
IFIP Congress (Information Processing)
Conferences -
Grzegorz Szwoch dr hab. inż.
PeopleGrzegorz Szwoch was born in 1972 in Gdansk. In 1991-1996 he studied at the Technical University of Gdansk. In 1996 he graduated as a student from the Sound Engineering Department. His thesis was related to physical modeling of musical instruments. Since that time he has been a member of the research staff at the Multimedia Systems Department as a PhD student (1996-2001), Assistant (2001-2004), Assistant professor (2004-2020) and...
-
Enhanced voice user interface employing spatial filtration of signals from acoustic vector sensor
PublicationSpatial filtration of sound is introduced to enhance speech recognition accuracy in noisy conditions. An acoustic vector sensor (AVS) is employed. The signals from the AVS probe are processed in order to attenuate the surrounding noise. As a result the signal to noise ratio is increased. An experiment is featured in which speech signals are disturbed by babble noise. The signals before and after spatial filtration are processed...
-
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
PublicationThis paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...
-
Mariusz Kaczmarek dr hab. inż.
PeopleReceived M.Sc., Eng. in Electronics in 1995 from Gdansk University of Technology, Ph.D. in Medical Electronics in 2003 and habilitation in Biocybernetics and Biomedical Engineering in 2017. He was an investigator in about 13 projects receiving a number of awards, including four best papers, practical innovations (7 medals and awards) and also the Andronicos G. Kantsios Award and Siemens Award. Main research activities: the issues...