Search results for: SPECH PROCESSING

Search results for: SPECH PROCESSING

results on page:
embed this view on your website

Displayed results came from alternative search method.

Filters

total: 2574

clear all filters disabled

displaying 1000 best results Help

Playback detection using machine learning with spectrogram features approach
Publication
- J. Dembski
- J. Rumiński
- Year 2017
This paper presents 2D image processing approach to playback detection in automatic speaker verification (ASV) systems using spectrograms as speech signal representation. Three feature extraction and classification methods: histograms of oriented gradients (HOG) with support vector machines (SVM), HAAR wavelets with AdaBoost classifier and deep convolutional neural networks (CNN) were compared on different data partitions in respect...

Full text available to download
Signal Processing: An International Journal (SPIJ)

Journals

ISSN: 1985-2339
Journal of Real-Time Image Processing

Journals

ISSN: 1861-8200 , eISSN: 1861-8219
Michał Lech dr inż.

People

Michał Lech was born in Gdynia in 1983. In 2007 he graduated from the faculty of Electronics, Telecommunications and Informatics of Gdansk University of Technology. In June 2013, he received his Ph.D. degree. The subject of the dissertation was: “A Method and Algorithms for Controlling the Sound Mixing Processes by Hand Gestures Recognized Using Computer Vision”. The main focus of the thesis was the bias of audio perception caused...
Image Processing in Robotics (2021/2022)
e-Learning Courses
- P. Chudziak
For ISD M.Sc. (II degr.) 2 sem. Participants are to learn image processing algorithms related to transformation, filtration, feature detection (image descriptors), image processing algorithms in robotic industrial systems.
Environmental Protection in Energetics, PG_00049751, W, PE-ET, sem.1, winter 2023/24
e-Learning Courses
Environmental aspects of energy production and processing.
Environmental Protection in Energetics (PG_00049751), W, PE-ET, sem.1, winter 2023/24
e-Learning Courses
- R. Liberacki
Environmental aspects of energy production and processing.
Badanie rozkładów parametrów sygnału mowy w zastosowaniach do prognozowania prawdopodobieństwa popełnienia błędów w systemach identyfikacji mówców = Examining distribution of speech signal parameters for the prognosis of error probability in speaker verification systems
Publication
- A. Kaczmarek
- Year 2010
Przedmiotem pracy jest system identyfikacji mówców w sposób zależny od tekstu ("text dependent''). Dokonano analizy wielu różnych wypowiedzi kilkudziesięciu mówców. Zastosowana metoda parametryzacji to metoda oparta na wynikach analizy cepstralnej sygnału mowy. Zdefiniowane zostały nowe parametry skojarzone z elementarnymi zdarzeniami w procesie weryfikacji mówców. Na tej podstawie dokonano estymacji funkcji gęstości prawdopodobieństwa...
IEEE Automatic Speech Recognition and Understanding Workshop

Conferences
Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
Publication
- S. Raczyński
- E. Vincent
- IEEE Transactions on Audio Speech and Language Processing - Year 2014
In this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor pr ocess priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bi- gram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of -grams with a topic model,...

Full text to download in external service
Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering
Publication
- IEEE Transactions on Audio Speech and Language Processing - Year 2015
This paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...

Full text available to download
Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling
Publication
- S. Raczyński
- E. Vincent
- S. Sagayama
- IEEE Transactions on Audio Speech and Language Processing - Year 2013
Symbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is...

Full text to download in external service
New approach for determining the QoS of MP3-coded voice signals in IP networks
Publication
- T. Uhl
- S. Paulsen
- K. Nowicki
- EURASIP Journal on Audio Speech and Music Processing - Year 2017
Present-day IP transport platforms being what they are, it will never be possible to rule out conflicts between the available services. The logical consequence of this assertion is the inevitable conclusion that the quality of service (QoS) must always be quantifiable no matter what. This paper focuses on one method to determine QoS. It defines an innovative, simple model that can evaluate the QoS of MP3-coded voice data transported...

Full text available to download
Automatic music signal mixing system based on one-dimensional Wave-U-Net autoencoders
Publication
- D. Koszewski
- T. Görne
- G. Korvel
- B. Kostek
- EURASIP Journal on Audio Speech and Music Processing - Year 2023
The purpose of this paper is to show a music mixing system that is capable of automatically mixing separate raw recordings with good quality regardless of the music genre. This work recalls selected methods for automatic audio mixing first. Then, a novel deep model based on one-dimensional Wave-U-Net autoencoders is proposed for automatic music mixing. The model is trained on a custom-prepared database. Mixes created using the...

Full text available to download
Przetwarzanie rozproszone
e-Learning Courses
- M. Kassjański
- E. Lubecka
Foundations and rules of distributed and parallel processing in networked computer systems.
Spatial data processing technologies
e-Learning Courses
- M. Kulawiak
- Z. Łubniewski
- E. Lubecka
The effect of current signal filtering method on the value of cutting power while sawing wood
Publication
- K. Orłowski
- J. Sandak
- T. Ochrymiuk
- M. Lackowski
- A. Sandak
- Annals of WULS, Forestry and Wood Technology - Year 2015
The goal of this work was to investigate an effect of various signal pre-processings on the outline of the electrical power curve and its influence on the measured cutting force estimation. Two signal processing methods were selected for the needs of the experiment, including digital filter and wavelet transform. The filter used was Butterworth, 3rd order band-stop with the cut-out band from 45 Hz to 55 Hz. The second approach...

Full text available to download
Geometry Modeling and Processing

Conferences
Symposium on Geometry Processing

Conferences
Marcin Sikorski prof. dr hab. inż.

People

Department of Informatics in Management

Marcin Sikorski is a professor at the Department of Informatics in Management at the Faculty of Management and Economics of the Gdańsk University of Technology. Earlier he had numerous fellowships in academic institutions, among others in Germany (Universities in Bonn and in Heidelberg), Switzerland (ETH Zurich), the Netherlands (TU Eindhoven) and the USA (Harvard University). Professor Sikorski is a representative of Poland in...
AUTOMATYCZNA KLASYFIKACJA MOWY PATOLOGICZNEJ
Publication
- M. Włoszczyńska
- B. Kostek
- Year 2023
Aplikacja przedstawiona w niniejszym rozdziale służy do automatycznego wykrywania mowy patologicznej na podstawie bazy nagrań. W pierwszej kolejności przedstawiono założenia leżące u podstaw przeprowadzonych badan wraz z wyborem bazy mowy patologicznej. Zaprezentowano również zastosowane algorytmy oraz cechy sygnału mowy, które pozwalają odróżnić mowę niezaburzoną od mowy patologicznej. Wytrenowane sieci neuronowe zostały następnie...

Full text to download in external service
Intelligent information services 23/24
e-Learning Courses
- J. Szymański
Information retrieval Text categorization Natural language processing
A Novel Approach to the Assessment of Cough Incidence
Publication
- Year 2013
In this paper we consider the problem of identication of cough events in patients suffering from chronic respiratory diseases. The information about frequency of cough events is necessary to medical treatment. The proposed approach is based on bidirectional processing of a measured vibration signal - cough events are localized by combining the results of forward-time and backward-time analysis. The signal is at rst transformed...

Full text to download in external service
ACM/SPEC International Conference on Performance Engineering

Conferences
Exception handling model influence factors for discributed systems. W: Proceedings. PPAM 2003. Parallel Processing and Applied Mathematics. 5th In- ternational Conference. Częstochowa, 7-10 September 2003.Model obsługi wyjątków uwzględniający wpływ czynników systemu rozproszonego.
Publication
- P. Kaczmarek
- H. Krawczyk
- LECTURE NOTES IN COMPUTER SCIENCE - Year 2003
Specyfikacja programu jest jasno określona w systemach sekwencyjnych, gdzie posiada standardowe i wyjątkowe przejścia. Praca przedstawia rozszerzony model specyfikacji systemu w środowiskach rozproszonych uwzględniający szereg specyficznych czynników. Model zawiera analizę specyfikacji pod kątem obsługi wyjątków dla rozproszonych danych oraz komunikacji międzyprocesorowej. Ogólny model został zaimplementowany w środowisku...
POPRAWA OBIEKTYWNYCH WSKAŹNIKÓW JAKOŚCI MOWY W WARUNKACH HAŁASU
Publication
- K. Kąkol
- B. Kostek
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Year 2018
Celem pracy jest modyfikacja sygnału mowy, aby uzyskać zwiększenie poprawy obiektywnych wskaźników jakości mowy po zmiksowaniu sygnału użytecznego z szumem bądź z sygnałem zakłócającym. Wykonane modyfikacje sygnału bazują na cechach mowy lombardzkiej, a w szczególności na efekcie podniesienia częstotliwości podstawowej F0. Sesja nagraniowa obejmowała zestawy słów i zdań w języku polskim, nagrane w warunkach ciszy, jak również w...

Full text available to download
ISCA Tutorial and Research Workshop Automatic Speech Recognition

Conferences
Jacek Rak dr hab. inż.

People

Department of Computer Communications

Jacek Rak uzyskał stopień doktora habilitowanego nauk technicznych w dyscyplinie telekomunikacji (specjalność: teleinformatyka) w 2016 r., a stopień doktora nauk technicznych w dyscyplinie informatyka w 2009 r. Obecnie jest pracownikiem naukowo-dydaktycznym Katedry Teleinformatyki Wydziału Elektroniki, Telekomunikacji i Informatyki Politechniki Gdańskiej. Jego działalność naukowa koncentruje się w obszarze doboru tras, projektowania...
Przetwarzanie Rozproszone 2021/2022
e-Learning Courses
- J. Cychnerski
- P. Weichbroth
- A. Domagalska
- M. Matuszek
- P. Dryja
- T. Gajger
- A. Brzeski
- J. Kuchta
- A. Królicka-Gałązka
- T. Cejrowski
- S. Olewniczak
{mlang pl}Przetwarzanie równoległe i rozproszone{mlang}{mlang en}Concurrent and Distributed Processing{mlang}
Przetwarzanie Rozproszone 2022/2023
e-Learning Courses
- J. Cychnerski
- A. Domagalska
- M. Matuszek
- J. Kuchta
- J. Szłapczyńska
- A. Królicka-Gałązka
{mlang pl}Przetwarzanie równoległe i rozproszone{mlang}{mlang en}Concurrent and Distributed Processing{mlang}
Przetwarzanie Rozproszone 2023/2024
e-Learning Courses
- J. Cychnerski
- A. Domagalska
- M. Matuszek
- J. Kuchta
- J. Szłapczyńska
- A. Królicka-Gałązka
- J. Majkutewicz
- R. Kałaska
{mlang pl}Przetwarzanie równoległe i rozproszone{mlang}{mlang en}Concurrent and Distributed Processing{mlang}
Piotr Szczuko dr hab. inż.

People

Department of Multimedia Systems

Piotr Szczuko received his M.Sc. degree in 2002. His thesis was dedicated to examination of correlation phenomena between perception of sound and vision for surround sound and digital image. He finished Ph.D. studies in 2007 and one year later completed a dissertation "Application of Fuzzy Rules in Computer Character Animation" that received award of Prime Minister of Poland. His interests include: processing of audio and video, computer...
DPCTM Data Processing Centre - Task Manager (DPCTM) product

Projects

Project manager: dr hab. inż. Marek Moszyński Financial Program Name: Europejska Agencja Kosmiczna

Project realized in Department of Geoinformatics
Digital Signal Processing - 22/23
e-Learning Courses
- T. Stefański
Po ukończeniu kursu, student projektuje podstawowe algorytmy cyfrowego przetwarzania sygnałów - filtrów cyfrowych FIR i IIR, i estymuje widmo za pomocą FFT.Student opisuje architektury i ścieżki danych procesorów stało-przecinkowych i zmienno-przecinkowych. Student tłumaczy podstawy arytmetyki procesorów i podaje przykłady zastosowań.
Big Data processing frameworks - 2022
e-Learning Courses
- A. Przybyłek
Informatics, postgraduate studies Data Engineering, undergraduate studies
Big Data processing frameworks - 2023
e-Learning Courses
- A. Przybyłek
Informatics, postgraduate studies Data Engineering, undergraduate studies
Big Data processing frameworks - 2024
e-Learning Courses
- A. Przybyłek
Informatics, postgraduate studies Data Engineering, undergraduate studies
Digital Signal Processing-23/24
e-Learning Courses
Po ukończeniu kursu, student projektuje podstawowe algorytmy cyfrowego przetwarzania sygnałów - filtrów cyfrowych FIR i IIR, i estymuje widmo za pomocą FFT.Student opisuje architektury i ścieżki danych procesorów stało-przecinkowych i zmienno-przecinkowych. Student tłumaczy podstawy arytmetyki procesorów i podaje przykłady zastosowań.
Zdzisław Kowalczuk prof. dr hab. inż.

People

Department of Decision Systems and Robotics

Zdzislaw Kowalczuk received his M.Sc. degree in 1978 and Ph.D. degree in 1986, both in Automatic Control from Technical University of Gdańsk (TUG), Gdańsk, Poland. In 1993 he received his D.Sc. degree (Dr Habilitus) in Automatic Control from Silesian Technical University, Gliwice, Poland, and the title of Professor from the President of Poland in 2003. Since 1978 he has been with Faculty of Electronics, Telecommunications and Informatics...
Krzysztof Goczyła prof. dr hab. inż.

People

Department of Software Engineering

Krzysztof Goczyła, full professor of Gdańsk University of Technology, computer scientist, a specialist in software engineering, knowledge engineering and databases. He graduated from the Faculty of Electronics Technical University of Gdansk in 1976 with a degree in electronic engineering, specializing in automation. Since then he has been working at Gdańsk University of Technology. In 1982 he obtained a doctorate in computer science...
International Symposium on Information Processing

Conferences
European Signal Processing Conference

Conferences
Workshop on Quantum Information Processing

Conferences
Information Processing in Sensor Networks

Conferences
International Conference on Parallel Processing

Conferences
IFIP Congress (Information Processing)

Conferences
Grzegorz Szwoch dr hab. inż.

People

Department of Multimedia Systems

Grzegorz Szwoch was born in 1972 in Gdansk. In 1991-1996 he studied at the Technical University of Gdansk. In 1996 he graduated as a student from the Sound Engineering Department. His thesis was related to physical modeling of musical instruments. Since that time he has been a member of the research staff at the Multimedia Systems Department as a PhD student (1996-2001), Assistant (2001-2004), Assistant professor (2004-2020) and...
Enhanced voice user interface employing spatial filtration of signals from acoustic vector sensor
Publication
- Year 2015
Spatial filtration of sound is introduced to enhance speech recognition accuracy in noisy conditions. An acoustic vector sensor (AVS) is employed. The signals from the AVS probe are processed in order to attenuate the surrounding noise. As a result the signal to noise ratio is increased. An experiment is featured in which speech signals are disturbed by babble noise. The signals before and after spatial filtration are processed...

Full text to download in external service
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
Publication
- D. Korzekwa
- R. Barra-Chicote
- S. Zaporowski
- G. Beringer
- J. Lorenzo-trueba
- A. Serafinowicz
- J. Droppo
- T. Drugman
- B. Kostek
- Year 2021
This paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...

Full text available to download
Mariusz Kaczmarek dr hab. inż.

People

Department of Biomedical Engineering, Gdańsk University of Technology

Received M.Sc., Eng. in Electronics in 1995 from Gdansk University of Technology, Ph.D. in Medical Electronics in 2003 and habilitation in Biocybernetics and Biomedical Engineering in 2017. He was an investigator in about 13 projects receiving a number of awards, including four best papers, practical innovations (7 medals and awards) and also the Andronicos G. Kantsios Award and Siemens Award. Main research activities: the issues...

Search

Filters

Catalog

Search results for: SPECH PROCESSING

Michał Lech dr inż.

Marcin Sikorski prof. dr hab. inż.

Jacek Rak dr hab. inż.

Piotr Szczuko dr hab. inż.

Zdzisław Kowalczuk prof. dr hab. inż.

Krzysztof Goczyła prof. dr hab. inż.

Grzegorz Szwoch dr hab. inż.

Mariusz Kaczmarek dr hab. inż.