Search results for: AUTOMATED PRONUNCIATION ASSESSMENT, SPEECH PROCESSING, SECOND-LANGUAGE LEARNING, DEEP LEARNING

Search results for: AUTOMATED PRONUNCIATION ASSESSMENT, SPEECH PROCESSING, SECOND-LANGUAGE LEARNING, DEEP LEARNING

results on page:
embed this view on your website

Filters

total: 490

clear all filters disabled

Machine-learning-based precise cost-efficient NO2 sensor calibration by means of time series matching and global data pre-processing
Publication
- Engineering Science and Technology-An International Journal-JESTECH - Year 2024
Air pollution remains a considerable contemporary challenge affecting life quality, the environment, and economic well-being. It encompasses an array of pollutants—gases, particulate matter, biological molecules—emanating from sources such as vehicle emissions, industrial activities, agriculture, and natural occurrences. Nitrogen dioxide (NO2), a harmful gas, is particularly abundant in densely populated urban areas. Given its...

Full text available to download
Statistical Data Pre-Processing and Time Series Incorporation for High-Efficacy Calibration of Low-Cost NO2 Sensor Using Machine Learning
Publication
- Scientific Reports - Year 2024
Air pollution stands as a significant modern-day challenge impacting life quality, the environment, and the economy. It comprises various pollutants like gases, particulate matter, biological molecules, and more, stemming from sources such as vehicle emissions, industrial operations, agriculture, and natural events. Nitrogen dioxide (NO2), among these harmful gases, is notably prevalent in densely populated urban regions. Given...

Full text available to download
International Journal of Computer-Assisted Language Learning and Teaching

Journals

ISSN: 2155-7098 , eISSN: 2155-7101
Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech
Publication
- D. Korzekwa
- J. Lorenzo-trueba
- T. Drugman
- S. Calamaro
- B. Kostek
- Year 2021
We propose a weakly-supervised model for word-level mispronunciation detection in non-native (L2) English speech. To train this model, phonetically transcribed L2 speech is not required and we only need to mark mispronounced words. The lack of phonetic transcriptions for L2 speech means that the model has to learn only from a weak signal of word-level mispronunciations. Because of that and due to the limited amount of mispronounced...

Full text available to download
Introduction to the special issue on machine learning in acoustics
Publication
- Z. Michalopoulou
- P. Gerstoft
- B. Kostek
- M. A. Roch
- Journal of the Acoustical Society of America - Year 2021
When we started our Call for Papers for a Special Issue on “Machine Learning in Acoustics” in the Journal of the Acoustical Society of America, our ambition was to invite papers in which machine learning was applied to all acoustics areas. They were listed, but not limited to, as follows: • Music and synthesis analysis • Music sentiment analysis • Music perception • Intelligent music recognition • Musical source separation • Singing...

Full text available to download
Predicting the Purchase of Electricity Prices for Renewable Energy Sources Based on Polish Power Grids Data Using Deep Learning Models for Controlling Small Hybrid PV Microinstallations
Publication
- M. Pikus
- J. Wąs
- Year 2023
Full text to download in external service
Wiktoria Wojnicz dr hab. inż.

People

Zakład Mechaniki Stosowanej i Biomechaniki, Faculty of Mechanical Engineering and Ship Technology, Institute of Mechanics and Machine Design

DSc in Mechanics (in the field of Biomechanics) - Lodz Univeristy of Technology, 2019 PhD in Mechanics (in the field of Biomechanics) - Lodz Univeristy of Technology, 2009 (with distinction) List of papers (2009 - ) Wojnicz W., Wittbrodt E., Analysis of muscles' behaviour. Part I. The computational model of muscle. Acta of Bioengineering and Biomechanics, Vol. 11, No.4, 2009, p. 15-21 Wojnicz W., Wittbrodt E., Analysis of...
Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling
Publication
- D. Korzekwa
- J. Lorenzo-trueba
- S. Zaporowski
- S. Calamaro
- T. Drugman
- B. Kostek
- Year 2021
A common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result...

Full text to download in external service
Conference on Computational Natural Language Learning (Conference on Natural Language Learning)

Conferences
CHALK & TALK OR SWIPE & SKYPE?
Publication
- E. Kozłowska
- R. Howard
- Zeszyty Naukowe Wydziału Elektrotechniki i Automatyki Politechniki Gdańskiej - Year 2019
Technology in classroom is a matter of heated discussions in the field of education development, especially when multidisciplinary education goes along with language skills. Engineers’ education requires theoretical and practical knowledge. Moreover, dedicated computer skills become crucial for both young graduates and experienced educators on the labor market. Teaching online with or without using different Learning Management...

Full text available to download
Investigating Feature Spaces for Isolated Word Recognition
Publication
- G. Korvel
- G. Tamulevicus
- P. Treigys
- J. Bernataviciene
- B. Kostek
- Year 2018
Much attention is given by researchers to the speech processing task in automatic speech recognition (ASR) over the past decades. The study addresses the issue related to the investigation of the appropriateness of a two-dimensional representation of speech feature spaces for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and timefrequency signal representation...
Language Learning in Higher Education

Journals

ISSN: 2191-611X , eISSN: 2191-6128
Joint Conference on New Methods in Language Processing and Computational Natural Language Learning

Conferences
Jaroslaw Spychala dr

People

Oprócz bardzo dobrego wykształcenia osoba posiada również wieloletnie doświadczenie zawodowe, które jest poświadczeniem tego, że potrafi wykorzystać swoją wiedzę teoretyczną w praktycznych działaniach. Doświadczenie zawodowe jest bardzo bogate i rozbudowane. Ze względu na nabyte całkiem nowe umiejętności zwiększa się atrakcyjność doświadczonego pracownika. Są to między innymi kreatywne myślenie, zorientowanie na cel, odporność...
Investigating Feature Spaces for Isolated Word Recognition
Publication
- P. Treigys
- G. Korvel
- G. Tamulevicius
- J. Bernataviciene
- B. Kostek
- Year 2020
The study addresses the issues related to the appropriateness of a two-dimensional representation of speech signal for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and time-frequency signal representation converted to the investigated feature spaces. In particular, waveforms and fractal dimension features of the signal were chosen for the time domain, and...

Full text to download in external service
Exploring the preferences of Polish EFL teachers towards the accents of English
Publication
- B. Grobelna
- Linguistische Treffen in Wrocław - Year 2021
This language attitudes study investigates the preferences of EFL (English as a foreign language) teachers from Poland towards the accents of English they speak and teach. Despite the substantial amount of research on EFL learners, little has been done to investigate the impact of preferences of Polish teachers for different variations of English language on their...

Full text to download in external service
SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM
Publication
- B. Kostek
- B. Szyca
- Journal of the Acoustical Society of America - Year 2023
The main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...

Full text available to download
Investigating Noise Interference on Speech Towards Applying the Lombard Effect Automatically
Publication
- G. Korvel
- K. Kąkol
- P. Treigys
- B. Kostek
- Year 2022
The aim of this study is two-fold. First, we perform a series of experiments to examine the interference of different noises on speech processing. For that purpose, we concentrate on the Lombard effect, an involuntary tendency to raise speech level in the presence of background noise. Then, we apply this knowledge to detecting speech with the Lombard effect. This is for preparing a dataset for training a machine learning-based...

Full text available to download
Jan Daciuk dr hab. inż.

People

Department of Intelligent Interactive Systems

Jan Daciuk received his M.Sc. from the Faculty of Electronics of Gdansk University of Technology in 1986, and his Ph.D. from the Faculty of Electronics, Telecommunications and Informatics of Gdańsk University of Technology in 1999. He has been working at the Faculty from 1988. His research interests include finite state methods in natural language processing and computational linguistics including speech processing. Dr. Daciuk...
International Conference on Intelligent Data Engineering and Automated Learning

Conferences
Book Review
Publication
- E. Szczerbicki
- Intelligent Decision Technologies-Netherlands - Year 2021
Acting over the last three decades as an Editor and Associate Editor for a number of international journals in the general area of cybernetics and AI, as well as a Chair and Co-Chair of numerous conferences in this field, I have had the exciting opportunity to closely witness and to be actively engaged in the stimulating research area of machine learning and its important augmentation with deep learning techniques and technologies. From...

Full text to download in external service
Assessing the attractiveness of human face based on machine learning
Publication
- Year 2023
The attractiveness of the face plays an important role in everyday life, especially in the modern world where social media and the Internet surround us. In this study, an attempt to assess the attractiveness of a face by machine learning is shown. Attractiveness is determined by three deep models whose sum of predictions is the final score. Two annotated datasets available in the literature are employed for training and testing...

Full text available to download
Adaptive Hounsfield Scale Windowing in Computed Tomography Liver Segmentation
Publication
- Year 2024
In computed tomography (CT) imaging, the Hounsfield Unit (HU) scale quantifies radiodensity, but its nonlinear nature across organs and lesions complicates machine learning analysis. This paper introduces an automated method for adaptive HU scale windowing in deep learning-based CT liver segmentation. We propose a new neural network layer that optimizes HU scale window parameters during training. Experiments on the Liver Tumor...

Full text to download in external service
Leszek Ziemczonek dr

People

University education 1973-1978 – Nicolaus Copernicus University in Toruń, University of Gdańsk in Gdańsk, Mathematical Physics, M. Sc. 1979 – Diploma of Postgraduate Studies, Pedagogics 1989 – Institute of Physics, Polish Academy of Sciences in Warsaw, Theoretical Physics, Ph. D. 2010-2012 – Diploma of Postgraduate Studies, Mathematics Training: · 09.1983 – Trieste (Italy) – International Centre for Theoretical Physics...
Vident-synth: a synthetic intra-oral video dataset for optical flow estimation
Open Research Data
embargo
We introduce Vident-synth, a large dataset of synthetic dental videos with corresponding ground truth forward and backward optical flows and occlusion masks. It can be used for:
Optymalizacja zasobów chmury obliczeniowej z wykorzystaniem inteligentnych agentów w zdalnym nauczaniu
Publication
- P. Dryja
- Year 2023
Rozprawa dotyczy optymalizacji zasobów chmury obliczeniowej, w której zastosowano inteligentne agenty w zdalnym nauczaniu. Zagadnienie jest istotne w edukacji, gdzie wykorzystuje się nowoczesne technologie, takie jak Internet Rzeczy, rozszerzoną i wirtualną rzeczywistość oraz deep learning w środowisku chmury obliczeniowej. Zagadnienie jest istotne również w sytuacji, gdy pandemia wymusza stosowanie zdalnego nauczania na dużą skalę...

Full text available to download
Muhammad Usman PhD

People

Muhammad Usman is currently a Computer Vision Researcher at Gdansk University of Technology, working on the BE-LIGHT project, where his research focuses on advancing biomedical diagnostics through the integration of light-based technologies and machine learning techniques. He has completed his Master’s degree in Control Science and Engineering from the University of Science and Technology of China (USTC), Hefei, China. His research...
Model-free and Model-based Reinforcement Learning, the Intersection of Learning and Planning
Publication
- P. Januszewski
- Year 2022
My doctoral dissertation is intended as the compound of four publications considering: structure and randomness in planning and reinforcement learning, continuous control with ensemble deep deterministic policy gradients, toddler-inspired active representation learning, and large-scale deep reinforcement learning costs.

Full text to download in external service
Deep CNN based decision support system for detection and assessing the stage of diabetic retinopathy
Publication
- A. Kwasigroch
- B. Jarzembinski
- M. Grochowski
- Year 2018
The diabetic retinopathy is a disease caused by long-standing diabetes. Lack of effective treatment can lead to vision impairment and even irreversible blindness. The disease can be diagnosed by examining digital color fundus photographs of retina. In this paper we propose deep learning approach to automated diabetic retinopathy screening. Deep convolutional neural networks (CNN) - the most popular kind of deep learning algorithms...

Full text to download in external service
Farzin Kazemi

People

His main research areas are seismic performance assessment of structures and seismic hazard analysis in earthquake engineering. He performed a comprehensive study on the effect of pounding phenomenon and ‎proposed modification factors to modify the seismic collapse capacity of ‎structures or predict the seismic collapse capacity of structures which were ‎retrofitted with linear and nonlinear Fluid Viscous Dampers (FVDs).‎ His current...
Review of the Complexity of Managing Big Data of the Internet of Things
Publication
- D. Gil
- M. Johnsson
- H. Mora
- J. Szymański
- COMPLEXITY - Year 2019
Tere is a growing awareness that the complexity of managing Big Data is one of the main challenges in the developing feld of the Internet of Tings (IoT). Complexity arises from several aspects of the Big Data life cycle, such as gathering data, storing them onto cloud servers, cleaning and integrating the data, a process involving the last advances in ontologies, such as Extensible Markup Language (XML) and Resource Description...

Full text available to download
Mutual recognition of certification systems: The case of SERMO and ACLES
Publication
- J. Zabala-Delgado
- B. Sawicka
- Language Learning in Higher Education - Year 2019
Full text to download in external service
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention
Publication
- D. Korzekwa
- R. Barra-Chicote
- S. Zaporowski
- G. Beringer
- J. Lorenzo-trueba
- A. Serafinowicz
- J. Droppo
- T. Drugman
- B. Kostek
- Year 2021
This paper describes two novel complementary techniques that improve the detection of lexical stress errors in non-native (L2) English speech: attention-based feature extraction and data augmentation based on Neural Text-To-Speech (TTS). In a classical approach, audio features are usually extracted from fixed regions of speech such as the syllable nucleus. We propose an attention-based deep learning model that automatically de...

Full text available to download
Olgun Aydin Dr

People

Olgun Aydin finished his PhD by publishing a thesis about Deep Neural Networks. He works as a Senior Data Scientist in PwC Poland, gives lectures in Gdansk University of Technology in Poland and member of WhyR? Foundation. Olgun is a very big fan of R and author of the book called “R Web Scraping Quick Start Guide” , two video courses are called “Deep Dive into Statistical Modelling using R” and “Applied Machine Learning and Deep...
Instructor Presence in Video Lectures: Preliminary Findings From an Online Experiment
Publication
- Y. Y. Ng
- A. Przybyłek
- IEEE Access - Year 2021
Motivation. Despite the widespread use of video lectures in online and blended learning environments, there is still debate whether the presence of an instructor in the video helps or hinders learning. According to social agency theory, seeing the instructor makes learners believe that s/he is personally teaching them, which leads to deeper cognitive processing and, in turn, better learning outcomes. Conversely, according to cognitive...

Full text available to download
Federated Learning in Healthcare Industry: Mammography Case Study
Publication
- Year 2023
The paper focuses on the role of federated learning in a healthcare environment. The experimental setup involved different healthcare providers, each with their datasets. A comparison was made between training a deep learning model using traditional methods, where all the data is stored in one place, and using federated learning, where the data is distributed among the workers. The experiment aimed to identify possible challenges...

Full text to download in external service
Self-Supervised Learning to Increase the Performance of Skin Lesion Classification
Publication
- Electronics - Year 2020
To successfully train a deep neural network, a large amount of human-labeled data is required. Unfortunately, in many areas, collecting and labeling data is a difficult and tedious task. Several ways have been developed to mitigate the problem associated with the shortage of data, the most common of which is transfer learning. However, in many cases, the use of transfer learning as the only remedy is insufficient. In this study,...

Full text available to download
Determining Pronunciation Differences in English Allophones Utilizing Audio Signal Parameterization
Publication
- B. Kostek
- M. Piotrowska
- T. Ciszewski
- A. Czyżewski
- Year 2017
An allophonic description of English plosive consonants, based on audio-visual recordings of 600 specially selected words, was developed. First, several speakers were recorded while reading words from a teleprompter. Then, every word was played back from the previously recorded sample read by a phonology expert and each examined speaker repeated a particular word trying to imitate correct pronunciation. The next step consisted...
Aktywności stymulujące refleksję w nauczaniu języka pisanego w wirtualnej klasie
Publication
- I. Mokwa-Tarnowska
- NEOFILOLOG - Year 2014
The paper aims to show how to engage students attending an online language course in various activities which by stimulating reflection enhance the learning process and result in better learning outcomes. By blending cognitivist, constructivist, constructionist and behavioural ideas, course developers and tutors can produce materials and use methods which satisfy the varied needs of adults who want to improve their writing skills....

Full text available to download
Intelligent Audio Signal Processing − Do We Still Need Annotated Datasets?
Publication
- B. Kostek
- Year 2022
In this paper, intelligent audio signal processing examples are shortly described. The focus is, however, on the machine learning approach and datasets needed, especially for deep learning models. Years of intense research produced many important results in this area; however, the goal of fully intelligent signal processing, characterized by its autonomous acting, is not yet achieved. Therefore, a review of state-of-the-art concerning...

Full text available to download
Method for Clustering of Brain Activity Data Derived from EEG Signals
Publication
- FUNDAMENTA INFORMATICAE - Year 2019
A method for assessing separability of EEG signals associated with three classes of brain activity is proposed. The EEG signals are acquired from 23 subjects, gathered from a headset consisting of 14 electrodes. Data are processed by applying Discrete Wavelet Transform (DWT) for the signal analysis and an autoencoder neural network for the brain activity separation. Processing involves 74 wavelets from 3 DWT families: Coiflets,...

Full text available to download
Analysis-by-synthesis paradigm evolved into a new concept
Publication
- B. Kostek
- Journal of the Acoustical Society of America - Year 2022
This work aims at showing how the well-known analysis-by-synthesis paradigm has recently been evolved into a new concept. However, in contrast to the original idea stating that the created sound should not fail to pass the foolproof synthesis test, the recent development is a consequence of the need to create new data. Deep learning models are greedy algorithms requiring a vast amount of data that, in addition, should be correctly...

Full text to download in external service
Learning design of a blended course in technical writing
Publication
- I. Mokwa-Tarnowska
- Beyond Philology: An International Journal of Linguistics, Literary Studies and English Language Teaching - Year 2013
Blending face-to-face classes with e-learning components can lead to a very successful outcome if the blend of approaches, methods, content, space, time, media and activities is carefully structured and approached from both the student’s and the tutor’s perspective. In order to blend synchronous and asynchronous e-learning activities with traditional ones, educators should make them inter-dependent and develop them according to...

Full text available to download
MP3vec: A Reusable Machine-Constructed Feature Representation for Protein Sequences
Publication
- S. R. Gupte
- D. S. Jain
- A. Srinivasan
- R. Aduri
- Year 2020
—Machine Learning (ML) methods have been used with varying degrees of success on protein prediction tasks, with two inherent limitations. First, prediction performance often depends upon the features extracted from the proteins. Second, experimental data may be insufficient to construct reliable ML models. Here we introduce MP3vec, a transferable representation for protein sequences that is designed to be used specifically for sequence-to-sequence...

Full text to download in external service
Agnieszka Mikołajczyk-Bareła dr inż.

People
Podstawy uczenia głębokiego 2022
e-Learning Courses
- K. Draszawka
- S. Olewniczak
- J. Szymański
{mlang pl}Kurs podstaw uczenia głębokiego przeznaczony dla studentów kierunku Informatyka.{mlang} {mlang en}This is a course about deep learning basics dedicated for Computer Science students.{mlang}
Reinforcement Learning Algorithm and FDTD-based Simulation Applied to Schroeder Diffuser Design Optimization
Publication
- A. Kurowski
- B. Kostek
- IEEE Access - Year 2021
The aim of this paper is to propose a novel approach to the algorithmic design of Schroeder acoustic diffusers employing a deep learning optimization algorithm and a fitness function based on a computer simulation of the propagation of acoustic waves. The deep learning method employed for the research is a deep policy gradient algorithm. It is used as a tool for carrying out a sequential optimization process the goal of which is...

Full text available to download
Comparison of Lithuanian and Polish Consonant Phonemes Based on Acoustic Analysis – Preliminary Results
Publication
- G. Korvel
- O. Kurasova
- B. Kostek
- Archives of Acoustics - Year 2019
The goal of this research is to find a set of acoustic parameters that are related to differences between Polish and Lithuanian language consonants. In order to identify these differences, an acoustic analysis is performed, and the phoneme sounds are described as the vectors of acoustic parameters. Parameters known from the speech domain as well as those from the music information retrieval area are employed. These parameters are...

Full text available to download
Wpływ struktur wsparcia na efektywność nauczania języka pisanego w środowisku e-learningowym
Publication
- I. Mokwa-Tarnowska
- LINGUODIDACTICA - Year 2014
The process of knowledge and language skills development during an online course can be very effective if student engagement in learning is achieved. This can be attained by introducing general and specific support mechanisms prior to the commencement of the course and during it. The former relates to the technological aspect, that is to familiarizing students with the functionalities of the virtual learning environment they will...
Voice command recognition using hybrid genetic algorithm
Publication
- M. Wroniszewska
- J. Dziedzic
- TASK Quarterly - Year 2010
Abstract: Speech recognition is a process of converting the acoustic signal into a set of words, whereas voice command recognition consists in the correct identification of voice commands, usually single words. Voice command recognition systems are widely used in the military, control systems, electronic devices, such as cellular phones, or by people with disabilities (e.g., for controlling a wheelchair or operating a computer...

Full text available to download

Search

Filters

Catalog

Search results for: AUTOMATED PRONUNCIATION ASSESSMENT, SPEECH PROCESSING, SECOND-LANGUAGE LEARNING, DEEP LEARNING

Wiktoria Wojnicz dr hab. inż.

Jaroslaw Spychala dr

Jan Daciuk dr hab. inż.

Leszek Ziemczonek dr

Muhammad Usman PhD

Olgun Aydin Dr

Agnieszka Mikołajczyk-Bareła dr inż.