Search results for: speech-to-text technology

Human-computer interactions in speech therapy using a blowing interface

Publication

- Year 2014

In this paper we present a new human-computer interface for the quantitative measurement of blowing activities. The interface can measure the air flow and air pressure during the blowing activity. The measured values are stored and used to control the state of the graphical objects in the graphical user interface. In speech therapy children will find easier to play attractive therapeutic games than to perform repetitive and tedious,...

Full text to download in external service

Virtual reality technology in architectural education

Publication

A. Gębczyńska-Janowicz

- World Transactions on Engineering and Technology Education - Year 2020

Contemporary virtual reality (VR) technology allows the recreation of non-existent architectural objects of which there may be no trace remaining. Virtual reality applications allow access to digital models, which visualise the lost architecture. The popularity of VR has resulted in it being applied not only to computer games, but also in visualising the past. Maps allow movement through historical trails and 3D models of architecture...

Full text available to download

A Text as a Set of Research Data. A Number of Aspects of Data Acquisition and Creation of Datasets in Neo-Latin Studies

Publication

- Year 2022

In this paper, the authors, who specialise in part in neo-Latin studies and the his-tory of early modern education, share their experiences of collecting sources for Open Research Data sets under the Bridge of Data project. On the basis of inscription texts from St. Mary’s Church in Gdańsk, they created 29 Open Research Data sets. In turn, the text of the lectures of the Gdańsk scholar Michael Christoph Hanow, Praecepta de arte...

Full text available to download

Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency

Publication

- International Journal of Image Processing and Visual Communication - Year 2013

In this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...

Full text to download in external service

Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets

Publication

- Electronics - Year 2022

Artificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...

Full text available to download

Comparison of technology adoption models

Publication

A. Landowska

- Year 2019

There are several technology adoption models, that try to explain, how and why the technologies are adopted and used. Among those, that are widely used to explain, how the older adults accept technologies, there are some general models and models specific to the group of older users. Among the general ones I would recommend paying attention to the following models: Technology Acceptance Model (TAM) proposed by Davis...

Full text to download in external service

Enabling Deeper Linguistic-based Text Analytics – Construct Development for the Criticality of Negative Service Experience

Publication

- IEEE Access - Year 2019

Significant progress has been made in linguistic-based text analytics particularly with the increasing availability of data and deep learning computational models for more accurate opinion analysis and domain-specific entity recognition. In understanding customer service experience from texts, analysis of sentiments associated with different stages of the service lifecycle is a useful starting point. However, when richer insights...

Full text available to download

Transformations of descriptive geometry education for architecture students at Gdańsk University of Technology, Poland

Publication

- World Transactions on Engineering and Technology Education - Year 2024

In this article, the authors analyse the evolution of teaching descriptive geometry for architecture students at Gdańsk University of Technology (Gdańsk Tech), Poland. The study traces changes in the curriculum in terms of teaching hours, considering also practices at Politechnika Lwowska (Lwów Polytechnic) and the Technische Hochschule Danzig (Technical University of Gdańsk), before World War II....

Full text to download in external service

Study on Speech Transmission under Varying QoS Parameters in a OFDM Communication System

Publication

M. Zamłyńska
P. Falkowski-Gilski
G. Debita
B. Miedziński

- Year 2021

Although there has been an outbreak of multiple multimedia platforms worldwide, speech communication is still the most essential and important type of service. With the spoken word we can exchange ideas, provide descriptive information, as well as aid to another person. As the amount of available bandwidth continues to shrink, researchers focus on novel types of transmission, based most often on multi-valued modulations, multiple...

Full text to download in external service

The Impact of Foreign Accents on the Performance of Whisper Family Models Using Medical Speech in Polish

Publication

S. Zaporowski

- Year 2024

The article presents preliminary experiments investigating the impact of accent on the performance of the Whisper automatic speech recognition (ASR) system, specifically for the Polish language and medical data. The literature review revealed a scarcity of studies on the influence of accents on speech recognition systems in Polish, especially concerning medical terminology. The experiments involved voice cloning of selected individuals...

Full text available to download

Internet technology in education - offer of Gdansk University of Technology.

Publication

A. Grabowska

- Year 2004

W artykule przedstawiono ofertę szkoleń Centrum Edukacji Niestacjonarnej Politechniki Gdańskiej oraz możliwości jej wykorzystania na Wydziale Inżynierii Lądowej w studiach doktoranckich i podyplomowych, które zostaną uruchomione w ramach V Programu Ramowego Unii Europejskiej, Centra Doskonałości (projekt CURE - Centre for Urban Construction and Rehabilitation: Technology Transfer, Research and Education). Zaprezentowano model nauczania...

Transfer learning in imagined speech EEG-based BCIs

Publication

J. S. Garcia Salinas
L. Villaseñor-Pineda
C. A. Reyes-Garćia
A. A. Torres-García

- Biomedical Signal Processing and Control - Year 2019

The Brain–Computer Interfaces (BCI) based on electroencephalograms (EEG) are systems which aim is to provide a communication channel to any person with a computer, initially it was proposed to aid people with disabilities, but actually wider applications have been proposed. These devices allow to send messages or to control devices using the brain signals. There are different neuro-paradigms which evoke brain signals of interest...

Full text available to download

Sliding burnishing technology of holes in hardened steel

Publication

W. Przybylski

- Advances in Manufacturing Science and Technology - Year 2016

New technology with sliding burnishing of holes with cylindrical surface, made of hardened steel /60 HRC/, is presented in the paper.After burnishing operation on hole diameter 30 mm in satellite gear wheel the surface roughness parameter Ra=0,02-0,04 micrometers was obtained. The method and results of research as technological conclusion are presented.

Full text available to download

Estimation of the short-term predictor parameters of speech under noisy conditions

Publication

M. Kuropatwinski
W. Kleijn
M. Kuropatwiński

- IEEE Transactions on Audio Speech and Language Processing - Year 2006

Full text to download in external service

Improvement of speech intelligibility in the presence of noise interference using the Lombard effect and an automatic noise interference profiling based on deep learning

Publication

K. Kąkol

- Year 2023

The Lombard effect is a phenomenon that results in speech intelligibility improvement when applied to noise. There are many distinctive features of Lombard speech that were recalled in this dissertation. This work proposes the creation of a system capable of improving speech quality and intelligibility in real-time measured by objective metrics and subjective tests. This system consists of three main components: speech type detection,...

Full text available to download

Estimation of the excitation variances of speech and noise AR-models for enhanced speech coding

Publication

M. Kuropatwinski
W. Kleijn
M. Kuropatwiński

- Year 2001

Full text to download in external service

Text Documents Classification with Support Vector Machines

Publication

P. Majewski

- Year 2008

Towards Effective Processing of Large Text Collections

Publication

- Year 2012

In the article we describe the approach to parallelimplementation of elementary operations for textual data categorization.In the experiments we evaluate parallel computations ofsimilarity matrices and k-means algorithm. The test datasets havebeen prepared as graphs created from Wikipedia articles relatedwith links. When we create the clustering data packages, wecompute pairs of eigenvectors and eigenvalues for visualizationsof...

Parallel Computations of Text Similarities for Categorization Task

Publication

J. Szymański

- Year 2013

In this chapter we describe the approach to parallel implementation of similarities in high dimensional spaces. The similarities computation have been used for textual data categorization. A test datasets we create from Wikipedia articles that with their hyper references formed a graph used in our experiments. The similarities based on Euclidean distance and Cosine measure have been used to process the data using k-means algorithm....

Bimodal classification of English allophones employing acoustic speech signal and facial motion capture

Publication

- Journal of the Acoustical Society of America - Year 2018

A method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...

Full text to download in external service

Noise profiling for speech enhancement employing machine learning models

Publication

K. Kąkol
G. Korvel
B. Kostek

- Journal of the Acoustical Society of America - Year 2022

This paper aims to propose a noise profiling method that can be performed in near real-time based on machine learning (ML). To address challenges related to noise profiling effectively, we start with a critical review of the literature background. Then, we outline the experiment performed consisting of two parts. The first part concerns the noise recognition model built upon several baseline classifiers and noise signal features...

Full text available to download

Intra-subject class-incremental deep learning approach for EEG-based imagined speech recognition

Publication

J. S. Garcia Salinas
A. A. Torres-García
C. A. Reyes-Garćia
L. Villaseñor-Pineda

- Biomedical Signal Processing and Control - Year 2023

Brain–computer interfaces (BCIs) aim to decode brain signals and transform them into commands for device operation. The present study aimed to decode the brain activity during imagined speech. The BCI must identify imagined words within a given vocabulary and thus perform the requested action. A possible scenario when using this approach is the gradual addition of new words to the vocabulary using incremental learning methods....

Full text available to download

Developing a Low SNR Resistant, Text Independent Speaker Recognition System for Intercom Solutions - A Case Study

Publication

- Year 2024

This article presents a case study on the development of a biometric voice verification system for an intercom solution, utilizing the DeepSpeaker neural network architecture. Despite the variety of solutions available in the literature, there is a noted lack of evaluations for "text-independent" systems under real conditions and with varying distances between the speaker and the microphone. This article aims to bridge this gap....

Full text available to download

Intelligent processing of stuttered speech.

Publication

- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Year 2003

W artykule zaprezentowano kilka metod analizy i automatycznego zliczania potknięć artykulacyjnych, związanych z jąkaniem się, opartych na wykorzystaniu algorytmów uczących się sztucznych sieci neuronowych i zbiorów przybliżonych.

Automatic Image and Speech Recognition Based on Neural Network

Publication

D. Król
B. Szlachetko

- Journal of Information Technology Research - Year 2010

Full text to download in external service

Text categorization with semantic commonsense knowledge: First results

Publication

P. Majewski
J. Szymański

- Year 2008

Do przetwarzania tekstów typowo wykorzystuje się reprezentacjeBOW. Podejście takie nie daje jednak dobrych rezultatów w sytuacjigdy podobne dokumenty nie współdzielą ze sobą słów.W artykule zaprezentowano podejście do konstrukcji funkcjijądra dla klasyfikatorów SVM opartego na zewnętrznej bazie wiedzyo pojęciach językowych.

External Validation Measures for Nested Clustering of Text Documents

Publication

- Year 2011

Abstract. This article handles the problem of validating the results of nested (as opposed to "flat") clusterings. It shows that standard external validation indices used for partitioning clustering validation, like Rand statistics, Hubert Γ statistic or F-measure are not applicable in nested clustering cases. Additionally to the work, where F-measure was adopted to hierarchical classification as hF-measure, here some methods to...

Towards facts extraction from text in Polish language

Publication

- Year 2017

Natural Language Processing (NLP) finds many usages in different fields of endeavor. Many tools exists allowing analysis of English language. For Polish language the situation is different as the language itself is more complicated. In this paper we show differences between NLP of Polish and English language. Existing solutions are presented and TEAMS software for facts extraction is described. The paper shows also evaluation of...

Full text available to download

A method supporting fault-tolerant optical text recognition from video sequences recorded with handheld cameras

Publication

- Engineering Applications of Artificial Intelligence - Year 2023

In the paper a method supporting the optical character recognition from video sequences recorded with cameras without good stabilization is proposed. Due to the presence of various distortions, such as motion blur, shadows, lossy compression artifacts, auto-focusing errors, etc., the quality of individual video frames, e.g., recorded by a smartphone camera, differs noticeably, influencing the results of text recognition, causing...

Full text to download in external service

Text analytics for co-creation in public sector organizations: a literature review-based research framework

Publication

N. Rizun
A. Revina
N. Edelmann

- ARTIFICIAL INTELLIGENCE REVIEW - Year 2025

The public sector faces considerable challenges that stem from increasing external and internal demands, the need for diverse and complex services, and citizens’ lack of satisfaction and trust in public sector organisations (PSOs). An alternative to traditional public service delivery is the co-creation of public services. Data analytics has been fueled by the availability of immense amounts of data, including textual data, and...

Full text available to download

THE ROLE OF THE POLISH UNIVERSITIES IN SHAPING A NEW MOBILITY CULTURE - ASSUMPTIONS, CONDITIONS, EXPERIENCE. CASE STUDY OF GDANSK UNIVERSITY OF TECHNOLOGY, CRACOW UNIVERSITY OF TECHNOLOGY AND SILESIAN UNIVERSITY OF TECHNOLOGY

Publication

R. Okraszewska
K. Nosal
G. Sierpiński

- Year 2014

The article expresses an idea of the privileged role of universities in the process of shaping the new culture for urban mobility. Mobility management of the academic community is adopted in order to indirectly influence a wider group of the population. The aim of the study was to investigate the situation at the Polish universities and the willingness of their authorities to implement integrated tools to manage the transportation...

Full text to download in external service

Comparison of Language Models Trained on Written Texts and Speech Transcripts in the Context of Automatic Speech Recognition

Publication

S. Dziadzio
A. Nabożny
A. Smywiński-Pohl
B. Ziółko

- Year 2015

Full text to download in external service

EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY

Publication

- Year 2014

The problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...

EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY

Publication

- Year 2014

The problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...

Trust Frameworks in Application to Technology in Elections

Publication

D. Duenas Cid
L. Loeber
B. Martin-Rozumiłowicz
R. Macias

- Year 2023

The prevalence of technology in elections has increased in recent decades, both in terms of voting systems as well as ancillary ones. At the same time, the issue of public confidence and trust has come to the fore as certain threat actors have sought to undermine electoral integrity through publicized attacks and disinformation campaigns against such technology. This paper examines the nexus between this public trust and the...

Full text available to download

Mobile application technology in levelling

Publication

M. Bednarczyk
A. Janowski

- Acta Geodynamica et Geomaterialia - Year 2014

The topic of this article is the use of mobile application technology in geodetic measurements with an emphasis on levelling. Reference points were registered as data from levelling benchmarks, performed with a traditional dumpy level. The created application allowed the recording of measurements on location, sending them to a remote server for processing and preparing a report to be saved in a database. The project was to decrease...

Full text available to download

Advanced Modeling of Management Processes in Information Technology

Publication

- Year 2014

This book deals with the issues of modeling management processes of information technology and IT projects while its core is the model of information technology management and its component models (contextual, local) describing initial processing and the maturity capsule as well as a decision-making system represented by a multi-level sequential model of IT technology selection, which acquires a fuzzy rule-based implementation...

The lasting traditions of activities in the field of wood technology researches at the Gdansk University of Technology

Publication

K. Orłowski

- Annals of WULS, Forestry and Wood Technology - Year 2012

W niniejszym artykule przedstawiono sylwetki ludzi z Politechniki Gdańskiej, którzy odeszli, jednakowoż, których działalność była znacząca w obszarze drzewnictwa i mechanicznej technologii drewna.

A system for Direction-Of-Arrival estimation in ISM 2.4 GHz frequency band based on ESPAR antenna and SDR technology

Publication

P. Kwapisiewicz

- Year 2018

Determination of the direction of the signal arrival (DOA) finds many applications in various areas of science and industry. Knowledge of DOA is used, among others to determine the position of a satellite with a low Earth orbit (LEO), localization of people and things as well as in research of wireless communication systems, for instance the determination of the number of...

Full text available to download

Novel approaches to wideband speech coding

Publication

- Year 2008

Dwie metoda kodowania szerokopasmowego mowy zostały zaprezentowane. W pierwszej metodzie wykorzystano algorytm kompresji i ekspansji czasowej sygnału mowy, pozwalający na kodowanie szerokopasmowe sygnału mowy z wykorzystaniem ustandaryzowanych kodeków. Metoda ta jest przewidziana do zastosowania w adaptacyjnych algorytmach kodowania mowy. Drugie z proponowanych rozwiazan dotyczy nowej metody estymacji obwiedni widma sygnalu mowy...

Full text to download in external service

Broadband interference in speech reinforcement systems

Publication

- Year 2008

Artykuł podejmuje niedoceniany problem wpływu liczby i rozkładu głośników w systemach nagłośnienia, na jakość przekazu głosowego, czyli na zrozumiałość mowy w audytoriach. Superpozycji przesuniętych w czasie szerokopasmowych sygnałów o tym samym kształcie i lekko różnych wielkościach, które docierają do słuchacza z licznych spójnych źródeł, towarzyszy zjawisko interferencji prowadzące do głębokiej modyfikacji odbieranych sygnałów...

Integration of speech enhancement and coding techniques

Publication

M. Kuropatwinski
D. Leckschat
K. Kroschel
A. Czyzewski
M. Kuropatwiński

- Year 1999

Full text to download in external service

Multitask Noisy Speech Enhancement System

Publication

- Year 2005

W referacie opisano Wielozadaniowy System Poprawy Jakości Sygnału Mowy. Jest to wyspecjalizowany pakiet oprogramowania przeznaczony do rejestrowania sygnału mowy i do poprawy jego jakości oraz zrozumiałości mowy, przy użyciu zaawansowanych procedur cyfrowego przetwarzania sygnału. Pakiet oprogramowania składa się z programów: Rejestrator, Przeglądarka oraz Rekonstruktor. Oprogramowanie to może być użyte w przypadkach, gdy zrozumiałość...

A system for multitask noisy speech enhancement.

Publication

A. Czyżewski
A. Kaczmarek
J. Kotus
A. Pawlik
A. Rypulak
P. Żwan

- Year 2004

W artykule przedstawiono ogolną charakterystyke opracowanego systemu rejestracji i rekonstrukcji mowy. Artykuł zawiera opis składników systemu, ktory jest oprogramowaniem zawierającym zaawansowane narzędzia służące poprawie zrozumiałości mowy. Zaimplementowane narzędzia systemu umożliwiają wyszukiwanie nagrań dźwiękowych i ich obróbkę przy pomocy zaimplementowanych pluginów. W artykule przedstawione wykorzystane w systemie algorytmy...

Difference in Perceived Speech Signal Quality Assessment Among Monolingual and Bilingual Teenage Students

Publication

P. Falkowski-Gilski

- Year 2021

The user perceived quality is a mixture of factors, including the background of an individual. The process of auditory perception is discussed in a wide variety of fields, ranging from engineering to medicine. Many studies examine the difference between musicians and non-musicians. Since musical training develops musical hearing and other various auditory capabilities, similar enhancements should be observable in case of bilingual...

Full text to download in external service

Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej

Publication

A. Czyżewski
B. Kostek
T. Ciszewski
D. Majewicz

- Year 2013

The bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...

Representation of hypertext documents based on terms, Links and text compressibility

Publication

J. Szymański
W. Duch

- LECTURE NOTES IN COMPUTER SCIENCE - Year 2010

Opisano metody reprezentacji dokumentów tekstowych oparte na słowach, wzajemnych powiązaniach i metodach kompresji. Dokonano ich oceny w oparciu o klasyfikator SVM.

Wieloznaczność w języku i tekście [Ambiguity in language and text]

Publication

K. Wojan

- PROGRESS. JOURNAL OF YOUNG RESEARCHERS - Year 2017

Full text to download in external service

Teaching civil engineering in English at Gdansk University of Technology

Publication

R. Jankowski

- Turkish Online Journal of Educational Technology - Year 2016

The effects of globalization, as well as many possibilities of easy and cheap ways of travelling, have led to the increase in number of different types of university studies conducted in English. This paper describes advantages and disadvantages after seven years of experience of conducting three-semester MSc Studies in Civil Engineering in English at Gdansk University of Technology, Poland. The studies started in 2009 after a...

Full text to download in external service

Suitability of LoRaWAN Technology for the Development of Maritime Applications

Publication

Ł. Wiszniewski

- TASK Quarterly - Year 2018

The LoRaWAN Technology opens new possibilities for gathering and analysis of distributed data. In the paper we concentrate on its maritime usability which was tested by us in the period from June to August 2018. Measurements of the LoRaWAN network coverage in the Bay of Gdansk area were carried out. Various conditions and places were tested. The research was planned in such a way as to gradually increase the range and control the...

Full text to download in external service

Search

Filters

Catalog

Category

Year

Options

Search results for: speech-to-text technology