Wyniki wyszukiwania dla: speech-to-text technology

Text Mining Algorithms for Extracting Brand Knowledge; The fashion Industry Case

Publikacja

- Rok 2018

Brand knowledge is determined by customer knowledge. The opportunity to develop brands based on customer knowledge management has never been greater. Social media as a set of leading communication platforms enable peer to peer interplays between customers and brands. A large stream of such interactions is a great source of information which, when thoroughly analyzed, can become a source of innovation and lead to competitive advantage....

Pełny tekst do pobrania w portalu

Virtual reality technology in architectural education

Publikacja

A. Gębczyńska-Janowicz

- World Transactions on Engineering and Technology Education - Rok 2020

Contemporary virtual reality (VR) technology allows the recreation of non-existent architectural objects of which there may be no trace remaining. Virtual reality applications allow access to digital models, which visualise the lost architecture. The popularity of VR has resulted in it being applied not only to computer games, but also in visualising the past. Maps allow movement through historical trails and 3D models of architecture...

Pełny tekst do pobrania w portalu

A Text as a Set of Research Data. A Number of Aspects of Data Acquisition and Creation of Datasets in Neo-Latin Studies

Publikacja

- Rok 2022

In this paper, the authors, who specialise in part in neo-Latin studies and the his-tory of early modern education, share their experiences of collecting sources for Open Research Data sets under the Bridge of Data project. On the basis of inscription texts from St. Mary’s Church in Gdańsk, they created 29 Open Research Data sets. In turn, the text of the lectures of the Gdańsk scholar Michael Christoph Hanow, Praecepta de arte...

Pełny tekst do pobrania w portalu

Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency

Publikacja

- International Journal of Image Processing and Visual Communication - Rok 2013

In this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...

Pełny tekst do pobrania w serwisie zewnętrznym

Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets

Publikacja

- Electronics - Rok 2022

Artificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...

Pełny tekst do pobrania w portalu

Comparison of technology adoption models

Publikacja

A. Landowska

- Rok 2019

There are several technology adoption models, that try to explain, how and why the technologies are adopted and used. Among those, that are widely used to explain, how the older adults accept technologies, there are some general models and models specific to the group of older users. Among the general ones I would recommend paying attention to the following models: Technology Acceptance Model (TAM) proposed by Davis...

Pełny tekst do pobrania w serwisie zewnętrznym

Enabling Deeper Linguistic-based Text Analytics – Construct Development for the Criticality of Negative Service Experience

Publikacja

- IEEE Access - Rok 2019

Significant progress has been made in linguistic-based text analytics particularly with the increasing availability of data and deep learning computational models for more accurate opinion analysis and domain-specific entity recognition. In understanding customer service experience from texts, analysis of sentiments associated with different stages of the service lifecycle is a useful starting point. However, when richer insights...

Pełny tekst do pobrania w portalu

Transformations of descriptive geometry education for architecture students at Gdańsk University of Technology, Poland

Publikacja

- World Transactions on Engineering and Technology Education - Rok 2024

In this article, the authors analyse the evolution of teaching descriptive geometry for architecture students at Gdańsk University of Technology (Gdańsk Tech), Poland. The study traces changes in the curriculum in terms of teaching hours, considering also practices at Politechnika Lwowska (Lwów Polytechnic) and the Technische Hochschule Danzig (Technical University of Gdańsk), before World War II....

Pełny tekst do pobrania w serwisie zewnętrznym

Study on Speech Transmission under Varying QoS Parameters in a OFDM Communication System

Publikacja

M. Zamłyńska
P. Falkowski-Gilski
G. Debita
B. Miedziński

- Rok 2021

Although there has been an outbreak of multiple multimedia platforms worldwide, speech communication is still the most essential and important type of service. With the spoken word we can exchange ideas, provide descriptive information, as well as aid to another person. As the amount of available bandwidth continues to shrink, researchers focus on novel types of transmission, based most often on multi-valued modulations, multiple...

Pełny tekst do pobrania w serwisie zewnętrznym

The Impact of Foreign Accents on the Performance of Whisper Family Models Using Medical Speech in Polish

Publikacja

S. Zaporowski

- Rok 2024

The article presents preliminary experiments investigating the impact of accent on the performance of the Whisper automatic speech recognition (ASR) system, specifically for the Polish language and medical data. The literature review revealed a scarcity of studies on the influence of accents on speech recognition systems in Polish, especially concerning medical terminology. The experiments involved voice cloning of selected individuals...

Pełny tekst do pobrania w portalu

Internet technology in education - offer of Gdansk University of Technology.

Publikacja

A. Grabowska

- Rok 2004

W artykule przedstawiono ofertę szkoleń Centrum Edukacji Niestacjonarnej Politechniki Gdańskiej oraz możliwości jej wykorzystania na Wydziale Inżynierii Lądowej w studiach doktoranckich i podyplomowych, które zostaną uruchomione w ramach V Programu Ramowego Unii Europejskiej, Centra Doskonałości (projekt CURE - Centre for Urban Construction and Rehabilitation: Technology Transfer, Research and Education). Zaprezentowano model nauczania...

Transfer learning in imagined speech EEG-based BCIs

Publikacja

J. S. Garcia Salinas
L. Villaseñor-Pineda
C. A. Reyes-Garćia
A. A. Torres-García

- Biomedical Signal Processing and Control - Rok 2019

The Brain–Computer Interfaces (BCI) based on electroencephalograms (EEG) are systems which aim is to provide a communication channel to any person with a computer, initially it was proposed to aid people with disabilities, but actually wider applications have been proposed. These devices allow to send messages or to control devices using the brain signals. There are different neuro-paradigms which evoke brain signals of interest...

Pełny tekst do pobrania w portalu

Sliding burnishing technology of holes in hardened steel

Publikacja

W. Przybylski

- Advances in Manufacturing Science and Technology - Rok 2016

New technology with sliding burnishing of holes with cylindrical surface, made of hardened steel /60 HRC/, is presented in the paper.After burnishing operation on hole diameter 30 mm in satellite gear wheel the surface roughness parameter Ra=0,02-0,04 micrometers was obtained. The method and results of research as technological conclusion are presented.

Pełny tekst do pobrania w portalu

Estimation of the short-term predictor parameters of speech under noisy conditions

Publikacja

M. Kuropatwinski
W. Kleijn
M. Kuropatwiński

- IEEE Transactions on Audio Speech and Language Processing - Rok 2006

Pełny tekst do pobrania w serwisie zewnętrznym

Improvement of speech intelligibility in the presence of noise interference using the Lombard effect and an automatic noise interference profiling based on deep learning

Publikacja

K. Kąkol

- Rok 2023

The Lombard effect is a phenomenon that results in speech intelligibility improvement when applied to noise. There are many distinctive features of Lombard speech that were recalled in this dissertation. This work proposes the creation of a system capable of improving speech quality and intelligibility in real-time measured by objective metrics and subjective tests. This system consists of three main components: speech type detection,...

Pełny tekst do pobrania w portalu

Estimation of the excitation variances of speech and noise AR-models for enhanced speech coding

Publikacja

M. Kuropatwinski
W. Kleijn
M. Kuropatwiński

- Rok 2001

Pełny tekst do pobrania w serwisie zewnętrznym

Text Documents Classification with Support Vector Machines

Publikacja

P. Majewski

- Rok 2008

Parallel Computations of Text Similarities for Categorization Task

Publikacja

J. Szymański

- Rok 2013

In this chapter we describe the approach to parallel implementation of similarities in high dimensional spaces. The similarities computation have been used for textual data categorization. A test datasets we create from Wikipedia articles that with their hyper references formed a graph used in our experiments. The similarities based on Euclidean distance and Cosine measure have been used to process the data using k-means algorithm....

Towards Effective Processing of Large Text Collections

Publikacja

- Rok 2012

In the article we describe the approach to parallelimplementation of elementary operations for textual data categorization.In the experiments we evaluate parallel computations ofsimilarity matrices and k-means algorithm. The test datasets havebeen prepared as graphs created from Wikipedia articles relatedwith links. When we create the clustering data packages, wecompute pairs of eigenvectors and eigenvalues for visualizationsof...

Bimodal classification of English allophones employing acoustic speech signal and facial motion capture

Publikacja

- Journal of the Acoustical Society of America - Rok 2018

A method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...

Pełny tekst do pobrania w serwisie zewnętrznym

Noise profiling for speech enhancement employing machine learning models

Publikacja

K. Kąkol
G. Korvel
B. Kostek

- Journal of the Acoustical Society of America - Rok 2022

This paper aims to propose a noise profiling method that can be performed in near real-time based on machine learning (ML). To address challenges related to noise profiling effectively, we start with a critical review of the literature background. Then, we outline the experiment performed consisting of two parts. The first part concerns the noise recognition model built upon several baseline classifiers and noise signal features...

Pełny tekst do pobrania w portalu

Intra-subject class-incremental deep learning approach for EEG-based imagined speech recognition

Publikacja

J. S. Garcia Salinas
A. A. Torres-García
C. A. Reyes-Garćia
L. Villaseñor-Pineda

- Biomedical Signal Processing and Control - Rok 2023

Brain–computer interfaces (BCIs) aim to decode brain signals and transform them into commands for device operation. The present study aimed to decode the brain activity during imagined speech. The BCI must identify imagined words within a given vocabulary and thus perform the requested action. A possible scenario when using this approach is the gradual addition of new words to the vocabulary using incremental learning methods....

Pełny tekst do pobrania w portalu

Developing a Low SNR Resistant, Text Independent Speaker Recognition System for Intercom Solutions - A Case Study

Publikacja

- Rok 2024

This article presents a case study on the development of a biometric voice verification system for an intercom solution, utilizing the DeepSpeaker neural network architecture. Despite the variety of solutions available in the literature, there is a noted lack of evaluations for "text-independent" systems under real conditions and with varying distances between the speaker and the microphone. This article aims to bridge this gap....

Pełny tekst do pobrania w portalu

Intelligent processing of stuttered speech.

Publikacja

- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2003

W artykule zaprezentowano kilka metod analizy i automatycznego zliczania potknięć artykulacyjnych, związanych z jąkaniem się, opartych na wykorzystaniu algorytmów uczących się sztucznych sieci neuronowych i zbiorów przybliżonych.

Automatic Image and Speech Recognition Based on Neural Network

Publikacja

D. Król
B. Szlachetko

- Journal of Information Technology Research - Rok 2010

Pełny tekst do pobrania w serwisie zewnętrznym

External Validation Measures for Nested Clustering of Text Documents

Publikacja

- Rok 2011

Abstract. This article handles the problem of validating the results of nested (as opposed to "flat") clusterings. It shows that standard external validation indices used for partitioning clustering validation, like Rand statistics, Hubert Γ statistic or F-measure are not applicable in nested clustering cases. Additionally to the work, where F-measure was adopted to hierarchical classification as hF-measure, here some methods to...

Text categorization with semantic commonsense knowledge: First results

Publikacja

P. Majewski
J. Szymański

- Rok 2008

Do przetwarzania tekstów typowo wykorzystuje się reprezentacjeBOW. Podejście takie nie daje jednak dobrych rezultatów w sytuacjigdy podobne dokumenty nie współdzielą ze sobą słów.W artykule zaprezentowano podejście do konstrukcji funkcjijądra dla klasyfikatorów SVM opartego na zewnętrznej bazie wiedzyo pojęciach językowych.

Towards facts extraction from text in Polish language

Publikacja

- Rok 2017

Natural Language Processing (NLP) finds many usages in different fields of endeavor. Many tools exists allowing analysis of English language. For Polish language the situation is different as the language itself is more complicated. In this paper we show differences between NLP of Polish and English language. Existing solutions are presented and TEAMS software for facts extraction is described. The paper shows also evaluation of...

Pełny tekst do pobrania w portalu

A method supporting fault-tolerant optical text recognition from video sequences recorded with handheld cameras

Publikacja

- Engineering Applications of Artificial Intelligence - Rok 2023

In the paper a method supporting the optical character recognition from video sequences recorded with cameras without good stabilization is proposed. Due to the presence of various distortions, such as motion blur, shadows, lossy compression artifacts, auto-focusing errors, etc., the quality of individual video frames, e.g., recorded by a smartphone camera, differs noticeably, influencing the results of text recognition, causing...

Pełny tekst do pobrania w serwisie zewnętrznym

Text analytics for co-creation in public sector organizations: a literature review-based research framework

Publikacja

N. Rizun
A. Revina
N. Edelmann

- ARTIFICIAL INTELLIGENCE REVIEW - Rok 2025

The public sector faces considerable challenges that stem from increasing external and internal demands, the need for diverse and complex services, and citizens’ lack of satisfaction and trust in public sector organisations (PSOs). An alternative to traditional public service delivery is the co-creation of public services. Data analytics has been fueled by the availability of immense amounts of data, including textual data, and...

Pełny tekst do pobrania w portalu

THE ROLE OF THE POLISH UNIVERSITIES IN SHAPING A NEW MOBILITY CULTURE - ASSUMPTIONS, CONDITIONS, EXPERIENCE. CASE STUDY OF GDANSK UNIVERSITY OF TECHNOLOGY, CRACOW UNIVERSITY OF TECHNOLOGY AND SILESIAN UNIVERSITY OF TECHNOLOGY

Publikacja

R. Okraszewska
K. Nosal
G. Sierpiński

- Rok 2014

The article expresses an idea of the privileged role of universities in the process of shaping the new culture for urban mobility. Mobility management of the academic community is adopted in order to indirectly influence a wider group of the population. The aim of the study was to investigate the situation at the Polish universities and the willingness of their authorities to implement integrated tools to manage the transportation...

Pełny tekst do pobrania w serwisie zewnętrznym

Comparison of Language Models Trained on Written Texts and Speech Transcripts in the Context of Automatic Speech Recognition

Publikacja

S. Dziadzio
A. Nabożny
A. Smywiński-Pohl
B. Ziółko

- Rok 2015

Pełny tekst do pobrania w serwisie zewnętrznym

EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY

Publikacja

- Rok 2014

The problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...

EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY

Publikacja

- Rok 2014

The problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...

Trust Frameworks in Application to Technology in Elections

Publikacja

D. Duenas Cid
L. Loeber
B. Martin-Rozumiłowicz
R. Macias

- Rok 2023

The prevalence of technology in elections has increased in recent decades, both in terms of voting systems as well as ancillary ones. At the same time, the issue of public confidence and trust has come to the fore as certain threat actors have sought to undermine electoral integrity through publicized attacks and disinformation campaigns against such technology. This paper examines the nexus between this public trust and the...

Pełny tekst do pobrania w portalu

Mobile application technology in levelling

Publikacja

M. Bednarczyk
A. Janowski

- Acta Geodynamica et Geomaterialia - Rok 2014

The topic of this article is the use of mobile application technology in geodetic measurements with an emphasis on levelling. Reference points were registered as data from levelling benchmarks, performed with a traditional dumpy level. The created application allowed the recording of measurements on location, sending them to a remote server for processing and preparing a report to be saved in a database. The project was to decrease...

Pełny tekst do pobrania w portalu

Advanced Modeling of Management Processes in Information Technology

Publikacja

- Rok 2014

This book deals with the issues of modeling management processes of information technology and IT projects while its core is the model of information technology management and its component models (contextual, local) describing initial processing and the maturity capsule as well as a decision-making system represented by a multi-level sequential model of IT technology selection, which acquires a fuzzy rule-based implementation...

The lasting traditions of activities in the field of wood technology researches at the Gdansk University of Technology

Publikacja

K. Orłowski

- Annals of WULS, Forestry and Wood Technology - Rok 2012

W niniejszym artykule przedstawiono sylwetki ludzi z Politechniki Gdańskiej, którzy odeszli, jednakowoż, których działalność była znacząca w obszarze drzewnictwa i mechanicznej technologii drewna.

A system for Direction-Of-Arrival estimation in ISM 2.4 GHz frequency band based on ESPAR antenna and SDR technology

Publikacja

P. Kwapisiewicz

- Rok 2018

Determination of the direction of the signal arrival (DOA) finds many applications in various areas of science and industry. Knowledge of DOA is used, among others to determine the position of a satellite with a low Earth orbit (LEO), localization of people and things as well as in research of wireless communication systems, for instance the determination of the number of...

Pełny tekst do pobrania w portalu

A system for multitask noisy speech enhancement.

Publikacja

A. Czyżewski
A. Kaczmarek
J. Kotus
A. Pawlik
A. Rypulak
P. Żwan

- Rok 2004

W artykule przedstawiono ogolną charakterystyke opracowanego systemu rejestracji i rekonstrukcji mowy. Artykuł zawiera opis składników systemu, ktory jest oprogramowaniem zawierającym zaawansowane narzędzia służące poprawie zrozumiałości mowy. Zaimplementowane narzędzia systemu umożliwiają wyszukiwanie nagrań dźwiękowych i ich obróbkę przy pomocy zaimplementowanych pluginów. W artykule przedstawione wykorzystane w systemie algorytmy...

Integration of speech enhancement and coding techniques

Publikacja

M. Kuropatwinski
D. Leckschat
K. Kroschel
A. Czyzewski
M. Kuropatwiński

- Rok 1999

Pełny tekst do pobrania w serwisie zewnętrznym

Novel approaches to wideband speech coding

Publikacja

- Rok 2008

Dwie metoda kodowania szerokopasmowego mowy zostały zaprezentowane. W pierwszej metodzie wykorzystano algorytm kompresji i ekspansji czasowej sygnału mowy, pozwalający na kodowanie szerokopasmowe sygnału mowy z wykorzystaniem ustandaryzowanych kodeków. Metoda ta jest przewidziana do zastosowania w adaptacyjnych algorytmach kodowania mowy. Drugie z proponowanych rozwiazan dotyczy nowej metody estymacji obwiedni widma sygnalu mowy...

Pełny tekst do pobrania w serwisie zewnętrznym

Broadband interference in speech reinforcement systems

Publikacja

- Rok 2008

Artykuł podejmuje niedoceniany problem wpływu liczby i rozkładu głośników w systemach nagłośnienia, na jakość przekazu głosowego, czyli na zrozumiałość mowy w audytoriach. Superpozycji przesuniętych w czasie szerokopasmowych sygnałów o tym samym kształcie i lekko różnych wielkościach, które docierają do słuchacza z licznych spójnych źródeł, towarzyszy zjawisko interferencji prowadzące do głębokiej modyfikacji odbieranych sygnałów...

Multitask Noisy Speech Enhancement System

Publikacja

- Rok 2005

W referacie opisano Wielozadaniowy System Poprawy Jakości Sygnału Mowy. Jest to wyspecjalizowany pakiet oprogramowania przeznaczony do rejestrowania sygnału mowy i do poprawy jego jakości oraz zrozumiałości mowy, przy użyciu zaawansowanych procedur cyfrowego przetwarzania sygnału. Pakiet oprogramowania składa się z programów: Rejestrator, Przeglądarka oraz Rekonstruktor. Oprogramowanie to może być użyte w przypadkach, gdy zrozumiałość...

Difference in Perceived Speech Signal Quality Assessment Among Monolingual and Bilingual Teenage Students

Publikacja

P. Falkowski-Gilski

- Rok 2021

The user perceived quality is a mixture of factors, including the background of an individual. The process of auditory perception is discussed in a wide variety of fields, ranging from engineering to medicine. Many studies examine the difference between musicians and non-musicians. Since musical training develops musical hearing and other various auditory capabilities, similar enhancements should be observable in case of bilingual...

Pełny tekst do pobrania w serwisie zewnętrznym

Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej

Publikacja

A. Czyżewski
B. Kostek
T. Ciszewski
D. Majewicz

- Rok 2013

The bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...

Representation of hypertext documents based on terms, Links and text compressibility

Publikacja

J. Szymański
W. Duch

- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2010

Opisano metody reprezentacji dokumentów tekstowych oparte na słowach, wzajemnych powiązaniach i metodach kompresji. Dokonano ich oceny w oparciu o klasyfikator SVM.

Wieloznaczność w języku i tekście [Ambiguity in language and text]

Publikacja

K. Wojan

- PROGRESS. JOURNAL OF YOUNG RESEARCHERS - Rok 2017

Pełny tekst do pobrania w serwisie zewnętrznym

Teaching civil engineering in English at Gdansk University of Technology

Publikacja

R. Jankowski

- Turkish Online Journal of Educational Technology - Rok 2016

The effects of globalization, as well as many possibilities of easy and cheap ways of travelling, have led to the increase in number of different types of university studies conducted in English. This paper describes advantages and disadvantages after seven years of experience of conducting three-semester MSc Studies in Civil Engineering in English at Gdansk University of Technology, Poland. The studies started in 2009 after a...

Pełny tekst do pobrania w serwisie zewnętrznym

Suitability of LoRaWAN Technology for the Development of Maritime Applications

Publikacja

Ł. Wiszniewski

- TASK Quarterly - Rok 2018

The LoRaWAN Technology opens new possibilities for gathering and analysis of distributed data. In the paper we concentrate on its maritime usability which was tested by us in the period from June to August 2018. Measurements of the LoRaWAN network coverage in the Bay of Gdansk area were carried out. Various conditions and places were tested. The research was planned in such a way as to gradually increase the range and control the...

Pełny tekst do pobrania w serwisie zewnętrznym

Wyszukiwarka

Filtry

Katalog

Kategoria

Rok

Opcje

Wyniki wyszukiwania dla: speech-to-text technology