Filtry
wszystkich: 6796
wybranych: 3597
-
Katalog
- Publikacje 3597 wyników po odfiltrowaniu
- Czasopisma 850 wyników po odfiltrowaniu
- Konferencje 80 wyników po odfiltrowaniu
- Wydawnictwa 1 wyników po odfiltrowaniu
- Osoby 962 wyników po odfiltrowaniu
- Projekty 76 wyników po odfiltrowaniu
- Laboratoria 1 wyników po odfiltrowaniu
- Aparatura Badawcza 1 wyników po odfiltrowaniu
- Kursy Online 414 wyników po odfiltrowaniu
- Wydarzenia 17 wyników po odfiltrowaniu
- Dane Badawcze 797 wyników po odfiltrowaniu
Filtry wybranego katalogu
wyświetlamy 1000 najlepszych wyników Pomoc
Wyniki wyszukiwania dla: speech-to-text technology
-
Text Mining Algorithms for Extracting Brand Knowledge; The fashion Industry Case
PublikacjaBrand knowledge is determined by customer knowledge. The opportunity to develop brands based on customer knowledge management has never been greater. Social media as a set of leading communication platforms enable peer to peer interplays between customers and brands. A large stream of such interactions is a great source of information which, when thoroughly analyzed, can become a source of innovation and lead to competitive advantage....
-
Virtual reality technology in architectural education
PublikacjaContemporary virtual reality (VR) technology allows the recreation of non-existent architectural objects of which there may be no trace remaining. Virtual reality applications allow access to digital models, which visualise the lost architecture. The popularity of VR has resulted in it being applied not only to computer games, but also in visualising the past. Maps allow movement through historical trails and 3D models of architecture...
-
A Text as a Set of Research Data. A Number of Aspects of Data Acquisition and Creation of Datasets in Neo-Latin Studies
PublikacjaIn this paper, the authors, who specialise in part in neo-Latin studies and the his-tory of early modern education, share their experiences of collecting sources for Open Research Data sets under the Bridge of Data project. On the basis of inscription texts from St. Mary’s Church in Gdańsk, they created 29 Open Research Data sets. In turn, the text of the lectures of the Gdańsk scholar Michael Christoph Hanow, Praecepta de arte...
-
Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency
PublikacjaIn this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...
-
Recognition of Emotions in Speech Using Convolutional Neural Networks on Different Datasets
PublikacjaArtificial Neural Network (ANN) models, specifically Convolutional Neural Networks (CNN), were applied to extract emotions based on spectrograms and mel-spectrograms. This study uses spectrograms and mel-spectrograms to investigate which feature extraction method better represents emotions and how big the differences in efficiency are in this context. The conducted studies demonstrated that mel-spectrograms are a better-suited...
-
Comparison of technology adoption models
PublikacjaThere are several technology adoption models, that try to explain, how and why the technologies are adopted and used. Among those, that are widely used to explain, how the older adults accept technologies, there are some general models and models specific to the group of older users. Among the general ones I would recommend paying attention to the following models: Technology Acceptance Model (TAM) proposed by Davis...
-
Enabling Deeper Linguistic-based Text Analytics – Construct Development for the Criticality of Negative Service Experience
PublikacjaSignificant progress has been made in linguistic-based text analytics particularly with the increasing availability of data and deep learning computational models for more accurate opinion analysis and domain-specific entity recognition. In understanding customer service experience from texts, analysis of sentiments associated with different stages of the service lifecycle is a useful starting point. However, when richer insights...
-
Transformations of descriptive geometry education for architecture students at Gdańsk University of Technology, Poland
PublikacjaIn this article, the authors analyse the evolution of teaching descriptive geometry for architecture students at Gdańsk University of Technology (Gdańsk Tech), Poland. The study traces changes in the curriculum in terms of teaching hours, considering also practices at Politechnika Lwowska (Lwów Polytechnic) and the Technische Hochschule Danzig (Technical University of Gdańsk), before World War II....
-
Study on Speech Transmission under Varying QoS Parameters in a OFDM Communication System
PublikacjaAlthough there has been an outbreak of multiple multimedia platforms worldwide, speech communication is still the most essential and important type of service. With the spoken word we can exchange ideas, provide descriptive information, as well as aid to another person. As the amount of available bandwidth continues to shrink, researchers focus on novel types of transmission, based most often on multi-valued modulations, multiple...
-
The Impact of Foreign Accents on the Performance of Whisper Family Models Using Medical Speech in Polish
PublikacjaThe article presents preliminary experiments investigating the impact of accent on the performance of the Whisper automatic speech recognition (ASR) system, specifically for the Polish language and medical data. The literature review revealed a scarcity of studies on the influence of accents on speech recognition systems in Polish, especially concerning medical terminology. The experiments involved voice cloning of selected individuals...
-
Internet technology in education - offer of Gdansk University of Technology.
PublikacjaW artykule przedstawiono ofertę szkoleń Centrum Edukacji Niestacjonarnej Politechniki Gdańskiej oraz możliwości jej wykorzystania na Wydziale Inżynierii Lądowej w studiach doktoranckich i podyplomowych, które zostaną uruchomione w ramach V Programu Ramowego Unii Europejskiej, Centra Doskonałości (projekt CURE - Centre for Urban Construction and Rehabilitation: Technology Transfer, Research and Education). Zaprezentowano model nauczania...
-
Transfer learning in imagined speech EEG-based BCIs
PublikacjaThe Brain–Computer Interfaces (BCI) based on electroencephalograms (EEG) are systems which aim is to provide a communication channel to any person with a computer, initially it was proposed to aid people with disabilities, but actually wider applications have been proposed. These devices allow to send messages or to control devices using the brain signals. There are different neuro-paradigms which evoke brain signals of interest...
-
Sliding burnishing technology of holes in hardened steel
PublikacjaNew technology with sliding burnishing of holes with cylindrical surface, made of hardened steel /60 HRC/, is presented in the paper.After burnishing operation on hole diameter 30 mm in satellite gear wheel the surface roughness parameter Ra=0,02-0,04 micrometers was obtained. The method and results of research as technological conclusion are presented.
-
Estimation of the short-term predictor parameters of speech under noisy conditions
Publikacja -
Improvement of speech intelligibility in the presence of noise interference using the Lombard effect and an automatic noise interference profiling based on deep learning
PublikacjaThe Lombard effect is a phenomenon that results in speech intelligibility improvement when applied to noise. There are many distinctive features of Lombard speech that were recalled in this dissertation. This work proposes the creation of a system capable of improving speech quality and intelligibility in real-time measured by objective metrics and subjective tests. This system consists of three main components: speech type detection,...
-
Estimation of the excitation variances of speech and noise AR-models for enhanced speech coding
Publikacja -
Text Documents Classification with Support Vector Machines
Publikacja -
Parallel Computations of Text Similarities for Categorization Task
PublikacjaIn this chapter we describe the approach to parallel implementation of similarities in high dimensional spaces. The similarities computation have been used for textual data categorization. A test datasets we create from Wikipedia articles that with their hyper references formed a graph used in our experiments. The similarities based on Euclidean distance and Cosine measure have been used to process the data using k-means algorithm....
-
Towards Effective Processing of Large Text Collections
PublikacjaIn the article we describe the approach to parallelimplementation of elementary operations for textual data categorization.In the experiments we evaluate parallel computations ofsimilarity matrices and k-means algorithm. The test datasets havebeen prepared as graphs created from Wikipedia articles relatedwith links. When we create the clustering data packages, wecompute pairs of eigenvectors and eigenvalues for visualizationsof...
-
Bimodal classification of English allophones employing acoustic speech signal and facial motion capture
PublikacjaA method for automatic transcription of English speech into International Phonetic Alphabet (IPA) system is developed and studied. The principal objective of the study is to evaluate to what extent the visual data related to lip reading can enhance recognition accuracy of the transcription of English consonantal and vocalic allophones. To this end, motion capture markers were placed on the faces of seven speakers to obtain lip...
-
Noise profiling for speech enhancement employing machine learning models
PublikacjaThis paper aims to propose a noise profiling method that can be performed in near real-time based on machine learning (ML). To address challenges related to noise profiling effectively, we start with a critical review of the literature background. Then, we outline the experiment performed consisting of two parts. The first part concerns the noise recognition model built upon several baseline classifiers and noise signal features...
-
Intra-subject class-incremental deep learning approach for EEG-based imagined speech recognition
PublikacjaBrain–computer interfaces (BCIs) aim to decode brain signals and transform them into commands for device operation. The present study aimed to decode the brain activity during imagined speech. The BCI must identify imagined words within a given vocabulary and thus perform the requested action. A possible scenario when using this approach is the gradual addition of new words to the vocabulary using incremental learning methods....
-
Developing a Low SNR Resistant, Text Independent Speaker Recognition System for Intercom Solutions - A Case Study
PublikacjaThis article presents a case study on the development of a biometric voice verification system for an intercom solution, utilizing the DeepSpeaker neural network architecture. Despite the variety of solutions available in the literature, there is a noted lack of evaluations for "text-independent" systems under real conditions and with varying distances between the speaker and the microphone. This article aims to bridge this gap....
-
Intelligent processing of stuttered speech.
PublikacjaW artykule zaprezentowano kilka metod analizy i automatycznego zliczania potknięć artykulacyjnych, związanych z jąkaniem się, opartych na wykorzystaniu algorytmów uczących się sztucznych sieci neuronowych i zbiorów przybliżonych.
-
Automatic Image and Speech Recognition Based on Neural Network
Publikacja -
External Validation Measures for Nested Clustering of Text Documents
PublikacjaAbstract. This article handles the problem of validating the results of nested (as opposed to "flat") clusterings. It shows that standard external validation indices used for partitioning clustering validation, like Rand statistics, Hubert Γ statistic or F-measure are not applicable in nested clustering cases. Additionally to the work, where F-measure was adopted to hierarchical classification as hF-measure, here some methods to...
-
Text categorization with semantic commonsense knowledge: First results
PublikacjaDo przetwarzania tekstów typowo wykorzystuje się reprezentacjeBOW. Podejście takie nie daje jednak dobrych rezultatów w sytuacjigdy podobne dokumenty nie współdzielą ze sobą słów.W artykule zaprezentowano podejście do konstrukcji funkcjijądra dla klasyfikatorów SVM opartego na zewnętrznej bazie wiedzyo pojęciach językowych.
-
Towards facts extraction from text in Polish language
PublikacjaNatural Language Processing (NLP) finds many usages in different fields of endeavor. Many tools exists allowing analysis of English language. For Polish language the situation is different as the language itself is more complicated. In this paper we show differences between NLP of Polish and English language. Existing solutions are presented and TEAMS software for facts extraction is described. The paper shows also evaluation of...
-
A method supporting fault-tolerant optical text recognition from video sequences recorded with handheld cameras
PublikacjaIn the paper a method supporting the optical character recognition from video sequences recorded with cameras without good stabilization is proposed. Due to the presence of various distortions, such as motion blur, shadows, lossy compression artifacts, auto-focusing errors, etc., the quality of individual video frames, e.g., recorded by a smartphone camera, differs noticeably, influencing the results of text recognition, causing...
-
Text analytics for co-creation in public sector organizations: a literature review-based research framework
PublikacjaThe public sector faces considerable challenges that stem from increasing external and internal demands, the need for diverse and complex services, and citizens’ lack of satisfaction and trust in public sector organisations (PSOs). An alternative to traditional public service delivery is the co-creation of public services. Data analytics has been fueled by the availability of immense amounts of data, including textual data, and...
-
THE ROLE OF THE POLISH UNIVERSITIES IN SHAPING A NEW MOBILITY CULTURE - ASSUMPTIONS, CONDITIONS, EXPERIENCE. CASE STUDY OF GDANSK UNIVERSITY OF TECHNOLOGY, CRACOW UNIVERSITY OF TECHNOLOGY AND SILESIAN UNIVERSITY OF TECHNOLOGY
PublikacjaThe article expresses an idea of the privileged role of universities in the process of shaping the new culture for urban mobility. Mobility management of the academic community is adopted in order to indirectly influence a wider group of the population. The aim of the study was to investigate the situation at the Polish universities and the willingness of their authorities to implement integrated tools to manage the transportation...
-
Comparison of Language Models Trained on Written Texts and Speech Transcripts in the Context of Automatic Speech Recognition
Publikacja -
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
PublikacjaThe problem of video framerate and audio/video synchronization in audio-visual speech recogni-tion is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
-
EXAMINING INFLUENCE OF VIDEO FRAMERATE AND AUDIO/VIDEO SYNCHRONIZATION ON AUDIO-VISUAL SPEECH RECOGNITION ACCURACY
PublikacjaThe problem of video framerate and audio/video synchronization in audio-visual speech recognition is considered. The visual features are added to the acoustic parameters in order to improve the accuracy of speech recognition in noisy conditions. The Mel-Frequency Cepstral Coefficients are used on the acoustic side whereas Active Appearance Model features are extracted from the image. The feature fusion approach is employed. The...
-
Trust Frameworks in Application to Technology in Elections
PublikacjaThe prevalence of technology in elections has increased in recent decades, both in terms of voting systems as well as ancillary ones. At the same time, the issue of public confidence and trust has come to the fore as certain threat actors have sought to undermine electoral integrity through publicized attacks and disinformation campaigns against such technology. This paper examines the nexus between this public trust and the...
-
Mobile application technology in levelling
PublikacjaThe topic of this article is the use of mobile application technology in geodetic measurements with an emphasis on levelling. Reference points were registered as data from levelling benchmarks, performed with a traditional dumpy level. The created application allowed the recording of measurements on location, sending them to a remote server for processing and preparing a report to be saved in a database. The project was to decrease...
-
Advanced Modeling of Management Processes in Information Technology
PublikacjaThis book deals with the issues of modeling management processes of information technology and IT projects while its core is the model of information technology management and its component models (contextual, local) describing initial processing and the maturity capsule as well as a decision-making system represented by a multi-level sequential model of IT technology selection, which acquires a fuzzy rule-based implementation...
-
The lasting traditions of activities in the field of wood technology researches at the Gdansk University of Technology
PublikacjaW niniejszym artykule przedstawiono sylwetki ludzi z Politechniki Gdańskiej, którzy odeszli, jednakowoż, których działalność była znacząca w obszarze drzewnictwa i mechanicznej technologii drewna.
-
A system for Direction-Of-Arrival estimation in ISM 2.4 GHz frequency band based on ESPAR antenna and SDR technology
PublikacjaDetermination of the direction of the signal arrival (DOA) finds many applications in various areas of science and industry. Knowledge of DOA is used, among others to determine the position of a satellite with a low Earth orbit (LEO), localization of people and things as well as in research of wireless communication systems, for instance the determination of the number of...
-
A system for multitask noisy speech enhancement.
PublikacjaW artykule przedstawiono ogolną charakterystyke opracowanego systemu rejestracji i rekonstrukcji mowy. Artykuł zawiera opis składników systemu, ktory jest oprogramowaniem zawierającym zaawansowane narzędzia służące poprawie zrozumiałości mowy. Zaimplementowane narzędzia systemu umożliwiają wyszukiwanie nagrań dźwiękowych i ich obróbkę przy pomocy zaimplementowanych pluginów. W artykule przedstawione wykorzystane w systemie algorytmy...
-
Integration of speech enhancement and coding techniques
Publikacja -
Novel approaches to wideband speech coding
PublikacjaDwie metoda kodowania szerokopasmowego mowy zostały zaprezentowane. W pierwszej metodzie wykorzystano algorytm kompresji i ekspansji czasowej sygnału mowy, pozwalający na kodowanie szerokopasmowe sygnału mowy z wykorzystaniem ustandaryzowanych kodeków. Metoda ta jest przewidziana do zastosowania w adaptacyjnych algorytmach kodowania mowy. Drugie z proponowanych rozwiazan dotyczy nowej metody estymacji obwiedni widma sygnalu mowy...
-
Broadband interference in speech reinforcement systems
PublikacjaArtykuł podejmuje niedoceniany problem wpływu liczby i rozkładu głośników w systemach nagłośnienia, na jakość przekazu głosowego, czyli na zrozumiałość mowy w audytoriach. Superpozycji przesuniętych w czasie szerokopasmowych sygnałów o tym samym kształcie i lekko różnych wielkościach, które docierają do słuchacza z licznych spójnych źródeł, towarzyszy zjawisko interferencji prowadzące do głębokiej modyfikacji odbieranych sygnałów...
-
Multitask Noisy Speech Enhancement System
PublikacjaW referacie opisano Wielozadaniowy System Poprawy Jakości Sygnału Mowy. Jest to wyspecjalizowany pakiet oprogramowania przeznaczony do rejestrowania sygnału mowy i do poprawy jego jakości oraz zrozumiałości mowy, przy użyciu zaawansowanych procedur cyfrowego przetwarzania sygnału. Pakiet oprogramowania składa się z programów: Rejestrator, Przeglądarka oraz Rekonstruktor. Oprogramowanie to może być użyte w przypadkach, gdy zrozumiałość...
-
Difference in Perceived Speech Signal Quality Assessment Among Monolingual and Bilingual Teenage Students
PublikacjaThe user perceived quality is a mixture of factors, including the background of an individual. The process of auditory perception is discussed in a wide variety of fields, ranging from engineering to medicine. Many studies examine the difference between musicians and non-musicians. Since musical training develops musical hearing and other various auditory capabilities, similar enhancements should be observable in case of bilingual...
-
Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej
PublikacjaThe bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...
-
Representation of hypertext documents based on terms, Links and text compressibility
PublikacjaOpisano metody reprezentacji dokumentów tekstowych oparte na słowach, wzajemnych powiązaniach i metodach kompresji. Dokonano ich oceny w oparciu o klasyfikator SVM.
-
Wieloznaczność w języku i tekście [Ambiguity in language and text]
Publikacja -
Teaching civil engineering in English at Gdansk University of Technology
PublikacjaThe effects of globalization, as well as many possibilities of easy and cheap ways of travelling, have led to the increase in number of different types of university studies conducted in English. This paper describes advantages and disadvantages after seven years of experience of conducting three-semester MSc Studies in Civil Engineering in English at Gdansk University of Technology, Poland. The studies started in 2009 after a...
-
Suitability of LoRaWAN Technology for the Development of Maritime Applications
PublikacjaThe LoRaWAN Technology opens new possibilities for gathering and analysis of distributed data. In the paper we concentrate on its maritime usability which was tested by us in the period from June to August 2018. Measurements of the LoRaWAN network coverage in the Bay of Gdansk area were carried out. Various conditions and places were tested. The research was planned in such a way as to gradually increase the range and control the...