Search results for: TEXT ANALYTICS, NATURAL LANGUAGE PROCESSING
-
Natural Language Semantics
Journals -
Evaluation of Multimedia Stream Processing Modeling Language from the Perspective of Cognitive Dimensions
PublicationW referacie zawarto opis zastosowania wymiarów poznawczych do oceny języka modelowania przetwarzania strumieni multimedialnych, nazwanego MSP-ML, w trakcie tworzenia tego języka. Poszczególne części referatu prezentują kontekst i motywacje oceny MSP-ML, metodę oceny, rezultaty oceny oraz porównanie rezultatów oceny z wynikami otrzymanymi za pomocą innych metod oceny języków modelowania wizualnego.
-
Natural Language & Linguistic Theory
Journals -
Language Context and Text-The Social Semiotics Forum
Journals -
IEEE Transactions on Audio Speech and Language Processing
Journals -
Empirical Methods in Natural Language Processing
Conferences -
Natural Language Processing and Knowledge Engineering
Conferences -
IEEE-ACM Transactions on Audio Speech and Language Processing
Journals -
International Joint Conference on Natural Language Processing
Conferences -
Instytucje demokracji bezpośredniej, partycypacyjnej i deliberacyjnej w Gdańsku od 2010 roku
PublicationTematem tej pracy doktorskiej jest studium przypadku stanu demokracji w Gdańsku. Miasto Gdańsk jest uważane jako jedno z najbardziej demokratycznych miast w Polsce, jednak czy to założenie pokrywa się z faktami? Analiza Autora rozprawy doktorskiej jest skupiona na instytucjach demokratycznych na poziomie lokalnym, ze szczególnym uwzględnieniem obywatelskiej inicjatywy uchwałodawczej jako instrumentu...
-
International Conference on Recent Advances in Natural Language Processing
Conferences -
ACM Transactions on Asian and Low-Resource Language Information Processing
Journals -
Joint Conference on New Methods in Language Processing and Computational Natural Language Learning
Conferences -
Natural Language Engineering
Journals -
Ontology-Aided Software Engineering
PublicationThis thesis is located between the fields of research on Artificial Intelligence (AI), Knowledge Representation and Reasoning (KRR), Computer-Aided Software Engineering (CASE) and Model Driven Engineering (MDE). The modern offspring of KRR - Description Logic (DL) [Baad03] is considered here as a formalization of the software engineering Methods & Tools. The bridge between the world of formal specification (governed by the mathematics)...
-
Conference on Computational Natural Language Learning (Conference on Natural Language Learning)
Conferences -
Jacek Namieśnik prof. dr hab. inż.
PeopleScientific discipline: chemistryRector in 2016-2019 He was born on 10 December, 1949 in Mogilno. He graduated in 1972 at the Faculty of Chemistry at Gdańsk University of Technology, obtaining a master's degree in chemical engineering. In 1972 he started working at Gdańsk University of Technology, where in 1978 he defended his doctoral thesis and in 1985 he completed his habilitation. He was appointed an associate professor in 1991...
-
Jan Daciuk dr hab. inż.
PeopleJan Daciuk received his M.Sc. from the Faculty of Electronics of Gdansk University of Technology in 1986, and his Ph.D. from the Faculty of Electronics, Telecommunications and Informatics of Gdańsk University of Technology in 1999. He has been working at the Faculty from 1988. His research interests include finite state methods in natural language processing and computational linguistics including speech processing. Dr. Daciuk...
-
Semantic OLAP with FluentEditor and Ontorion Semantic Excel Toolchain
PublicationSemantic technologies appear as a step on the way to creating systems capable of representing the physical world as real time computational processes. In this context, the paper presents a toolchain for an ontology based knowledge management system. It consists of the ontology editor, FluentEditor and the distributed knowledge representation system, Ontorion. FluentEditor is a comprehensive tool for editing and manipulating complex...
-
Piotr Krajewski dr
PeoplePiotr Krajewski is a librarian at the Library of Gdańsk University of Technology (GUT) and a PhD student at the Medical University of Gdańsk. His research interests focus on the standardization of the e-resources usage data and Open Access publishing, especially the role of institutional repositories in the development of the OA initiative and the phenomenon of “predatory publishers”. He works at Scientific and Technical Information...
-
Logic and Engineering of Natural Language Semantics
Conferences -
International Natural Language Generation Conference
Conferences -
Applications of Natural Language to Data Bases
Conferences -
European Natural Language Generation Workshop
Conferences -
International Conference on Intelligent Text Processing and Computational Linguistics
Conferences -
A new library for construction of automata
PublicationWe present a new library of functions that construct minimal, acyclic, deterministic, finite-state automata in the same format as the author's fsa package, and also accepted by the author's fadd library of functions that use finite-state automata as dictionaries in natural language processing.
-
Collaborative approach to WordNet and Wikipedia integration
PublicationIn this article we present a collaborative approach tocreating mappings between WordNet and Wikipedia. Wikipediaarticles have been first matched with WordNet synsets in anautomatic way. Then such associations have been evaluated andcomplemented in a collaborative way using a web application.We describe algorithms used for creating automatic mappingsas well as a system for their collaborative development. Theoutcome enables further...
-
From Sequential to Parallel Implementation of NLP Using the Actor Model
PublicationThe article focuses on presenting methods allowing easy parallelization of an existing, sequential Natural Language Processing (NLP) application within a multi-core system. The actor-based solution implemented with the Akka framework has been applied and compared to an application based on Task Parallel Library (TPL) and to the original sequential application. Architectures, data and control flows are described along with execution...
-
An efficient incremental DFA minimization algorithm
PublicationW tym artykule przedstawiamy nowy algorytm minimalizacji deterministycznego automatu skończonego. Algorytm jest przyrostowy - może być zatrzymany w dowolnym momencie, dając częściowo zminimalizowany automat. Wszystkie inne (znane) algorytmy minimalizacji dają wyniki pośrednie nieprzydatne dla częściowej minimalizacji. Ponieważ pierwszy algorytm jest łatwo zrozumiały ale mało wydajny, rozważamy trzy praktyczne, znaczące usprawnienia....
-
Marek Czachor prof. dr hab.
People -
Exact-match Based Wikipedia-WordNet Integration
PublicationAbility to link between WordNet synsets and Wikipedia articles allows usage of those resources by computers during natural language processing. A lot of work was done in this field, however most of the approaches focus on similarity between Wikipedia articles and WordNet synsets rather than creation of perfect matches. In this paper we proposed a set of methods for automatic perfect matching generation. The proposed methods were...
-
Time-domain prosodic modifications for text-to-speech synthesizer
PublicationAn application of prosodic speech processing algorithms to Text-To-Speech synthesis is presented. Prosodic modifications that improve the naturalness of the synthesized signal are discussed. The applied method is based on the TD-PSOLA algorithm. The developed Text-To-Speech Synthesizer is used in applications employing multimodal computer interfaces.
-
Rust QA: question answering dataset for "The Rust Programming Language" in SQuAD 2.0 format
Open Research DataRust QA is a dataset for training and evaluating QA systems. The dataset consists of 1068 questions to "The Rust Programming Language" book (https://doc.rust-lang.org/stable/book/) with the answers provided as text spans from the book. The dataset is released in SQuAD 2.0 format.
-
Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej
PublicationThe bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...
-
Video recordings of static hand gestures for gesture based interaction
Open Research DataThis data set contains video recording of selected simple hand gestures related to sign language. The purpose of the data set is to evaluate different computer algorithms design for hand gesture detection as well as for hand features and hand pose detection and identification. The data set contains 5 video recordings in mp4 format. Each recording is...
-
International Conference on Advanced Language Processing and Web Information Technology
Conferences -
A Model-Driven Solution for Development of Multimedia Stream Processing Applications
PublicationThis paper presents results of action research related to model-driven solutions in the area of multimedia stream processing. The practical problem to be solved was the need to support application developers who make their multimedia stream processing applications in a supercomputer environment. The solution consists of a domain-specific visual language for composing complex services from simple services called Multimedia Stream...
-
Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency
PublicationIn this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...
-
Personal adaptive tuning of mobile computer audio
PublicationAn integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of the acoustic track to the changing conditions and to the user's individual preferences. Original signal processing algorithms are introduced, which concern: linearization of frequency response, dialogue intelligibility enhancement and dynamics processing tuned up to the user's preferences....
-
Adaptive Personal Tuning of Sound in Mobile Computers
PublicationAn integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of their acoustic track to changing acoustic conditions of the environment and to users’ individual preferences. Signal processing algorithms are introduced that concern: linearization of frequency response, dialogue intelligibility enhancement, and dynamics processing tuned up to the users’...
-
Semantic Analysis and Text Summarization in Socio-Technical Systems
PublicationIn this chapter the authors present the results of the development the methodology for increasing the reliability of the functioning of the Socio-Technical System. The existed methods and algorithms for processing unstructured (textual) information were studied. Taking into account noted above strengths and weaknesses of Discriminant and Probabilistic approaches of Latent Semantic Relations analysis in of the summarization projection...
-
Threat intelligence platform for the energy sector
PublicationIn recent years, critical infrastructures and power systems in particular have been subjected to sophisticated cyberthreats, including targeted attacks and advanced persistent threats. A promising response to this challenging situation is building up enhanced threat intelligence that interlinks information sharing and fine-grained situation awareness. In this paper a framework which integrates all levels of threat intelligence...
-
Elgold partial: Automotive blogs
Open Research DataThe dataset contains 34 English texts scrapped from automotive blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and...
-
Elgold partial: Movie reviews
Open Research DataThe dataset contains 37 English texts with movie reviews. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: Job offers
Open Research DataThe dataset contains 34 English texts scrapped from the web portals offering job offers. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity...
-
Elgold partial: Scientific papers' abstracts
Open Research DataThe dataset contains 87 Scientific papers' abstracts in English randomly chosen from the folowing scientific disciplines: Biomedicine, Life Sciences, Mathematics, Medicine, Science, Humanities, Social Science.
-
Elgold partial: Amazon product reviews
Open Research DataThe dataset contains 34 Amazon product reviews in English. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: History blogs
Open Research DataThe dataset contains 13 texts from English history blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
English Language Learning Employing Developments in Multimedia IS
PublicationIn the realm of the development of information systems related to education, integrating multimedia technologies offers novel ways to enhance foreign language learning. This study investigates audio-video processing methods that leverage real-time speech rate adjustment and dynamic captioning to support English language acquisition. Through a mixed-methods analysis involving participants from a language school, we explore the impact...
-
Application of Semantic Knowledge Management System in Selected Areas of Polish Public Administration
PublicationThis paper describes an application of semantic technologies and knowledge management systems in chosen areas of Polish public administration. Short analyses of crisis management and EU policy coordination processes are presented. An architecture of a knowledge management system with interfaces using controlled natural language is proposed. A lot of examples are shown that prove a usefulness of semantic knowledge management and...