Search results for: TEXT ANALYTICS, NATURAL LANGUAGE PROCESSING

Application of Text Analytics in Public Service Co-Creation: Literature Review and Research Framework

Publication

N. Rizun
A. Revina
N. Edelmann

- Year 2023

The public sector faces several challenges, such as a number of external and internal demands for change, citizens' dissatisfaction and frustration with public sector organizations, that need to be addressed. An alternative to the traditional top-down development of public services is co-creation of public services. Co-creation promotes collaboration between stakeholders with the aim to create better public services and achieve...

Full text available to download

Generating actionable evidence from free-text feedback to improve maternity and acute hospital experiences: A computational text analytics & predictive modelling approach

Publication

A. Ojo
N. Rizun
M. Isazad Mashinchi
G. Walsh
J. Gruda
M. N. Narayana
M. Venosa
C. Foley
D. Rohde
R. Flynn

- EUROPEAN JOURNAL OF PUBLIC HEALTH - Year 2023

Background Patient experience surveys are a key source of evidence for supporting decision-making and quality improvement in healthcare services. These surveys contain two main types of questions: closed and open-ended, asking about patients’ care experiences. Apart from the knowledge obtained from analysing closed-ended questions, invaluable insights can be gleaned from free-text data. Advanced analytics techniques are increasingly...

Full text to download in external service

Enabling Deeper Linguistic-based Text Analytics – Construct Development for the Criticality of Negative Service Experience

Publication

- IEEE Access - Year 2019

Significant progress has been made in linguistic-based text analytics particularly with the increasing availability of data and deep learning computational models for more accurate opinion analysis and domain-specific entity recognition. In understanding customer service experience from texts, analysis of sentiments associated with different stages of the service lifecycle is a useful starting point. However, when richer insights...

Full text available to download

Text-mining Similarity Approximation Operators for Opinion Mining in BI tools

Publication

N. Rizun
P. Kapłański
Y. Taranenko
S. Alessandro

- Year 2016

The concept of the Text-mining Similarity Approximation Operators for Opinion Mining as extensions to Natural Language Interface Database is defined. The new operators: “keywords of” dimension; subsetting operator “about C is q”; aggregation operator “by similar C” are proposed. These operators are based on the Latent Semantic Analysis and Social Network Analysis

Full text available to download

Towards a Framework for Context Awareness Based on Textual Process Data

Publication

A. Revina
N. Rizun
A. Ünal

- Year 2023

Context awareness is critical for the successful execution of processes. In the abundance of business process management (BPM) research, frameworks exclusively devoted to extracting context from textual process data are scarce. With the deluge of textual data and its increasing value for organizations, it be-comes essential to employ relevant text analytics techniques to increase the awareness of business process (BP) workers,...

Full text to download in external service

Acoustic Sensing Analytics Applied to Speech in Reverberation Conditions

Publication

- SENSORS - Year 2021

The paper aims to discuss a case study of sensing analytics and technology in acoustics when applied to reverberation conditions. Reverberation is one of the issues that makes speech in indoor spaces challenging to understand. This problem is particularly critical in large spaces with few absorbing or diffusing surfaces. One of the natural remedies to improve speech intelligibility in such conditions may be achieved through speaking...

Full text available to download

Relation-based Wikipedia Search System for Factoid Questions Answering

Publication

- International Journal of Innovative Research in Computer and Communication Engineering - Year 2014

In this paper we propose an alternative keyword search mechanism for Wikipedia, designed as a prototype solution towards factoid questions answering. The method considers relations between articles for finding the best matching article. Unlike the standard Wikipedia search engine and also Google engine, which search the articles content independently, requiring the entire query to be satisfied by a single article, the proposed...

Full text available to download

Previous Opinions is All You Need - Legal Information Retrieval System

Publication

M. Osowski
K. Lorenc
P. Drozda
R. Scherer
K. Szałapak
K. Komar-Komarowski
J. Szymański
A. Sobecki

- Year 2023

We present a system for retrieving the most relevant legal opinions to a given legal case or question. To this end, we checked several state-of-the-art neural language models. As a training and testing data, we use tens of thousands of legal cases as question-opinion pairs. Text data has been subjected to advanced pre-processing adapted to the specifics of the legal domain. We empirically chose the BERT-based HerBERT model to perform...

Full text to download in external service

Evaluation of a company’s image on social media using the Net Sentiment Rate

Publication

A. Baj-Rogowska

- Year 2020

Vast amounts of new types of data are constantly being created as a result of dynamic digitization in all areas of our lives. One of the most important and valuable categories for business is data from social networks such as Facebook. Feedback resulting from the sharing of thoughts and emotions, expressed in comments on various products and services, is becoming the key factor on which modern business is based. This feedback is...

Full text to download in external service

Prioritising national healthcare service issues from free text feedback – A computational text analysis & predictive modelling approach

Publication

A. Ojo
N. Rizun
G. Walsh
M. I. Mashinchi
M. Venosa
M. N. Rao

- DECISION SUPPORT SYSTEMS - Year 2024

Patient experience surveys have become a key source of evidence for supporting decision-making and continuous quality improvement within healthcare services. To harness free-text feedback collected as part of these surveys for additional insights, text analytics methods are increasingly employed when the data collected is not amenable to traditional qualitative analysis due to volume. However, while text analytics techniques offer...

Full text available to download

Influence of YARN Schedulers on Power Consumption and Processing Time for Various Big Data Benchmarks

Publication

- TASK Quarterly - Year 2019

Climate change caused by human activities can influence the lives of everybody onthe planet. The environmental concerns must be taken into consideration by all fields of studyincludingICT. Green Computing aims to reduce negative effects of IT on the environment while,at the same time, maintaining all of the possible benefits it provides. Several Big Data platformslike Apache Spark orYARNhave become widely used in analytics and...

Full text available to download

What matters most to patients? On the Core Determinants of Patient Experience from Free Text Feedback

Publication

- Year 2021

Free-text feedback from patients is increasingly used for improving the quality of healthcare services and systems. A major reason for the growing interest in harnessing free-text feedback is the belief that it provides richer information about what patients want and care about. The use of computational approaches such as structural topic modelling for analysing large unstructured textual data such as free-text feedback from patients...

Full text available to download

A survey of automatic speech recognition deep models performance for Polish medical terms

Publication

- Year 2023

Among the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....

Full text to download in external service

Towards Facts Extraction From Texts in Polish Language

Publication

- International Journal of Innovative Research in Computer and Communication Engineering - Year 2014

The Polish language differs from English in many ways. It has more complicated conjugation and declination. Because of that automatic facts extraction from texts is difficult. In this paper we present basic differences between those languages. The paper presents an algorithm for extraction of facts from articles from Polish Wikipedia. The algorithm is based on 7 proposed facts schemes that are searched for in the analyzed text....

Full text available to download

Assessing business process complexity based on textual data: Evidence from ITIL IT ticket processing

Publication

N. Rizun
A. Revina
V. Maister

- Business Process Management Journal - Year 2021

Purpose This study aims to draw the attention of business process management (BPM) research and practice to the textual data generated in the processes and the potential of meaningful insights extraction. The authors apply standard natural language processing (NLP) approaches to gain valuable knowledge in the form of business process (BP) complexity concept suggested in the study. It is built on the objective, subjective and meta-knowledge...

Full text available to download

SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM

Publication

- Journal of the Acoustical Society of America - Year 2023

The main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...

Full text available to download

Extracting concepts from the software requirements specification using natural language processing

Publication

- Year 2018

Extracting concepts from the software require¬ments is one of the first step on the way to automating the software development process. This task is difficult due to the ambiguity of the natural language used to express the requirements specification. The methods used so far consist mainly of statistical analysis of words and matching expressions with a specific ontology of the domain in which the planned software will be applicable....

Full text to download in external service

The image of the City on social media: A comparative study using “Big Data” and “Small Data” methods in the Tri-City Region in Poland

Publication

- LANDSCAPE AND URBAN PLANNING - Year 2021

“The Image of the City” by Kevin Lynch is a landmark planning theory of lasting influence; its scientific rigor and relevance in the digital age were in dispute. The rise of social media and other digital technologies offers new opportunities to study the perception of urban environments. Questions remain as to whether social media analytics can provide a reliable measure of perceived city images? If yes, what implication does...

Full text available to download

Towards facts extraction from text in Polish language

Publication

- Year 2017

Natural Language Processing (NLP) finds many usages in different fields of endeavor. Many tools exists allowing analysis of English language. For Polish language the situation is different as the language itself is more complicated. In this paper we show differences between NLP of Polish and English language. Existing solutions are presented and TEAMS software for facts extraction is described. The paper shows also evaluation of...

Full text available to download

Agile Commerce in the light of Text Mining

Publication

A. Baj-Rogowska

- Przedsiębiorczość i Zarządzanie - Year 2017

The survey conducted for this study reveals that more than 84% of respondents have never encountered the term “agile commerce” and do not understand its meaning. At the same time, they are active participants of this strategy. Using digital channels as customers more often than ever before, they have already been included in the agile philosophy. Based on the above, the purpose of the study is to analyse major text sets containing...

Full text available to download

Information Extraction from Polish Radiology Reports using Language Models

Publication

- Year 2023

Radiology reports are vital elements of directing patient care. They are usually delivered in free text form, which makes them prone to errors, such as omission in reporting radiological findings and using difficult-to-comprehend mental shortcuts. Although structured reporting is the recommended method, its adoption continues to be limited. Radiologists find structured reports too limiting and burdensome. In this paper, we propose...

Full text available to download

Understanding the Ukrainian Migrants Challenges in the EU: A Topic Modeling Approach

Publication

N. Khairova
N. Rizun
C. H. Alexopoulos
M. Ciesielska
A. Lukashevskyi
I. Redozub

- Year 2024

Confronted with the aggression against Ukraine in 2022, Europe faces one of the most important humanitarian challenges - the migration of war refugees from Ukraine, most of them women with children and the elderly. Both international institutions such as the European Union and the United Nations, but also national governments and, above all, local governments, which are the main providers of services and resources for refugees,...

Full text available to download

Words context analysis for improvement of information retrieval

Publication

J. Szymański

- Year 2012

In the article we present an approach to improvement of retrieval informationfrom large text collections using words context vectors. The vectorshave been created analyzing English Wikipedia with Hyperspace Analogue to Language model of words similarity. For test phrases we evaluate retrieval with direct user queries as well as retrieval with context vectors of these queries. The results indicate that the proposed method can not...

DEVELOPMENT OF THE ALGORITHM OF POLISH LANGUAGE FILM REVIEWS PREPROCESSING

Publication

- Rocznik Naukowy Wydzialu Zarzadzania w Ciechanowie - Year 2017

The algorithm and the software for conducting the procedure of Preprocessing of the reviews of films in the Polish language were developed. This algorithm contains the following steps: Text Adaptation Procedure; Procedure of Tokenization; Procedure of Transforming Words into the Byte Format; Part-of-Speech Tagging; Stemming / Lemmatization Procedure; Presentation of Documents in the Vector Form (Vector Space Model) Procedure; Forming...

Full text available to download

Modeling the Customer’s Contextual Expectations Based on Latent Semantic Analysis Algorithms

Publication

- Year 2017

Nowadays, in the age of Internet, access to open data detects the huge possibilities for information retrieval. More and more often we hear about the concept of open data which is unrestricted access, in addition to reuse and analysis by external institutions, organizations and people. It’s such information that can be freely processed, add another data (so-called remix) and then published. More and more data are available in text...

Full text available to download

Knowledge Base Suitable for Answering Questions in Natural Language

Publication

- Year 2014

This paper presents three knowledge bases widely used by researchers coping with natural language processing: OpenCyc, DBpedia and YAGO. They are characterized from the point of view of questions answering system. In this paper a short description of the aforementioned system implementation is also presented.

Full text to download in external service

Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning

Publication

A. Czyżewski

- Journal of the Acoustical Society of America - Year 2023

Text-to-Speech synthesis (TTS) can be used to generate training data for building Automatic Speech Recognition models (ASR). Access to medical speech data is because it is sensitive data that is difficult to obtain for privacy reasons; TTS can help expand the data set. Speech can be synthesized by mimicking different accents, dialects, and speaking styles that may occur in a medical language. Reinforcement Learning (RL), in the...

Full text available to download

Infedeltà nel trasferimento delle collocazioni nella traduzione dei romanzi di Michel Houellebecq dal francese all’italiano

Publication

P. Golda

- Italica Wratislaviensia - Year 2024

Building on my PhD project, this paper explores fidelity challenges in the transfer of verb-nominal collocations (VNC) in the Italian translations of seven of Michel Houellebecq’s novels. I examine various kinds of infidelity, such as omissions, errors, incongruence in constituent transmission, incoherence in recurrent VNC transmission, and infidelity at the level of phraseological coverage. The accurate transfer...

Full text available to download

An Analysis of Neural Word Representations for Wikipedia Articles Classification

Publication

J. Szymański
N. Kawalec

- CYBERNETICS AND SYSTEMS - Year 2019

One of the current popular methods of generating word representations is an approach based on the analysis of large document collections with neural networks. It creates so-called word-embeddings that attempt to learn relationships between words and encode this information in the form of a low-dimensional vector. The goal of this paper is to examine the differences between the most popular embedding models and the typical bag-of-words...

Full text to download in external service

Methodology for Text Classification using Manually Created Corpora-based Sentiment Dictionary

Publication

- Year 2018

This paper presents the methodology of Textual Content Classification, which is based on a combination of algorithms: preliminary formation of a contextual framework for the texts in particular problem area; manual creation of the Hierarchical Sentiment Dictionary (HSD) on the basis of a topically-oriented Corpus; tonality texts recognition via using HSD for analysing the documents as a collection of topically completed fragments...

Full text available to download

Ontologies vs. Rules — Comparison of Methods of Knowledge Representation Based on the Example of IT Services Management

Publication

- Year 2013

This text provides a brief overview of selected structures aimed at knowledge representation in the form of ontologies based on description logic and aims at comparing them with their counterparts based on the rule-based approach. Due to the limitations on the length of the article, only elements associated with the representation of concepts could be shown, without including roles. The formalisms of the OWL language were used...

Full text to download in external service

Rozwijanie kreatywności ucznia w procesie kształtowania umiejętności językowych. Innowacja pedagogiczna z elementami neurodydaktyki w edukacji wczesnoszkolnej

Publication

B. Grobelna

- Języki Obce w Szkole - Year 2023

This text is a ready-to-use pedagogical innovation program combining teaching English and classes developing creativity in early childhood education. Classes developing creativity are a unique opportunity to implement innovative solutions and ideas to develop language competencies and key competencies, which can be difficult during a standard English lesson. The...

Methodology of Constructing and Analyzing the Hierarchical Contextually-Oriented Corpora

Publication

- Year 2018

Methodology of Constructing and Analyzing the Hierarchical structure of the Contextually-Oriented Corpora was developed. The methodology contains the following steps: Contextual Component of the Corpora’s Structure Building; Text Analysis of the Contextually-Oriented Hierarchical Corpus. Main contribution of this study is the following: hierarchical structure of the Corpus provides advanced possibilities for identification of the...

Full text available to download

Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech

Publication

D. Piotrowski
R. Korzeniowski
A. Falai
S. Cygert
K. Pokora
G. Tinchev
Z. Zhang
K. Yanagisawa

- Year 2023

In this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream Text-To-Speech (TTS) model. The proposed framework consists of 4 stages. In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker. In the third stage, the converted data is combined with the linguistic features and durations...

Full text to download in external service

DBpedia and YAGO as Knowledge Base for Natural Language Based Question Answering—The Evaluation

Publication

- Advances in Intelligent Systems and Computing - Year 2018

The idea of automatic question answering system has a very long history. Despite constant improvement of the systems asking questions in the natural language requires very complex solutions. In this paper the DBpedia and YAGO are evaluated as a knowledge bases for simple class 1 and 2 question answering system. For this purpose a question answering system was designed and implemented. The proposed solution and the knowledge bases...

Full text available to download

Wieloznaczność w języku i tekście [Ambiguity in language and text]

Publication

K. Wojan

- PROGRESS. JOURNAL OF YOUNG RESEARCHERS - Year 2017

Full text to download in external service

Automatic prosodic modification in a Text-To-Speech synthesizer of Polish language

Publication

K. Łopatka
P. Suchomski
A. Czyżewski

- Elektronika : konstrukcje, technologie, zastosowania - Year 2011

Przedstawiono system syntezy mowy polskiej z funkcją automatycznej modyfikacji prozodii wypowiedzi. Opisane zostały metody automatycznego wyznaczania akcentu i intonacji wypowiedzi. Przedstawiono zastosowanie algorytmów przetwarzania sygnału mowy w procesie kształtowania prozodii. Omówiono wpływ zastosowanych modyfikacji na naturalność brzmienia syntezowanego sygnału. Zastosowana metoda oparta jest na algorytmie TD-PSOLA. Opracowany...

Towards Effective Processing of Large Text Collections

Publication

- Year 2012

In the article we describe the approach to parallelimplementation of elementary operations for textual data categorization.In the experiments we evaluate parallel computations ofsimilarity matrices and k-means algorithm. The test datasets havebeen prepared as graphs created from Wikipedia articles relatedwith links. When we create the clustering data packages, wecompute pairs of eigenvectors and eigenvalues for visualizationsof...

Natural language dictionaries implemented as finite automata

Publication

J. Daciuk
J. Piskorski
S. Ristov

- Year 2010

Rozdział przedstawia wykorzystanie automatów skończonych jako słowników języka naturalnego. Podane są podstawy teoretyczne. Omówione są zastosowania: realizacja doskonałej funkcji mieszającej, analizy i syntezy morfologicznej, poprawiania pisowni i dopisywania znaków diakrytycznych, wydobywanie informacji. Podano algorytmy tworzenia automatów oraz omówiono sposoby reprezentacji automatów z uwzględnieniem kompresji.

Full text to download in external service

Processing and structure–property relationships of natural rubber/wheat bran biocomposites

Publication

- CELLULOSE - Year 2016

In this work, wheat bran was used as cellulosic filler in biocomposites based on natural rubber. The impact of wheat bran content [ranging from 10 to 50 parts per hundred rubber (phr)] on processing, structure, dynamic mechanical properties, thermal properties, physico-mechanical properties and morphology of resulting biocomposites was investigated. For better characterization of interfacial interactions between natural rubber...

Full text available to download

Fluent Editor and Controlled Natural Language in Ontology Development

Publication

P. Weichbroth

- International Journal on Artificial Intelligence Tools - Year 2019

Full text to download in external service

Semantic rules representation in controlled natural language in FluentEditor

Publication

A. Wróblewska
P. Kapłański
P. Zarzycki
I. Ługowska

- IEEE Industrial Electronics Magazine - Year 2013

This paper presents a way of representation of semantic rules (SWRL) in controlled English in order to facilitate understanding the rules by humans interacting with a machine. This approach (implemented in FluentEditor) may be applied in many domains, where the understandability of the rules used to support a decision process is of great importance.

Full text to download in external service

Advanced Control With PLC—Code Generator for aMPC Controller Implementation and Cooperation With External Computational Server for Dealing With Multidimensionality, Constraints and LMI Based Robustness

Publication

- IEEE Access - Year 2022

The manufacturers of Programmable Logic Controllers (PLC) usually equip their products with extremely simple control algorithms, such as PID and on-off regulators. However, modern PLCs have much more efficient processors and extensive memory, which enables implementing more sophisticated controllers. The paper discusses issues related to the implementation of matrix operations, time limitations for code execution within one PLC...

Full text available to download

DBpedia and YAGO Based System for Answering Questions in Natural Language

Publication

- Year 2018

In this paper we propose a method for answering class 1 and class 2 questions (out of 5 classes defined by Moldovan for TREC conference) based on DBpedia and YAGO. Our method is based on generating dependency trees for the query. In the dependency tree we look for paths leading from the root to the named entity of interest. These paths (referenced further as fibers) are candidates for representation of actual user intention. The...

Full text available to download

Should we publish in Chinese? –answers exemplified by articles on OSH and electromagnetism indexed in selected databases

Publication

W. Sygocki
E. Korzeniewska

- Przegląd Elektrotechniczny - Year 2022

The article addresses the issues of scientific communication, including the indexing of articles in international databases (Web of ScienceCC, Scopus) and Chinese institutions, including technical universities. One of the important issues in assessing the quality of a scientist's work is the...

Full text available to download

Evaluation of Multimedia Stream Processing Modeling Language from the Perspective of Cognitive Dimensions

Publication

- Year 2011

W referacie zawarto opis zastosowania wymiarów poznawczych do oceny języka modelowania przetwarzania strumieni multimedialnych, nazwanego MSP-ML, w trakcie tworzenia tego języka. Poszczególne części referatu prezentują kontekst i motywacje oceny MSP-ML, metodę oceny, rezultaty oceny oraz porównanie rezultatów oceny z wynikami otrzymanymi za pomocą innych metod oceny języków modelowania wizualnego.

Full text to download in external service

Instytucje demokracji bezpośredniej, partycypacyjnej i deliberacyjnej w Gdańsku od 2010 roku

Publication

S. Andrzejewski

- Year 2023

Tematem tej pracy doktorskiej jest studium przypadku stanu demokracji w Gdańsku. Miasto Gdańsk jest uważane jako jedno z najbardziej demokratycznych miast w Polsce, jednak czy to założenie pokrywa się z faktami? Analiza Autora rozprawy doktorskiej jest skupiona na instytucjach demokratycznych na poziomie lokalnym, ze szczególnym uwzględnieniem obywatelskiej inicjatywy uchwałodawczej jako instrumentu...

Ontology-Aided Software Engineering

Publication

P. Kapłański

- Year 2012

This thesis is located between the fields of research on Artificial Intelligence (AI), Knowledge Representation and Reasoning (KRR), Computer-Aided Software Engineering (CASE) and Model Driven Engineering (MDE). The modern offspring of KRR - Description Logic (DL) [Baad03] is considered here as a formalization of the software engineering Methods & Tools. The bridge between the world of formal specification (governed by the mathematics)...

Full text to download in external service

Semantic OLAP with FluentEditor and Ontorion Semantic Excel Toolchain

Publication

D. Dobrowolski
P. Kapłański
A. Marciniak
Z. Łojewski

- Year 2015

Semantic technologies appear as a step on the way to creating systems capable of representing the physical world as real time computational processes. In this context, the paper presents a toolchain for an ontology based knowledge management system. It consists of the ontology editor, FluentEditor and the distributed knowledge representation system, Ontorion. FluentEditor is a comprehensive tool for editing and manipulating complex...

Full text to download in external service

A new library for construction of automata

Publication

J. Daciuk

- Year 2017

We present a new library of functions that construct minimal, acyclic, deterministic, finite-state automata in the same format as the author's fsa package, and also accepted by the author's fadd library of functions that use finite-state automata as dictionaries in natural language processing.

Search

Filters

Catalog

Category

Year

Options

Search results for: TEXT ANALYTICS, NATURAL LANGUAGE PROCESSING