Search results for: TEXT ANALYTICS, NATURAL LANGUAGE PROCESSING

Search results for: TEXT ANALYTICS, NATURAL LANGUAGE PROCESSING

results on page:
embed this view on your website

Filters

total: 395

clear all filters disabled

Intelligent information services 23/24
e-Learning Courses
- J. Szymański
Information retrieval Text categorization Natural language processing
Application of Text Analytics in Public Service Co-Creation: Literature Review and Research Framework
Publication
- N. Rizun
- A. Revina
- N. Edelmann
- Year 2023
The public sector faces several challenges, such as a number of external and internal demands for change, citizens' dissatisfaction and frustration with public sector organizations, that need to be addressed. An alternative to the traditional top-down development of public services is co-creation of public services. Co-creation promotes collaboration between stakeholders with the aim to create better public services and achieve...

Full text available to download
Tomasz Maria Boiński dr inż.

People

Department of Computer Architecture, IT Services Centre

I’m associated with the University since the year 2000 when I started my studies in Computer Science on the Faculty of Electronics, Telecommunications and Informatics. After graduating with honors in 2005, I applied for doctoral studies. During his studies and immediately afterward I was involved in cooperation with Hogart from Warsaw, in the implementation of business solutions in Gdynia company Elektronika SA (Infor FMS SunSystems)...
Generating actionable evidence from free-text feedback to improve maternity and acute hospital experiences: A computational text analytics & predictive modelling approach
Publication
- A. Ojo
- N. Rizun
- M. Isazad Mashinchi
- G. Walsh
- J. Gruda
- M. N. Narayana
- M. Venosa
- C. Foley
- D. Rohde
- R. Flynn
- EUROPEAN JOURNAL OF PUBLIC HEALTH - Year 2023
Background Patient experience surveys are a key source of evidence for supporting decision-making and quality improvement in healthcare services. These surveys contain two main types of questions: closed and open-ended, asking about patients’ care experiences. Apart from the knowledge obtained from analysing closed-ended questions, invaluable insights can be gleaned from free-text data. Advanced analytics techniques are increasingly...

Full text to download in external service
Enabling Deeper Linguistic-based Text Analytics – Construct Development for the Criticality of Negative Service Experience
Publication
- A. Ojo
- N. Rizun
- IEEE Access - Year 2019
Significant progress has been made in linguistic-based text analytics particularly with the increasing availability of data and deep learning computational models for more accurate opinion analysis and domain-specific entity recognition. In understanding customer service experience from texts, analysis of sentiments associated with different stages of the service lifecycle is a useful starting point. However, when richer insights...

Full text available to download
Text-mining Similarity Approximation Operators for Opinion Mining in BI tools
Publication
- N. Rizun
- P. Kapłański
- Y. Taranenko
- S. Alessandro
- Year 2016
The concept of the Text-mining Similarity Approximation Operators for Opinion Mining as extensions to Natural Language Interface Database is defined. The new operators: “keywords of” dimension; subsetting operator “about C is q”; aggregation operator “by similar C” are proposed. These operators are based on the Latent Semantic Analysis and Social Network Analysis

Full text available to download
Towards a Framework for Context Awareness Based on Textual Process Data
Publication
- A. Revina
- N. Rizun
- A. Ünal
- Year 2023
Context awareness is critical for the successful execution of processes. In the abundance of business process management (BPM) research, frameworks exclusively devoted to extracting context from textual process data are scarce. With the deluge of textual data and its increasing value for organizations, it be-comes essential to employ relevant text analytics techniques to increase the awareness of business process (BP) workers,...

Full text to download in external service
Acoustic Sensing Analytics Applied to Speech in Reverberation Conditions
Publication
- SENSORS - Year 2021
The paper aims to discuss a case study of sensing analytics and technology in acoustics when applied to reverberation conditions. Reverberation is one of the issues that makes speech in indoor spaces challenging to understand. This problem is particularly critical in large spaces with few absorbing or diffusing surfaces. One of the natural remedies to improve speech intelligibility in such conditions may be achieved through speaking...

Full text available to download
Relation-based Wikipedia Search System for Factoid Questions Answering
Publication
- A. Brzeski
- T. M. Boiński
- International Journal of Innovative Research in Computer and Communication Engineering - Year 2014
In this paper we propose an alternative keyword search mechanism for Wikipedia, designed as a prototype solution towards factoid questions answering. The method considers relations between articles for finding the best matching article. Unlike the standard Wikipedia search engine and also Google engine, which search the articles content independently, requiring the entire query to be satisfied by a single article, the proposed...

Full text available to download
Previous Opinions is All You Need - Legal Information Retrieval System
Publication
- M. Osowski
- K. Lorenc
- P. Drozda
- R. Scherer
- K. Szałapak
- K. Komar-Komarowski
- J. Szymański
- A. Sobecki
- Year 2023
We present a system for retrieving the most relevant legal opinions to a given legal case or question. To this end, we checked several state-of-the-art neural language models. As a training and testing data, we use tens of thousands of legal cases as question-opinion pairs. Text data has been subjected to advanced pre-processing adapted to the specifics of the legal domain. We empirically chose the BERT-based HerBERT model to perform...

Full text to download in external service
Evaluation of a company’s image on social media using the Net Sentiment Rate
Publication
- A. Baj-Rogowska
- Year 2020
Vast amounts of new types of data are constantly being created as a result of dynamic digitization in all areas of our lives. One of the most important and valuable categories for business is data from social networks such as Facebook. Feedback resulting from the sharing of thoughts and emotions, expressed in comments on various products and services, is becoming the key factor on which modern business is based. This feedback is...

Full text to download in external service
Prioritising national healthcare service issues from free text feedback – A computational text analysis & predictive modelling approach
Publication
- A. Ojo
- N. Rizun
- G. Walsh
- M. I. Mashinchi
- M. Venosa
- M. N. Rao
- DECISION SUPPORT SYSTEMS - Year 2024
Patient experience surveys have become a key source of evidence for supporting decision-making and continuous quality improvement within healthcare services. To harness free-text feedback collected as part of these surveys for additional insights, text analytics methods are increasingly employed when the data collected is not amenable to traditional qualitative analysis due to volume. However, while text analytics techniques offer...

Full text available to download
Influence of YARN Schedulers on Power Consumption and Processing Time for Various Big Data Benchmarks
Publication
- TASK Quarterly - Year 2019
Climate change caused by human activities can influence the lives of everybody onthe planet. The environmental concerns must be taken into consideration by all fields of studyincludingICT. Green Computing aims to reduce negative effects of IT on the environment while,at the same time, maintaining all of the possible benefits it provides. Several Big Data platformslike Apache Spark orYARNhave become widely used in analytics and...

Full text available to download
What matters most to patients? On the Core Determinants of Patient Experience from Free Text Feedback
Publication
- A. Ojo
- N. Rizun
- Year 2021
Free-text feedback from patients is increasingly used for improving the quality of healthcare services and systems. A major reason for the growing interest in harnessing free-text feedback is the belief that it provides richer information about what patients want and care about. The use of computational approaches such as structural topic modelling for analysing large unstructured textual data such as free-text feedback from patients...

Full text available to download
A survey of automatic speech recognition deep models performance for Polish medical terms
Publication
- Year 2023
Among the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....

Full text to download in external service
Towards Facts Extraction From Texts in Polish Language
Publication
- T. M. Boiński
- A. Brzeski
- International Journal of Innovative Research in Computer and Communication Engineering - Year 2014
The Polish language differs from English in many ways. It has more complicated conjugation and declination. Because of that automatic facts extraction from texts is difficult. In this paper we present basic differences between those languages. The paper presents an algorithm for extraction of facts from articles from Polish Wikipedia. The algorithm is based on 7 proposed facts schemes that are searched for in the analyzed text....

Full text available to download
Assessing business process complexity based on textual data: Evidence from ITIL IT ticket processing
Publication
- N. Rizun
- A. Revina
- V. Maister
- Business Process Management Journal - Year 2021
Purpose This study aims to draw the attention of business process management (BPM) research and practice to the textual data generated in the processes and the potential of meaningful insights extraction. The authors apply standard natural language processing (NLP) approaches to gain valuable knowledge in the form of business process (BP) complexity concept suggested in the study. It is built on the objective, subjective and meta-knowledge...

Full text available to download
SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM
Publication
- B. Kostek
- B. Szyca
- Journal of the Acoustical Society of America - Year 2023
The main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...

Full text available to download
Extracting concepts from the software requirements specification using natural language processing
Publication
- J. Kuchta
- P. Padhiyar
- Year 2018
Extracting concepts from the software require¬ments is one of the first step on the way to automating the software development process. This task is difficult due to the ambiguity of the natural language used to express the requirements specification. The methods used so far consist mainly of statistical analysis of words and matching expressions with a specific ontology of the domain in which the planned software will be applicable....

Full text to download in external service
The image of the City on social media: A comparative study using “Big Data” and “Small Data” methods in the Tri-City Region in Poland
Publication
- J. Huang
- H. Obracht-Prondzyńska
- D. D. Kamrowska-Załuska
- Y. Sun
- L. Li
- LANDSCAPE AND URBAN PLANNING - Year 2021
“The Image of the City” by Kevin Lynch is a landmark planning theory of lasting influence; its scientific rigor and relevance in the digital age were in dispute. The rise of social media and other digital technologies offers new opportunities to study the perception of urban environments. Questions remain as to whether social media analytics can provide a reliable measure of perceived city images? If yes, what implication does...

Full text available to download
Towards facts extraction from text in Polish language
Publication
- T. M. Boiński
- A. Chojnowski
- Year 2017
Natural Language Processing (NLP) finds many usages in different fields of endeavor. Many tools exists allowing analysis of English language. For Polish language the situation is different as the language itself is more complicated. In this paper we show differences between NLP of Polish and English language. Existing solutions are presented and TEAMS software for facts extraction is described. The paper shows also evaluation of...

Full text available to download
Agile Commerce in the light of Text Mining
Publication
- A. Baj-Rogowska
- Przedsiębiorczość i Zarządzanie - Year 2017
The survey conducted for this study reveals that more than 84% of respondents have never encountered the term “agile commerce” and do not understand its meaning. At the same time, they are active participants of this strategy. Using digital channels as customers more often than ever before, they have already been included in the agile philosophy. Based on the above, the purpose of the study is to analyse major text sets containing...

Full text available to download
Information Extraction from Polish Radiology Reports using Language Models
Publication
- Year 2023
Radiology reports are vital elements of directing patient care. They are usually delivered in free text form, which makes them prone to errors, such as omission in reporting radiological findings and using difficult-to-comprehend mental shortcuts. Although structured reporting is the recommended method, its adoption continues to be limited. Radiologists find structured reports too limiting and burdensome. In this paper, we propose...

Full text available to download
Understanding the Ukrainian Migrants Challenges in the EU: A Topic Modeling Approach
Publication
- N. Khairova
- N. Rizun
- C. H. Alexopoulos
- M. Ciesielska
- A. Lukashevskyi
- I. Redozub
- Year 2024
Confronted with the aggression against Ukraine in 2022, Europe faces one of the most important humanitarian challenges - the migration of war refugees from Ukraine, most of them women with children and the elderly. Both international institutions such as the European Union and the United Nations, but also national governments and, above all, local governments, which are the main providers of services and resources for refugees,...

Full text available to download
Words context analysis for improvement of information retrieval
Publication
- J. Szymański
- Year 2012
In the article we present an approach to improvement of retrieval informationfrom large text collections using words context vectors. The vectorshave been created analyzing English Wikipedia with Hyperspace Analogue to Language model of words similarity. For test phrases we evaluate retrieval with direct user queries as well as retrieval with context vectors of these queries. The results indicate that the proposed method can not...
DEVELOPMENT OF THE ALGORITHM OF POLISH LANGUAGE FILM REVIEWS PREPROCESSING
Publication
- N. Rizun
- J. Taranenko
- Rocznik Naukowy Wydzialu Zarzadzania w Ciechanowie - Year 2017
The algorithm and the software for conducting the procedure of Preprocessing of the reviews of films in the Polish language were developed. This algorithm contains the following steps: Text Adaptation Procedure; Procedure of Tokenization; Procedure of Transforming Words into the Byte Format; Part-of-Speech Tagging; Stemming / Lemmatization Procedure; Presentation of Documents in the Vector Form (Vector Space Model) Procedure; Forming...

Full text available to download
Modeling the Customer’s Contextual Expectations Based on Latent Semantic Analysis Algorithms
Publication
- Year 2017
Nowadays, in the age of Internet, access to open data detects the huge possibilities for information retrieval. More and more often we hear about the concept of open data which is unrestricted access, in addition to reuse and analysis by external institutions, organizations and people. It’s such information that can be freely processed, add another data (so-called remix) and then published. More and more data are available in text...

Full text available to download
Knowledge Base Suitable for Answering Questions in Natural Language
Publication
- Year 2014
This paper presents three knowledge bases widely used by researchers coping with natural language processing: OpenCyc, DBpedia and YAGO. They are characterized from the point of view of questions answering system. In this paper a short description of the aforementioned system implementation is also presented.

Full text to download in external service
Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning
Publication
- A. Czyżewski
- Journal of the Acoustical Society of America - Year 2023
Text-to-Speech synthesis (TTS) can be used to generate training data for building Automatic Speech Recognition models (ASR). Access to medical speech data is because it is sensitive data that is difficult to obtain for privacy reasons; TTS can help expand the data set. Speech can be synthesized by mimicking different accents, dialects, and speaking styles that may occur in a medical language. Reinforcement Learning (RL), in the...

Full text available to download
Infedeltà nel trasferimento delle collocazioni nella traduzione dei romanzi di Michel Houellebecq dal francese all’italiano
Publication
- P. Golda
- Italica Wratislaviensia - Year 2024
Building on my PhD project, this paper explores fidelity challenges in the transfer of verb-nominal collocations (VNC) in the Italian translations of seven of Michel Houellebecq’s novels. I examine various kinds of infidelity, such as omissions, errors, incongruence in constituent transmission, incoherence in recurrent VNC transmission, and infidelity at the level of phraseological coverage. The accurate transfer...

Full text available to download
An Analysis of Neural Word Representations for Wikipedia Articles Classification
Publication
- J. Szymański
- N. Kawalec
- CYBERNETICS AND SYSTEMS - Year 2019
One of the current popular methods of generating word representations is an approach based on the analysis of large document collections with neural networks. It creates so-called word-embeddings that attempt to learn relationships between words and encode this information in the form of a low-dimensional vector. The goal of this paper is to examine the differences between the most popular embedding models and the typical bag-of-words...

Full text to download in external service
Methodology for Text Classification using Manually Created Corpora-based Sentiment Dictionary
Publication
- N. Rizun
- W. Waloszek
- Year 2018
This paper presents the methodology of Textual Content Classification, which is based on a combination of algorithms: preliminary formation of a contextual framework for the texts in particular problem area; manual creation of the Hierarchical Sentiment Dictionary (HSD) on the basis of a topically-oriented Corpus; tonality texts recognition via using HSD for analysing the documents as a collection of topically completed fragments...

Full text available to download
Nina Rizun dr

People

Department of Informatics in Management

Nina Rizun is an assistant professor at the Faculty of Management and Economics at the Gdańsk University of Technology. In October 1999 she obtained a PhD degree in technical sciences in the Faculty of Enterprise Economy and Production Organization, National Mining Academy, Dnipropetrovsk, Ukraine. PhD thesis title: Development of Complex Subsystem of the Organization and Planning of Mining and Transport Processes. In the years...
Ontologies vs. Rules — Comparison of Methods of Knowledge Representation Based on the Example of IT Services Management
Publication
- A. Czarnecki
- T. Sitek
- Year 2013
This text provides a brief overview of selected structures aimed at knowledge representation in the form of ontologies based on description logic and aims at comparing them with their counterparts based on the rule-based approach. Due to the limitations on the length of the article, only elements associated with the representation of concepts could be shown, without including roles. The formalisms of the OWL language were used...

Full text to download in external service
Rozwijanie kreatywności ucznia w procesie kształtowania umiejętności językowych. Innowacja pedagogiczna z elementami neurodydaktyki w edukacji wczesnoszkolnej
Publication
- B. Grobelna
- Języki Obce w Szkole - Year 2023
This text is a ready-to-use pedagogical innovation program combining teaching English and classes developing creativity in early childhood education. Classes developing creativity are a unique opportunity to implement innovative solutions and ideas to develop language competencies and key competencies, which can be difficult during a standard English lesson. The...
Methodology of Constructing and Analyzing the Hierarchical Contextually-Oriented Corpora
Publication
- N. Rizun
- J. Taranenko
- Year 2018
Methodology of Constructing and Analyzing the Hierarchical structure of the Contextually-Oriented Corpora was developed. The methodology contains the following steps: Contextual Component of the Corpora’s Structure Building; Text Analysis of the Contextually-Oriented Hierarchical Corpus. Main contribution of this study is the following: hierarchical structure of the Corpus provides advanced possibilities for identification of the...

Full text available to download
Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech
Publication
- D. Piotrowski
- R. Korzeniowski
- A. Falai
- S. Cygert
- K. Pokora
- G. Tinchev
- Z. Zhang
- K. Yanagisawa
- Year 2023
In this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream Text-To-Speech (TTS) model. The proposed framework consists of 4 stages. In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker. In the third stage, the converted data is combined with the linguistic features and durations...

Full text to download in external service
DBpedia and YAGO as Knowledge Base for Natural Language Based Question Answering—The Evaluation
Publication
- T. M. Boiński
- A. Ambrożewicz
- Advances in Intelligent Systems and Computing - Year 2018
The idea of automatic question answering system has a very long history. Despite constant improvement of the systems asking questions in the natural language requires very complex solutions. In this paper the DBpedia and YAGO are evaluated as a knowledge bases for simple class 1 and 2 question answering system. For this purpose a question answering system was designed and implemented. The proposed solution and the knowledge bases...

Full text available to download
Wieloznaczność w języku i tekście [Ambiguity in language and text]
Publication
- K. Wojan
- PROGRESS. JOURNAL OF YOUNG RESEARCHERS - Year 2017
Full text to download in external service
Automatic prosodic modification in a Text-To-Speech synthesizer of Polish language
Publication
- K. Łopatka
- P. Suchomski
- A. Czyżewski
- Elektronika : konstrukcje, technologie, zastosowania - Year 2011
Przedstawiono system syntezy mowy polskiej z funkcją automatycznej modyfikacji prozodii wypowiedzi. Opisane zostały metody automatycznego wyznaczania akcentu i intonacji wypowiedzi. Przedstawiono zastosowanie algorytmów przetwarzania sygnału mowy w procesie kształtowania prozodii. Omówiono wpływ zastosowanych modyfikacji na naturalność brzmienia syntezowanego sygnału. Zastosowana metoda oparta jest na algorytmie TD-PSOLA. Opracowany...
Towards Effective Processing of Large Text Collections
Publication
- J. Szymański
- H. Krawczyk
- Year 2012
In the article we describe the approach to parallelimplementation of elementary operations for textual data categorization.In the experiments we evaluate parallel computations ofsimilarity matrices and k-means algorithm. The test datasets havebeen prepared as graphs created from Wikipedia articles relatedwith links. When we create the clustering data packages, wecompute pairs of eigenvectors and eigenvalues for visualizationsof...
Natural language dictionaries implemented as finite automata
Publication
- J. Daciuk
- J. Piskorski
- S. Ristov
- Year 2010
Rozdział przedstawia wykorzystanie automatów skończonych jako słowników języka naturalnego. Podane są podstawy teoretyczne. Omówione są zastosowania: realizacja doskonałej funkcji mieszającej, analizy i syntezy morfologicznej, poprawiania pisowni i dopisywania znaków diakrytycznych, wydobywanie informacji. Podano algorytmy tworzenia automatów oraz omówiono sposoby reprezentacji automatów z uwzględnieniem kompresji.

Full text to download in external service
Processing and structure–property relationships of natural rubber/wheat bran biocomposites
Publication
- K. Formela
- A. Hejna
- Ł. Piszczyk
- M. Saeb
- X. Colom
- CELLULOSE - Year 2016
In this work, wheat bran was used as cellulosic filler in biocomposites based on natural rubber. The impact of wheat bran content [ranging from 10 to 50 parts per hundred rubber (phr)] on processing, structure, dynamic mechanical properties, thermal properties, physico-mechanical properties and morphology of resulting biocomposites was investigated. For better characterization of interfacial interactions between natural rubber...

Full text available to download
Fluent Editor and Controlled Natural Language in Ontology Development
Publication
- P. Weichbroth
- International Journal on Artificial Intelligence Tools - Year 2019
Full text to download in external service
Semantic rules representation in controlled natural language in FluentEditor
Publication
- A. Wróblewska
- P. Kapłański
- P. Zarzycki
- I. Ługowska
- IEEE Industrial Electronics Magazine - Year 2013
This paper presents a way of representation of semantic rules (SWRL) in controlled English in order to facilitate understanding the rules by humans interacting with a machine. This approach (implemented in FluentEditor) may be applied in many domains, where the understandability of the rules used to support a decision process is of great importance.

Full text to download in external service
Advanced Control With PLC—Code Generator for aMPC Controller Implementation and Cooperation With External Computational Server for Dealing With Multidimensionality, Constraints and LMI Based Robustness
Publication
- IEEE Access - Year 2022
The manufacturers of Programmable Logic Controllers (PLC) usually equip their products with extremely simple control algorithms, such as PID and on-off regulators. However, modern PLCs have much more efficient processors and extensive memory, which enables implementing more sophisticated controllers. The paper discusses issues related to the implementation of matrix operations, time limitations for code execution within one PLC...

Full text available to download
Text Technology: A Journal of computer Text Processing

Journals

ISSN: 1496-0958
DBpedia and YAGO Based System for Answering Questions in Natural Language
Publication
- Year 2018
In this paper we propose a method for answering class 1 and class 2 questions (out of 5 classes defined by Moldovan for TREC conference) based on DBpedia and YAGO. Our method is based on generating dependency trees for the query. In the dependency tree we look for paths leading from the root to the named entity of interest. These paths (referenced further as fibers) are candidates for representation of actual user intention. The...

Full text available to download
Clinical situations text database for Polish language
Open Research Data
open access
- A. Czyżewski
- D. Szplit
- J. Bogdan
- B. Graff
- K. Narkiewicz
- K. Marciniuk
- A. Harasimiuk
- P. Odya
- series: ADMEDVOICE
Dataset contains a database of anonymized texts in Polish for the purposes of building a medical speech corpus, for clinical situations in the following areas: medical interview, interview and description of the result of an oncological examination, description of a radiological examination, description of a pathomorphological examination, description...
Should we publish in Chinese? –answers exemplified by articles on OSH and electromagnetism indexed in selected databases
Publication
- W. Sygocki
- E. Korzeniewska
- Przegląd Elektrotechniczny - Year 2022
The article addresses the issues of scientific communication, including the indexing of articles in international databases (Web of ScienceCC, Scopus) and Chinese institutions, including technical universities. One of the important issues in assessing the quality of a scientist's work is the...

Full text available to download

Search

Filters

Catalog

Search results for: TEXT ANALYTICS, NATURAL LANGUAGE PROCESSING

Tomasz Maria Boiński dr inż.

Nina Rizun dr