Filtry
wszystkich: 166
wybranych: 127
-
Katalog
Filtry wybranego katalogu
Wyniki wyszukiwania dla: NATURAL LANGUAGE PROCESSING (NLP)
-
Extracting concepts from the software requirements specification using natural language processing
PublikacjaExtracting concepts from the software require¬ments is one of the first step on the way to automating the software development process. This task is difficult due to the ambiguity of the natural language used to express the requirements specification. The methods used so far consist mainly of statistical analysis of words and matching expressions with a specific ontology of the domain in which the planned software will be applicable....
-
Towards facts extraction from text in Polish language
PublikacjaNatural Language Processing (NLP) finds many usages in different fields of endeavor. Many tools exists allowing analysis of English language. For Polish language the situation is different as the language itself is more complicated. In this paper we show differences between NLP of Polish and English language. Existing solutions are presented and TEAMS software for facts extraction is described. The paper shows also evaluation of...
-
From Sequential to Parallel Implementation of NLP Using the Actor Model
PublikacjaThe article focuses on presenting methods allowing easy parallelization of an existing, sequential Natural Language Processing (NLP) application within a multi-core system. The actor-based solution implemented with the Akka framework has been applied and compared to an application based on Task Parallel Library (TPL) and to the original sequential application. Architectures, data and control flows are described along with execution...
-
OrphaGPT: An Adapted Large Language Model for Orphan Diseases Classification
PublikacjaOrphan diseases (OD) represent a category of rare conditions that affect only a relatively small number of individuals. These conditions are often neglected in research due to the challenges posed by their scarcity, making medical advancements difficult. Then, the ever-evolving medical research and diagnosis landscape calls for more attention and innovative approaches to address the complex challenges of rare diseases and OD. Pre-trained...
-
Assessing business process complexity based on textual data: Evidence from ITIL IT ticket processing
PublikacjaPurpose This study aims to draw the attention of business process management (BPM) research and practice to the textual data generated in the processes and the potential of meaningful insights extraction. The authors apply standard natural language processing (NLP) approaches to gain valuable knowledge in the form of business process (BP) complexity concept suggested in the study. It is built on the objective, subjective and meta-knowledge...
-
University Students’ Research on Artificial Intelligence and Knowledge Management. A Review and Report of Multi-case Studies
PublikacjaLeading technologies are very attractive for students preparing their theses as the completion of their studies. Such an orientation of students connected with professional experiences seems to be a crucial motivator in the research in the management and business areas where these technologies condition the development of professional activities. The goal of the paper is the analysis of students’ thesis topics defended in the last...
-
Evaluation of a company’s image on social media using the Net Sentiment Rate
PublikacjaVast amounts of new types of data are constantly being created as a result of dynamic digitization in all areas of our lives. One of the most important and valuable categories for business is data from social networks such as Facebook. Feedback resulting from the sharing of thoughts and emotions, expressed in comments on various products and services, is becoming the key factor on which modern business is based. This feedback is...
-
Knowledge Base Suitable for Answering Questions in Natural Language
PublikacjaThis paper presents three knowledge bases widely used by researchers coping with natural language processing: OpenCyc, DBpedia and YAGO. They are characterized from the point of view of questions answering system. In this paper a short description of the aforementioned system implementation is also presented.
-
Finite automata for compact representation of language models in NLP
PublikacjaPrzedstawiona zostaje technika reprezentacji modeli języka w przetwarzaniu języka naturalnego wymagająca mało pamięci. Po krótkim omówieniu przyczyn poszukiwania oszczędnej reprezentacji takich modeli języka, pokazane jest, jak automaty skończone mogą być użyte w tym celu. Technika może być postrzegana jako zastosowanie i rozszerzenie doskonałej funkcji mieszającej z wykorzystaniem automatów skończonych. Pierwsze doświadczenia...
-
DBpedia and YAGO as Knowledge Base for Natural Language Based Question Answering—The Evaluation
PublikacjaThe idea of automatic question answering system has a very long history. Despite constant improvement of the systems asking questions in the natural language requires very complex solutions. In this paper the DBpedia and YAGO are evaluated as a knowledge bases for simple class 1 and 2 question answering system. For this purpose a question answering system was designed and implemented. The proposed solution and the knowledge bases...
-
Natural language dictionaries implemented as finite automata
PublikacjaRozdział przedstawia wykorzystanie automatów skończonych jako słowników języka naturalnego. Podane są podstawy teoretyczne. Omówione są zastosowania: realizacja doskonałej funkcji mieszającej, analizy i syntezy morfologicznej, poprawiania pisowni i dopisywania znaków diakrytycznych, wydobywanie informacji. Podano algorytmy tworzenia automatów oraz omówiono sposoby reprezentacji automatów z uwzględnieniem kompresji.
-
Processing and structure–property relationships of natural rubber/wheat bran biocomposites
PublikacjaIn this work, wheat bran was used as cellulosic filler in biocomposites based on natural rubber. The impact of wheat bran content [ranging from 10 to 50 parts per hundred rubber (phr)] on processing, structure, dynamic mechanical properties, thermal properties, physico-mechanical properties and morphology of resulting biocomposites was investigated. For better characterization of interfacial interactions between natural rubber...
-
Fluent Editor and Controlled Natural Language in Ontology Development
Publikacja -
Semantic rules representation in controlled natural language in FluentEditor
PublikacjaThis paper presents a way of representation of semantic rules (SWRL) in controlled English in order to facilitate understanding the rules by humans interacting with a machine. This approach (implemented in FluentEditor) may be applied in many domains, where the understandability of the rules used to support a decision process is of great importance.
-
DBpedia and YAGO Based System for Answering Questions in Natural Language
PublikacjaIn this paper we propose a method for answering class 1 and class 2 questions (out of 5 classes defined by Moldovan for TREC conference) based on DBpedia and YAGO. Our method is based on generating dependency trees for the query. In the dependency tree we look for paths leading from the root to the named entity of interest. These paths (referenced further as fibers) are candidates for representation of actual user intention. The...
-
Evaluation of Multimedia Stream Processing Modeling Language from the Perspective of Cognitive Dimensions
PublikacjaW referacie zawarto opis zastosowania wymiarów poznawczych do oceny języka modelowania przetwarzania strumieni multimedialnych, nazwanego MSP-ML, w trakcie tworzenia tego języka. Poszczególne części referatu prezentują kontekst i motywacje oceny MSP-ML, metodę oceny, rezultaty oceny oraz porównanie rezultatów oceny z wynikami otrzymanymi za pomocą innych metod oceny języków modelowania wizualnego.
-
Ontology-Aided Software Engineering
PublikacjaThis thesis is located between the fields of research on Artificial Intelligence (AI), Knowledge Representation and Reasoning (KRR), Computer-Aided Software Engineering (CASE) and Model Driven Engineering (MDE). The modern offspring of KRR - Description Logic (DL) [Baad03] is considered here as a formalization of the software engineering Methods & Tools. The bridge between the world of formal specification (governed by the mathematics)...
-
Semantic OLAP with FluentEditor and Ontorion Semantic Excel Toolchain
PublikacjaSemantic technologies appear as a step on the way to creating systems capable of representing the physical world as real time computational processes. In this context, the paper presents a toolchain for an ontology based knowledge management system. It consists of the ontology editor, FluentEditor and the distributed knowledge representation system, Ontorion. FluentEditor is a comprehensive tool for editing and manipulating complex...
-
A new library for construction of automata
PublikacjaWe present a new library of functions that construct minimal, acyclic, deterministic, finite-state automata in the same format as the author's fsa package, and also accepted by the author's fadd library of functions that use finite-state automata as dictionaries in natural language processing.
-
Collaborative approach to WordNet and Wikipedia integration
PublikacjaIn this article we present a collaborative approach tocreating mappings between WordNet and Wikipedia. Wikipediaarticles have been first matched with WordNet synsets in anautomatic way. Then such associations have been evaluated andcomplemented in a collaborative way using a web application.We describe algorithms used for creating automatic mappingsas well as a system for their collaborative development. Theoutcome enables further...
-
An efficient incremental DFA minimization algorithm
PublikacjaW tym artykule przedstawiamy nowy algorytm minimalizacji deterministycznego automatu skończonego. Algorytm jest przyrostowy - może być zatrzymany w dowolnym momencie, dając częściowo zminimalizowany automat. Wszystkie inne (znane) algorytmy minimalizacji dają wyniki pośrednie nieprzydatne dla częściowej minimalizacji. Ponieważ pierwszy algorytm jest łatwo zrozumiały ale mało wydajny, rozważamy trzy praktyczne, znaczące usprawnienia....
-
Exact-match Based Wikipedia-WordNet Integration
PublikacjaAbility to link between WordNet synsets and Wikipedia articles allows usage of those resources by computers during natural language processing. A lot of work was done in this field, however most of the approaches focus on similarity between Wikipedia articles and WordNet synsets rather than creation of perfect matches. In this paper we proposed a set of methods for automatic perfect matching generation. The proposed methods were...
-
Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej
PublikacjaThe bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...
-
A Model-Driven Solution for Development of Multimedia Stream Processing Applications
PublikacjaThis paper presents results of action research related to model-driven solutions in the area of multimedia stream processing. The practical problem to be solved was the need to support application developers who make their multimedia stream processing applications in a supercomputer environment. The solution consists of a domain-specific visual language for composing complex services from simple services called Multimedia Stream...
-
Personal adaptive tuning of mobile computer audio
PublikacjaAn integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of the acoustic track to the changing conditions and to the user's individual preferences. Original signal processing algorithms are introduced, which concern: linearization of frequency response, dialogue intelligibility enhancement and dynamics processing tuned up to the user's preferences....
-
Text-mining Similarity Approximation Operators for Opinion Mining in BI tools
PublikacjaThe concept of the Text-mining Similarity Approximation Operators for Opinion Mining as extensions to Natural Language Interface Database is defined. The new operators: “keywords of” dimension; subsetting operator “about C is q”; aggregation operator “by similar C” are proposed. These operators are based on the Latent Semantic Analysis and Social Network Analysis
-
Adaptive Personal Tuning of Sound in Mobile Computers
PublikacjaAn integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of their acoustic track to changing acoustic conditions of the environment and to users’ individual preferences. Signal processing algorithms are introduced that concern: linearization of frequency response, dialogue intelligibility enhancement, and dynamics processing tuned up to the users’...
-
Relation-based Wikipedia Search System for Factoid Questions Answering
PublikacjaIn this paper we propose an alternative keyword search mechanism for Wikipedia, designed as a prototype solution towards factoid questions answering. The method considers relations between articles for finding the best matching article. Unlike the standard Wikipedia search engine and also Google engine, which search the articles content independently, requiring the entire query to be satisfied by a single article, the proposed...
-
Threat intelligence platform for the energy sector
PublikacjaIn recent years, critical infrastructures and power systems in particular have been subjected to sophisticated cyberthreats, including targeted attacks and advanced persistent threats. A promising response to this challenging situation is building up enhanced threat intelligence that interlinks information sharing and fine-grained situation awareness. In this paper a framework which integrates all levels of threat intelligence...
-
English Language Learning Employing Developments in Multimedia IS
PublikacjaIn the realm of the development of information systems related to education, integrating multimedia technologies offers novel ways to enhance foreign language learning. This study investigates audio-video processing methods that leverage real-time speech rate adjustment and dynamic captioning to support English language acquisition. Through a mixed-methods analysis involving participants from a language school, we explore the impact...
-
Application of Semantic Knowledge Management System in Selected Areas of Polish Public Administration
PublikacjaThis paper describes an application of semantic technologies and knowledge management systems in chosen areas of Polish public administration. Short analyses of crisis management and EU policy coordination processes are presented. An architecture of a knowledge management system with interfaces using controlled natural language is proposed. A lot of examples are shown that prove a usefulness of semantic knowledge management and...
-
Acoustic Sensing Analytics Applied to Speech in Reverberation Conditions
PublikacjaThe paper aims to discuss a case study of sensing analytics and technology in acoustics when applied to reverberation conditions. Reverberation is one of the issues that makes speech in indoor spaces challenging to understand. This problem is particularly critical in large spaces with few absorbing or diffusing surfaces. One of the natural remedies to improve speech intelligibility in such conditions may be achieved through speaking...
-
Scoreboard Architectural Pattern and Integration of Emotion Recognition Results
PublikacjaThis paper proposes a new design pattern, named Scoreboard , dedicated for applications solving complex, multi-stage, non-deterministic problems. The pattern provides a computational framework for the design and implementation of systems that integrate a large number of diverse specialized modules that may vary in accuracy, solution level, and modality. The Scoreboard is an extension of Blackboard design pattern and comes under...
-
Deep Learning-Based Cellular Nuclei Segmentation Using Transformer Model
PublikacjaAccurate segmentation of cellular nuclei is imperative for various biological and medical applications, such as cancer diagnosis and drug discovery. Histopathology, a discipline employing microscopic examination of bodily tissues, serves as a cornerstone for cancer diagnosis. Nonetheless, the conventional histopathological diagnosis process is frequently marred by time constraints and potential inaccuracies. Consequently, there...
-
GOOD HEALTH PRODUCTS AND POLYMER COMPOSITES OBTAINED WITH THE USE OF NATURAL ORIGIN MICRO- AND NANOFIBERS
PublikacjaThe aim of this work was focused on good health products and polymer composites obtained with the use of natural origin micro- and nanofibers. Many different type of the natural plants are used in the production of good health products and polymer processing like curcuma, cellulose, microalgae and others. Often are used as a antioxidants and fillers. In this work the effect of algae and cellulose as a microfillers on the chosen...
-
Ontology clustering by directions algorithm to expand ontology queries
PublikacjaThis paper concerns formulating ontology queries. It describes existing languages in which ontologies can be queried. It focuses on languages which are intended to be easily understood by users who are willing to retrieve information from ontologies. Such a language can be, for example, a type of controlled natural language (CNL). In this paper a novel algorithm called Ontology Clustering by Directions is presented. The algorithm...
-
The Principles of Model Building Concepts Which Are Applied to the Design Patterns for Smart Cities
PublikacjaThe involvement of citizens into decision-making processes is one of the main features of smart cities. Such commitment is reflected in the form of requirements towards the city, and the benefits which are expected from the city. Requirements and benefits are thus the primary language of communication between decision-makers and urban residents. To develop such a language, it becomes necessary to develop design patterns for Smart...
-
Automatic evaluation of information credibility in Semantic Web and Knowledge Grid
PublikacjaThis article presents a novel algorithm for automatic estimation of information credibility. It concerns information collected in Knowledge Grid and Semantic Web. Possibilities to evaluate the credibility of information in such structures are much greater than those available for WWW sites which use natural language. The rating system presented in this paper estimates credibility automatically on the basis of the following metrics:...
-
Application of the Chimney Cap as a Method of Improving the Effectiveness of Natural Ventilation in Buildings
PublikacjaAdequately designed natural ventilation is the cheapest and easiest way to effectively remove indoor pollutants and keep the air inside a building fresh. A prediction of the performance and effectiveness of ventilation in order to determine the design of a ventilation system can provide real and long-term cost savings. The worst time in terms of the efficiency of natural ventilation is the spring-autumn transition period [7]. In...
-
Ontologiczna inżynieria wiedzy
PublikacjaOntologiczna inżynieria wiedzy jest dobrą podstawą metodologiczną, a ontologie dziedzin przedmiotowych ważnym elementem konstrukcyjnym semantycznych systemów reprezentacji wiedzy. W artykule omówiono budowanie ontologii w oparciu o edytor ontologii FluentEditor i język CNL (Controlled Natural Language). Przykładową ontologię dotyczącą fragmentu procesu produkcji rolniczej wykorzystano do budowy semantycznej bazy wiedzy. W tym celu...
-
Previous Opinions is All You Need - Legal Information Retrieval System
PublikacjaWe present a system for retrieving the most relevant legal opinions to a given legal case or question. To this end, we checked several state-of-the-art neural language models. As a training and testing data, we use tens of thousands of legal cases as question-opinion pairs. Text data has been subjected to advanced pre-processing adapted to the specifics of the legal domain. We empirically chose the BERT-based HerBERT model to perform...
-
A universal IT system architecture for servicing, collecting, storing, processing and presenting data from wireless devices
PublikacjaIn the article we present a universal IT system architecture, which allows one to develop, based on mobile and multiplatform JAVA language, applications capable of working with many different wireless systems in an easy and effective way. Modular system architecture supports efficient data processing and enables convenient presentation of chosen parameters. Additionally, proposed IT system architecture provides easy adoption to...
-
The Effect of Polyurethane Glycolysate on the Structure and Properties of Natural Rubber/Carbon Black Composites
PublikacjaIn this work the use of polyurethane chemical recycling product (i.e. glycolysis of polyurethane waste realized with the mass excess of polymer) as a plasticizer for natural rubber-based composites was proposed. The effect of plasticizer type (napthenic oil and polyurethane foam glycolysate) and amount (2, 4, 6 or 8 parts per 100 parts of natural rubber) on the processing properties of rubber mixtures and chemical structure, swelling,...
-
Introduction to the special issue on machine learning in acoustics
PublikacjaWhen we started our Call for Papers for a Special Issue on “Machine Learning in Acoustics” in the Journal of the Acoustical Society of America, our ambition was to invite papers in which machine learning was applied to all acoustics areas. They were listed, but not limited to, as follows: • Music and synthesis analysis • Music sentiment analysis • Music perception • Intelligent music recognition • Musical source separation • Singing...
-
Mushroom Toxins
PublikacjaToxins and Other Harmful Compounds in Foods provides information on the contents, distribution, chemical properties, and biological activity of toxins and other harmful compounds in foods that are natural components of the raw materials, accumulated due to microbial actions and environmental pollution, or are generated due to processing.
-
SMAQ - A Semantic Model for Analitical Queries
PublikacjaWhile the Self-Service Business Intelligence (BI) becomes an important part of organizational BI solutions there is a great need for new tools allowing to construct ad-hoc queries by users with various responsibilities and skills. The paper presents a Semantic Model for Analytical Queries – SMAQ allowing to construct queries by users familiar with business events and terms, but being unaware of database or data warehouse concepts...
-
Geometric Algebra Model of Distributed Representations
PublikacjaFormalism based on GA is an alternative to distributed representation models developed so far-Smolensky's tensor product, Holographic Reduced Representations (HRR) and Binary Spatter Code (BSC). Convolutions are replaced by geometric products, interpretable in terms of geometry which seems to be the most natural language for visualization of higher concepts. This paper recalls the main ideas behind the GA model and investigates...
-
Performance Analysis of the OpenCL Environment on Mobile Platforms
PublikacjaToday’s smartphones have more and more features that so far were only assigned to personal computers. Every year these devices are composed of better and more efficient components. Everything indicates that modern smartphones are replacing ordinary computers in various activities. High computing power is required for tasks such as image processing, speech recognition and object detection. This paper analyses the performance of...
-
SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM
PublikacjaThe main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...
-
A Parallel Corpus-Based Approach to the Crime Event Extraction for Low-Resource Languages
PublikacjaThese days, a lot of crime-related events take place all over the world. Most of them are reported in news portals and social media. Crime-related event extraction from the published texts can allow monitoring, analysis, and comparison of police or criminal activities in different countries or regions. Existing approaches to event extraction mainly suggest processing texts in English, French, Chinese, and some other resource-rich...