Filters
total: 395
filtered: 180
Search results for: TEXT ANALYTICS, NATURAL LANGUAGE PROCESSING
-
Collaborative approach to WordNet and Wikipedia integration
PublicationIn this article we present a collaborative approach tocreating mappings between WordNet and Wikipedia. Wikipediaarticles have been first matched with WordNet synsets in anautomatic way. Then such associations have been evaluated andcomplemented in a collaborative way using a web application.We describe algorithms used for creating automatic mappingsas well as a system for their collaborative development. Theoutcome enables further...
-
From Sequential to Parallel Implementation of NLP Using the Actor Model
PublicationThe article focuses on presenting methods allowing easy parallelization of an existing, sequential Natural Language Processing (NLP) application within a multi-core system. The actor-based solution implemented with the Akka framework has been applied and compared to an application based on Task Parallel Library (TPL) and to the original sequential application. Architectures, data and control flows are described along with execution...
-
An efficient incremental DFA minimization algorithm
PublicationW tym artykule przedstawiamy nowy algorytm minimalizacji deterministycznego automatu skończonego. Algorytm jest przyrostowy - może być zatrzymany w dowolnym momencie, dając częściowo zminimalizowany automat. Wszystkie inne (znane) algorytmy minimalizacji dają wyniki pośrednie nieprzydatne dla częściowej minimalizacji. Ponieważ pierwszy algorytm jest łatwo zrozumiały ale mało wydajny, rozważamy trzy praktyczne, znaczące usprawnienia....
-
Exact-match Based Wikipedia-WordNet Integration
PublicationAbility to link between WordNet synsets and Wikipedia articles allows usage of those resources by computers during natural language processing. A lot of work was done in this field, however most of the approaches focus on similarity between Wikipedia articles and WordNet synsets rather than creation of perfect matches. In this paper we proposed a set of methods for automatic perfect matching generation. The proposed methods were...
-
Time-domain prosodic modifications for text-to-speech synthesizer
PublicationAn application of prosodic speech processing algorithms to Text-To-Speech synthesis is presented. Prosodic modifications that improve the naturalness of the synthesized signal are discussed. The applied method is based on the TD-PSOLA algorithm. The developed Text-To-Speech Synthesizer is used in applications employing multimodal computer interfaces.
-
Language material for English audiovisual speech recognition system developmen . Materiał językowy do wykorzystania w systemie audiowizualnego rozpoznawania mowy angielskiej
PublicationThe bi-modal speech recognition system requires a 2-sample language input for training and for testing algorithms which precisely depicts natural English speech. For the purposes of the audio-visual recordings, a training data base of 264 sentences (1730 words without repetitions; 5685 sounds) has been created. The language sample reflects vowel and consonant frequencies in natural speech. The recording material reflects both the...
-
A Model-Driven Solution for Development of Multimedia Stream Processing Applications
PublicationThis paper presents results of action research related to model-driven solutions in the area of multimedia stream processing. The practical problem to be solved was the need to support application developers who make their multimedia stream processing applications in a supercomputer environment. The solution consists of a domain-specific visual language for composing complex services from simple services called Multimedia Stream...
-
Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency
PublicationIn this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...
-
Personal adaptive tuning of mobile computer audio
PublicationAn integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of the acoustic track to the changing conditions and to the user's individual preferences. Original signal processing algorithms are introduced, which concern: linearization of frequency response, dialogue intelligibility enhancement and dynamics processing tuned up to the user's preferences....
-
Adaptive Personal Tuning of Sound in Mobile Computers
PublicationAn integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of their acoustic track to changing acoustic conditions of the environment and to users’ individual preferences. Signal processing algorithms are introduced that concern: linearization of frequency response, dialogue intelligibility enhancement, and dynamics processing tuned up to the users’...
-
Threat intelligence platform for the energy sector
PublicationIn recent years, critical infrastructures and power systems in particular have been subjected to sophisticated cyberthreats, including targeted attacks and advanced persistent threats. A promising response to this challenging situation is building up enhanced threat intelligence that interlinks information sharing and fine-grained situation awareness. In this paper a framework which integrates all levels of threat intelligence...
-
Semantic Analysis and Text Summarization in Socio-Technical Systems
PublicationIn this chapter the authors present the results of the development the methodology for increasing the reliability of the functioning of the Socio-Technical System. The existed methods and algorithms for processing unstructured (textual) information were studied. Taking into account noted above strengths and weaknesses of Discriminant and Probabilistic approaches of Latent Semantic Relations analysis in of the summarization projection...
-
English Language Learning Employing Developments in Multimedia IS
PublicationIn the realm of the development of information systems related to education, integrating multimedia technologies offers novel ways to enhance foreign language learning. This study investigates audio-video processing methods that leverage real-time speech rate adjustment and dynamic captioning to support English language acquisition. Through a mixed-methods analysis involving participants from a language school, we explore the impact...
-
Application of Semantic Knowledge Management System in Selected Areas of Polish Public Administration
PublicationThis paper describes an application of semantic technologies and knowledge management systems in chosen areas of Polish public administration. Short analyses of crisis management and EU policy coordination processes are presented. An architecture of a knowledge management system with interfaces using controlled natural language is proposed. A lot of examples are shown that prove a usefulness of semantic knowledge management and...
-
OrphaGPT: An Adapted Large Language Model for Orphan Diseases Classification
PublicationOrphan diseases (OD) represent a category of rare conditions that affect only a relatively small number of individuals. These conditions are often neglected in research due to the challenges posed by their scarcity, making medical advancements difficult. Then, the ever-evolving medical research and diagnosis landscape calls for more attention and innovative approaches to address the complex challenges of rare diseases and OD. Pre-trained...
-
Scoreboard Architectural Pattern and Integration of Emotion Recognition Results
PublicationThis paper proposes a new design pattern, named Scoreboard , dedicated for applications solving complex, multi-stage, non-deterministic problems. The pattern provides a computational framework for the design and implementation of systems that integrate a large number of diverse specialized modules that may vary in accuracy, solution level, and modality. The Scoreboard is an extension of Blackboard design pattern and comes under...
-
Deep Learning-Based Cellular Nuclei Segmentation Using Transformer Model
PublicationAccurate segmentation of cellular nuclei is imperative for various biological and medical applications, such as cancer diagnosis and drug discovery. Histopathology, a discipline employing microscopic examination of bodily tissues, serves as a cornerstone for cancer diagnosis. Nonetheless, the conventional histopathological diagnosis process is frequently marred by time constraints and potential inaccuracies. Consequently, there...
-
GOOD HEALTH PRODUCTS AND POLYMER COMPOSITES OBTAINED WITH THE USE OF NATURAL ORIGIN MICRO- AND NANOFIBERS
PublicationThe aim of this work was focused on good health products and polymer composites obtained with the use of natural origin micro- and nanofibers. Many different type of the natural plants are used in the production of good health products and polymer processing like curcuma, cellulose, microalgae and others. Often are used as a antioxidants and fillers. In this work the effect of algae and cellulose as a microfillers on the chosen...
-
Ontology clustering by directions algorithm to expand ontology queries
PublicationThis paper concerns formulating ontology queries. It describes existing languages in which ontologies can be queried. It focuses on languages which are intended to be easily understood by users who are willing to retrieve information from ontologies. Such a language can be, for example, a type of controlled natural language (CNL). In this paper a novel algorithm called Ontology Clustering by Directions is presented. The algorithm...
-
Comparative Analysis of Text Representation Methods Using Classification
PublicationIn our work, we review and empirically evaluate five different raw methods of text representation that allow automatic processing of Wikipedia articles. The main contribution of the article—evaluation of approaches to text representation for machine learning tasks—indicates that the text representation is fundamental for achieving good categorization results. The analysis of the representation methods creates a baseline that cannot...
-
The Principles of Model Building Concepts Which Are Applied to the Design Patterns for Smart Cities
PublicationThe involvement of citizens into decision-making processes is one of the main features of smart cities. Such commitment is reflected in the form of requirements towards the city, and the benefits which are expected from the city. Requirements and benefits are thus the primary language of communication between decision-makers and urban residents. To develop such a language, it becomes necessary to develop design patterns for Smart...
-
Automatic evaluation of information credibility in Semantic Web and Knowledge Grid
PublicationThis article presents a novel algorithm for automatic estimation of information credibility. It concerns information collected in Knowledge Grid and Semantic Web. Possibilities to evaluate the credibility of information in such structures are much greater than those available for WWW sites which use natural language. The rating system presented in this paper estimates credibility automatically on the basis of the following metrics:...
-
Selection of Relevant Features for Text Classification with K-NN
PublicationIn this paper, we describe five features selection techniques used for a text classification. An information gain, independent significance feature test, chi-squared test, odds ratio test, and frequency filtering have been compared according to the text benchmarks based on Wikipedia. For each method we present the results of classification quality obtained on the test datasets using K-NN based approach. A main advantage of evaluated...
-
Application of the Chimney Cap as a Method of Improving the Effectiveness of Natural Ventilation in Buildings
PublicationAdequately designed natural ventilation is the cheapest and easiest way to effectively remove indoor pollutants and keep the air inside a building fresh. A prediction of the performance and effectiveness of ventilation in order to determine the design of a ventilation system can provide real and long-term cost savings. The worst time in terms of the efficiency of natural ventilation is the spring-autumn transition period [7]. In...
-
Ontologiczna inżynieria wiedzy
PublicationOntologiczna inżynieria wiedzy jest dobrą podstawą metodologiczną, a ontologie dziedzin przedmiotowych ważnym elementem konstrukcyjnym semantycznych systemów reprezentacji wiedzy. W artykule omówiono budowanie ontologii w oparciu o edytor ontologii FluentEditor i język CNL (Controlled Natural Language). Przykładową ontologię dotyczącą fragmentu procesu produkcji rolniczej wykorzystano do budowy semantycznej bazy wiedzy. W tym celu...
-
A universal IT system architecture for servicing, collecting, storing, processing and presenting data from wireless devices
PublicationIn the article we present a universal IT system architecture, which allows one to develop, based on mobile and multiplatform JAVA language, applications capable of working with many different wireless systems in an easy and effective way. Modular system architecture supports efficient data processing and enables convenient presentation of chosen parameters. Additionally, proposed IT system architecture provides easy adoption to...
-
University Students’ Research on Artificial Intelligence and Knowledge Management. A Review and Report of Multi-case Studies
PublicationLeading technologies are very attractive for students preparing their theses as the completion of their studies. Such an orientation of students connected with professional experiences seems to be a crucial motivator in the research in the management and business areas where these technologies condition the development of professional activities. The goal of the paper is the analysis of students’ thesis topics defended in the last...
-
The Effect of Polyurethane Glycolysate on the Structure and Properties of Natural Rubber/Carbon Black Composites
PublicationIn this work the use of polyurethane chemical recycling product (i.e. glycolysis of polyurethane waste realized with the mass excess of polymer) as a plasticizer for natural rubber-based composites was proposed. The effect of plasticizer type (napthenic oil and polyurethane foam glycolysate) and amount (2, 4, 6 or 8 parts per 100 parts of natural rubber) on the processing properties of rubber mixtures and chemical structure, swelling,...
-
Introduction to the special issue on machine learning in acoustics
PublicationWhen we started our Call for Papers for a Special Issue on “Machine Learning in Acoustics” in the Journal of the Acoustical Society of America, our ambition was to invite papers in which machine learning was applied to all acoustics areas. They were listed, but not limited to, as follows: • Music and synthesis analysis • Music sentiment analysis • Music perception • Intelligent music recognition • Musical source separation • Singing...
-
Blockchain based Secure Data Exchange between Cloud Networks and Smart Hand-held Devices for use in Smart Cities
PublicationIn relation to smart city planning and management, processing huge amounts of generated data and execution of non-lightweight cryptographic algorithms on resource constraint devices at disposal, is the primary focus of researchers today. To enable secure exchange of data between cloud networks and mobile devices, in particular smart hand held devices, this paper presents Blockchain based approach that disperses a public/free key...
-
Mushroom Toxins
PublicationToxins and Other Harmful Compounds in Foods provides information on the contents, distribution, chemical properties, and biological activity of toxins and other harmful compounds in foods that are natural components of the raw materials, accumulated due to microbial actions and environmental pollution, or are generated due to processing.
-
SMAQ - A Semantic Model for Analitical Queries
PublicationWhile the Self-Service Business Intelligence (BI) becomes an important part of organizational BI solutions there is a great need for new tools allowing to construct ad-hoc queries by users with various responsibilities and skills. The paper presents a Semantic Model for Analytical Queries – SMAQ allowing to construct queries by users familiar with business events and terms, but being unaware of database or data warehouse concepts...
-
Geometric Algebra Model of Distributed Representations
PublicationFormalism based on GA is an alternative to distributed representation models developed so far-Smolensky's tensor product, Holographic Reduced Representations (HRR) and Binary Spatter Code (BSC). Convolutions are replaced by geometric products, interpretable in terms of geometry which seems to be the most natural language for visualization of higher concepts. This paper recalls the main ideas behind the GA model and investigates...
-
Radar with rotary head
PublicationNowadays usage of radars is no longer reserved only for the military purpose. It finds many applications in various areas of science and industry. It may be used in order to obtain extended information about the state of critical infrastructure, like shipyards or petrochemical plants. Furthermore, it may be applied in vision denied environments. The aim of this project...
-
Performance Analysis of the OpenCL Environment on Mobile Platforms
PublicationToday’s smartphones have more and more features that so far were only assigned to personal computers. Every year these devices are composed of better and more efficient components. Everything indicates that modern smartphones are replacing ordinary computers in various activities. High computing power is required for tasks such as image processing, speech recognition and object detection. This paper analyses the performance of...
-
Developing Methods for Building Intelligent Systems of Information Resources Processing Using an Ontological Approach
PublicationThe problem of developing methods of information resource processing is investigated. A formal procedure description of processing text content is developed. A new ontological approach to the implementation of business processes is proposed. Consider that the aim of our work is to develop methods and tools for building intelligent systems of information resource processing, the core of knowledge bases of which are ontology’s, and...
-
SEMANTIC ANALYSIS ALGORITHMS FOR KNOWLEDGE WORKERS SUPPORT
PublicationThe paper examines various aspects of text analysis application for knowledge worker’s activity realization. Conclusions are drawn about the relevance and importance of processing the non-structured textual information in order to increase knowledge worker’s efficiency, as well as their awareness in different branches of science. The paper considers the existing algorithms of texts semantic analysis as the sphere of documents topical...
-
A Parallel Corpus-Based Approach to the Crime Event Extraction for Low-Resource Languages
PublicationThese days, a lot of crime-related events take place all over the world. Most of them are reported in news portals and social media. Crime-related event extraction from the published texts can allow monitoring, analysis, and comparison of police or criminal activities in different countries or regions. Existing approaches to event extraction mainly suggest processing texts in English, French, Chinese, and some other resource-rich...
-
rigid polyurethane foams modified with selected nanofillers
PublicationThe nanofillers: natural montmoryllonite (MMT) - Bentonite, natural MMT modified with a quaternary ammonium salt - Cloisite30B, has been used in rigid polyurethane foams (PUFs). The influence of fillers amounts on processing parameters, physical-mechanical properties (density, water absorption, brittleness and compression strength) and thermal properties (thermal stability, fire behaviours) of such foams has been analysed. The...
-
Human verbal memory encoding is hierarchically distributed in a continuous processing stream
PublicationProcessing of memory is supported by coordinated activity in a network of sensory, association, and motor brain regions. It remains a major challenge to determine where memory is encoded for later retrieval. Here we used direct intracranial brain recordings from epilepsy patients performing free recall tasks to determine the temporal pattern and anatomical distribution of verbal memory encoding across the entire human cortex. High...
-
Estimation of the short-term predictor parameters of speech under noisy conditions
Publication -
Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering
PublicationThis paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...
-
Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
PublicationIn this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor pr ocess priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bi- gram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of -grams with a topic model,...
-
Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing
PublicationIn this application-oriented paper we consider the problem of elimination of impulsive disturbances, such as clicks, pops and record scratches, from archive audio recordings. The proposed approach is based on bidirectional processing—noise pulses are localized by combining the results of forward-time and backward-time signal analysis. Based on the results of specially designed empirical tests (rather than on the results of theoretical analysis),...
-
Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling
PublicationSymbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is...
-
The Role of Proteins in Food
PublicationThis chapter describes the effect of proteins on the sensory attributes and the biological value and safety of foods. The role of proteins depends on their amino acid composition and structure, on changes due to storage and processing, as well as on interactions with other food components. The effect on the sensory quality of foods is brought about by hydrophobicity, solubility, water holding capacity, gelling, film formation,...
-
Architektura a dekonstrukcja. Przypadek Petera Eisenmana i Bernarda Tschumiego
PublicationArchitecture and Deconstruction Case of Peter Eisenman and Bernard Tschumi Introduction Towards deconstruction in architecture Intensive relations between philosophical deconstruction and architecture, which were present in the late 1980s and early 1990s, belong to the past and therefore may be described from a greater than...
-
Preparation and characterization of natural rubber composites highly filled with brewers' spent grain/ground tire rubber hybrid reinforcement
PublicationBrewers' spent grain (BSG) and ground tire rubber (GTR) were applied as low-cost hybrid reinforcement natural rubber (NR). The impact of BSG/GTR ratio (in range: 100/0, 75/25, 50/50, 25/75 and 0/100 phr) on processing and performance properties of highly filled natural rubber composites was evaluated by oscillating disc rheometer, Fourier-transform infrared spectroscopy, thermogravimetric analysis, scanning electron microscopy,...
-
Polyurethanes modified with natural polymers for medical application. I. Polyurethanes/ Chitosan and polyurethane/collagen.
PublicationFor over three decades polyurethanes (PUR or PU) have been reported for application in a variety of medical devices. These polymers consist of hard and soft segments, which allow for more subtle control of their structure and properties. By varying the composition of the different segments, properties of PURcan be tuned up for use in many areas of medicine. Recently there is a great interest in modification of biomedical PUR with...
-
Dynamic Semantic Visual Information Management
PublicationDominant Internet search engines use keywords and therefore are not suited for exploration of new domains of knowledge, when the user does not know specific vocabulary. Browsing through articles in a large encyclopedia, each presenting a small fragment of knowledge, it is hard to map the whole domain, see relevant concepts and their relations. In Wikipedia for example some highly relevant articles are not linked with each other....