Filters
total: 581
filtered: 354
Search results for: NATURAL LANGUAGE PROCESSING, LARGE LANGUAGE MODELS, DATA MINING, QUANTUM PHYSICS
-
Dynamic Semantic Visual Information Management
PublicationDominant Internet search engines use keywords and therefore are not suited for exploration of new domains of knowledge, when the user does not know specific vocabulary. Browsing through articles in a large encyclopedia, each presenting a small fragment of knowledge, it is hard to map the whole domain, see relevant concepts and their relations. In Wikipedia for example some highly relevant articles are not linked with each other....
-
Modeling the Customer’s Contextual Expectations Based on Latent Semantic Analysis Algorithms
PublicationNowadays, in the age of Internet, access to open data detects the huge possibilities for information retrieval. More and more often we hear about the concept of open data which is unrestricted access, in addition to reuse and analysis by external institutions, organizations and people. It’s such information that can be freely processed, add another data (so-called remix) and then published. More and more data are available in text...
-
Waste management in the mining industry of metals ores, coal, oil and natural gas - A review
PublicationWaste generated due to mining activity poses a serious issue due to the large amounts generated, even up to 65 billion tons per year, and is often associated with the risk posed by its storage and environmental management. This work aims to review waste management in the mining industry of metals ores, coal, oil and natural gas. It includes an analysis and discussion on the possibilities for reuse of certain types of wastes generated...
-
Monitoring of the Process of System Information Broadcasting in Time
PublicationOne of the problems of quantum physics is how a measurement turns quantum, noncopyable data, towards copyable classical knowledge. We use the quantum state discrimination in a central system model to show how its evolution leads to the broadcasting of the information, and how orthogonalization and decoherence factors allow us to monitor the distance of the state in question to the one perfectly broadcasting information, in any...
-
Personal adaptive tuning of mobile computer audio
PublicationAn integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of the acoustic track to the changing conditions and to the user's individual preferences. Original signal processing algorithms are introduced, which concern: linearization of frequency response, dialogue intelligibility enhancement and dynamics processing tuned up to the user's preferences....
-
Harmony Search for Data Mining with Big Data
PublicationIn this paper, some harmony search algorithms have been proposed for data mining with big data. Three areas of big data processing have been studied to apply new metaheuristics. The first problem is related to MapReduce architecture that can be supported by a team of harmony search agents in grid infrastructure. The second dilemma involves development of harmony search in preprocessing of data series before data mining. Moreover,...
-
Adaptive Personal Tuning of Sound in Mobile Computers
PublicationAn integrated methodology for enhancing audio quality in mobile computers is presented. The key features are adaptation of the characteristics of their acoustic track to changing acoustic conditions of the environment and to users’ individual preferences. Signal processing algorithms are introduced that concern: linearization of frequency response, dialogue intelligibility enhancement, and dynamics processing tuned up to the users’...
-
Reduction of measurement data before Digital Terrain Model generation vs. DTM generalisation
PublicationModern data acquisition technologies provide large datasets that are not always necessary in its entirety to properly accomplish the goal of the study. In addition, such datasets are often cumbersome for rational processing, and their processing is time and labour consuming. Therefore, methods that enable to reduce the size of the measurement dataset, such as the generalization of the Digital Terrain Model (DTM) or the reduction...
-
Individual Resources and Intercultural Interactions
PublicationThe work environment in multinational corporations (MNCs) is specific and demanding including intercultural interactions with co-workers and clients and using a foreign language. Some individual resources can help in dealing with these circumstances. Individual resources refer to personal dispositions, competencies and prior experiences. With regard to previous studies, a caravan of personal resources, namely Psychological Capital...
-
Automatic evaluation of information credibility in Semantic Web and Knowledge Grid
PublicationThis article presents a novel algorithm for automatic estimation of information credibility. It concerns information collected in Knowledge Grid and Semantic Web. Possibilities to evaluate the credibility of information in such structures are much greater than those available for WWW sites which use natural language. The rating system presented in this paper estimates credibility automatically on the basis of the following metrics:...
-
Exploring the preferences of Polish EFL teachers towards the accents of English
PublicationThis language attitudes study investigates the preferences of EFL (English as a foreign language) teachers from Poland towards the accents of English they speak and teach. Despite the substantial amount of research on EFL learners, little has been done to investigate the impact of preferences of Polish teachers for different variations of English language on their...
-
Modeling energy consumption of parallel applications
PublicationThe paper presents modeling and simulation of energy consumption of two types of parallel applications: geometric Single Program Multiple Data (SPMD) and divide-and-conquer (DAC). Simulation is performed in a new MERPSYS environment. Model of an application uses the Java language with extension representing message exchange between processes working in parallel. Simulation is performed by running threads representing distinct process...
-
Local hidden–variable models for entangled quantum states
PublicationWhile entanglement and violation of Bell inequalities were initially thought to be equivalent quantum phenomena, we now have different examples of entangled states whose correlations can be described by local hidden-variable models and, therefore, do not violate any of the Bell inequalities. We provide an up-to-date overview of the existing literature regarding local hidden-variable models for entangled quantum states, in both...
-
Automatic Classification of Polish Sign Language Words
PublicationIn the article we present the approach to automatic recognition of hand gestures using eGlove device. We present the research results of the system for detection and classification of static and dynamic words of Polish language. The results indicate the usage of eGlove allows to gain good recognition quality that additionally can be improved using additional data sources such as RGB cameras.
-
Systemy Smart Cities - studium przypadku
PublicationThe paper presents the architecture of an enterprise service bus used in the construction of information systems processing large amounts of data for decision-making needs at the City Hall in Gdańsk. The key concept of processes of bus development involves installation of developing environment, database connection, flow mechanisms and data presentation. The issue was supported by models such as KPI (Key Processes Identifier) and...
-
University Students’ Research on Artificial Intelligence and Knowledge Management. A Review and Report of Multi-case Studies
PublicationLeading technologies are very attractive for students preparing their theses as the completion of their studies. Such an orientation of students connected with professional experiences seems to be a crucial motivator in the research in the management and business areas where these technologies condition the development of professional activities. The goal of the paper is the analysis of students’ thesis topics defended in the last...
-
Relation-based Wikipedia Search System for Factoid Questions Answering
PublicationIn this paper we propose an alternative keyword search mechanism for Wikipedia, designed as a prototype solution towards factoid questions answering. The method considers relations between articles for finding the best matching article. Unlike the standard Wikipedia search engine and also Google engine, which search the articles content independently, requiring the entire query to be satisfied by a single article, the proposed...
-
Words context analysis for improvement of information retrieval
PublicationIn the article we present an approach to improvement of retrieval informationfrom large text collections using words context vectors. The vectorshave been created analyzing English Wikipedia with Hyperspace Analogue to Language model of words similarity. For test phrases we evaluate retrieval with direct user queries as well as retrieval with context vectors of these queries. The results indicate that the proposed method can not...
-
Ontologiczna inżynieria wiedzy
PublicationOntologiczna inżynieria wiedzy jest dobrą podstawą metodologiczną, a ontologie dziedzin przedmiotowych ważnym elementem konstrukcyjnym semantycznych systemów reprezentacji wiedzy. W artykule omówiono budowanie ontologii w oparciu o edytor ontologii FluentEditor i język CNL (Controlled Natural Language). Przykładową ontologię dotyczącą fragmentu procesu produkcji rolniczej wykorzystano do budowy semantycznej bazy wiedzy. W tym celu...
-
A Parallel Corpus-Based Approach to the Crime Event Extraction for Low-Resource Languages
PublicationThese days, a lot of crime-related events take place all over the world. Most of them are reported in news portals and social media. Crime-related event extraction from the published texts can allow monitoring, analysis, and comparison of police or criminal activities in different countries or regions. Existing approaches to event extraction mainly suggest processing texts in English, French, Chinese, and some other resource-rich...
-
What is in a name: Defining “high entropy” oxides
PublicationABSTRACT High entropy oxides are emerging as an exciting new avenue to design highly tailored functional behaviors that have no traditional counterparts. Study and application of these materials are bringing together scientists and engineers from physics, chemistry, and materials science. The diversity of each of these disciplines comes with perspectives and jargon that may be confusing to those outside of the individual fields,...
-
Assessing business process complexity based on textual data: Evidence from ITIL IT ticket processing
PublicationPurpose This study aims to draw the attention of business process management (BPM) research and practice to the textual data generated in the processes and the potential of meaningful insights extraction. The authors apply standard natural language processing (NLP) approaches to gain valuable knowledge in the form of business process (BP) complexity concept suggested in the study. It is built on the objective, subjective and meta-knowledge...
-
Fundamentals of Physics-Based Surrogate Modeling
PublicationChapter 1 was focused on data-driven (or approximation-based) modeling methods. The second major class of surrogates are physics-based models outlined in this chapter. Although they are not as popular, their importance is growing because of the challenges related to construction and handling of approximation surrogates for many real-world problems. The high cost of evaluating computational models, nonlinearity of system responses,...
-
Relativity of arithmetic as a fundamental symmetry of physics
PublicationArithmetic operations can be defined in various ways, even if one assumes commutativity and associativity of addition and multiplication, and distributivity of multiplication with respect to addition. In consequence, whenever one encounters ‘plus’ or ‘times’ one has certain freedom of interpreting this operation. This leads to some freedom in definitions of derivatives, integrals and, thus, practically all equations occurring in...
-
Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech
PublicationIn this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream Text-To-Speech (TTS) model. The proposed framework consists of 4 stages. In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker. In the third stage, the converted data is combined with the linguistic features and durations...
-
Methodology for Processing of 3D Multibeam Sonar Big Data for Comparative Navigation
PublicationAutonomous navigation is an important task for unmanned vehicles operating both on the surface and underwater. A sophisticated solution for autonomous non-global navigational satellite system navigation is comparative (terrain reference) navigation. We present a method for fast processing of 3D multibeam sonar data to make depth area comparable with depth areas from bathymetric electronic navigational charts as source maps during...
-
A survey of automatic speech recognition deep models performance for Polish medical terms
PublicationAmong the numerous applications of speech-to-text technology is the support of documentation created by medical personnel. There are many available speech recognition systems for doctors. Their effectiveness in languages such as Polish should be verified. In connection with our project in this field, we decided to check how well the popular speech recognition systems work, employing models trained for the general Polish language....
-
DEVELOPMENT OF THE ALGORITHM OF POLISH LANGUAGE FILM REVIEWS PREPROCESSING
PublicationThe algorithm and the software for conducting the procedure of Preprocessing of the reviews of films in the Polish language were developed. This algorithm contains the following steps: Text Adaptation Procedure; Procedure of Tokenization; Procedure of Transforming Words into the Byte Format; Part-of-Speech Tagging; Stemming / Lemmatization Procedure; Presentation of Documents in the Vector Form (Vector Space Model) Procedure; Forming...
-
Remote Sensing Methods In the Study of the Impact of Long-Term Process of Sulphur Mining on Environmental Changes of the Carpathian Foreland
PublicationThe paper presents research on the extent of impact of sulphur mining process and post-mining activities upon properties of selected elements of the environment, as well as the assessment of the influence of indirect effects resulting from many years' process of exploitation of sulphur deposits in the areas of the Carpathian Foreland (south-east Poland). Within the scope of research conducted, the assessment of the extent of...
-
Web-based 3D processing and dissemination of multibeam sonar data
PublicationThe continuous detailed surveys of the various water bodies over time produce a large and ever-increasing volume and density of underwater sounding data. Three-dimensional data, such as those obtained by multibeam sonar systems, are quite complex to manage, and thus their growing numbers increase the pressure on development of new solutions dedicated to processing them. This paper presents a concept system for web-based dissemination...
-
Application of Web-GIS for Dissemination and 3D Visualization of Large-Volume LiDAR Data
PublicationThe increasing number of digital data sources, which allow for semi-automatic collection and storage of information regarding various aspects of life has recently granted a considerable rise in popularity to the term “Big data”. As far as geospatial data is concerned, one of the major sources of Big data are Light Detection And Ranging (LiDAR) scanners, which produce high resolution three-dimensional data on a local scale. The...
-
Quality Evaluation of Speech Transmission via Two-way BPL-PLC Voice Communication System in an Underground Mine
PublicationIn order to design a stable and reliable voice communication system, it is essential to know how many resources are necessary for conveying quality content. These parameters may include objective quality of service (QoS) metrics, such as: available bandwidth, bit error rate (BER), delay, latency as well as subjective quality of experience (QoE) related to user expectations. QoE is expressed as clarity of speech and the ability...
-
Towards automation of IT systems repairs
PublicationMonitoring and repair are two sides of the on-the-fly maintenance of IT systems.Monitoring is well supported by automatic tools. In contrast, repairs involve much higherhuman intervention which negatively affects reliability and efficiency. The paper intro-duces a method of automating repairs of IT systems which can be integrated with any ofthe existing monitoring mechanisms. The method is described as a collection of modelsand...
-
Evaluating experimental molecular physics studies of radiation damage in DNA*
PublicationThe field of Atomic and Molecular Physics (AMP) is a mature field exploring the spectroscopy, excitation, ionisation of atoms and molecules in all three phases. Understanding of the spectroscopy and collisional dynamics of AMP has been fundamental to the development and application of quantum mechanics and is applied across a broad range of disparate disciplines including atmospheric sciences, astrochemistry, combustion and environmental...
-
Introduction to the special issue on machine learning in acoustics
PublicationWhen we started our Call for Papers for a Special Issue on “Machine Learning in Acoustics” in the Journal of the Acoustical Society of America, our ambition was to invite papers in which machine learning was applied to all acoustics areas. They were listed, but not limited to, as follows: • Music and synthesis analysis • Music sentiment analysis • Music perception • Intelligent music recognition • Musical source separation • Singing...
-
Paradygmat jakościowy w analizie interakcji międzykulturowych – interpretacja na bazie wybranych teorii psychologicznych
PublicationIntercultural interactions in a multicultural work environment are a peculiar type of social interactions. The results of prior research on the effects of interactions in such environment are inconclusive. The majority of the previous studies have emphasized problems, applied a quantitative methodology and interpreted the results with regard to social identity and categorization theory, information-processing theory and intergroup contact...
-
Modelling of the High Speed Multi-Pole Synchronous Generator for Application in More Electric Aircraft Power Systems
PublicationIn this paper different models of the synchronous generator are presented. The simulation results compared with the measurements are shown. Certain physical phenomena are included in described models for the porpoise of adequate analysis of the more electric aircraft power system. For different modelling levels, such as functional level or behavioural level, different physical phenomena have been included. Simulation results for...
-
Robustness in Compressed Neural Networks for Object Detection
PublicationModel compression techniques allow to significantly reduce the computational cost associated with data processing by deep neural networks with only a minor decrease in average accuracy. Simultaneously, reducing the model size may have a large effect on noisy cases or objects belonging to less frequent classes. It is a crucial problem from the perspective of the models' safety, especially for object detection in the autonomous driving...
-
Terminological and Assertional Queries in KQL Knowledge Access Language
PublicationOne of the directions of development of information systems in recent years in the evolution of data-based systems into the knowledge-based systems. As a part of this process there is ongoing work on a whole range of languages for accessing knowledge bases. They can be used in a variety of applications, however their main drawback is the lack of clearly defined algebra representing a theoretical basis for them. For instance, such...
-
The particle method for simulation of self-organization phenomena
PublicationThe aim of the work was to design, implement, and use, in a number of experiments, an abstract software environment (an artificial world)suitable for modelling systems consisting of many moving and interactingobjects distributed in space. The environment, named DigiHive, is directed towards modeling of complex systems manifested by processes of self-organization, self-reproduction and self modifications. The environment is mainly...
-
Optimizing Medical Personnel Speech Recognition Models Using Speech Synthesis and Reinforcement Learning
PublicationText-to-Speech synthesis (TTS) can be used to generate training data for building Automatic Speech Recognition models (ASR). Access to medical speech data is because it is sensitive data that is difficult to obtain for privacy reasons; TTS can help expand the data set. Speech can be synthesized by mimicking different accents, dialects, and speaking styles that may occur in a medical language. Reinforcement Learning (RL), in the...
-
Application of the Chimney Cap as a Method of Improving the Effectiveness of Natural Ventilation in Buildings
PublicationAdequately designed natural ventilation is the cheapest and easiest way to effectively remove indoor pollutants and keep the air inside a building fresh. A prediction of the performance and effectiveness of ventilation in order to determine the design of a ventilation system can provide real and long-term cost savings. The worst time in terms of the efficiency of natural ventilation is the spring-autumn transition period [7]. In...
-
Estimation of the short-term predictor parameters of speech under noisy conditions
Publication -
Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling
PublicationSymbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of an- alyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the “horizontal” and the “vertical” pitch struc- ture. These models are formulated as linear or log-linear interpo- lations of up to fi ve sub-models, each of which is...
-
Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing
PublicationIn this application-oriented paper we consider the problem of elimination of impulsive disturbances, such as clicks, pops and record scratches, from archive audio recordings. The proposed approach is based on bidirectional processing—noise pulses are localized by combining the results of forward-time and backward-time signal analysis. Based on the results of specially designed empirical tests (rather than on the results of theoretical analysis),...
-
Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
PublicationIn this work we present a new Bayesian topic model: latent hierarchical Pitman-Yor process allocation (LHPYA), which uses hierarchical Pitman-Yor pr ocess priors for both word and topic distributions, and generalizes a few of the existing topic models, including the latent Dirichlet allocation (LDA), the bi- gram topic model and the hierarchical Pitman-Yor topic model. Using such priors allows for integration of -grams with a topic model,...
-
Elimination of Impulsive Disturbances From Stereo Audio Recordings Using Vector Autoregressive Modeling and Variable-order Kalman Filtering
PublicationThis paper presents a new approach to elimination of impulsive disturbances from stereo audio recordings. The proposed solution is based on vector autoregressive modeling of audio signals. Online tracking of signal model parameters is performed using the exponential ly weighted least squares algo- rithm. Detection of noise pulses an d model-based interpolation of the irrevocably distorted sampl es is realized using an adaptive, variable-order...
-
Polish FDI in Central Asian Countries
PublicationSince gaining independence, Kazakhstan, Kyrgyzstan, Tajikistan, Turkmenistan and Uzbekistan gradually opened their markets to foreign investors. Before Poland’s accession to the European Union, the activities of Polish investors in Kazakhstan, Kyrgyzstan, Tajikistan, Turkmenistan and Uzbekistan were based on bilateral treaties concluded by Poland with those countries. Later, except Turkmenistan, they were governed by the partnership...
-
Experimental study on models of cylindrical steel tanks under mining tremors and moderate earthquakes
PublicationThe aim of the study is to show the results of complex shaking table experimental investigation focused on the response of two models of cylindrical steel tanks under mining tremors and moderate earthquakes, including the aspects of diagnosis of structural damage. Firstly, the impact and the sweep-sine tests have been carried out, so as to determine the dynamic properties of models filled with different levels of liquid. Then,...
-
Processing of Satellite Data in the Cloud
PublicationThe dynamic development of digital technologies, especially those dedicated to devices generating large data streams, such as all kinds of measurement equipment (temperature and humidity sensors, cameras, radio-telescopes and satellites – Internet of Things) enables more in-depth analysis of the surrounding reality, including better understanding of various natural phenomenon, starting from atomic level reactions, through macroscopic...