displaying 1000 best results Help
Search results for: SEMI-STRUCTURED TEXT
-
Evaluation of Path Based Methods for Conceptual Representation of the Text
PublicationTypical text clustering methods use the bag of words (BoW) representation to describe content of documents. However, this method is known to have several limitations. Employing Wikipedia as the lexical knowledge base has shown an improvement of the text representation for data-mining purposes. Promising extensions of that trend employ hierarchical organization of Wikipedia category system. In this paper we propose three path-based...
-
Interactive Information Search in Text Data Collections
PublicationThis article presents a new idea for retrieving in text repositories, as well as it describes general infrastructure of a system created to implement and test those ideas. The implemented system differs from today’s standard search engine by introducing process of interactive search with users and data clustering. We present the basic algorithms behind our system and measures we used for results evaluation. The achieved results...
-
X-RAY DIFFRACTION STUDY OF BISMUTH LAYER-STRUCTURED MULTIFERROIC CERAMICS
PublicationGoal of the present research was to apply a solid state reaction route to fabricate bismuth layer-structured multiferroic ceramics described with the formula Bi5FeTi3O15 and reveal the influence of processing conditions on its crystal structure and phase composition. Simple oxide powders Bi2O3, TiO2 and Fe2O3 were used to fabricate Aurivillius-type bismuth layer-structured ferroelectrics. Pressureless sintering in ambient air was...
-
Selection of Relevant Features for Text Classification with K-NN
PublicationIn this paper, we describe five features selection techniques used for a text classification. An information gain, independent significance feature test, chi-squared test, odds ratio test, and frequency filtering have been compared according to the text benchmarks based on Wikipedia. For each method we present the results of classification quality obtained on the test datasets using K-NN based approach. A main advantage of evaluated...
-
Text
Journals -
Electrochemicalsynthesis of 3D nano-/micro-structured porous polypyrrole
PublicationIn this work, electrosynthesis of electroactive, 3D nano-/micro-structured porous polypyrrole film is presented. The PPy film was synthesized potentiostatically in a one-step process from aqueous solution of pyrrole and lithium perchlorate. The growth mechanism of such structure included: the formation of typical globular PPy film, followed by the formation of the PPy fibers, which then took part in the formation of 3D highly porous...
-
Structured deformation of granular material in the state of active earth pressure
PublicationThe paper focuses on the ability of granular materials to undergo structured deformation by analysing the data from the retaining wall model tests and discrete element simulations. The structured deformation means the movement of a granular material which produces a stable, regular pattern of multiple shear bands. The paper's primary purpose is to study this kind of deformation for the selected data representing the state of active...
-
Study of Statistical Text Representation Methods for Performance Improvement of a Hierarchical Attention Network
PublicationTo effectively process textual data, many approaches have been proposed to create text representations. The transformation of a text into a form of numbers that can be computed using computers is crucial for further applications in downstream tasks such as document classification, document summarization, and so forth. In our work, we study the quality of text representations using statistical methods and compare them to approaches...
-
Two Stage SVM and kNN Text Documents Classifier
PublicationThe paper presents an approach to the large scale text documents classification problem in parallel environments. A two stage classifier is proposed, based on a combination of k-nearest neighbors and support vector machines classification methods. The details of the classifier and the parallelisation of classification, learning and prediction phases are described. The classifier makes use of our method named one-vs-near. It is...
-
Anna Baj-Rogowska dr
PeopleAnna Baj-Rogowska is employed as an assistant professor at the Department of Informatics in Management at the Faculty of Management and Economics, Gdańsk University of Technology. Her higher education is connected with the University of Gdańsk, where she graduated from a master's degree in business informatics, doctoral studies and then obtained a PhD degree in economics in management science (Department of Business Informatics...
-
The Method of a Two-Level Text-Meaning Similarity Approximation of the Customers’ Opinions
PublicationThe method of two-level text-meaning similarity approximation, consisting in the implementation of the classification of the stages of text opinions of customers and identifying their rank quality level was developed. Proposed and proved the significance of major hypotheses, put as the basis of the developed methodology, notably about the significance of suggestions about the existence of analogies between mathematical bases of...
-
Semi-definite programming and quantum information
PublicationThis paper presents a comprehensive exploration of semi-definite programming (SDP) techniques within the context of quantum information. It examines the mathematical foundations of convex optimization, duality, and SDP formulations, providing a solid theoretical framework for addressing optimization challenges in quantum systems. By leveraging these tools, researchers and practitioners can characterize classical and quantum correlations,...
-
Relationship between semi- and fully-device-independent protocols
PublicationWe study the relation between semi and fully device independent protocols. As a tool, we use the correspondence between Bell inequalities and dimension witnesses. We present a method for converting the former into the latter and vice versa. This relation provides us with interesting results for both scenarios. First, we find new random number generation protocols with higher bit rates for both the semi and fully device independent...
-
Thresholding Strategies for Large Scale Multi-Label Text Classifier
PublicationThis article presents an overview of thresholding methods for labeling objects given a list of candidate classes’ scores. These methods are essential to multi-label classification tasks, especially when there are a lot of classes which are organized in a hierarchy. Presented techniques are evaluated using the state-of-the-art dedicated classifier on medium scale text corpora extracted from Wikipedia. Obtained results show that the...
-
Nina Rizun dr
PeopleNina Rizun is an Assistant Professor at the Faculty of Management and Economics at the Gdańsk University of Technology. In October 1999, she obtained a PhD in Technical Sciences from the Faculty of Enterprise Economy and Production Organization, National Mining Academy, Dnipropetrovsk, Ukraine. PhD thesis title: Development of Complex Subsystem of the Organization and Planning of Mining and Transport Processes. From 1993-2000,...
-
Text-mining Similarity Approximation Operators for Opinion Mining in BI tools
PublicationThe concept of the Text-mining Similarity Approximation Operators for Opinion Mining as extensions to Natural Language Interface Database is defined. The new operators: “keywords of” dimension; subsetting operator “about C is q”; aggregation operator “by similar C” are proposed. These operators are based on the Latent Semantic Analysis and Social Network Analysis
-
TEORIA DECYZYJNYCH PROCESÓW SEMI-MARKOWA I JEJ ZASTOSOWANIE W PROJEKTOWANIU I EKSPLOATACJI OKRĘTOWYCH SILNIKÓW GŁÓWNYCH I INNYCH URZĄDZEŃ SIŁOWNI OKRĘTOWYCH
PublicationW referacie zaprezentowano znaczenie teorii procesów semi-Markowa w naukach technicznych, zwłaszcza w teorii niezawodności urządzeń technicznych, teorii bezpieczeństwa ich działania oraz statystycznej teorii podejmowania decyzji eksploatacyjnych. W referacie wyeksponowano także przydatność teorii procesów semi-Markowa w teorii i praktyce eksploatacji wspomnianych urządzeń technicznych na przykładzie tak istotnych urządzeń w transporcie...
-
New polish catalogue of typical flexible and semi-rigid pavements
PublicationThe paper covers the following topics important for the development of the new Polish Catalogue of typical flexible and semi-rigid pavements: reasons for preparing the new issue of the Catalogue of typical flexible and semi-rigid pavements, items introduced in the new issue, organise the terminology related to pavements, design traffic calculations and new equivalent axle load factors,...
-
Physiologically structured populations with diffusion
PublicationRozważamy strukturalny model populacyjny z dyfuzją wraz z dołączonymi warunkami brzegowymi Wentzella. Badamy przypadek o stałych współczynnikach. Dowodzimy nieujemności rozwiązań oraz badamy ich asymptotyczne zachowanie poprzez zasadę maksimum. Symulacje numeryczne potwierdzają nasze teoretyczne rozważania.
-
Electrochemical behavior of a composite material containing 3D-structured diatom biosilica
Publication3D-structured diatom biosilica mixed with conducting carbon black was investigated as an active electrode material for lithium-ion batteries. Diatom biosilica was obtained by cultivation of the selected diatom species under laboratory conditions. Several instrumental techniques (XRD, FTIR, Raman, SEM-EDX, TGA) were used to characterize the physicochemical properties of applied biosilica. It was evidenced that the prepared new composite...
-
Sharp bounds for the complexity of semi-equitable coloring of cubic and subcubic graphs
PublicationIn this paper we consider the complexity of semi-equitable k-coloring of the vertices of a cubic or subcubic graph. We show that, given n-vertex subcubic graph G, a semi-equitable k-coloring of G is NP-hard if s >= 7n/20 and polynomially solvable if s <= 7n/21, where s is the size of maximum color class of the coloring.
-
What matters most to patients? On the Core Determinants of Patient Experience from Free Text Feedback
PublicationFree-text feedback from patients is increasingly used for improving the quality of healthcare services and systems. A major reason for the growing interest in harnessing free-text feedback is the belief that it provides richer information about what patients want and care about. The use of computational approaches such as structural topic modelling for analysing large unstructured textual data such as free-text feedback from patients...
-
Tight bounds on the complexity of semi-equitable coloring of cubic and subcubic graphs
PublicationWe consider the complexity of semi-equitable k-coloring, k>3, of the vertices of a cubic or subcubic graph G. In particular, we show that, given a n-vertex subcubic graph G, it is NP-complete to obtain a semi-equitable k-coloring of G whose non-equitable color class is of size s if s>n/3, and it is polynomially solvable if s, n/3.
-
Applications of semi-definite optimization in quantum information protocols
PublicationThis work is concerned with the issue of applications of the semi-definite programming (SDP) in the field of quantum information sci- ence. Our results of the analysis of certain quantum information protocols using this optimization technique are presented, and an implementation of a relevant numerical tool is introduced. The key method used is NPA discovered by Navascues et al. [Phys. Rev. Lett. 98, 010401 (2007)]. In chapter...
-
Methodology for Text Classification using Manually Created Corpora-based Sentiment Dictionary
PublicationThis paper presents the methodology of Textual Content Classification, which is based on a combination of algorithms: preliminary formation of a contextual framework for the texts in particular problem area; manual creation of the Hierarchical Sentiment Dictionary (HSD) on the basis of a topically-oriented Corpus; tonality texts recognition via using HSD for analysing the documents as a collection of topically completed fragments...
-
Equitable and semi-equitable coloring of cubic graphs and its application in batch scheduling
PublicationIn the paper we consider the problems of equitable and semi-equitable coloring of vertices of cubic graphs. We show that in contrast to the equitable coloring, which is easy, the problem of semi-equitable coloring is NP- complete within a broad spectrum of graph parameters. This affects the complexity of batch scheduling of unit-length jobs with cubic incompatibility graph on three uniform processors to minimize...
-
Nitrogen mass balance and mathematical model of a Structured- Bed Reactor (SBRRIA)
PublicationAn up-flow structured-bed reactor subjected to recirculation and intermittent aeration (SBRRIA) was operated under varying influent conditions. The influent COD/N ratio was adjusted by maintaining a stable organic loading rate (average of 1.07 kgCOD m-3 d-1) and changing the nitrogen loading rate from 0.1 to 0.4 kg N m-3. Therefore, three different COD/N ratios were tested: 9.7±1 (Scenario 1); 7.6±1 (Scenario 2) and 2.9±1 (Scenario...
-
Text (new tilte Text and Talk)
Journals -
Exploring the Usability and User Experience of Social Media Apps through a Text Mining Approach
PublicationThis study aims to evaluate the applicability of a text mining approach for extracting UUX-related issues from a dataset of user comments and not to evaluate the Instagram (IG) app. This study analyses textual data mined from reviews in English written by IG mobile application users. The article’s authors used text mining (based on the LDA algorithm) to identify the main UUX-related topics. Next, they mapped the identified topics...
-
Application of Text Analytics in Public Service Co-Creation: Literature Review and Research Framework
PublicationThe public sector faces several challenges, such as a number of external and internal demands for change, citizens' dissatisfaction and frustration with public sector organizations, that need to be addressed. An alternative to the traditional top-down development of public services is co-creation of public services. Co-creation promotes collaboration between stakeholders with the aim to create better public services and achieve...
-
Generating molecular entities as structured data
Publication -
Third Text
Journals -
Social Text
Journals -
Word and Text
Journals -
Text & Talk
Journals -
Text classifiers for automatic articles categorization
PublicationThe article concerns the problem of automatic classification of textual content. We present selected methods for generation of documents representation and we evaluate them in classification tasks. The experiments have been performed on Wikipedia articles classified automatically to their categories made by Wikipedia editors.
-
SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM
PublicationThe main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...
-
Text Mining Algorithms for Extracting Brand Knowledge; The fashion Industry Case
PublicationBrand knowledge is determined by customer knowledge. The opportunity to develop brands based on customer knowledge management has never been greater. Social media as a set of leading communication platforms enable peer to peer interplays between customers and brands. A large stream of such interactions is a great source of information which, when thoroughly analyzed, can become a source of innovation and lead to competitive advantage....
-
Bio-based semi-aromatic polyesters for coating applications
PublicationLinear and branched bio-based semi-aromatic (co)polyesters were evaluated as resins for solvent-basedand powder coatings. Dimethyl-2,5-furandicarboxylate (DMF), 2,3-butanediol and various multifunc-tional comonomers were used to synthesize amorphous hydroxyl-end-capped (co)polyesters. The resinswere cross-linked using the -caprolactam blocked trimer of isophorone diisocyanate. Both the solvent-based and powder coatings proved to...
-
A Text as a Set of Research Data. A Number of Aspects of Data Acquisition and Creation of Datasets in Neo-Latin Studies
PublicationIn this paper, the authors, who specialise in part in neo-Latin studies and the his-tory of early modern education, share their experiences of collecting sources for Open Research Data sets under the Bridge of Data project. On the basis of inscription texts from St. Mary’s Church in Gdańsk, they created 29 Open Research Data sets. In turn, the text of the lectures of the Gdańsk scholar Michael Christoph Hanow, Praecepta de arte...
-
Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency
PublicationIn this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...
-
Structural design and sensitivity analysis of semi-rigid pavement of a motorway
PublicationThis paper presents application of mechanistic-empirical methods in design of semi-rigid pavement for a section of a motorway in Poland. The stage construction was assumed. Three fatigue criteria were applied in the design. For asphalt fatigue cracking and subgrade soil the criteria from the Asphalt Institute (USA) were applied. For fatigue cracking of cement stabilized bases the Dempsey (USA) and De Beer (South Africa) criteria...
-
Enabling Deeper Linguistic-based Text Analytics – Construct Development for the Criticality of Negative Service Experience
PublicationSignificant progress has been made in linguistic-based text analytics particularly with the increasing availability of data and deep learning computational models for more accurate opinion analysis and domain-specific entity recognition. In understanding customer service experience from texts, analysis of sentiments associated with different stages of the service lifecycle is a useful starting point. However, when richer insights...
-
Age-structured population model of cell survival
Publication -
Text Technology: A Journal of computer Text Processing
Journals -
Towards Effective Processing of Large Text Collections
PublicationIn the article we describe the approach to parallelimplementation of elementary operations for textual data categorization.In the experiments we evaluate parallel computations ofsimilarity matrices and k-means algorithm. The test datasets havebeen prepared as graphs created from Wikipedia articles relatedwith links. When we create the clustering data packages, wecompute pairs of eigenvectors and eigenvalues for visualizationsof...
-
Text Documents Classification with Support Vector Machines
Publication -
Parallel Computations of Text Similarities for Categorization Task
PublicationIn this chapter we describe the approach to parallel implementation of similarities in high dimensional spaces. The similarities computation have been used for textual data categorization. A test datasets we create from Wikipedia articles that with their hyper references formed a graph used in our experiments. The similarities based on Euclidean distance and Cosine measure have been used to process the data using k-means algorithm....
-
Machine Learning and Text Analysis in an Artificial Intelligent System for the Training of Air Traffic Controllers
PublicationThis chapter presents the application of new information technology in education for the training of air traffic controllers (ATCs). Machine learning, multi-criteria decision analysis, and text analysis as the methods of artificial intelligence for ATCs training have been described. The authors have made an analysis of the International Civil Aviation Organization documents for modern principles of ATCs education. The prototype...
-
Application of semi-Markov processes for evaluation of diesel engines reliability with regards to diagnostics
PublicationThe paper presents semi-Markov models of technical state transitions for diesel engines, useful for determination of their reliability, as a result of the conducted statistical empirical studies. Interpretation of technical states provided for this sort of engines refers to ship main engines, i.e. engines employed in propulsion systems of sea-going ships. The considerations recognize diesel engine as a diagnosed system (SDN), of...