Wyniki wyszukiwania dla: TEXT DOCUMENTS CATEGORIZATION

Wyniki wyszukiwania dla: TEXT DOCUMENTS CATEGORIZATION

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 1210

wyczyść wszystkie filtry niedostępne

wyświetlamy 1000 najlepszych wyników Pomoc

Text classifiers for automatic articles categorization
Publikacja
- Rok 2012
The article concerns the problem of automatic classification of textual content. We present selected methods for generation of documents representation and we evaluate them in classification tasks. The experiments have been performed on Wikipedia articles classified automatically to their categories made by Wikipedia editors.
Text Categorization Improvement via User Interaction
Publikacja
- J. Atroszko
- J. Szymański
- D. Gil
- H. Mora
- Rok 2018
In this paper, we propose an approach to improvement of text categorization using interaction with the user. The quality of categorization has been defined in terms of a distribution of objects related to the classes and projected on the self-organizing maps. For the experiments, we use the articles and categories from the subset of Simple Wikipedia. We test three different approaches for text representation. As a baseline we use...

Pełny tekst do pobrania w serwisie zewnętrznym
Parallel Computations of Text Similarities for Categorization Task
Publikacja
- J. Szymański
- Rok 2013
In this chapter we describe the approach to parallel implementation of similarities in high dimensional spaces. The similarities computation have been used for textual data categorization. A test datasets we create from Wikipedia articles that with their hyper references formed a graph used in our experiments. The similarities based on Euclidean distance and Cosine measure have been used to process the data using k-means algorithm....
Text categorization with semantic commonsense knowledge: First results
Publikacja
- P. Majewski
- J. Szymański
- Rok 2008
Do przetwarzania tekstów typowo wykorzystuje się reprezentacjeBOW. Podejście takie nie daje jednak dobrych rezultatów w sytuacjigdy podobne dokumenty nie współdzielą ze sobą słów.W artykule zaprezentowano podejście do konstrukcji funkcjijądra dla klasyfikatorów SVM opartego na zewnętrznej bazie wiedzyo pojęciach językowych.
Text Documents Classification with Support Vector Machines
Publikacja
- P. Majewski
- Rok 2008
Two Stage SVM and kNN Text Documents Classifier
Publikacja
- M. Kępa
- J. Szymański
- Rok 2015
The paper presents an approach to the large scale text documents classification problem in parallel environments. A two stage classifier is proposed, based on a combination of k-nearest neighbors and support vector machines classification methods. The details of the classifier and the parallelisation of classification, learning and prediction phases are described. The classifier makes use of our method named one-vs-near. It is...
External Validation Measures for Nested Clustering of Text Documents
Publikacja
- K. Draszawka
- J. Szymański
- Rok 2011
Abstract. This article handles the problem of validating the results of nested (as opposed to "flat") clusterings. It shows that standard external validation indices used for partitioning clustering validation, like Rand statistics, Hubert Γ statistic or F-measure are not applicable in nested clustering cases. Additionally to the work, where F-measure was adopted to hierarchical classification as hF-measure, here some methods to...
Representation of hypertext documents based on terms, Links and text compressibility
Publikacja
- J. Szymański
- W. Duch
- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2010
Opisano metody reprezentacji dokumentów tekstowych oparte na słowach, wzajemnych powiązaniach i metodach kompresji. Dokonano ich oceny w oparciu o klasyfikator SVM.
TF-IDF weighted bag-of-words preprocessed text documents from Simple English Wikipedia
Dane Badawcze
open access
The SimpleWiki2K-scores dataset contains TF-IDF weighted bag-of-words preprocessed text documents (raw strings are not available) [feature matrix] and their multi-label assignments [label-matrix]. Label scores for each document are also provided for an enhanced multi-label KNN [1] and LEML [2] classifiers. The aim of the dataset is to establish a benchmark...
Evaluation of Path Based Methods for Conceptual Representation of the Text
Publikacja
- Ł. Kucharczyk
- J. Szymański
- Rok 2014
Typical text clustering methods use the bag of words (BoW) representation to describe content of documents. However, this method is known to have several limitations. Employing Wikipedia as the lexical knowledge base has shown an improvement of the text representation for data-mining purposes. Promising extensions of that trend employ hierarchical organization of Wikipedia category system. In this paper we propose three path-based...

Pełny tekst do pobrania w serwisie zewnętrznym
Path-based methods on categorical structures for conceptual representation of wikipedia articles
Publikacja
- Ł. Kucharczyk
- J. Szymański
- JOURNAL OF INTELLIGENT INFORMATION SYSTEMS - Rok 2017
Machine learning algorithms applied to text categorization mostly employ the Bag of Words (BoW) representation to describe the content of the documents. This method has been successfully used in many applications, but it is known to have several limitations. One way of improving text representation is usage of Wikipedia as the lexical knowledge base – an approach that has already shown promising results in many research studies....

Pełny tekst do pobrania w portalu
Improving css-KNN Classification Performance by Shifts in Training Data
Publikacja
- K. Draszawka
- J. Szymański
- F. Guerra
- Rok 2015
This paper presents a new approach to improve the performance of a css-k-NN classifier for categorization of text documents. The css-k-NN classifier (i.e., a threshold-based variation of a standard k-NN classifier we proposed in [1]) is a lazy-learning instance-based classifier. It does not have parameters associated with features and/or classes of objects, that would be optimized during off-line learning. In this paper we propose...
Development and Research of the Text Messages Semantic Clustering Methodology
Publikacja
- N. Rizun
- P. Kapłański
- Y. Taranenko
- Rok 2016
The methodology of semantic clustering analysis of customer’s text-opinions collection is developed. The author's version of the mathematical models of formalization and practical realization of short textual messages semantic clustering procedure is proposed, based on the customer’s text-opinions collection Latent Semantic Analysis knowledge extracting method. An algorithm for semantic clustering of the text-opinions is developed,...

Pełny tekst do pobrania w portalu
Agile Commerce in the light of Text Mining
Publikacja
- A. Baj-Rogowska
- Przedsiębiorczość i Zarządzanie - Rok 2017
The survey conducted for this study reveals that more than 84% of respondents have never encountered the term “agile commerce” and do not understand its meaning. At the same time, they are active participants of this strategy. Using digital channels as customers more often than ever before, they have already been included in the agile philosophy. Based on the above, the purpose of the study is to analyse major text sets containing...

Pełny tekst do pobrania w portalu
Comparative Analysis of Text Representation Methods Using Classification
Publikacja
- J. Szymański
- CYBERNETICS AND SYSTEMS - Rok 2014
In our work, we review and empirically evaluate five different raw methods of text representation that allow automatic processing of Wikipedia articles. The main contribution of the article—evaluation of approaches to text representation for machine learning tasks—indicates that the text representation is fundamental for achieving good categorization results. The analysis of the representation methods creates a baseline that cannot...

Pełny tekst do pobrania w serwisie zewnętrznym
Just look at to open it up: A biometric verification facility for password autofill to protect electronic documents
Publikacja
- M. Smiatacz
- B. Wiszniewski
- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2021
Electronic documents constitute specific units of information, and protecting them against unauthorized access is a challenging task. This is because a password protected document may be stolen from its host computer or intercepted while on transfer and exposed to unlimited offline attacks. The key issue is, therefore, making document passwords hard to crack. We propose to augment a common text password authentication interface...

Pełny tekst do pobrania w portalu
Extraction of information from born-digital PDF documents for reproducible research
Publikacja
- B. Wiszniewski
- J. Siciarek
- Journal of Advanced Management - Rok 2016
Born-digital PDF electronic documents might reasonably be expected to preserve useful data units of their source originals that suffice to produce executable papers for reproducible research. Unfortunately, developers of authoring tools may adopt arbitrary PDF generation strategies, producing a plethora of internal data representations. Such common information units as text paragraphs, tables, function graphs and flow diagrams,...

Pełny tekst do pobrania w portalu
Semantic Analysis and Text Summarization in Socio-Technical Systems
Publikacja
- N. Rizun
- Rok 2018
In this chapter the authors present the results of the development the methodology for increasing the reliability of the functioning of the Socio-Technical System. The existed methods and algorithms for processing unstructured (textual) information were studied. Taking into account noted above strengths and weaknesses of Discriminant and Probabilistic approaches of Latent Semantic Relations analysis in of the summarization projection...

Pełny tekst do pobrania w serwisie zewnętrznym
Wikipedia Articles Representation with Matrix'u
Publikacja
- J. Szymański
- Rok 2013
In the article we evaluate different text representation methods used for a task of Wikipedia articles categorization. We present the Matrix’u application used for creating computational datasets ofWikipedia articles. The representations have been evaluated with SVM classifiers used for reconstruction human made categories.

Pełny tekst do pobrania w serwisie zewnętrznym
Methodology for Text Classification using Manually Created Corpora-based Sentiment Dictionary
Publikacja
- N. Rizun
- W. Waloszek
- Rok 2018
This paper presents the methodology of Textual Content Classification, which is based on a combination of algorithms: preliminary formation of a contextual framework for the texts in particular problem area; manual creation of the Hierarchical Sentiment Dictionary (HSD) on the basis of a topically-oriented Corpus; tonality texts recognition via using HSD for analysing the documents as a collection of topically completed fragments...

Pełny tekst do pobrania w portalu
Spectral Clustering Wikipedia Keyword-Based search Results
Publikacja
- J. Szymański
- T. Dziubich
- FRONTIERS IN ROBOTICS AND AI - Rok 2017
The paper summarizes our research in the area of unsupervised categorization of Wikipedia articles. As a practical result of our research, we present an application of spectral clustering algorithm used for grouping Wikipedia search results. The main contribution of the paper is a representation method for Wikipedia articles that has been based on combination of words and links and used for categoriation of search results in this...

Pełny tekst do pobrania w portalu
DEVELOPMENT OF THE ALGORITHM OF POLISH LANGUAGE FILM REVIEWS PREPROCESSING
Publikacja
- N. Rizun
- J. Taranenko
- Rocznik Naukowy Wydzialu Zarzadzania w Ciechanowie - Rok 2017
The algorithm and the software for conducting the procedure of Preprocessing of the reviews of films in the Polish language were developed. This algorithm contains the following steps: Text Adaptation Procedure; Procedure of Tokenization; Procedure of Transforming Words into the Byte Format; Part-of-Speech Tagging; Stemming / Lemmatization Procedure; Presentation of Documents in the Vector Form (Vector Space Model) Procedure; Forming...

Pełny tekst do pobrania w portalu
Review on Wikification methods
Publikacja
- J. Szymański
- M. Naruszewicz
- AI COMMUNICATIONS - Rok 2019
The paper reviews methods on automatic annotation of texts with Wikipedia entries. The process, called Wikification aims at building references between concepts identified in the text and Wikipedia articles. Wikification finds many applications, especially in text representation, where it enables one to capture the semantic similarity of the documents. Also, it can be considered as automatic tagging of the text. We describe typical...

Pełny tekst do pobrania w serwisie zewnętrznym
Machine Learning and Text Analysis in an Artificial Intelligent System for the Training of Air Traffic Controllers
Publikacja
- T. Shmelova
- Y. Sikirda
- N. Rizun
- V. Lazorenko
- V. Kharchenko
- Rok 2020
This chapter presents the application of new information technology in education for the training of air traffic controllers (ATCs). Machine learning, multi-criteria decision analysis, and text analysis as the methods of artificial intelligence for ATCs training have been described. The authors have made an analysis of the International Civil Aviation Organization documents for modern principles of ATCs education. The prototype...

Pełny tekst do pobrania w portalu
Contextual ontology for tonality assessment
Publikacja
- W. Waloszek
- N. Rizun
- Procedia Computer Science - Rok 2020
classification tasks. The discussion focuses on two important research hypotheses: (1) whether it is possible to construct such an ontology from a corpus of textual document, and (2) whether it is possible and beneficial to use inferencing from this ontology to support the process of sentiment classification. To support the first hypothesis we present a method of extraction of hierarchy of contexts from a set of textual documents...

Pełny tekst do pobrania w portalu
System of specific grants for local government units in Poland
Publikacja
- A. Sekuła
- Rok 2009
The article analyses the system of specific grants in local governments in Poland. First, main revenue sources of local self-governments are presented. Their presentation is based upon the consideration of one of the basic important principles in democratic states today, i.e. decentralization. The text then, in more details, describes specific grants with respect to the European Charter of Local Self-Government. Subsequently, the...
Methodology of Selecting the Hadoop Ecosystem Configuration in Order to Improve the Performance of a Plagiarism Detection System
Publikacja
- A. Sobecki
- M. Kępa
- Rok 2018
The plagiarism detection problem involves finding patterns in unstructured text documents. Similarity of documents in this approach means that the documents contain some identical phrases with defined minimal length. The typical methods used to find similar documents in dig- ital libraries are not suitable for this task (plagiarism detection) because found documents may contain similar content and we have not any war- ranty that...

Pełny tekst do pobrania w serwisie zewnętrznym
Information Retrieval with the Use of Music Clustering by Directions Algorithm
Publikacja
- A. Kaczmarek
- Rok 2013
This paper introduces the Music Clustering by Directions (MCBD) algorithm. The algorithm is designed to support users of query by humming systems in formulating queries. This kind of systems makes it possible to retrieve songs and tunes on the basis of a melody recorded by the user. The Music Clustering by Directions algorithm is a kind of an interactive query expansion method. On the basis of query, the algorithm provides suggestions...

Pełny tekst do pobrania w serwisie zewnętrznym
Retrieval with Semantic Sieve
Publikacja
- Rok 2013
The article presents an algorithm we called Semantic Sieve applied for refining search results in text documents repository. The algorithm calculates socalled conceptual directions that enables interaction with the user and allows to narrow the set of results to the most relevant ones. We present the system where the algorithm has been implemented. The system also offers in the presentation layer clustering of the results into...

Pełny tekst do pobrania w serwisie zewnętrznym
SEMANTIC ANALYSIS ALGORITHMS FOR KNOWLEDGE WORKERS SUPPORT
Publikacja
- N. Rizun
- M. Rizun
- J. Taranenko
- Rok 2017
The paper examines various aspects of text analysis application for knowledge worker’s activity realization. Conclusions are drawn about the relevance and importance of processing the non-structured textual information in order to increase knowledge worker’s efficiency, as well as their awareness in different branches of science. The paper considers the existing algorithms of texts semantic analysis as the sphere of documents topical...

Pełny tekst do pobrania w portalu
Passing from requirements specification to class model using application domain ontology
Publikacja
- J. Kuchta
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Rok 2010
The quality of a classic software engineering process depends on the completeness of project documents and on the inter-phase consistency. In this paper, a method for passing from the requirement specification to the class model is proposed. First, a developer browses the text of the requirements, extracts the word sequences, and places them as terms into the glossary. Next, the internal ontology logic for the glossary needs to...
Improving the Accuracy in Sentiment Classification in the Light of Modelling the Latent Semantic Relations
Publikacja
- N. Rizun
- W. Waloszek
- Y. Taranenko
- Information - Rok 2018
The research presents the methodology of improving the accuracy in sentiment classification in the light of modelling the latent semantic relations (LSR). The objective of this methodology is to find ways of eliminating the limitations of the discriminant and probabilistic methods for LSR revealing and customizing the sentiment classification process (SCP) to the more accurate recognition of text tonality. This objective was achieved...

Pełny tekst do pobrania w portalu
China and the Chinese in the modern world. An interdisciplinary study
Publikacja
- I. Szpotakowski
- Z. Kopania
- Rok 2020
This monograph is a collection of chapters devoted to modern China on various approaches. There is no future without a past and a modern China is a country that skillfully combines the new with the old and the authors have attempted to present this phenomenon in this book. It brings to light issues such as a honorificativity in Chinese administrative and legal documents, a comparison of Chinese and...

Pełny tekst do pobrania w portalu
Categorization of Cloud Workload Types with Clustering
Publikacja
- Rok 2017
The paper presents a new classification schema of IaaS cloud workloads types, based on the functional characteristics. We show the results of an experiment of automatic categorization performed with different benchmarks that represent particular workload types. Monitoring of resource utilization allowed us to construct workload models that can be processed with machine learning algorithms. The direct connection between the functional...

Pełny tekst do pobrania w serwisie zewnętrznym
Linking music data in executable documents
Publikacja
- A. Kaczmarek
- Rok 2022
This paper presents the application of Interactive Open Document Architecture (IODA) to music and video data. This architecture was design to create multilayer documents which consist of many files. The paper shows the method of creating media documents on the basis of IODA. These kind of documents were called IODA Media Documents (IMD). IMD have links that connect many different kinds of files containing music and video data....

Pełny tekst do pobrania w serwisie zewnętrznym
Knowledge management implementation in small and micro KIBS : A categorization
Publikacja
- E. Bolisani
- E. Scarso
- R. Ceccato
- M. Zięba
- Knowledge and Process Management - Rok 2023
he main goal of the paper is to provide a statistical categorization of small and micro knowledge-intensive business service (KIBS) companies, based on their knowledge management (KM) attitude. Since knowledge is the main production factor and output of these companies, it is essential to achieve a better understanding of how they manage this resource. A questionnaire-based survey was conducted on a sample of Polish small and micro...

Pełny tekst do pobrania w portalu
Augmenting digital documents with negotiation capability
Publikacja
- B. Wiszniewski
- J. Kaczorek
- Rok 2013
Active digital documents are not only capable of performing various operations using their internal functionality and external services, accessible in the environment in which they operate, but can also migrate on their own over a network of mobile devices that provide dynamically changing execution contexts. They may imply conflicts between preferences of the active document and the device the former wishes to execute on. In the...

Pełny tekst do pobrania w portalu
Semantic Driven Table Understanding in Born-Digital Documents
Publikacja
- J. Siciarek
- Rok 2014
This paper presents a new approach to table understanding, suitable for born-digital PDF documents. Advance beyond the current state of the art in table understanding is provided by the proposed reverse MVC method, which takes advantage of only partial logic structure loss (degradation) in born-digital PDF documents, as opposed to unrecoverable loss (deterioration) taking place in scan based PDF documents.

Pełny tekst do pobrania w serwisie zewnętrznym
The potential of computational methods for the categorization of architectural objects on the example of media architecture
Publikacja
- K. Życzkowska
- M. Życzkowski
- TASK Quarterly - Rok 2022
The paper presents an example of the categorization of architectural objects and assessment of the characteristics of urban space, based on the analysis of specific features of architectural objects and urban landscape. The conducted analysis refers to media architecture and is presented in the complex context of the development of media solutions. The field of influence of IT on architecture is also stressed, both on the architect’s...

Pełny tekst do pobrania w portalu
Deep learning for recommending subscription-limited documents
Publikacja
- G. Chłodziński
- K. Woźniak
- Rok 2020
Documents recommendation for a commercial, subscription-based online platform is important due to the difficulty in navigation through a large volume and diversity of content available to clients. However, this is also a challenging task due to the number of new documents added every day and decreasing relevance of older contents. To solve this problem, we propose deep neural network architecture that combines autoencoder with...

Pełny tekst do pobrania w portalu
For Your Eyes Only – Biometric Protection of PDF Documents
Publikacja
- Rok 2013
The paper introduces a concept of a digital document content encryption/decryption with facial biometric data coming from a legitimate user. Access to the document content is simple and straightforward, especially during collaborative work with mobile devices equipped with cameras. Various contexts of document exchange are presented with regard to the next generation pro-active digital documents proposed by authors. An important...

Pełny tekst do pobrania w portalu
Prioritising national healthcare service issues from free text feedback – A computational text analysis & predictive modelling approach
Publikacja
- A. Ojo
- N. Rizun
- G. Walsh
- M. I. Mashinchi
- M. Venosa
- M. N. Rao
- DECISION SUPPORT SYSTEMS - Rok 2024
Patient experience surveys have become a key source of evidence for supporting decision-making and continuous quality improvement within healthcare services. To harness free-text feedback collected as part of these surveys for additional insights, text analytics methods are increasingly employed when the data collected is not amenable to traditional qualitative analysis due to volume. However, while text analytics techniques offer...

Pełny tekst do pobrania w portalu
Time-domain prosodic modifications for text-to-speech synthesizer
Publikacja
- J. Łopatka
- P. Suchomski
- A. Czyżewski
- Rok 2010
An application of prosodic speech processing algorithms to Text-To-Speech synthesis is presented. Prosodic modifications that improve the naturalness of the synthesized signal are discussed. The applied method is based on the TD-PSOLA algorithm. The developed Text-To-Speech Synthesizer is used in applications employing multimodal computer interfaces.
Facial data registration facility for biometric protection of electronic documents
Publikacja
- Rok 2014
In modern world, information is crucial, and its leakage may lead to serious losses. Documents as the main medium of information must be therefore highly protected. Nowadays, the most common way of protecting data is using passwords, however it seems inconvenient to type complex passwords, when it is needed many times a day. For that reason a significant research has been conducted on biometric authentication...
Generating actionable evidence from free-text feedback to improve maternity and acute hospital experiences: A computational text analytics & predictive modelling approach
Publikacja
- A. Ojo
- N. Rizun
- M. Isazad Mashinchi
- G. Walsh
- J. Gruda
- M. N. Narayana
- M. Venosa
- C. Foley
- D. Rohde
- R. Flynn
- EUROPEAN JOURNAL OF PUBLIC HEALTH - Rok 2023
Background Patient experience surveys are a key source of evidence for supporting decision-making and quality improvement in healthcare services. These surveys contain two main types of questions: closed and open-ended, asking about patients’ care experiences. Apart from the knowledge obtained from analysing closed-ended questions, invaluable insights can be gleaned from free-text data. Advanced analytics techniques are increasingly...

Pełny tekst do pobrania w serwisie zewnętrznym
Categorization of Wikipedia articles with spectral clustering
Publikacja
- J. Szymański
- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2011
Abstract. The article reports application of clustering algorithms for creating hierarchical groups withinWikipedia articles.We evaluate three spectral clustering algorithms based on datasets constructed with usage ofWikipedia categories. Selected algorithm has been implemented in the system that categorize Wikipedia search results in the fly.
Interactive Information Search in Text Data Collections
Publikacja
- Rok 2013
This article presents a new idea for retrieving in text repositories, as well as it describes general infrastructure of a system created to implement and test those ideas. The implemented system differs from today’s standard search engine by introducing process of interactive search with users and data clustering. We present the basic algorithms behind our system and measures we used for results evaluation. The achieved results...

Pełny tekst do pobrania w serwisie zewnętrznym
Selection of Relevant Features for Text Classification with K-NN
Publikacja
- Rok 2013
In this paper, we describe five features selection techniques used for a text classification. An information gain, independent significance feature test, chi-squared test, odds ratio test, and frequency filtering have been compared according to the text benchmarks based on Wikipedia. For each method we present the results of classification quality obtained on the test datasets using K-NN based approach. A main advantage of evaluated...

Pełny tekst do pobrania w serwisie zewnętrznym
Text

Czasopisma

eISSN: 1327-9556
LEVEL OF DETAIL CATEGORIZATION FOR THE APPLICATION IN URBAN DESIGN
Publikacja
- J. Cudzik
- B. Güler,
- M. Aydoğan
- Przestrzeń i Forma - Rok 2023
Urban planning and urban design involve complex processes that require detailed information about the visual information of a place at various scales. Different graphic tools, such as game engines, are evolving to use urban representation fields. The concept of "level of detail" (LOD) has been used to categorize the level of detail in AEC applications such as BIM and GML for urban representation models. However, there is a need...

Pełny tekst do pobrania w portalu
Study of Statistical Text Representation Methods for Performance Improvement of a Hierarchical Attention Network
Publikacja
- A. Wawrzyński
- J. Szymański
- Applied Sciences-Basel - Rok 2021
To effectively process textual data, many approaches have been proposed to create text representations. The transformation of a text into a form of numbers that can be computed using computers is crucial for further applications in downstream tasks such as document classification, document summarization, and so forth. In our work, we study the quality of text representations using statistical methods and compare them to approaches...

Pełny tekst do pobrania w portalu
Schema mining in XML documents.
Publikacja
- K. Goczyła
- W. Waloszek
- Rok 2004
W artykule przedstawiono algorytm COBWEB S+T służący do wywodzenia schematów z kolekcji dokumentów XML. Algorytm wykorzystuje model danych semistrukturalnych oraz alorytm COBWEB służący do grupowania koncepcyjnego. W artykule zaprezentowano również wyniki testów działania algorytmu.
The Method of a Two-Level Text-Meaning Similarity Approximation of the Customers’ Opinions
Publikacja
- N. Rizun
- P. Kapłański
- Y. Taranenko
- Studia Ekonomiczne. Zeszyty Naukowe Uniwersytetu Ekonomicznego w Katowicach - Rok 2016
The method of two-level text-meaning similarity approximation, consisting in the implementation of the classification of the stages of text opinions of customers and identifying their rank quality level was developed. Proposed and proved the significance of major hypotheses, put as the basis of the developed methodology, notably about the significance of suggestions about the existence of analogies between mathematical bases of...

Pełny tekst do pobrania w portalu
Thresholding Strategies for Large Scale Multi-Label Text Classifier
Publikacja
- K. Draszawka
- J. Szymański
- Rok 2013
This article presents an overview of thresholding methods for labeling objects given a list of candidate classes’ scores. These methods are essential to multi-label classiﬁcation tasks, especially when there are a lot of classes which are organized in a hierarchy. Presented techniques are evaluated using the state-of-the-art dedicated classiﬁer on medium scale text corpora extracted from Wikipedia. Obtained results show that the...

Pełny tekst do pobrania w serwisie zewnętrznym
Text-mining Similarity Approximation Operators for Opinion Mining in BI tools
Publikacja
- N. Rizun
- P. Kapłański
- Y. Taranenko
- S. Alessandro
- Rok 2016
The concept of the Text-mining Similarity Approximation Operators for Opinion Mining as extensions to Natural Language Interface Database is defined. The new operators: “keywords of” dimension; subsetting operator “about C is q”; aggregation operator “by similar C” are proposed. These operators are based on the Latent Semantic Analysis and Social Network Analysis

Pełny tekst do pobrania w portalu
Documents d'archéologie méridionale

Czasopisma

ISSN: 0184-1068
Application of dynamic time warping and cepstrograms to text-dependent speaker verification
Publikacja
- A. Kaczmarek
- M. Staworko
- Rok 2009
This work provides a description of an automatic speaker verification (ASV) system. In particular, it documents the evolution of all individual stages of the proposed ASV system design from the phase of preprocessing to an operational decision making system. The aim of this research was to achieve the system of the best safety and ease of use in view of users. The objective estimation of this target has been accomplished by assessing...
What matters most to patients? On the Core Determinants of Patient Experience from Free Text Feedback
Publikacja
- A. Ojo
- N. Rizun
- Rok 2021
Free-text feedback from patients is increasingly used for improving the quality of healthcare services and systems. A major reason for the growing interest in harnessing free-text feedback is the belief that it provides richer information about what patients want and care about. The use of computational approaches such as structural topic modelling for analysing large unstructured textual data such as free-text feedback from patients...

Pełny tekst do pobrania w portalu
Green energy in municipal planning documents
Publikacja
- A. Bazan-Krzywoszańska
- M. Skiba
- M. Mrówczyńska
- M. Sztubecka
- D. Bazuń
- M. Kwiatkowski
- E3S Web of Conferences - Rok 2018
Pełny tekst do pobrania w serwisie zewnętrznym
Ruling lines removal in handwritten documents
Publikacja
- S. Seifzadeh
- Rok 2013
Querying the digital database of musical documents
Publikacja
- A. Sobociński
- M. Smiatacz
- Rok 2007
W rozdziale zaprezentowano program Melody Explorer służący do formułowania zapytań dla bazy danych dokumentów muzycznych. Przedstawiono problemy związane z konwersją informacji wprowadzanych przez użytkownika na zapis nutowy. Zaproponowano ulepszenia istniejących rozwiązań mające na celu poprawę dokładności i stabilności systemu. Oprócz cyfrowego zapisu dźwięku również podany przez użytkownika rytm melodii wykorzystywany jest do...
Nina Rizun dr

Osoby

Katedra Informatyki w Zarządzaniu

Nina Rizun jest adiunktem na Wydziale Zarządzania i Ekonomii Politechniki Gdańskiej. W październiku 1999 r. uzyskała stopień doktora nauk technicznych za specjalizacją Gospodarka przedsiębiorstwa i organizacja produkcji. W latach 1993–2000 pracowała na Wydziale Informatyki Ekonomicznej w Akademji Metalurgicznej, Dnipro, Ukraina. W latach 2000–2016 – na Wydziale Cybernetyki Ekonomicznej i Metod Matematycznych na Uniwersytecie Alfreda...
The categorization of the tourist services quality perception determinants in hierarchical conception
Publikacja
- G. Zieliński
- Rok 2011
W niniejszym rozdziale zaprezentowano kategoryzację determinant percepcji jakości usług turystycznych. Omówione zostały główne grupy oraz determinanty elementarne z wykorzystaniem ujęcia modeli hierarchicznych.
Categorization of emotions in dog behavior based on the deep neural network
Publikacja
- COMPUTATIONAL INTELLIGENCE - Rok 2022
The aim of this article is to present a neural system based on stock architecture for recognizing emotional behavior in dogs. Our considerations are inspired by the original work of Franzoni et al. on recognizing dog emotions. An appropriate set of photographic data has been compiled taking into account five classes of emotional behavior in dogs of one breed, including joy, anger, licking, yawning, and sleeping. Focusing on a particular...

Pełny tekst do pobrania w portalu
Exploring the Usability and User Experience of Social Media Apps through a Text Mining Approach
Publikacja
- A. Baj-Rogowska
- M. Sikorski
- Engineering Management in Production and Services - Rok 2023
This study aims to evaluate the applicability of a text mining approach for extracting UUX-related issues from a dataset of user comments and not to evaluate the Instagram (IG) app. This study analyses textual data mined from reviews in English written by IG mobile application users. The article’s authors used text mining (based on the LDA algorithm) to identify the main UUX-related topics. Next, they mapped the identified topics...

Pełny tekst do pobrania w portalu
Application of Text Analytics in Public Service Co-Creation: Literature Review and Research Framework
Publikacja
- N. Rizun
- A. Revina
- N. Edelmann
- Rok 2023
The public sector faces several challenges, such as a number of external and internal demands for change, citizens' dissatisfaction and frustration with public sector organizations, that need to be addressed. An alternative to the traditional top-down development of public services is co-creation of public services. Co-creation promotes collaboration between stakeholders with the aim to create better public services and achieve...

Pełny tekst do pobrania w portalu
Text (new tilte Text and Talk)

Czasopisma

ISSN: 0165-4888
Third Text

Czasopisma

ISSN: 0952-8822 , eISSN: 1475-5297
Social Text

Czasopisma

ISSN: 0164-2472 , eISSN: 1527-1951
Word and Text

Czasopisma

ISSN: 2069-9271
Text & Talk

Czasopisma

ISSN: 1860-7330 , eISSN: 1860-7349
SYNTHESIZING MEDICAL TERMS – QUALITY AND NATURALNESS OF THE DEEP TEXT-TO-SPEECH ALGORITHM
Publikacja
- B. Kostek
- B. Szyca
- Journal of the Acoustical Society of America - Rok 2023
The main purpose of this study is to develop a deep text-to-speech (TTS) algorithm designated for an embedded system device. First, a critical literature review of state-of-the-art speech synthesis deep models is provided. The algorithm implementation covers both hardware and algorithmic solutions. The algorithm is designed for use with the Raspberry Pi 4 board. 80 synthesized sentences were prepared based on medical and everyday...

Pełny tekst do pobrania w portalu
Intelligent system for editing and analysis of examination documents
Publikacja
- Ł. Karpowicz
- W. Malina
- M. Smiatacz
- Rok 2006
Opisano ogólną koncepcję systemu IATE - systemu do edycji i automatycznej analizy testów egzaminacyjnych. Edytor systemu umożliwia generację 4 typów testów o dowolnej liczbie pytań (do 8 stron tekstu), różnej formie udzielania odpowiedzi oraz możliwością tworzenia wariantów testu. Bardziej szczegółowo opisano wybrane fragmenty systemu: analizę nagłówka testu, edycję i organizację segmentu tworzenia wariantów testu oraz organizację...
Planning documents and sustainable development of a commune in Poland
Publikacja
- A. Stacherzak
- M. Hełdak
- B. Raszka
- Rok 2012
Pełny tekst do pobrania w serwisie zewnętrznym
Computer analysis of multiple-choice examination documents
Publikacja
- Ł. Karpowicz
- W. Malina
- M. Smiatacz
- Rok 2004
Opisany system AATE wyposażony jest w edytor testów, za pomocą którego egzaminator przygotowuje test egzaminacyjny. Utworzony test ze swoimi parametrami jest pamiętany w bazie danych i następnie może być wydrukowany. Po przeprowadzeniu egzaminu wypełnione formularze za pomocą skanera z podajnikiem wprowadza się do komputera. W komputerze system analizuje formularze i odczytane odpowiedzi porównuje się z wzorcami przechowywanymi...
Anna Baj-Rogowska dr

Osoby

Katedra Informatyki w Zarządzaniu

Anna Baj-Rogowska zatrudniona jest na stanowisku adiunkta w Katedrze Informatyki w Zarządzaniu (Politechnika Gdańska, Wydział Zarządzania i Ekonomii). Jej wyższa edukacja związana jest z Uniwersytetem Gdańskim, gdzie ukończyła magisterskie studia informatyczne, studia doktoranckie i następnie uzyskała stopień naukowy doktora nauk ekonomicznych w zakresie nauk o zarządzaniu (Katedra Informatyki Ekonomicznej na Wydziale Zarządzania...
Text Mining Algorithms for Extracting Brand Knowledge; The fashion Industry Case
Publikacja
- N. Rizun
- W. Kucharska
- Rok 2018
Brand knowledge is determined by customer knowledge. The opportunity to develop brands based on customer knowledge management has never been greater. Social media as a set of leading communication platforms enable peer to peer interplays between customers and brands. A large stream of such interactions is a great source of information which, when thoroughly analyzed, can become a source of innovation and lead to competitive advantage....

Pełny tekst do pobrania w portalu
A Text as a Set of Research Data. A Number of Aspects of Data Acquisition and Creation of Datasets in Neo-Latin Studies
Publikacja
- Rok 2022
In this paper, the authors, who specialise in part in neo-Latin studies and the his-tory of early modern education, share their experiences of collecting sources for Open Research Data sets under the Bridge of Data project. On the basis of inscription texts from St. Mary’s Church in Gdańsk, they created 29 Open Research Data sets. In turn, the text of the lectures of the Gdańsk scholar Michael Christoph Hanow, Praecepta de arte...

Pełny tekst do pobrania w portalu
Endoscopic Videos Deinterlacing and On-Screen Text and Light Flashes Removal and Its Influence on Image Analysis Algorithms' Efficiency
Publikacja
- International Journal of Image Processing and Visual Communication - Rok 2013
In this article, deinterlacing and removing on- screen text and light flashes methods on endoscopic video images are discussed. The research is intended to improve disease recognition algorithms' performance. In the article, four configurations of deinterlacing methods and another four configurations of text and flashes removal methods are described and examined. The efficiency of endoscopic video analysis algorithms is measured...

Pełny tekst do pobrania w serwisie zewnętrznym
Enabling Deeper Linguistic-based Text Analytics – Construct Development for the Criticality of Negative Service Experience
Publikacja
- A. Ojo
- N. Rizun
- IEEE Access - Rok 2019
Significant progress has been made in linguistic-based text analytics particularly with the increasing availability of data and deep learning computational models for more accurate opinion analysis and domain-specific entity recognition. In understanding customer service experience from texts, analysis of sentiments associated with different stages of the service lifecycle is a useful starting point. However, when richer insights...

Pełny tekst do pobrania w portalu
Documents d Analisi Geografica

Czasopisma

ISSN: 0212-1573
Towards Effective Processing of Large Text Collections
Publikacja
- J. Szymański
- H. Krawczyk
- Rok 2012
In the article we describe the approach to parallelimplementation of elementary operations for textual data categorization.In the experiments we evaluate parallel computations ofsimilarity matrices and k-means algorithm. The test datasets havebeen prepared as graphs created from Wikipedia articles relatedwith links. When we create the clustering data packages, wecompute pairs of eigenvectors and eigenvalues for visualizationsof...
Agent System for Managing Distributed Mobile Interactive Documents
Publikacja
- M. Godlewska
- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2010
The MIND architecture of distributed mobile interactive document is a new processing model defined for facilitate informed decision-making in non-algorithmic decision-making processes carried out by knowledge-based organizations. The aim of this architecture is to change the static document to mobile agents, which are designed to implement the structure of the organization through autonomous migration between knowledge workers...
Text Technology: A Journal of computer Text Processing

Czasopisma

ISSN: 1496-0958
Improving the Workflow for Creation of Textual Versions of Polish Historical Documents
Publikacja
- A. Dudczak
- M. Kmieciak
- C. Mazurek
- M. Stroiński
- M. Werla
- J. Węglarz
- Rok 2013
Pełny tekst do pobrania w serwisie zewnętrznym
Visual GQM approach to quality driven development of electronic documents.
Publikacja
- H. Krawczyk
- B. Wiszniewski
- Rok 2003
Jednym z celów projektu europejskiego MEORIAL jest opracowanie nowej technologii wytwarzania webowych systemów informacyjnych wykorzystujących interaktywne dokumenty cyfrowe wytworzone z papierowych oryginałów z zastosowaniem zaawansowanych technik przetwarzania i rozpoznania obrazów. Wieloelementowy model cyklu życia dokumentu cyfrowego przedstawiony w artykule stanowi postawę opracowanej technologii.

Pełny tekst do pobrania w portalu
Towards facts extraction from text in Polish language
Publikacja
- T. M. Boiński
- A. Chojnowski
- Rok 2017
Natural Language Processing (NLP) finds many usages in different fields of endeavor. Many tools exists allowing analysis of English language. For Polish language the situation is different as the language itself is more complicated. In this paper we show differences between NLP of Polish and English language. Existing solutions are presented and TEAMS software for facts extraction is described. The paper shows also evaluation of...

Pełny tekst do pobrania w portalu
Evaluation and Irony in Text in the Light of Speech Act Theory
Publikacja
- K. Kukowicz-Zarska
- Forum Filologiczne Ateneum - Rok 2020
Pełny tekst do pobrania w serwisie zewnętrznym
Foundation text of St. Mary's Church in Gdańsk
Dane Badawcze
open access
- E. Starek
- G. Kotłowski
The data set concerns epigraphy. It refers to the medieval foundation preserved on the wall above the sacristy entrance in St. Mary’s Church in Gdańsk, which confirms that the foundation stone of the temple was laid on 28th of March 1343. The data set contains one general photo of the foundation text, transcription of its text in Latin and its Polish...
Kryteria wytrzymałości gruntu na ścinanie w zagadnieniach geotechniki
Publikacja
- M. Cudny
- K. Binder
- Rok 2005
Przedstawiono wpływ zastosowania różnych kryteriów wytrzymałości gruntu na ścinanie w symulacjach numerycznych prostych praktycznych zagadnień geotechnicznych. Obliczenia wykonano metodą elementów skończonych w płaskim oraz osiowosymetrycznym stanie odkształcenia. Wyniki obliczeń porównano oraz poddano krytycznej dyskusji.
Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech
Publikacja
- D. Piotrowski
- R. Korzeniowski
- A. Falai
- S. Cygert
- K. Pokora
- G. Tinchev
- Z. Zhang
- K. Yanagisawa
- Rok 2023
In this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and a downstream Text-To-Speech (TTS) model. The proposed framework consists of 4 stages. In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker. In the third stage, the converted data is combined with the linguistic features and durations...

Pełny tekst do pobrania w serwisie zewnętrznym
Text und Kontext

Czasopisma

ISSN: 0105-7014
Post-Colonial Text

Czasopisma

ISSN: 1705-9100
Text & Performance Quarterly

Czasopisma

ISSN: 1046-2937 , eISSN: 1479-5760
Text: Kritische Beiträge

Czasopisma

ISSN: 1420-1496
Text und Kritik

Czasopisma

ISSN: 0040-5329
English Text Construction

Czasopisma

ISSN: 1874-8767 , eISSN: 1874-8775
Law Text Culture

Czasopisma

ISSN: 1322-9060 , eISSN: 2200-7121
Instytucje demokracji bezpośredniej, partycypacyjnej i deliberacyjnej w Gdańsku od 2010 roku
Publikacja
- S. Andrzejewski
- Rok 2023
Tematem tej pracy doktorskiej jest studium przypadku stanu demokracji w Gdańsku. Miasto Gdańsk jest uważane jako jedno z najbardziej demokratycznych miast w Polsce, jednak czy to założenie pokrywa się z faktami? Analiza Autora rozprawy doktorskiej jest skupiona na instytucjach demokratycznych na poziomie lokalnym, ze szczególnym uwzględnieniem obywatelskiej inicjatywy uchwałodawczej jako instrumentu...
Distributed MIND - A New Processing Model Based on Mobile Interactive Documents
Publikacja
- M. Godlewska
- B. Wiszniewski
- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2010
Obliczenia w trybie zespołowym pozwalają na integrację działań ludzi i agentów systemowych w otwartym środowisku rozproszonym w celu rozwiązywania problemów formułowanych dynamicznie w trakcie pracy systemu. Problemy te najczęściej nie mają charakteru algorytmicznego, tzn. generowane rozwiązania nie mogłyby zostać wyliczone w skończonej liczbie kroków na podstawie danych charakteryzujących uczestników obliczeń. Autorzy proponują...

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: TEXT DOCUMENTS CATEGORIZATION

Nina Rizun dr

Anna Baj-Rogowska dr