Wyniki wyszukiwania dla: INFORMATION RETRIEVAL - MOST Wiedzy

Wyszukiwarka

Wyniki wyszukiwania dla: INFORMATION RETRIEVAL

Wyniki wyszukiwania dla: INFORMATION RETRIEVAL

  • Evaluation of Path Based Methods for Conceptual Representation of the Text

    Publikacja

    Typical text clustering methods use the bag of words (BoW) representation to describe content of documents. However, this method is known to have several limitations. Employing Wikipedia as the lexical knowledge base has shown an improvement of the text representation for data-mining purposes. Promising extensions of that trend employ hierarchical organization of Wikipedia category system. In this paper we propose three path-based...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Towards Increasing Density of Relations in Category Graphs

    Publikacja

    In the chapter we propose methods for identifying new associations between Wikipedia categories. The first method is based on Bag-of-Words (BOW) representation of Wikipedia articles. Using similarity of the articles belonging to different categories allows to calculate the information about categories similarity. The second method is based on average scores given to categories while categorizing documents by our dedicated score-based...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Comparative Analysis of Text Representation Methods Using Classification

    Publikacja

    In our work, we review and empirically evaluate five different raw methods of text representation that allow automatic processing of Wikipedia articles. The main contribution of the article—evaluation of approaches to text representation for machine learning tasks—indicates that the text representation is fundamental for achieving good categorization results. The analysis of the representation methods creates a baseline that cannot...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Hanna Gaweł

    Osoby

    Hanna Gawel is a Doctoral Student of the Doctoral School in the Social Sciences in the discipline of Social Communication and Media at Jagiellonian University. Hanna’s research focuses on knowledge, information management and the influence of well-served information in different formats on society.  She is currently writing a PhD thesis about how information pollutants affect information regarding air quality in Polish metropolises....