mgr inż. Łukasz Kucharczyk
Zatrudnienie
Kontakt
- lukkucha@student.pg.edu.pl
Wybrane publikacje
-
Path-based methods on categorical structures for conceptual representation of wikipedia articles
Machine learning algorithms applied to text categorization mostly employ the Bag of Words (BoW) representation to describe the content of the documents. This method has been successfully used in many applications, but it is known to have several limitations. One way of improving text representation is usage of Wikipedia as the lexical knowledge base – an approach that has already shown promising results in many research studies....
-
Evaluation of Path Based Methods for Conceptual Representation of the Text
Typical text clustering methods use the bag of words (BoW) representation to describe content of documents. However, this method is known to have several limitations. Employing Wikipedia as the lexical knowledge base has shown an improvement of the text representation for data-mining purposes. Promising extensions of that trend employ hierarchical organization of Wikipedia category system. In this paper we propose three path-based...
wyświetlono 559 razy