Abstract
In the chapter we propose methods for identifying new associations between Wikipedia categories. The first method is based on Bag-of-Words (BOW) representation of Wikipedia articles. Using similarity of the articles belonging to different categories allows to calculate the information about categories similarity. The second method is based on average scores given to categories while categorizing documents by our dedicated score-based classifier. As a result of application of presented methods we obtain weighed category graphs that allow to extend original relations between Wikipedia categories. We propose the method for selecting the weight value for cutting off less important relations. The given preliminary examination of the quality of obtained new relations supports our procedure.
Citations
-
0
CrossRef
-
0
Web of Science
-
1
Scopus
Authors (3)
Cite as
Full text
full text is not available in portal
Keywords
Details
- Category:
- Conference activity
- Type:
- publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
- Title of issue:
- Intelligent Tools for Building a Scientific Information Platform: From Research to Implementation strony 51 - 60
- Language:
- English
- Publication year:
- 2014
- Bibliographic description:
- Draszawka K., Szymański J., Krawczyk H.: Towards Increasing Density of Relations in Category Graphs// Intelligent Tools for Building a Scientific Information Platform: From Research to Implementation/ ed. Robert Bembenik, Łukasz Skonieczny, Henryk Rybiński, Marzena Kryszkiewicz, Marek Niezgódka : Springer International Publishing, 2014, s.51-60
- DOI:
- Digital Object Identifier (open in new tab) 10.1007/978-3-319-04714-0_4
- Verified by:
- Gdańsk University of Technology
seen 107 times