Information Retrieval in Wikipedia with Conceptual Directions

Julian Szymański

doi:10.1007/978-3-319-14977-6_42

Information Retrieval in Wikipedia with Conceptual Directions

Abstrakt

The paper describes our algorithm used for retrieval of textual information from Wikipedia. The experiments show that the algorithm allows to improve typical evaluation measures of retrieval quality. The improvement of the retrieval results was achieved by two phase usage approach. In first the algorithm extends the set of content that has been indexed by the specified keywords and thus increases the Recall value. Then, using the interaction with the user by presenting him so-called Conceptual Directions the search results are purified, which allows to increase Precision value. The preliminary evaluation on multi-sense test phrases indicates, that the algorithm is able to increase the Precision, within result set, without Recall loss. We also describe an additional method used for extending the result set based on creating cluster prototypes and finding the most similar, not retrieved content in text repository. In our demo implementation in the form of web portal, clustering has been used to present the search results organized in thematic groups instead of ranked list.

Cytowania

0

CrossRef
0

Web of Science
0

Scopus

Autor (1)

Julian Szymański dr hab. inż.

Cytuj jako

Pełna treść

pełna treść publikacji nie jest dostępna w portalu

pełna treść artykułu zobacz w serwisie zewnętrznym otwiera się w nowej karcie

Słowa kluczowe

Informacje szczegółowe

Kategoria:: Aktywność konferencyjna
Typ:: materiały konferencyjne indeksowane w Web of Science
Tytuł wydania:: Distributed Computing and Internet Technology strony 391 - 402
Język:: angielski
Rok wydania:: 2015
Opis bibliograficzny:: Szymański J..: Information Retrieval in Wikipedia with Conceptual Directions, W: Distributed Computing and Internet Technology, 2015, Volume 8956 of the series Lecture Notes in Computer Science pp ,.
DOI:: Cyfrowy identyfikator dokumentu elektronicznego (otwiera się w nowej karcie) 10.1007/978-3-319-14977-6_42
Weryfikacja:: Politechnika Gdańska