Self–Organizing Map representation for clustering Wikipedia search results

Julian Szymański

doi:10.1007/978-3-642-20042-7_15

Self–Organizing Map representation for clustering Wikipedia search results

Abstrakt

The article presents an approach to automated organization of textual data. The experiments have been performed on selected sub-set of Wikipedia. The Vector Space Model representation based on terms has been used to build groups of similar articles extracted from Kohonen Self-Organizing Maps with DBSCAN clustering. To warrant efficiency of the data processing, we performed linear dimensionality reduction of raw data using Principal Component Analysis. We introduce hierarchical organization of the categorized articles changing the granularity of SOM network. The categorization method has been used in implementation of the system that clusters results of keyword-based search in Polish Wikipedia.

Cytowania

1 0

CrossRef
0

Web of Science
1 4

Scopus

Autor (1)

Julian Szymański dr hab. inż.

Cytuj jako

Pełna treść

pełna treść publikacji nie jest dostępna w portalu

pełna treść artykułu zobacz w serwisie zewnętrznym otwiera się w nowej karcie

Słowa kluczowe

Informacje szczegółowe

Kategoria:: Aktywność konferencyjna
Typ:: materiały konferencyjne indeksowane w Web of Science
Tytuł wydania:: INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2011, PT II strony 140 - 149
ISSN:: 0302-9743
Język:: angielski
Rok wydania:: 2011
Opis bibliograficzny:: Szymański J..: Self–Organizing Map representation for clustering Wikipedia search results , W: INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2011, PT II, 2011, Springer-Verlag Berlin Heidelberg,.
DOI:: Cyfrowy identyfikator dokumentu elektronicznego (otwiera się w nowej karcie) 10.1007/978-3-642-20042-7_15
Weryfikacja:: Politechnika Gdańska