Abstract
In the article we describe the approach to parallelimplementation of elementary operations for textual data categorization.In the experiments we evaluate parallel computations ofsimilarity matrices and k-means algorithm. The test datasets havebeen prepared as graphs created from Wikipedia articles relatedwith links. When we create the clustering data packages, wecompute pairs of eigenvectors and eigenvalues for visualizationsof the datasets. We describe the method used for evaluation ofthe clustering quality. Finally we discuss achieved results, pointsome improvements and perspectives for future development.
Citations
-
0
CrossRef
-
0
Web of Science
-
0
Scopus
Authors (2)
Cite as
Full text
full text is not available in portal
Keywords
Details
- Category:
- Conference activity
- Type:
- materiały konferencyjne indeksowane w Web of Science
- Title of issue:
- 2nd International Conference on Innovative Computing Technology (INTECH) strony 293 - 298
- Language:
- English
- Publication year:
- 2012
- Bibliographic description:
- Szymański J., Krawczyk H..: Towards Effective Processing of Large Text Collections, W: 2nd International Conference on Innovative Computing Technology (INTECH), 2012, ,.
- DOI:
- Digital Object Identifier (open in new tab) 10.1109/intech.2012.6457784
- Verified by:
- Gdańsk University of Technology
seen 79 times