Improving css-KNN Classification Performance by Shifts in Training Data

Karol Draszawka; Julian Szymański; Francesco Guerra

doi:10.1007/978-3-319-27932-9_5

Improving css-KNN Classification Performance by Shifts in Training Data

Abstrakt

This paper presents a new approach to improve the performance of a css-k-NN classifier for categorization of text documents. The css-k-NN classifier (i.e., a threshold-based variation of a standard k-NN classifier we proposed in [1]) is a lazy-learning instance-based classifier. It does not have parameters associated with features and/or classes of objects, that would be optimized during off-line learning. In this paper we propose a training data preprocessing phase that tries to alleviate the lack of learning. The idea is to compute training data modifications, such that class representative instances are optimized before the actual k-NN algorithm is employed. The empirical text classification experiments using mid-size Wikipedia data sets show that carefully cross-validated settings of such preprocessing yields significant improvements in k-NN performance compared to classification without this step. The proposed approach can be useful for improving the effectivenes of other classifiers as well as it can find applications in domain of recommendation systems and keyword-based search.

Cytowania

4

CrossRef
0

Web of Science
2

Scopus

Autorzy (3)

Karol Draszawka mgr inż.
Julian Szymański dr hab. inż.
Francesco Guerra
- Universita’ di Modena e Reggio Emilia, Modena .

Cytuj jako

Pełna treść

pełna treść publikacji nie jest dostępna w portalu

Słowa kluczowe

Informacje szczegółowe

Kategoria:: Aktywność konferencyjna
Typ:: materiały konferencyjne indeksowane w Web of Science
Tytuł wydania:: 1st International KEYSTONE Conference (IKC) strony 51 - 63
Język:: angielski
Rok wydania:: 2015
Opis bibliograficzny:: Draszawka K., Szymański J., Guerra F..: Improving css-KNN Classification Performance by Shifts in Training Data, W: 1st International KEYSTONE Conference (IKC), 2015, ,.
DOI:: Cyfrowy identyfikator dokumentu elektronicznego (otwiera się w nowej karcie) 10.1007/978-3-319-27932-9_5
Weryfikacja:: Politechnika Gdańska