Wyniki wyszukiwania dla: WIKIPEDIA

Parallel Computations of Text Similarities for Categorization Task

Publikacja

J. Szymański

- Rok 2013

In this chapter we describe the approach to parallel implementation of similarities in high dimensional spaces. The similarities computation have been used for textual data categorization. A test datasets we create from Wikipedia articles that with their hyper references formed a graph used in our experiments. The similarities based on Euclidean distance and Cosine measure have been used to process the data using k-means algorithm....

Selection of Relevant Features for Text Classification with K-NN

Publikacja

- Rok 2013

In this paper, we describe five features selection techniques used for a text classification. An information gain, independent significance feature test, chi-squared test, odds ratio test, and frequency filtering have been compared according to the text benchmarks based on Wikipedia. For each method we present the results of classification quality obtained on the test datasets using K-NN based approach. A main advantage of evaluated...

Pełny tekst do pobrania w serwisie zewnętrznym

Text Categorization Improvement via User Interaction

Publikacja

J. Atroszko
J. Szymański
D. Gil
H. Mora

- Rok 2018

In this paper, we propose an approach to improvement of text categorization using interaction with the user. The quality of categorization has been defined in terms of a distribution of objects related to the classes and projected on the self-organizing maps. For the experiments, we use the articles and categories from the subset of Simple Wikipedia. We test three different approaches for text representation. As a baseline we use...

Pełny tekst do pobrania w serwisie zewnętrznym

How Specific Can We Be with k-NN Classifier?

Publikacja

- Rok 2014

This paper discusses the possibility of designing a two stage classifier for large-scale hierarchical and multilabel text classification task, that will be a compromise between two common approaches to this task. First of it is called big-bang, where there is only one classifier that aims to do all the job at once. Top-down approach is the second popular option, in which at each node of categories’ hierarchy, there is a flat classifier...

Pełny tekst do pobrania w serwisie zewnętrznym

Improving css-KNN Classification Performance by Shifts in Training Data

Publikacja

- Rok 2015

This paper presents a new approach to improve the performance of a css-k-NN classifier for categorization of text documents. The css-k-NN classifier (i.e., a threshold-based variation of a standard k-NN classifier we proposed in [1]) is a lazy-learning instance-based classifier. It does not have parameters associated with features and/or classes of objects, that would be optimized during off-line learning. In this paper we propose...

Follow the Light. Where to search for useful research information

Publikacja

K. Zielińska-Dąbkowska

- ARC Lighting In Architecture - Rok 2019

Architectural Lighting Design (ALD) has never been a standalone professional discipline. Rather, it has existed as the combination of art and the science of light. Today, third generation lighting professionals are already creatively intertwining these fields, and the acceleration in scientific, technological and societal studies has only increased the need for reliable multidisciplinary information. Therefore, a thorough re-examination...

Pełny tekst do pobrania w portalu

Identification of category associations using a multilabel classifier

Publikacja

- EXPERT SYSTEMS WITH APPLICATIONS - Rok 2016

Description of the data using categories allows one to describe it on a higher abstraction level. In this way, we can operate on aggregated groups of the information, allowing one to see relationships that do not appear explicit when we analyze the individual objects separately. In this paper we present automatic identification of the associations between categories used for organization of the textual data. As experimental data...

Pełny tekst do pobrania w serwisie zewnętrznym

Filtry

Katalog

Kategoria

Rok

Opcje

Parallel Computations of Text Similarities for Categorization Task

Selection of Relevant Features for Text Classification with K-NN

Text Categorization Improvement via User Interaction

How Specific Can We Be with k-NN Classifier?

Improving css-KNN Classification Performance by Shifts in Training Data

Follow the Light. Where to search for useful research information

Identification of category associations using a multilabel classifier

Wyszukiwarka

Filtry

Katalog

Kategoria

Rok

Opcje