Wyniki wyszukiwania dla: INFORMATION RETRIEVAL
-
Improvement of Imperfect String Matching Based on Asymetric n-Grams
PublikacjaTypical approaches to string comparing treats them as either different or identical without taking into account the possibility of misspelling of the word. In this article we present an approach we used for improvement of imperfect string matching that allows one to reconstruct potential string distortions. The proposed method increases the quality of imperfect string matching, allowing the lookup of misspelled words without significant...
-
Selection of Relevant Features for Text Classification with K-NN
PublikacjaIn this paper, we describe five features selection techniques used for a text classification. An information gain, independent significance feature test, chi-squared test, odds ratio test, and frequency filtering have been compared according to the text benchmarks based on Wikipedia. For each method we present the results of classification quality obtained on the test datasets using K-NN based approach. A main advantage of evaluated...
-
Interactive Information Search in Text Data Collections
PublikacjaThis article presents a new idea for retrieving in text repositories, as well as it describes general infrastructure of a system created to implement and test those ideas. The implemented system differs from today’s standard search engine by introducing process of interactive search with users and data clustering. We present the basic algorithms behind our system and measures we used for results evaluation. The achieved results...
-
Hanna Gaweł
OsobyHanna Gawel is a Doctoral Student of the Doctoral School in the Social Sciences in the discipline of Social Communication and Media at Jagiellonian University. Hanna’s research focuses on knowledge, information management and the influence of well-served information in different formats on society. She is currently writing a PhD thesis about how information pollutants affect information regarding air quality in Polish metropolises....