DEVELOPMENT OF THE ALGORITHM OF POLISH LANGUAGE FILM REVIEWS PREPROCESSING

Nina Rizun; Jurij Taranenko

DEVELOPMENT OF THE ALGORITHM OF POLISH LANGUAGE FILM REVIEWS PREPROCESSING

Abstrakt

The algorithm and the software for conducting the procedure of Preprocessing of the reviews of films in the Polish language were developed. This algorithm contains the following steps: Text Adaptation Procedure; Procedure of Tokenization; Procedure of Transforming Words into the Byte Format; Part-of-Speech Tagging; Stemming / Lemmatization Procedure; Presentation of Documents in the Vector Form (Vector Space Model) Procedure; Forming the Documents Models Database Procedure. The experiments of this algorithm conduction on the test sampling of reviews analysis was performed and the main conclusion was formulated.

Autorzy (2)

Nina Rizun dr
Jurij Taranenko
- Alfred Nobel University, Dnipro Department of Applied Linguistics and Methods of Teaching Foreign Languages

Cytuj jako

Pełna treść

pobierz publikację

pobrano 1221 razy

Wersja publikacji: Accepted albo Published Version
Licencja: Copyright (Wydział Zarządzania w Ciechanowie (WSM w Warszawie))

Słowa kluczowe

Informacje szczegółowe

Kategoria:: Publikacja w czasopiśmie
Typ:: artykuły w czasopismach recenzowanych i innych wydawnictwach ciągłych
Opublikowano w:: Rocznik Naukowy Wydzialu Zarzadzania w Ciechanowie strony 167 - 188,
ISSN: 1897-4716
Język:: angielski
Rok wydania:: 2017
Opis bibliograficzny:: Rizun N., Taranenko J.: DEVELOPMENT OF THE ALGORITHM OF POLISH LANGUAGE FILM REVIEWS PREPROCESSING// Rocznik Naukowy Wydzialu Zarzadzania w Ciechanowie. -., nr. 1-4 (XI) (2017), s.167-188
Weryfikacja:: Politechnika Gdańska