Methodology for Text Classification using Manually Created Corpora-based Sentiment Dictionary - Publikacja - MOST Wiedzy

Wyszukiwarka

Methodology for Text Classification using Manually Created Corpora-based Sentiment Dictionary

Abstrakt

This paper presents the methodology of Textual Content Classification, which is based on a combination of algorithms: preliminary formation of a contextual framework for the texts in particular problem area; manual creation of the Hierarchical Sentiment Dictionary (HSD) on the basis of a topically-oriented Corpus; tonality texts recognition via using HSD for analysing the documents as a collection of topically completed fragments (paragraphs). For verification of the proposed methodology, a case study of Polish-language film reviews Corpora was used. The main scientific contributions of this research are: writing style of the analyzed text determines the possibility of adaptation of the Texts Classification algorithms; Hierarchically-oriented Structure of the HSD allows customizing the classification process to qualitative recognition of text tonality in the context of individual paragraphs topics; texts of Persuasive style most often are initially empowered by authors with a certain tonality. The tone, expressed in the author's opinion, effects the qualitative indicators of sentiment recognition. Negative emotions of the author usually reduce the level of vocabulary variability as well as the variety of topics raised in the document but simultaneously increase the level of unpredictability of words contextually used with both positive and negative emotional coloring

Cytowania

  • 4

    CrossRef

  • 0

    Web of Science

  • 4

    Scopus

Cytuj jako

Pełna treść

pobierz publikację
pobrano 84 razy
Wersja publikacji
Accepted albo Published Version
Licencja
Copyright (2018 by SCITEPRESS – Science and Technology Publications, Lda)

Słowa kluczowe

Informacje szczegółowe

Kategoria:
Aktywność konferencyjna
Typ:
publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
Tytuł wydania:
Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management strony 1 - 9
Język:
angielski
Rok wydania:
2018
Opis bibliograficzny:
Rizun N., Waloszek W.: Methodology for Text Classification using Manually Created Corpora-based Sentiment Dictionary// Proceedings of the 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management/ 10th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management : , 2018, s.1-9
DOI:
Cyfrowy identyfikator dokumentu elektronicznego (otwiera się w nowej karcie) 10.5220/0006932602120220
Weryfikacja:
Politechnika Gdańska

wyświetlono 126 razy

Publikacje, które mogą cię zainteresować

Meta Tagi