Abstract
This paper presents an approach for Modeling the Latent Semantic Relations. The approach is based on advantages of two computational approaches: Latent Semantic Analysis and Latent Dirichlet Allocation. The scientific question about the possibility of reducing the influence of these Methods limitation on the Quality of the Latent Semantic Relations Analysis Results is raised. The case study for building the Two-level Hierarchical Contextual Framework of Textual Corpora was performed. The main scientific contributions of this research are: using the paragraphs as a topically completed textual messages can guarantee that it will be centered on a single topic; collecting the topics within the Corpora via its identification in each document separately is the instrument for preventing the model size increasing; film’s review as a specific type of textual document have the approximately similar writing style only within the Corpora with the same semantic tonality.
Citations
-
4
CrossRef
-
0
Web of Science
-
5
Scopus
Authors (2)
Cite as
Full text
- Publication version
- Accepted or Published Version
- License
- Copyright (2017, IEEE)
Keywords
Details
- Category:
- Conference activity
- Type:
- publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
- Title of issue:
- Intelligent Computing and Information Systems (ICICIS), 2017 Eighth International Conference on strony 366 - 372
- Language:
- English
- Publication year:
- 2018
- Bibliographic description:
- Rizun N., Waloszek W.: The algorithm of building the hierarchical contextual framework of textual corpora// Intelligent Computing and Information Systems (ICICIS), 2017 Eighth International Conference on/ : , 2018, s.366-372
- DOI:
- Digital Object Identifier (open in new tab) 10.1109/intelcis.2017.8260064
- Verified by:
- Gdańsk University of Technology
seen 122 times