Wyniki wyszukiwania dla: TEXT REPRESENTATION DOCUMENTS CATEGORIZATION INFORMATION RETRIEVAL

Wyniki wyszukiwania dla: TEXT REPRESENTATION DOCUMENTS CATEGORIZATION INFORMATION RETRIEVAL

wyników na stronę:
osadź ten widok na swojej stronie

Filtry

wszystkich: 445

wyczyść wszystkie filtry niedostępne

Anna Baj-Rogowska dr

Osoby

Katedra Informatyki w Zarządzaniu

Anna Baj-Rogowska zatrudniona jest na stanowisku adiunkta w Katedrze Informatyki w Zarządzaniu (Politechnika Gdańska, Wydział Zarządzania i Ekonomii). Jej wyższa edukacja związana jest z Uniwersytetem Gdańskim, gdzie ukończyła magisterskie studia informatyczne, studia doktoranckie i następnie uzyskała stopień naukowy doktora nauk ekonomicznych w zakresie nauk o zarządzaniu (Katedra Informatyki Ekonomicznej na Wydziale Zarządzania...
Improving css-KNN Classification Performance by Shifts in Training Data
Publikacja
- K. Draszawka
- J. Szymański
- F. Guerra
- Rok 2015
This paper presents a new approach to improve the performance of a css-k-NN classifier for categorization of text documents. The css-k-NN classifier (i.e., a threshold-based variation of a standard k-NN classifier we proposed in [1]) is a lazy-learning instance-based classifier. It does not have parameters associated with features and/or classes of objects, that would be optimized during off-line learning. In this paper we propose...
An Analysis of Neural Word Representations for Wikipedia Articles Classification
Publikacja
- J. Szymański
- N. Kawalec
- CYBERNETICS AND SYSTEMS - Rok 2019
One of the current popular methods of generating word representations is an approach based on the analysis of large document collections with neural networks. It creates so-called word-embeddings that attempt to learn relationships between words and encode this information in the form of a low-dimensional vector. The goal of this paper is to examine the differences between the most popular embedding models and the typical bag-of-words...

Pełny tekst do pobrania w serwisie zewnętrznym
Improving the Accuracy in Sentiment Classification in the Light of Modelling the Latent Semantic Relations
Publikacja
- N. Rizun
- W. Waloszek
- Y. Taranenko
- Information - Rok 2018
The research presents the methodology of improving the accuracy in sentiment classification in the light of modelling the latent semantic relations (LSR). The objective of this methodology is to find ways of eliminating the limitations of the discriminant and probabilistic methods for LSR revealing and customizing the sentiment classification process (SCP) to the more accurate recognition of text tonality. This objective was achieved...

Pełny tekst do pobrania w portalu
Concept description vectors and the 20 question game
Publikacja
- J. Szymański
- T. Sarnatowicz
- W. Duch
- Rok 2005
Knowledge of properties that are applicable to a given object is a necessary prerequisite to formulate intelligent question. Concept description vectors provide simplest representation of this knowledge, storing for each object information about the values of its properties. Experiments with automatic creation of concept description vectors from various sources, including ontologies, dictionaries, encyclopedias and unstructured...

Pełny tekst do pobrania w serwisie zewnętrznym
Development and Research of the Text Messages Semantic Clustering Methodology
Publikacja
- N. Rizun
- P. Kapłański
- Y. Taranenko
- Rok 2016
The methodology of semantic clustering analysis of customer’s text-opinions collection is developed. The author's version of the mathematical models of formalization and practical realization of short textual messages semantic clustering procedure is proposed, based on the customer’s text-opinions collection Latent Semantic Analysis knowledge extracting method. An algorithm for semantic clustering of the text-opinions is developed,...

Pełny tekst do pobrania w portalu
Fusion-based Representation Learning Model for Multimode User-generated Social Network Content
Publikacja
- A. M. Soomar
- ACM Journal of Data and Information Quality - Rok 2023
As mobile networks and APPs are developed, user-generated content (UGC), which includes multi-source heterogeneous data like user reviews, tags, scores, images, and videos, has become an essential basis for improving the quality of personalized services. Due to the multi-source heterogeneous nature of the data, big data fusion offers both promise and drawbacks. With the rise of mobile networks and applications, UGC, which includes...

Pełny tekst do pobrania w serwisie zewnętrznym
Just look at to open it up: A biometric verification facility for password autofill to protect electronic documents
Publikacja
- M. Smiatacz
- B. Wiszniewski
- MULTIMEDIA TOOLS AND APPLICATIONS - Rok 2021
Electronic documents constitute specific units of information, and protecting them against unauthorized access is a challenging task. This is because a password protected document may be stolen from its host computer or intercepted while on transfer and exposed to unlimited offline attacks. The key issue is, therefore, making document passwords hard to crack. We propose to augment a common text password authentication interface...

Pełny tekst do pobrania w portalu
Agile Commerce in the light of Text Mining
Publikacja
- A. Baj-Rogowska
- Przedsiębiorczość i Zarządzanie - Rok 2017
The survey conducted for this study reveals that more than 84% of respondents have never encountered the term “agile commerce” and do not understand its meaning. At the same time, they are active participants of this strategy. Using digital channels as customers more often than ever before, they have already been included in the agile philosophy. Based on the above, the purpose of the study is to analyse major text sets containing...

Pełny tekst do pobrania w portalu
Ontologies vs. Rules — Comparison of Methods of Knowledge Representation Based on the Example of IT Services Management
Publikacja
- A. Czarnecki
- T. Sitek
- Rok 2013
This text provides a brief overview of selected structures aimed at knowledge representation in the form of ontologies based on description logic and aims at comparing them with their counterparts based on the rule-based approach. Due to the limitations on the length of the article, only elements associated with the representation of concepts could be shown, without including roles. The formalisms of the OWL language were used...

Pełny tekst do pobrania w serwisie zewnętrznym
Semantic Analysis and Text Summarization in Socio-Technical Systems
Publikacja
- N. Rizun
- Rok 2018
In this chapter the authors present the results of the development the methodology for increasing the reliability of the functioning of the Socio-Technical System. The existed methods and algorithms for processing unstructured (textual) information were studied. Taking into account noted above strengths and weaknesses of Discriminant and Probabilistic approaches of Latent Semantic Relations analysis in of the summarization projection...

Pełny tekst do pobrania w serwisie zewnętrznym
Methodology of Selecting the Hadoop Ecosystem Configuration in Order to Improve the Performance of a Plagiarism Detection System
Publikacja
- A. Sobecki
- M. Kępa
- Rok 2018
The plagiarism detection problem involves finding patterns in unstructured text documents. Similarity of documents in this approach means that the documents contain some identical phrases with defined minimal length. The typical methods used to find similar documents in dig- ital libraries are not suitable for this task (plagiarism detection) because found documents may contain similar content and we have not any war- ranty that...

Pełny tekst do pobrania w serwisie zewnętrznym
Context Search Algorithm for Lexical Knowledge Acquisition
Publikacja
- J. Szymański
- W. Duch
- CONTROL AND CYBERNETICS - Rok 2012
A Context Search algorithm used for lexical knowledge acquisition is presented. Knowledge representation based on psycholinguistic theories of cognitive processes allows for implementation of a computational model of semantic memory in the form of semantic network. A knowledge acquisition using supervised dialog templates have been performed in a word game designed to guess the concept a human user is thinking about. The game,...
SEMANTIC ANALYSIS ALGORITHMS FOR KNOWLEDGE WORKERS SUPPORT
Publikacja
- N. Rizun
- M. Rizun
- J. Taranenko
- Rok 2017
The paper examines various aspects of text analysis application for knowledge worker’s activity realization. Conclusions are drawn about the relevance and importance of processing the non-structured textual information in order to increase knowledge worker’s efficiency, as well as their awareness in different branches of science. The paper considers the existing algorithms of texts semantic analysis as the sphere of documents topical...

Pełny tekst do pobrania w portalu
Information Retrieval Facility Conference

Konferencje
Asia Information Retrieval Symposium

Konferencje
European Conference on Information Retrieval

Konferencje
SIGIR workshop: Stylistic Analysis of Text For Information Access

Konferencje
DEVELOPMENT OF THE ALGORITHM OF POLISH LANGUAGE FILM REVIEWS PREPROCESSING
Publikacja
- N. Rizun
- J. Taranenko
- Rocznik Naukowy Wydzialu Zarzadzania w Ciechanowie - Rok 2017
The algorithm and the software for conducting the procedure of Preprocessing of the reviews of films in the Polish language were developed. This algorithm contains the following steps: Text Adaptation Procedure; Procedure of Tokenization; Procedure of Transforming Words into the Byte Format; Part-of-Speech Tagging; Stemming / Lemmatization Procedure; Presentation of Documents in the Vector Form (Vector Space Model) Procedure; Forming...

Pełny tekst do pobrania w portalu
Music Mood Visualization Using Self-Organizing Maps
Publikacja
- M. Piotrowska
- B. Kostek
- Archives of Acoustics - Rok 2015
Due to an increasing amount of music being made available in digital form in the Internet, an automatic organization of music is sought. The paper presents an approach to graphical representation of mood of songs based on Self-Organizing Maps. Parameters describing mood of music are proposed and calculated and then analyzed employing correlation with mood dimensions based on the Multidimensional Scaling. A map is created in which...

Pełny tekst do pobrania w portalu
Ontologie vs. reguły — porównanie metod reprezentacji wiedzy na przykładzie dziedziny zarządzania usługami informatycznymi
Publikacja
- A. Czarnecki
- T. Sitek
- Ekonomiczne Problemy Usług - Rok 2013
Tekst stanowi krótki przegląd wybranych konstrukcji służących reprezentacji wiedzy w postaci ontologii opartych na logice opisowej i porównanie ich z odpowiednikami opartymi na zapisie regułowym. Z powodu ograniczonej liczby stron pokazano tylko elementy związane z reprezentacją konceptów, bez uwzględniania ról. Do zapisu ontologii wykorzystano formalizmy języka OWL, zaś reguły wyrażono w Prologu. Dla lepszego zilustrowania tych...

Pełny tekst do pobrania w portalu
Krystyna Dziubich mgr inż.

Osoby

Katedra Architektury Systemów Komputerowych

1996 r ukończone jednolite dzienne studia magisterskie na WETI, kierunek Informatyka; Specjalność: Informatyczne zarządzanie przedsiębiorstwem (WETI); 1996-2005 zatrudnienie z przemyśle, w zawodzie informatyk jako specjalista analityk w Departamencie Rozwoju Systemów Zarządzania; od roku 2005 - asystent, a następnie wykładowca PG WETI KASK; Wieloletnie zaangażowanie w opracowywanie i prowadzenie zajęć dydaktycznych na Studiach...
Internal legal acts of technical and medical universities in Poland regulating classes conducted in-person during the Covid-19 pandemic
Dane Badawcze
open access
- K. Górak-Sosnowska
- L. Tomaszewska
A database of legal acts and other internal documents of medical and technical universities in Poland regulating the way of organizing in-person or hybrid classes during the COVID-19 pandemic from the summer semester 2019/2020 to the winter semester 2020/2021.Documents were encoded in two separate coding systems using the MAXQDA program for qualitative...
Speech Analytics Based on Machine Learning
Publikacja
- Rok 2019
In this chapter, the process of speech data preparation for machine learning is discussed in detail. Examples of speech analytics methods applied to phonemes and allophones are shown. Further, an approach to automatic phoneme recognition involving optimized parametrization and a classifier belonging to machine learning algorithms is discussed. Feature vectors are built on the basis of descriptors coming from the music information...

Pełny tekst do pobrania w serwisie zewnętrznym
Towards Increasing Density of Relations in Category Graphs
Publikacja
- Rok 2014
In the chapter we propose methods for identifying new associations between Wikipedia categories. The first method is based on Bag-of-Words (BOW) representation of Wikipedia articles. Using similarity of the articles belonging to different categories allows to calculate the information about categories similarity. The second method is based on average scores given to categories while categorizing documents by our dedicated score-based...

Pełny tekst do pobrania w serwisie zewnętrznym
Retrieval of Heterogeneus Sevices in C2NIWA Repository
Publikacja
- J. Szymański
- TASK Quarterly - Rok 2015
The paper reviews the methods used for retrieval of information and services. The selected approaches presented in the review inspired us to build retrieval mechanisms in a system for searching the resources stored in the C2NIWA repository. We describe the architecture of the system, its functions and the surrounding subsystems to which it is related. For retrieval of C2NIWA sevices we propos three approaches based on: keyword...

Pełny tekst do pobrania w portalu
Context-Aware Indexing and Retrieval for Cognitive Systems Using SOEKS and DDNA
Publikacja
- C. De Silva Oliveira
- C. Sanin
- E. Szczerbicki
- Advances in Intelligent Systems and Computing - Rok 2019
Visual content searching, browsing and retrieval tools have been a focus area of interest as they are required by systems from many different domains. Context-based, Content-Based, and Semantic-based are different approaches utilized for indexing/retrieving, but have their drawbacks when applied to systems that aim to mimic the human capabilities. Such systems, also known as Cognitive Systems, are still limited in terms of processing...

Pełny tekst do pobrania w portalu
Marek Czachor prof. dr hab.

Osoby

Instytut Fizyki i Informatyki Stosowanej
International Conference on the Theory of Information Retrieval (The 3rd ACM International Conference on the Theory of Information Retrieval)

Konferencje
CAD. Integrated Architectural Design, MSc Arch (2022/2023)
Kursy Online
- D. Cyparski
The programme will provide students with a solid grounding in BIM (Building Information Modelling) using Autodesks Revit Architecture. Students will review the advanced features of Revit for Architecture, a tool to support BIM (Building Information Modelling) and delivery of 3D digital models and related documentation. The lesson plans will specifically introduce students to common workflows and problem-solving skills while creating...
CAD. Integrated Architectural Design, BSc Arch (2023-24)
Kursy Online
- D. Cyparski
The programme will provide students with a solid grounding in BIM (Building Information Modelling) using Autodesks Revit Architecture. Students will review the advanced features of Revit for Architecture, a tool to support BIM (Building Information Modelling) and delivery of 3D digital models and related documentation. The lesson plans will specifically introduce students to common workflows and problem-solving skills while creating...
DBpedia and YAGO Based System for Answering Questions in Natural Language
Publikacja
- Rok 2018
In this paper we propose a method for answering class 1 and class 2 questions (out of 5 classes defined by Moldovan for TREC conference) based on DBpedia and YAGO. Our method is based on generating dependency trees for the query. In the dependency tree we look for paths leading from the root to the named entity of interest. These paths (referenced further as fibers) are candidates for representation of actual user intention. The...

Pełny tekst do pobrania w portalu
Contextual ontology for tonality assessment
Publikacja
- W. Waloszek
- N. Rizun
- Procedia Computer Science - Rok 2020
classification tasks. The discussion focuses on two important research hypotheses: (1) whether it is possible to construct such an ontology from a corpus of textual document, and (2) whether it is possible and beneficial to use inferencing from this ontology to support the process of sentiment classification. To support the first hypothesis we present a method of extraction of hierarchy of contexts from a set of textual documents...

Pełny tekst do pobrania w portalu
Semantic Memory for Avatars in Cyberspace
Publikacja
- J. Szymański
- T. Sarnatowicz
- W. Duch
- Rok 2005
Avatars that show intelligent behavior should have an access to general knowledge about the world, knowledge that humans store in their semantic memories. The simplest knowledge representation for semantic memory is based on the Concept Description Vectors (CDVs) that store, for each concept, an information whether a given property can be applied to this concept or not. Unfortunately large-scale semantic memories are not available....
Next Generation Digital
Publikacja
- B. Wiszniewski
- Pan European Networks: Science & Technology - Rok 2013
The paper outlines the major objectives of the MENAID research project, eimed at novel architectures of digital documents. Such documents will enable reduction of information overflow and strain, a major threat to the growth of a digital society. They will be forward compatible, technology neutral and lightweight, allowing workers of network organizations to use personal devices of any type.

Pełny tekst do pobrania w serwisie zewnętrznym
Modeling the Customer’s Contextual Expectations Based on Latent Semantic Analysis Algorithms
Publikacja
- Rok 2017
Nowadays, in the age of Internet, access to open data detects the huge possibilities for information retrieval. More and more often we hear about the concept of open data which is unrestricted access, in addition to reuse and analysis by external institutions, organizations and people. It’s such information that can be freely processed, add another data (so-called remix) and then published. More and more data are available in text...

Pełny tekst do pobrania w portalu
Towards Healthcare Cloud Computing
Publikacja
- Rok 2016
In this paper we present construction of a software platform for supporting medical research teams, in the area of impedance cardiography, called IPMed. Using the platform, research tasks will be performed by the teams through computer-supported cooperative work. The platform enables secure medical data storing, access to the data for research group members, cooperative analysis of medical data and provide analysis supporting tools...

Pełny tekst do pobrania w serwisie zewnętrznym
Machine Learning and Text Analysis in an Artificial Intelligent System for the Training of Air Traffic Controllers
Publikacja
- T. Shmelova
- Y. Sikirda
- N. Rizun
- V. Lazorenko
- V. Kharchenko
- Rok 2020
This chapter presents the application of new information technology in education for the training of air traffic controllers (ATCs). Machine learning, multi-criteria decision analysis, and text analysis as the methods of artificial intelligence for ATCs training have been described. The authors have made an analysis of the International Civil Aviation Organization documents for modern principles of ATCs education. The prototype...

Pełny tekst do pobrania w portalu
Workflow patterns applicable to virtual knowledge-based organizations
Publikacja
- M. Godlewska
- Zeszyty Naukowe Wydziału ETI Politechniki Gdańskiej. Technologie Informacyjne - Rok 2010
Workflow is a term specifying how to automate a business process, in whole or part during which documents, information or tasks are passed from one participant to another for action, according to a set of procedural rules. Workflow is therefore directly applicable in virtual knowledge-based organizations, where information is exchanged via electronic documents. In the literature, is presented a complete list of workflow control-flow...
System of specific grants for local government units in Poland
Publikacja
- A. Sekuła
- Rok 2009
The article analyses the system of specific grants in local governments in Poland. First, main revenue sources of local self-governments are presented. Their presentation is based upon the consideration of one of the basic important principles in democratic states today, i.e. decentralization. The text then, in more details, describes specific grants with respect to the European Charter of Local Self-Government. Subsequently, the...
Gaining knowledge through experience: developing decisional DNA applications in robotics
Publikacja
- H. Zhang
- C. Sanin
- E. Szczerbicki
- CYBERNETICS AND SYSTEMS - Rok 2010
Omówiono nowatorskie podejscie do zastosowania wiedzy opartej na doświadczeniu i budowie decyzyjnego DNA w obszarach związanych z robotyką.In this article, we explore an approach that integrates Decisional DNA, a domain-independent, flexible, and standard knowledge representation structure, with robots in order to test the usability and suitability of this novel knowledge representation structure. Core issues in using this Decisional...

Pełny tekst do pobrania w serwisie zewnętrznym
Facial data registration facility for biometric protection of electronic documents
Publikacja
- Rok 2014
In modern world, information is crucial, and its leakage may lead to serious losses. Documents as the main medium of information must be therefore highly protected. Nowadays, the most common way of protecting data is using passwords, however it seems inconvenient to type complex passwords, when it is needed many times a day. For that reason a significant research has been conducted on biometric authentication...
ACM SIGIR Workshop on XML and Information Retrieval

Konferencje
International Symposium on String Processing and Information Retrieval

Konferencje
Magdalena Szuflita-Żurawska

Osoby

Politechnika Gdańska, Sekcja Informacji Naukowo-Technicznej, Biblioteka PG

Magdalena Szuflita-Żurawska jest kierownikiem Sekcji Informacji Naukowo-Technicznej na Politechnice Gdańskiej oraz Liderem Centrum Kompetencji Otwartej Nauki przy Bibliotece Politechniki Gdańskiej. Jej główne zainteresowania badawcze koncentrują się w obszarze komunikacji naukowej oraz otwartych danych badawczych, a także motywacji i produktywności naukowej. Jest odpowiedzialna między innymi za prowadzenie szkoleń dla pracowników...
Manufacturing Data Analysis in Internet of Things/Internet of Data (IoT/IoD) Scenario
Publikacja
- E. Szczerbicki
- S. I. Shafiq
- C. Sanin
- CYBERNETICS AND SYSTEMS - Rok 2018
Computer integrated manufacturing (CIM) has enormous benefits as it increases the rate of production, reduces errors and production waste, and streamlines manufacturing sub-systems. However, there are some new challenges related to CIM operating in the Internet of Things/Internet of Data (IoT/IoD) scenarios associated with Industry 4.0 and cyber-physical systems. The main challenge is to deal with the massive volume of data flowing...

Pełny tekst do pobrania w portalu
Self-Organizing Map representation for clustering Wikipedia search results
Publikacja
- J. Szymański
- LECTURE NOTES IN COMPUTER SCIENCE - Rok 2011
The article presents an approach to automated organization of textual data. The experiments have been performed on selected sub-set of Wikipedia. The Vector Space Model representation based on terms has been used to build groups of similar articles extracted from Kohonen Self-Organizing Maps with DBSCAN clustering. To warrant efficiency of the data processing, we performed linear dimensionality reduction of raw data using Principal...
Self–Organizing Map representation for clustering Wikipedia search results
Publikacja
- J. Szymański
- Rok 2011
The article presents an approach to automated organization of textual data. The experiments have been performed on selected sub-set of Wikipedia. The Vector Space Model representation based on terms has been used to build groups of similar articles extracted from Kohonen Self-Organizing Maps with DBSCAN clustering. To warrant efficiency of the data processing, we performed linear dimensionality reduction of raw data using Principal...

Pełny tekst do pobrania w serwisie zewnętrznym
ACM International Conference on Research and Development in Information Retrieval

Konferencje
Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives
Publikacja
- T. Souza
- E. Demidova
- T. Risse
- H. Holzmann
- G. Gossen
- J. Szymański
- Rok 2015
Long-term Web archives comprise Web documents gathered over longer time periods and can easily reach hundreds of terabytes in size. Semantic annotations such as named entities can facilitate intelligent access to the Web archive data. However, the annotation of the entire archive content on this scale is often infeasible. The most efficient way to access the documents within Web archives is provided through their URLs, which are...

Pełny tekst do pobrania w serwisie zewnętrznym

Wyszukiwarka

Filtry

Katalog

Wyniki wyszukiwania dla: TEXT REPRESENTATION DOCUMENTS CATEGORIZATION INFORMATION RETRIEVAL

Anna Baj-Rogowska dr

Krystyna Dziubich mgr inż.

Marek Czachor prof. dr hab.