Search results for: DOCUMENTS CATEGORIZATION - Bridge of Knowledge

Search

Search results for: DOCUMENTS CATEGORIZATION

Search results for: DOCUMENTS CATEGORIZATION

  • Text classifiers for automatic articles categorization

    Publication

    The article concerns the problem of automatic classification of textual content. We present selected methods for generation of documents representation and we evaluate them in classification tasks. The experiments have been performed on Wikipedia articles classified automatically to their categories made by Wikipedia editors.

  • Categorization of Wikipedia articles with spectral clustering

    Abstract. The article reports application of clustering algorithms for creating hierarchical groups withinWikipedia articles.We evaluate three spectral clustering algorithms based on datasets constructed with usage ofWikipedia categories. Selected algorithm has been implemented in the system that categorize Wikipedia search results in the fly.

  • Categorization of Cloud Workload Types with Clustering

    The paper presents a new classification schema of IaaS cloud workloads types, based on the functional characteristics. We show the results of an experiment of automatic categorization performed with different benchmarks that represent particular workload types. Monitoring of resource utilization allowed us to construct workload models that can be processed with machine learning algorithms. The direct connection between the functional...

    Full text to download in external service

  • LEVEL OF DETAIL CATEGORIZATION FOR THE APPLICATION IN URBAN DESIGN

    Publication

    - Przestrzeń i Forma - Year 2023

    Urban planning and urban design involve complex processes that require detailed information about the visual information of a place at various scales. Different graphic tools, such as game engines, are evolving to use urban representation fields. The concept of "level of detail" (LOD) has been used to categorize the level of detail in AEC applications such as BIM and GML for urban representation models. However, there is a need...

    Full text available to download

  • Parallel Computations of Text Similarities for Categorization Task

    Publication

    - Year 2013

    In this chapter we describe the approach to parallel implementation of similarities in high dimensional spaces. The similarities computation have been used for textual data categorization. A test datasets we create from Wikipedia articles that with their hyper references formed a graph used in our experiments. The similarities based on Euclidean distance and Cosine measure have been used to process the data using k-means algorithm....

  • Text Categorization Improvement via User Interaction

    Publication

    - Year 2018

    In this paper, we propose an approach to improvement of text categorization using interaction with the user. The quality of categorization has been defined in terms of a distribution of objects related to the classes and projected on the self-organizing maps. For the experiments, we use the articles and categories from the subset of Simple Wikipedia. We test three different approaches for text representation. As a baseline we use...

    Full text to download in external service

  • Schema mining in XML documents.

    Publication

    - Year 2004

    W artykule przedstawiono algorytm COBWEB S+T służący do wywodzenia schematów z kolekcji dokumentów XML. Algorytm wykorzystuje model danych semistrukturalnych oraz alorytm COBWEB służący do grupowania koncepcyjnego. W artykule zaprezentowano również wyniki testów działania algorytmu.

  • Knowledge management implementation in small and micro KIBS : A categorization

    Publication

    - Knowledge and Process Management - Year 2023

    he main goal of the paper is to provide a statistical categorization of small and micro knowledge-intensive business service (KIBS) companies, based on their knowledge management (KM) attitude. Since knowledge is the main production factor and output of these companies, it is essential to achieve a better understanding of how they manage this resource. A questionnaire-based survey was conducted on a sample of Polish small and micro...

    Full text available to download

  • Text categorization with semantic commonsense knowledge: First results

    Publication

    - Year 2008

    Do przetwarzania tekstów typowo wykorzystuje się reprezentacjeBOW. Podejście takie nie daje jednak dobrych rezultatów w sytuacjigdy podobne dokumenty nie współdzielą ze sobą słów.W artykule zaprezentowano podejście do konstrukcji funkcjijądra dla klasyfikatorów SVM opartego na zewnętrznej bazie wiedzyo pojęciach językowych.

  • Linking music data in executable documents

    Publication

    - Year 2022

    This paper presents the application of Interactive Open Document Architecture (IODA) to music and video data. This architecture was design to create multilayer documents which consist of many files. The paper shows the method of creating media documents on the basis of IODA. These kind of documents were called IODA Media Documents (IMD). IMD have links that connect many different kinds of files containing music and video data....

    Full text to download in external service

  • Ruling lines removal in handwritten documents

    Publication

    - Year 2013

  • Querying the digital database of musical documents

    Publication

    W rozdziale zaprezentowano program Melody Explorer służący do formułowania zapytań dla bazy danych dokumentów muzycznych. Przedstawiono problemy związane z konwersją informacji wprowadzanych przez użytkownika na zapis nutowy. Zaproponowano ulepszenia istniejących rozwiązań mające na celu poprawę dokładności i stabilności systemu. Oprócz cyfrowego zapisu dźwięku również podany przez użytkownika rytm melodii wykorzystywany jest do...

  • Green energy in municipal planning documents

    Publication
    • A. Bazan-Krzywoszańska
    • M. Skiba
    • M. Mrówczyńska
    • M. Sztubecka
    • D. Bazuń
    • M. Kwiatkowski

    - E3S Web of Conferences - Year 2018

    Full text to download in external service

  • Augmenting digital documents with negotiation capability

    Publication

    Active digital documents are not only capable of performing various operations using their internal functionality and external services, accessible in the environment in which they operate, but can also migrate on their own over a network of mobile devices that provide dynamically changing execution contexts. They may imply conflicts between preferences of the active document and the device the former wishes to execute on. In the...

    Full text available to download

  • Categorization of emotions in dog behavior based on the deep neural network

    The aim of this article is to present a neural system based on stock architecture for recognizing emotional behavior in dogs. Our considerations are inspired by the original work of Franzoni et al. on recognizing dog emotions. An appropriate set of photographic data has been compiled taking into account five classes of emotional behavior in dogs of one breed, including joy, anger, licking, yawning, and sleeping. Focusing on a particular...

    Full text available to download

  • The categorization of the tourist services quality perception determinants in hierarchical conception

    Publication

    - Year 2011

    W niniejszym rozdziale zaprezentowano kategoryzację determinant percepcji jakości usług turystycznych. Omówione zostały główne grupy oraz determinanty elementarne z wykorzystaniem ujęcia modeli hierarchicznych.

  • Documenta Mathematica

    Journals

    ISSN: 1431-0643

  • DOCUMENTA OPHTHALMOLOGICA

    Journals

    ISSN: 0012-4486 , eISSN: 1573-2622

  • Documenta Pragensia

    Journals

    ISSN: 0231-7443

  • Documenta Praehistorica

    Journals

    ISSN: 1408-967X

  • Documenti Geografici

    Journals

    ISSN: 2035-8792 , eISSN: 2281-7549

  • Deep learning for recommending subscription-limited documents

    Publication

    Documents recommendation for a commercial, subscription-based online platform is important due to the difficulty in navigation through a large volume and diversity of content available to clients. However, this is also a challenging task due to the number of new documents added every day and decreasing relevance of older contents. To solve this problem, we propose deep neural network architecture that combines autoencoder with...

    Full text available to download

  • Text Documents Classification with Support Vector Machines

    Publication
    • P. Majewski

    - Year 2008

  • Planning documents and sustainable development of a commune in Poland

    Publication
    • A. Stacherzak
    • M. Hełdak
    • B. Raszka

    - Year 2012

    Full text to download in external service

  • Intelligent system for editing and analysis of examination documents

    Publication

    - Year 2006

    Opisano ogólną koncepcję systemu IATE - systemu do edycji i automatycznej analizy testów egzaminacyjnych. Edytor systemu umożliwia generację 4 typów testów o dowolnej liczbie pytań (do 8 stron tekstu), różnej formie udzielania odpowiedzi oraz możliwością tworzenia wariantów testu. Bardziej szczegółowo opisano wybrane fragmenty systemu: analizę nagłówka testu, edycję i organizację segmentu tworzenia wariantów testu oraz organizację...

  • Computer analysis of multiple-choice examination documents

    Publication

    - Year 2004

    Opisany system AATE wyposażony jest w edytor testów, za pomocą którego egzaminator przygotowuje test egzaminacyjny. Utworzony test ze swoimi parametrami jest pamiętany w bazie danych i następnie może być wydrukowany. Po przeprowadzeniu egzaminu wypełnione formularze za pomocą skanera z podajnikiem wprowadza się do komputera. W komputerze system analizuje formularze i odczytane odpowiedzi porównuje się z wzorcami przechowywanymi...

  • Documents d'archéologie méridionale

    Journals

    ISSN: 0184-1068

  • Document Numerique

    Journals

    ISSN: 1279-5127

  • The potential of computational methods for the categorization of architectural objects on the example of media architecture

    Publication

    The paper presents an example of the categorization of architectural objects and assessment of the characteristics of urban space, based on the analysis of specific features of architectural objects and urban landscape. The conducted analysis refers to media architecture and is presented in the complex context of the development of media solutions. The field of influence of IT on architecture is also stressed, both on the architect’s...

    Full text available to download

  • Document centric knowledge processes

    Publication

    - Year 2022

    .

    Full text to download in external service

  • For Your Eyes Only – Biometric Protection of PDF Documents

    Publication

    The paper introduces a concept of a digital document content encryption/decryption with facial biometric data coming from a legitimate user. Access to the document content is simple and straightforward, especially during collaborative work with mobile devices equipped with cameras. Various contexts of document exchange are presented with regard to the next generation pro-active digital documents proposed by authors. An important...

    Full text available to download

  • Agent System for Managing Distributed Mobile Interactive Documents

    The MIND architecture of distributed mobile interactive document is a new processing model defined for facilitate informed decision-making in non-algorithmic decision-making processes carried out by knowledge-based organizations. The aim of this architecture is to change the static document to mobile agents, which are designed to implement the structure of the organization through autonomous migration between knowledge workers...

    Full text available to download

  • External Validation Measures for Nested Clustering of Text Documents

    Publication

    Abstract. This article handles the problem of validating the results of nested (as opposed to "flat") clusterings. It shows that standard external validation indices used for partitioning clustering validation, like Rand statistics, Hubert Γ statistic or F-measure are not applicable in nested clustering cases. Additionally to the work, where F-measure was adopted to hierarchical classification as hF-measure, here some methods to...

  • Two Stage SVM and kNN Text Documents Classifier

    Publication

    - Year 2015

    The paper presents an approach to the large scale text documents classification problem in parallel environments. A two stage classifier is proposed, based on a combination of k-nearest neighbors and support vector machines classification methods. The details of the classifier and the parallelisation of classification, learning and prediction phases are described. The classifier makes use of our method named one-vs-near. It is...

  • Semantic Driven Table Understanding in Born-Digital Documents

    Publication

    - Year 2014

    This paper presents a new approach to table understanding, suitable for born-digital PDF documents. Advance beyond the current state of the art in table understanding is provided by the proposed reverse MVC method, which takes advantage of only partial logic structure loss (degradation) in born-digital PDF documents, as opposed to unrecoverable loss (deterioration) taking place in scan based PDF documents.

    Full text to download in external service

  • Digital document life cycle development

    Publication

    Przedstawiono model DDLC wytwarzania interaktywnych dokumentów cyfrowych z ich pierwowzorów papierowych. Model DDLC opracowany w ramach 5 PR UE IST-2002-33441 MEMORIAL wyróżnia 6 faz i odpowiednie grupy funkcjonalności narzędzi do ich realizacji. Cykl wytwarzanie realizuje politykę całkowitej kontroli jakości, wykorzystującej specjalnie opracowaną metodę Visual GQM.

  • Document Agents with the Intelligent Negotiations Capability

    Publication

    The paper focus is on augmenting proactive document-agents with built -in intelligence to enable them to recognize execution context provided by devices visited durning the business process, and to reach collaboration agreement despite of their conflicting requirements. We propose a solution based on neural networks to improve simple multi-issue negotiation between the document and the device, practically with no excessive cost...

  • Dokumenty Cyfrowe Przyszłosci

    Publication

    - Year 2013

    W referacie przedstawiono nowe modele architektur dokumentów elektronicznych, które pozwolą zracjonalizować wewnętrzny obieg informacji w organizacjach opartych na wiedzy i zredukować koszty ich funkcjonowania.

  • Improving the Workflow for Creation of Textual Versions of Polish Historical Documents

    Publication
    • A. Dudczak
    • M. Kmieciak
    • C. Mazurek
    • M. Stroiński
    • M. Werla
    • J. Węglarz

    - Year 2013

    Full text to download in external service

  • Representation of hypertext documents based on terms, Links and text compressibility

    Publication

    Opisano metody reprezentacji dokumentów tekstowych oparte na słowach, wzajemnych powiązaniach i metodach kompresji. Dokonano ich oceny w oparciu o klasyfikator SVM.

  • Visual GQM approach to quality driven development of electronic documents.

    Publication

    Jednym z celów projektu europejskiego MEORIAL jest opracowanie nowej technologii wytwarzania webowych systemów informacyjnych wykorzystujących interaktywne dokumenty cyfrowe wytworzone z papierowych oryginałów z zastosowaniem zaawansowanych technik przetwarzania i rozpoznania obrazów. Wieloelementowy model cyklu życia dokumentu cyfrowego przedstawiony w artykule stanowi postawę opracowanej technologii.

    Full text available to download

  • Facial data registration facility for biometric protection of electronic documents

    Publication

    In modern world, information is crucial, and its leakage may lead to serious losses. Documents as the main medium of information must be therefore highly protected. Nowadays, the most common way of protecting data is using passwords, however it seems inconvenient to type complex passwords, when it is needed many times a day. For that reason a significant research has been conducted on biometric authentication...

  • Kryteria wytrzymałości gruntu na ścinanie w zagadnieniach geotechniki

    Publication

    - Year 2005

    Przedstawiono wpływ zastosowania różnych kryteriów wytrzymałości gruntu na ścinanie w symulacjach numerycznych prostych praktycznych zagadnień geotechnicznych. Obliczenia wykonano metodą elementów skończonych w płaskim oraz osiowosymetrycznym stanie odkształcenia. Wyniki obliczeń porównano oraz poddano krytycznej dyskusji.

  • Documents d Analisi Geografica

    Journals

    ISSN: 0212-1573

  • Documenta et Instrumenta

    Journals

    ISSN: 1697-4328 , eISSN: 1697-3798

  • Studia et Documenta

    Journals

    ISSN: 1970-4879

  • Cine Documental

    Journals

    ISSN: 1852-4699

  • The Application of the IODA Document Architecture to Music Data

    Publication

    - Year 2014

    This paper is concerned with storing music data with the use of document architecture called Interactive Open Document Architecture (IODA). This architecture makes it possible to create documents which are executable, mobile, interactive and intelligent. Such documents consist of many files that are semantically related to each other. Semantic links are defined in XML files which are a part of a document. IODA documents with music...

  • Document transformations for data processing in information systems

    Publication

    - Year 2007

    Atrykuł przedstawia podejście do automatyzacji transformacjidokumentów użytkownika bazujące na technologii XML. W artykuleprzedstawiony został system Endoscopy Recommender System.ERS wykorzystuje dedykowane transformacje XML Schema do Java, Java dodokumentów XML. Dzięki tym transformacjom procesy pobierania iprzechowywania danych zostały w pełni zautomatyzowane.Zaimplementowane podejście XML data binding umożliwia walidacjępodstawowych...

  • A document-centric processing paradigm for collaborative computing

    Publication

    - Year 2012

    Klasyczne modele przetwarzania rozproszonego zakładają, że dokumenty są biernymi obiektami, które rozsyła się w formie komunikatów lub pobiera z serwerów do przetwarzania jako pliki.W artykule przedstawiono koncepcję dokumentu jako aktywnego obiektu, zdolnego do samodzielnej migracji miedzy węzłami sieci i interakcji z użytkownikami w ich lokalnym środowisku. Takie podejście jest szczególnie przydatne do realizacji procesów biznesowych...