Search results for: WIKIPEDIA, WIKIFICATION, NAME ENTITY RECOGNITION, DISAMBIGUATION, CONCEPTS IDENTIFICATION - Bridge of Knowledge

Search

Search results for: WIKIPEDIA, WIKIFICATION, NAME ENTITY RECOGNITION, DISAMBIGUATION, CONCEPTS IDENTIFICATION

Search results for: WIKIPEDIA, WIKIFICATION, NAME ENTITY RECOGNITION, DISAMBIGUATION, CONCEPTS IDENTIFICATION

  • Elgold: gold standard, multi-genre dataset for named entity recognition and linking

    Open Research Data

    The dataset contains 276 multi-genre texts with marked named entities, which are linked to corresponding Wikipedia articles if available. Each entity was manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.

  • Review on Wikification methods

    Publication

    - AI COMMUNICATIONS - Year 2019

    The paper reviews methods on automatic annotation of texts with Wikipedia entries. The process, called Wikification aims at building references between concepts identified in the text and Wikipedia articles. Wikification finds many applications, especially in text representation, where it enables one to capture the semantic similarity of the documents. Also, it can be considered as automatic tagging of the text. We describe typical...

    Full text to download in external service

  • Stable nanoconjugates of transferrin with alloyed quaternary nanocrystals Ag–In–Zn–S as a biological entity for tumor recognition

    Publication

    - NANOSCALE - Year 2018

    One way to limit the negative effects of anti-tumor drugs on healthy cells is targeted therapy employing functionalized drug carriers. Here we present a biocompatible and stable nanoconjugate of transferrin anchored to Ag-In-Zn-S quantum dots modified with 11-mercaptoundecanoic acid (Tf-QD) as a drug carrier versus typical anticancer drug, doxorubicin. Detailed investigations of Tf-QD nanoconjugates without and with doxorubicin...

    Full text available to download

  • Artur Gańcza dr inż.

    I received the M.Sc. degree from the Gdańsk University of Technology (GUT), Gdańsk, Poland, in 2019. I am currently a Ph.D. student at GUT, with the Department of Automatic Control, Faculty of Electronics, Telecommunications and Informatics. My professional interests include speech recognition, system identification, adaptive signal processing and linear algebra.

  • Automatic Watercraft Recognition and Identification on Water Areas Covered by Video Monitoring as Extension for Sea and River Traffic Supervision Systems

    Publication

    - Polish Maritime Research - Year 2018

    The article presents the watercraft recognition and identification system as an extension for the presently used visual water area monitoring systems, such as VTS (Vessel Traffic Service) or RIS (River Information Service). The watercraft identification systems (AIS - Automatic Identification Systems) which are presently used in both sea and inland navigation require purchase and installation of relatively expensive transceivers...

    Full text to download in external service

  • Gait Recognition: A Challenging Task for MEMS Signal Identification

    Publication
    • S. Głowiński
    • A. Błażejewski
    • T. Królikowski
    • R. Knitter
    • S. Glowinski

    - Year 2019

    Full text to download in external service

  • Elgold partial: News

    Open Research Data

    The dataset contains 37 English texts scrapped from news websites. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking...

  • Elgold intermediate: annotated raw

    Open Research Data

    The dataset contains a subset of texts from Elgold intermediate: raw texts with named entities marked and linked to corresponding Wikipedia articles. The texts were annotated by 31 participants during the 1.5-hour session.

  • Elgold partial: Automotive blogs

    Open Research Data

    The dataset contains 34 English texts scrapped from automotive blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and...

  • Elgold partial: Movie reviews

    Open Research Data

    The dataset contains 37 English texts with movie reviews. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.

  • Elgold partial: Job offers

    Open Research Data

    The dataset contains 34 English texts scrapped from the web portals offering job offers. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity...

  • Elgold partial: Scientific papers' abstracts

    Open Research Data

    The dataset contains 87 Scientific papers' abstracts in English randomly chosen from the folowing scientific disciplines: Biomedicine, Life Sciences, Mathematics, Medicine, Science, Humanities, Social Science.

  • Elgold partial: Amazon product reviews

    Open Research Data

    The dataset contains 34 Amazon product reviews in English. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.

  • Elgold partial: History blogs

    Open Research Data

    The dataset contains 13 texts from English history blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.

  • Elgold intermediate: verified by verification team

    Open Research Data

    The dataset contains the texts from Elgold intermediate: annotated raw additionaly verified by the five-person verification team.  arly 25% of the mentions were corrected in some aspect.

  • Chromatographic and Spectroscopic Identification and Recognition of Natural Dyes, Uncommon Dyestuff Components, and Mordants: Case Study of a 16th Century Carpet with Chintamani Motifs

    Publication

    A multi-tool analytical practice was used for the characterisation of a 16th century carpet manufactured in Cairo. A mild extraction method with hydrofluoric acid has been evaluated in order to isolate intact flavonoids and their glycosides, anthraquinones, tannins, and indigoids from fibre samples. High-performance liquid chromatography coupled to spectroscopic and mass spectrometric detectors was used for the identification of...

    Full text available to download

  • Fast Approximate String Search for Wikification

    Publication

    The paper presents a novel method for fast approximate string search based on neural distance metrics embeddings. Our research is focused primarily on applying the proposed method for entity retrieval in the Wikification process, which is similar to edit distance-based similarity search on the typical dictionary. The proposed method has been compared with symmetric delete spelling correction algorithm and proven to be more efficient...

    Full text available to download

  • Towards Facts Extraction From Texts in Polish Language

    The Polish language differs from English in many ways. It has more complicated conjugation and declination. Because of that automatic facts extraction from texts is difficult. In this paper we present basic differences between those languages. The paper presents an algorithm for extraction of facts from articles from Polish Wikipedia. The algorithm is based on 7 proposed facts schemes that are searched for in the analyzed text....

    Full text available to download

  • Wikipedia and WordNet integration based on words co-occurrences

    Publication

    - Year 2009

    The article presents a method for automatic integration of two lexical resources: semantic dictionary WordNet and electronic encyclopaedia Wikipedia. Our goal is to add automatically an semantic tags - a WordNet synset identifier to the title of the Wikipedia article. We've analyze several different ap-proaches to these problem and implement our own solution, based on word occurrences in synsets descriptions and the article body....

  • OntoValidate: OntoNotes 5.0 NER validation dataset

    Open Research Data
    open access

    OntoValidate dataset consists of 603 randomly chosen raw textsfrom the original OntoNote 5.0 dataset (3637 raw texts in total).

  • Dynamic Semantic Visual Information Management

    Publication

    - Year 2010

    Dominant Internet search engines use keywords and therefore are not suited for exploration of new domains of knowledge, when the user does not know specific vocabulary. Browsing through articles in a large encyclopedia, each presenting a small fragment of knowledge, it is hard to map the whole domain, see relevant concepts and their relations. In Wikipedia for example some highly relevant articles are not linked with each other....

    Full text to download in external service

  • Semantic Integration of Heterogeneous Recognition Systems

    Publication

    - LECTURE NOTES IN COMPUTER SCIENCE - Year 2011

    Computer perception of real-life situations is performed using a variety of recognition techniques, including video-based computer vision, biometric systems, RFID devices and others. The proliferation of recognition modules enables development of complex systems by integration of existing components, analogously to the Service Oriented Architecture technology. In the paper, we propose a method that enables integration of information...

  • Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives

    Publication
    • T. Souza
    • E. Demidova
    • T. Risse
    • H. Holzmann
    • G. Gossen
    • J. Szymański

    - Year 2015

    Long-term Web archives comprise Web documents gathered over longer time periods and can easily reach hundreds of terabytes in size. Semantic annotations such as named entities can facilitate intelligent access to the Web archive data. However, the annotation of the entire archive content on this scale is often infeasible. The most efficient way to access the documents within Web archives is provided through their URLs, which are...

    Full text to download in external service

  • Szymon Olewniczak mgr inż.

    People

    I've been a part of the Gdansk University of Technology since 2013, when I started my bachelor's degree in computer science at the Faculty of Electronics, Telecommunications and Informatics. After receiving my master's degree in 2019, I've been working as an assistant at the Department of Computer Architecture. Since 2024, I am also the deputy head of my department. My research interests revolve around various NLP related topics,...

  • Enabling Deeper Linguistic-based Text Analytics – Construct Development for the Criticality of Negative Service Experience

    Publication

    - IEEE Access - Year 2019

    Significant progress has been made in linguistic-based text analytics particularly with the increasing availability of data and deep learning computational models for more accurate opinion analysis and domain-specific entity recognition. In understanding customer service experience from texts, analysis of sentiments associated with different stages of the service lifecycle is a useful starting point. However, when richer insights...

    Full text available to download

  • A video monitoring system using ontology-driven identification of threats

    Publication

    In this paper, we present a video monitoring systemthat leverages image recognition and ontological reasoningabout threats. In the solution, an image processing subsystemuses video recording of a monitored area and recognizesknown concepts in scenes. Then, a reasoning subsystem uses anontological description of security conditions and informationfrom image recognition to check if a violation of a conditionhas occurred. If a threat...

    Full text to download in external service

  • Development and Research of the Text Messages Semantic Clustering Methodology

    Publication

    - Year 2016

    The methodology of semantic clustering analysis of customer’s text-opinions collection is developed. The author's version of the mathematical models of formalization and practical realization of short textual messages semantic clustering procedure is proposed, based on the customer’s text-opinions collection Latent Semantic Analysis knowledge extracting method. An algorithm for semantic clustering of the text-opinions is developed,...

    Full text available to download

  • Geometric Algebra Model of Distributed Representations

    Publication

    - Year 2010

    Formalism based on GA is an alternative to distributed representation models developed so far-Smolensky's tensor product, Holographic Reduced Representations (HRR) and Binary Spatter Code (BSC). Convolutions are replaced by geometric products, interpretable in terms of geometry which seems to be the most natural language for visualization of higher concepts. This paper recalls the main ideas behind the GA model and investigates...

  • Tomasz Korol dr hab. inż.

    Education Gdańsk University of Technology, Faculty of Management and Economics (2001) University of Applied Sciences Stralsund (1999) Degree / scientific title Habilitation – Gdańsk University of Technology, Faculty of Management and Economics (2015) Ph.D. – Gdańsk University of Technology, Faculty of Management and Economics (2004) Employment Gdańsk University of Technology - associate professor (since 2017); assistant professor...

  • Mining inconsistent emotion recognition results with the multidimensional model

    Publication

    - IEEE Access - Year 2021

    The paper deals with the challenge of inconsistency in multichannel emotion recognition. The focus of the paper is to explore factors that might influence the inconsistency. The paper reports an experiment that used multi-camera facial expression analysis with multiple recognition systems. The data were analyzed using a multidimensional approach and data mining techniques. The study allowed us to explore camera location, occlusions...

    Full text available to download

  • Interactions with recognized patients using smart glasses

    Publication

    - Year 2015

    Recently, different smart glasses solutions have been proposed on the market. The rapid development of this wearable technology has led to several research projects related to applications of smart glasses in healthcare. In this paper we propose a general architecture of the system enabling data integration for the recognized person. In the proposed system smart glasses integrates data obtained for the recognized patient from health...

    Full text to download in external service

  • Information retrieval with semantic memory model

    Publication

    Psycholinguistic theories of semantic memory form the basis of understanding of natural language concepts. These theories are used here as an inspiration for implementing a computational model of semantic memory in the form of semantic network. Combining this network with a vector-based object-relation-feature value representation of concepts that includes also weights for confidence and support, allows for recognition of concepts...

    Full text to download in external service

  • Sensors integration in the smart home environment - a proposal to solve the problem with user identification

    In this preliminary study we, investigate the possibility of user recognition techniques suitable on smart home devices like chairs, beds, aiming for low–power, high accuracy and quick response time. We propose the two well know technique: voice speaker recognition and accelerometer signal from device mounted on the chair, and the third one optical system basing on IR LED transmitter/receiver circuit. The preliminary results proved...

    Full text to download in external service

  • Numerical simulations of novel GFRP sandwich footbridge

    In the following paper some aspects of numerical analysis and designing stages of a footbridge made of composite materials are elucidated. Because of the used materials the design process in this case is not ordinary and requires the development of concepts, material selections, identification of material properties, numerical simulations, strength calculations, serviceability and durability analyses. This contribution presents...

  • Odpowiedzialność kierownicza w biznesie – zagadnienia procesowe i rodzajowe

    Celem podjętych prac było przeprowadzenie rozważań nad zagadnieniami powstawania, ponoszenia, podejmowania oraz pociągania do odpowiedzialności kierowniczej. Kontynuacja badań nad odpowiedzialnością kierowniczą została oparta na sformułowanej wcześniej definicji. Umożliwiło to procesowe ujęcie przedmiotu eksploracji. W procesie tym wyróżniono fazy: przywołania, przyjęcia oraz poniesienia odpowiedzialności kierowniczej. Przeprowadzone...

    Full text available to download

  • The Innovative Faculty for Innovative Technologies

    A leaflet describing Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology. Multimedia Systems Department described laboratories and prototypes of: Auditory-visual attention stimulator, Automatic video event detection, Object re-identification application for multi-camera surveillance systems, Object Tracking and Automatic Master-Slave PTZ Camera Positioning System, Passive Acoustic Radar,...

    Full text to download in external service

  • Human carnosinases: A brief history, medicinal relevance, and in silico analyses

    Publication

    - DRUG DISCOVERY TODAY - Year 2024

    Carnosine, an endogenous dipeptide, has been found to have a plethora of medicinal properties, such as antioxidant, antiageing, and chelating effects, but with one downside: a short half-life. Carnosinases and two hydrolytic enzymes, which remain enigmatic, are responsible for these features. Hence, here we emphasize why research is valuable for better understanding crucial concepts like ageing, neurodegradation, and cancerogenesis,...

    Full text available to download

  • Wykorzystanie sztucznych sieci neuronowych do wykrywania i rozpoznawania tablic rejestracyjnych na zdjęciach pojazdów

    W artykule przedstawiono koncepcję algorytmu wykrywania i rozpoznawania tablic rejestracyjnych (AWiRTR) na obrazach cyfrowych pojazdów. Detekcja i lokalizacja tablic rejestracyjnych oraz wyodrębnienie z obrazu tablicy rejestracyjnej poszczególnych znaków odbywa się z wykorzystaniem podstawowych technik przetwarzania obrazu (przekształcenia morfologiczne, wykrywanie krawędzi) jak i podstawowych danych statystycznych obiektów wykrytych...

    Full text available to download

  • Scoreboard Architectural Pattern and Integration of Emotion Recognition Results

    Publication

    This paper proposes a new design pattern, named Scoreboard , dedicated for applications solving complex, multi-stage, non-deterministic problems. The pattern provides a computational framework for the design and implementation of systems that integrate a large number of diverse specialized modules that may vary in accuracy, solution level, and modality. The Scoreboard is an extension of Blackboard design pattern and comes under...

    Full text available to download

  • Features extraction from the electrocatalytic gas sensor responses

    One of the types of gas sensors used for detection and identification of toxic-air pollutant is an electrocatalytic gas sensor. The electrocatalytic sensors are working in cyclic voltammetry mode, enable detection of various gases. Their response are in the form of I-V curves which contain information about the type and the concentration of measured volatile compound. However,...

    Full text to download in external service

  • Voice command recognition using hybrid genetic algorithm

    Publication

    Abstract: Speech recognition is a process of converting the acoustic signal into a set of words, whereas voice command recognition consists in the correct identification of voice commands, usually single words. Voice command recognition systems are widely used in the military, control systems, electronic devices, such as cellular phones, or by people with disabilities (e.g., for controlling a wheelchair or operating a computer...

    Full text available to download

  • Quantum structure in competing lizard communities

    Publication

    - ECOLOGICAL MODELLING - Year 2014

    Almost two decades of research on applications of the mathematical formalism of quantum theory as a modeling tool in domains different from the micro-world has given rise to many successful applications in situations related to human behavior and thought, more specifically in cognitive processes of decision-making and the ways concepts are combined into sentences. In this article, we extend this approach to animal behavior, showing...

    Full text available to download

  • Potential and Use of the Googlenet Ann for the Purposes of Inland Water Ships Classification

    Publication

    - Polish Maritime Research - Year 2020

    This article presents an analysis of the possibilities of using the pre-degraded GoogLeNet artificial neural network to classify inland vessels. Inland water authorities monitor the intensity of the vessels via CCTV. Such classification seems to be an improvement in their statutory tasks. The automatic classification of the inland vessels from video recording is a one of the main objectives of the Automatic Ship Recognition and...

    Full text available to download

  • Discovering Rule-Based Learning Systems for the Purpose of Music Analysis

    Publication

    Music analysis and processing aims at understanding information retrieved from music (Music Information Retrieval). For the purpose of music data mining, machine learning (ML) methods or statistical approach are employed. Their primary task is recognition of musical instrument sounds, music genre or emotion contained in music, identification of audio, assessment of audio content, etc. In terms of computational approach, music databases...

    Full text available to download

  • Follow the Light. Where to search for useful research information

    Architectural Lighting Design (ALD) has never been a standalone professional discipline. Rather, it has existed as the combination of art and the science of light. Today, third generation lighting professionals are already creatively intertwining these fields, and the acceleration in scientific, technological and societal studies has only increased the need for reliable multidisciplinary information. Therefore, a thorough re-examination...

    Full text available to download

  • Local Texture Pattern Selection for Efficient Face Recognition and Tracking

    This paper describes the research aimed at finding the optimal configuration of the face recognition algorithm based on local texture descriptors (binary and ternary patterns). Since the identification module was supposed to be a part of the face tracking system developed for interactive wearable computer, proper feature selection, allowing for real-time operation, became particularly important. Our experiments showed that it is...

    Full text to download in external service

  • A new look at the statistical identification of nonstationary systems

    Publication

    The paper presents a new, two-stage approach to identification of linear time-varying stochastic systems, based on the concepts of preestimation and postfiltering. The proposed preestimated parameter trajectories are unbiased but have large variability. Hence, to obtain reliable estimates of system parameters, the preestimated trajectories must be further filtered (postfiltered). It is shown how one can design and optimize such...

    Full text available to download

  • Joanna Czerska dr inż.

    I am a woman whose mission and passion is the development of people and organizations. My motto is: "There is no such fantasy that human will and reason cannot transform into reality." William Shakespeare In my life I am guided by the values ​​of respect, teamwork and a positive attitude. They define me and decide what kind of person I am. My adventure with Lean began when I was writing my diploma thesis during my studies at WZiE...

  • Quality of graphical markers for the needs of eyewear devices

    Publication

    - Year 2015

    in this paper we propose to cast the problem of identification of people, objects or places into an application for smart glasses that decodes information from graphical markers. We focus on analyzing different factors that can have influence on the processes of the automatic recognition of information from a code. The research we present aims at reviewing recognition performances in function of: size of a marker, distance from/to...

    Full text to download in external service

  • Biometryczna kontrola dostępu

    Opisano szczegółowo algorytm detekcji oraz identyfikacji człowieka na podstawie punktów nodalnych twarzy. Zdefiniowano pojęcia: biometria, proces pomiaru biometrycznego, metody biometrycznej identyfikacji oraz kontrola dostępu. Przedstawiono opis opracowanego systemu biometrycznej identyfikacji wykorzystującego sztuczne sieci neuronowe. Podano wyniki badań oraz przeprowadzono ich wnikliwą dyskusję.Biometrics is the study of automated...

    Full text available to download