Search results for: WIKIPEDIA, WIKIFICATION, NAME ENTITY RECOGNITION, DISAMBIGUATION, CONCEPTS IDENTIFICATION
-
Elgold: gold standard, multi-genre dataset for named entity recognition and linking
Open Research DataThe dataset contains 276 multi-genre texts with marked named entities, which are linked to corresponding Wikipedia articles if available. Each entity was manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Review on Wikification methods
PublicationThe paper reviews methods on automatic annotation of texts with Wikipedia entries. The process, called Wikification aims at building references between concepts identified in the text and Wikipedia articles. Wikification finds many applications, especially in text representation, where it enables one to capture the semantic similarity of the documents. Also, it can be considered as automatic tagging of the text. We describe typical...
-
Stable nanoconjugates of transferrin with alloyed quaternary nanocrystals Ag–In–Zn–S as a biological entity for tumor recognition
PublicationOne way to limit the negative effects of anti-tumor drugs on healthy cells is targeted therapy employing functionalized drug carriers. Here we present a biocompatible and stable nanoconjugate of transferrin anchored to Ag-In-Zn-S quantum dots modified with 11-mercaptoundecanoic acid (Tf-QD) as a drug carrier versus typical anticancer drug, doxorubicin. Detailed investigations of Tf-QD nanoconjugates without and with doxorubicin...
-
Artur Gańcza dr inż.
PeopleI received the M.Sc. degree from the Gdańsk University of Technology (GUT), Gdańsk, Poland, in 2019. I am currently a Ph.D. student at GUT, with the Department of Automatic Control, Faculty of Electronics, Telecommunications and Informatics. My professional interests include speech recognition, system identification, adaptive signal processing and linear algebra.
-
Automatic Watercraft Recognition and Identification on Water Areas Covered by Video Monitoring as Extension for Sea and River Traffic Supervision Systems
PublicationThe article presents the watercraft recognition and identification system as an extension for the presently used visual water area monitoring systems, such as VTS (Vessel Traffic Service) or RIS (River Information Service). The watercraft identification systems (AIS - Automatic Identification Systems) which are presently used in both sea and inland navigation require purchase and installation of relatively expensive transceivers...
-
Gait Recognition: A Challenging Task for MEMS Signal Identification
Publication -
Elgold partial: News
Open Research DataThe dataset contains 37 English texts scrapped from news websites. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking...
-
Elgold intermediate: annotated raw
Open Research DataThe dataset contains a subset of texts from Elgold intermediate: raw texts with named entities marked and linked to corresponding Wikipedia articles. The texts were annotated by 31 participants during the 1.5-hour session.
-
Elgold partial: Scientific papers' abstracts
Open Research DataThe dataset contains 87 Scientific papers' abstracts in English randomly chosen from the folowing scientific disciplines: Biomedicine, Life Sciences, Mathematics, Medicine, Science, Humanities, Social Science.
-
Elgold partial: Amazon product reviews
Open Research DataThe dataset contains 34 Amazon product reviews in English. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: Automotive blogs
Open Research DataThe dataset contains 34 English texts scrapped from automotive blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and...
-
Elgold partial: Movie reviews
Open Research DataThe dataset contains 37 English texts with movie reviews. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: Job offers
Open Research DataThe dataset contains 34 English texts scrapped from the web portals offering job offers. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity...
-
Elgold partial: History blogs
Open Research DataThe dataset contains 13 texts from English history blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold intermediate: verified by the authors
Open Research DataThe dataset contains the texts from Elgold intermediate: verified by verification team additionaly verified by the dataset authors but before the final validation step with the elgold toolset.
-
Elgold intermediate: verified by verification team
Open Research DataThe dataset contains the texts from Elgold intermediate: annotated raw additionaly verified by the five-person verification team. arly 25% of the mentions were corrected in some aspect.
-
Chromatographic and Spectroscopic Identification and Recognition of Natural Dyes, Uncommon Dyestuff Components, and Mordants: Case Study of a 16th Century Carpet with Chintamani Motifs
PublicationA multi-tool analytical practice was used for the characterisation of a 16th century carpet manufactured in Cairo. A mild extraction method with hydrofluoric acid has been evaluated in order to isolate intact flavonoids and their glycosides, anthraquinones, tannins, and indigoids from fibre samples. High-performance liquid chromatography coupled to spectroscopic and mass spectrometric detectors was used for the identification of...
-
Fast Approximate String Search for Wikification
PublicationThe paper presents a novel method for fast approximate string search based on neural distance metrics embeddings. Our research is focused primarily on applying the proposed method for entity retrieval in the Wikification process, which is similar to edit distance-based similarity search on the typical dictionary. The proposed method has been compared with symmetric delete spelling correction algorithm and proven to be more efficient...
-
Towards Facts Extraction From Texts in Polish Language
PublicationThe Polish language differs from English in many ways. It has more complicated conjugation and declination. Because of that automatic facts extraction from texts is difficult. In this paper we present basic differences between those languages. The paper presents an algorithm for extraction of facts from articles from Polish Wikipedia. The algorithm is based on 7 proposed facts schemes that are searched for in the analyzed text....
-
Wikipedia and WordNet integration based on words co-occurrences
PublicationThe article presents a method for automatic integration of two lexical resources: semantic dictionary WordNet and electronic encyclopaedia Wikipedia. Our goal is to add automatically an semantic tags - a WordNet synset identifier to the title of the Wikipedia article. We've analyze several different ap-proaches to these problem and implement our own solution, based on word occurrences in synsets descriptions and the article body....
-
OntoValidate: OntoNotes 5.0 NER validation dataset
Open Research DataOntoValidate dataset consists of 603 randomly chosen raw textsfrom the original OntoNote 5.0 dataset (3637 raw texts in total).
-
Dynamic Semantic Visual Information Management
PublicationDominant Internet search engines use keywords and therefore are not suited for exploration of new domains of knowledge, when the user does not know specific vocabulary. Browsing through articles in a large encyclopedia, each presenting a small fragment of knowledge, it is hard to map the whole domain, see relevant concepts and their relations. In Wikipedia for example some highly relevant articles are not linked with each other....
-
Semantic Integration of Heterogeneous Recognition Systems
PublicationComputer perception of real-life situations is performed using a variety of recognition techniques, including video-based computer vision, biometric systems, RFID devices and others. The proliferation of recognition modules enables development of complex systems by integration of existing components, analogously to the Service Oriented Architecture technology. In the paper, we propose a method that enables integration of information...
-
Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives
PublicationLong-term Web archives comprise Web documents gathered over longer time periods and can easily reach hundreds of terabytes in size. Semantic annotations such as named entities can facilitate intelligent access to the Web archive data. However, the annotation of the entire archive content on this scale is often infeasible. The most efficient way to access the documents within Web archives is provided through their URLs, which are...
-
Szymon Olewniczak mgr inż.
PeopleI've been a part of the Gdansk University of Technology since 2013, when I started my bachelor's degree in computer science at the Faculty of Electronics, Telecommunications and Informatics. After receiving my master's degree in 2019, I've been working as an assistant at the Department of Computer Architecture. Since 2024, I am also the deputy head of my department. My research interests revolve around various NLP related topics,...
-
Enabling Deeper Linguistic-based Text Analytics – Construct Development for the Criticality of Negative Service Experience
PublicationSignificant progress has been made in linguistic-based text analytics particularly with the increasing availability of data and deep learning computational models for more accurate opinion analysis and domain-specific entity recognition. In understanding customer service experience from texts, analysis of sentiments associated with different stages of the service lifecycle is a useful starting point. However, when richer insights...
-
A video monitoring system using ontology-driven identification of threats
PublicationIn this paper, we present a video monitoring systemthat leverages image recognition and ontological reasoningabout threats. In the solution, an image processing subsystemuses video recording of a monitored area and recognizesknown concepts in scenes. Then, a reasoning subsystem uses anontological description of security conditions and informationfrom image recognition to check if a violation of a conditionhas occurred. If a threat...
-
Development and Research of the Text Messages Semantic Clustering Methodology
PublicationThe methodology of semantic clustering analysis of customer’s text-opinions collection is developed. The author's version of the mathematical models of formalization and practical realization of short textual messages semantic clustering procedure is proposed, based on the customer’s text-opinions collection Latent Semantic Analysis knowledge extracting method. An algorithm for semantic clustering of the text-opinions is developed,...
-
Geometric Algebra Model of Distributed Representations
PublicationFormalism based on GA is an alternative to distributed representation models developed so far-Smolensky's tensor product, Holographic Reduced Representations (HRR) and Binary Spatter Code (BSC). Convolutions are replaced by geometric products, interpretable in terms of geometry which seems to be the most natural language for visualization of higher concepts. This paper recalls the main ideas behind the GA model and investigates...
-
Tomasz Korol dr hab. inż.
PeopleEducation Gdańsk University of Technology, Faculty of Management and Economics (2001) University of Applied Sciences Stralsund (1999) Degree / scientific title Habilitation – Gdańsk University of Technology, Faculty of Management and Economics (2015) Ph.D. – Gdańsk University of Technology, Faculty of Management and Economics (2004) Employment Gdańsk University of Technology - associate professor (since 2017); assistant professor...
-
Mining inconsistent emotion recognition results with the multidimensional model
PublicationThe paper deals with the challenge of inconsistency in multichannel emotion recognition. The focus of the paper is to explore factors that might influence the inconsistency. The paper reports an experiment that used multi-camera facial expression analysis with multiple recognition systems. The data were analyzed using a multidimensional approach and data mining techniques. The study allowed us to explore camera location, occlusions...
-
Interactions with recognized patients using smart glasses
PublicationRecently, different smart glasses solutions have been proposed on the market. The rapid development of this wearable technology has led to several research projects related to applications of smart glasses in healthcare. In this paper we propose a general architecture of the system enabling data integration for the recognized person. In the proposed system smart glasses integrates data obtained for the recognized patient from health...
-
Information retrieval with semantic memory model
PublicationPsycholinguistic theories of semantic memory form the basis of understanding of natural language concepts. These theories are used here as an inspiration for implementing a computational model of semantic memory in the form of semantic network. Combining this network with a vector-based object-relation-feature value representation of concepts that includes also weights for confidence and support, allows for recognition of concepts...
-
Sensors integration in the smart home environment - a proposal to solve the problem with user identification
PublicationIn this preliminary study we, investigate the possibility of user recognition techniques suitable on smart home devices like chairs, beds, aiming for low–power, high accuracy and quick response time. We propose the two well know technique: voice speaker recognition and accelerometer signal from device mounted on the chair, and the third one optical system basing on IR LED transmitter/receiver circuit. The preliminary results proved...
-
Numerical simulations of novel GFRP sandwich footbridge
PublicationIn the following paper some aspects of numerical analysis and designing stages of a footbridge made of composite materials are elucidated. Because of the used materials the design process in this case is not ordinary and requires the development of concepts, material selections, identification of material properties, numerical simulations, strength calculations, serviceability and durability analyses. This contribution presents...
-
Odpowiedzialność kierownicza w biznesie – zagadnienia procesowe i rodzajowe
PublicationCelem podjętych prac było przeprowadzenie rozważań nad zagadnieniami powstawania, ponoszenia, podejmowania oraz pociągania do odpowiedzialności kierowniczej. Kontynuacja badań nad odpowiedzialnością kierowniczą została oparta na sformułowanej wcześniej definicji. Umożliwiło to procesowe ujęcie przedmiotu eksploracji. W procesie tym wyróżniono fazy: przywołania, przyjęcia oraz poniesienia odpowiedzialności kierowniczej. Przeprowadzone...
-
The Innovative Faculty for Innovative Technologies
PublicationA leaflet describing Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology. Multimedia Systems Department described laboratories and prototypes of: Auditory-visual attention stimulator, Automatic video event detection, Object re-identification application for multi-camera surveillance systems, Object Tracking and Automatic Master-Slave PTZ Camera Positioning System, Passive Acoustic Radar,...
-
Human carnosinases: A brief history, medicinal relevance, and in silico analyses
PublicationCarnosine, an endogenous dipeptide, has been found to have a plethora of medicinal properties, such as antioxidant, antiageing, and chelating effects, but with one downside: a short half-life. Carnosinases and two hydrolytic enzymes, which remain enigmatic, are responsible for these features. Hence, here we emphasize why research is valuable for better understanding crucial concepts like ageing, neurodegradation, and cancerogenesis,...
-
Wykorzystanie sztucznych sieci neuronowych do wykrywania i rozpoznawania tablic rejestracyjnych na zdjęciach pojazdów
PublicationW artykule przedstawiono koncepcję algorytmu wykrywania i rozpoznawania tablic rejestracyjnych (AWiRTR) na obrazach cyfrowych pojazdów. Detekcja i lokalizacja tablic rejestracyjnych oraz wyodrębnienie z obrazu tablicy rejestracyjnej poszczególnych znaków odbywa się z wykorzystaniem podstawowych technik przetwarzania obrazu (przekształcenia morfologiczne, wykrywanie krawędzi) jak i podstawowych danych statystycznych obiektów wykrytych...
-
Scoreboard Architectural Pattern and Integration of Emotion Recognition Results
PublicationThis paper proposes a new design pattern, named Scoreboard , dedicated for applications solving complex, multi-stage, non-deterministic problems. The pattern provides a computational framework for the design and implementation of systems that integrate a large number of diverse specialized modules that may vary in accuracy, solution level, and modality. The Scoreboard is an extension of Blackboard design pattern and comes under...
-
Features extraction from the electrocatalytic gas sensor responses
PublicationOne of the types of gas sensors used for detection and identification of toxic-air pollutant is an electrocatalytic gas sensor. The electrocatalytic sensors are working in cyclic voltammetry mode, enable detection of various gases. Their response are in the form of I-V curves which contain information about the type and the concentration of measured volatile compound. However,...
-
Voice command recognition using hybrid genetic algorithm
PublicationAbstract: Speech recognition is a process of converting the acoustic signal into a set of words, whereas voice command recognition consists in the correct identification of voice commands, usually single words. Voice command recognition systems are widely used in the military, control systems, electronic devices, such as cellular phones, or by people with disabilities (e.g., for controlling a wheelchair or operating a computer...
-
Quantum structure in competing lizard communities
PublicationAlmost two decades of research on applications of the mathematical formalism of quantum theory as a modeling tool in domains different from the micro-world has given rise to many successful applications in situations related to human behavior and thought, more specifically in cognitive processes of decision-making and the ways concepts are combined into sentences. In this article, we extend this approach to animal behavior, showing...
-
Follow the Light. Where to search for useful research information
PublicationArchitectural Lighting Design (ALD) has never been a standalone professional discipline. Rather, it has existed as the combination of art and the science of light. Today, third generation lighting professionals are already creatively intertwining these fields, and the acceleration in scientific, technological and societal studies has only increased the need for reliable multidisciplinary information. Therefore, a thorough re-examination...
-
Potential and Use of the Googlenet Ann for the Purposes of Inland Water Ships Classification
PublicationThis article presents an analysis of the possibilities of using the pre-degraded GoogLeNet artificial neural network to classify inland vessels. Inland water authorities monitor the intensity of the vessels via CCTV. Such classification seems to be an improvement in their statutory tasks. The automatic classification of the inland vessels from video recording is a one of the main objectives of the Automatic Ship Recognition and...
-
Discovering Rule-Based Learning Systems for the Purpose of Music Analysis
PublicationMusic analysis and processing aims at understanding information retrieved from music (Music Information Retrieval). For the purpose of music data mining, machine learning (ML) methods or statistical approach are employed. Their primary task is recognition of musical instrument sounds, music genre or emotion contained in music, identification of audio, assessment of audio content, etc. In terms of computational approach, music databases...
-
Local Texture Pattern Selection for Efficient Face Recognition and Tracking
PublicationThis paper describes the research aimed at finding the optimal configuration of the face recognition algorithm based on local texture descriptors (binary and ternary patterns). Since the identification module was supposed to be a part of the face tracking system developed for interactive wearable computer, proper feature selection, allowing for real-time operation, became particularly important. Our experiments showed that it is...
-
Joanna Czerska dr inż.
PeopleI am a woman whose mission and passion is the development of people and organizations. My motto is: "There is no such fantasy that human will and reason cannot transform into reality." William Shakespeare In my life I am guided by the values of respect, teamwork and a positive attitude. They define me and decide what kind of person I am. My adventure with Lean began when I was writing my diploma thesis during my studies at WZiE...
-
A new look at the statistical identification of nonstationary systems
PublicationThe paper presents a new, two-stage approach to identification of linear time-varying stochastic systems, based on the concepts of preestimation and postfiltering. The proposed preestimated parameter trajectories are unbiased but have large variability. Hence, to obtain reliable estimates of system parameters, the preestimated trajectories must be further filtered (postfiltered). It is shown how one can design and optimize such...
-
Quality of graphical markers for the needs of eyewear devices
Publicationin this paper we propose to cast the problem of identification of people, objects or places into an application for smart glasses that decodes information from graphical markers. We focus on analyzing different factors that can have influence on the processes of the automatic recognition of information from a code. The research we present aims at reviewing recognition performances in function of: size of a marker, distance from/to...