Wyniki wyszukiwania dla: named entity disambiguation - MOST Wiedzy

Wyszukiwarka

Wyniki wyszukiwania dla: named entity disambiguation

Wyniki wyszukiwania dla: named entity disambiguation

  • Elgold: gold standard, multi-genre dataset for named entity recognition and linking

    Dane Badawcze
    wersja 1.0 open access

    The dataset contains 276 multi-genre texts with marked named entities, which are linked to corresponding Wikipedia articles if available. Each entity was manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.

  • Elgold intermediate: annotated raw

    Dane Badawcze

    The dataset contains a subset of texts from Elgold intermediate: raw texts with named entities marked and linked to corresponding Wikipedia articles. The texts were annotated by 31 participants during the 1.5-hour session.

  • Elgold partial: News

    Dane Badawcze

    The dataset contains 37 English texts scrapped from news websites. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking...

  • Elgold intermediate: verified by the authors

    Dane Badawcze

    The dataset contains the texts from Elgold intermediate: verified by verification team additionaly verified by the dataset authors but before the final validation step with the elgold toolset.

  • Elgold intermediate: verified by verification team

    Dane Badawcze

    The dataset contains the texts from Elgold intermediate: annotated raw additionaly verified by the five-person verification team.  arly 25% of the mentions were corrected in some aspect.

  • Elgold partial: Scientific papers' abstracts

    Dane Badawcze

    The dataset contains 87 Scientific papers' abstracts in English randomly chosen from the folowing scientific disciplines: Biomedicine, Life Sciences, Mathematics, Medicine, Science, Humanities, Social Science.

  • Elgold partial: Amazon product reviews

    Dane Badawcze

    The dataset contains 34 Amazon product reviews in English. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.

  • Elgold partial: Automotive blogs

    Dane Badawcze

    The dataset contains 34 English texts scrapped from automotive blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and...

  • Elgold partial: Movie reviews

    Dane Badawcze

    The dataset contains 37 English texts with movie reviews. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.

  • Elgold partial: Job offers

    Dane Badawcze

    The dataset contains 34 English texts scrapped from the web portals offering job offers. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity...

  • Elgold partial: History blogs

    Dane Badawcze

    The dataset contains 13 texts from English history blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.

  • Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives

    Publikacja
    • T. Souza
    • E. Demidova
    • T. Risse
    • H. Holzmann
    • G. Gossen
    • J. Szymański

    - Rok 2015

    Long-term Web archives comprise Web documents gathered over longer time periods and can easily reach hundreds of terabytes in size. Semantic annotations such as named entities can facilitate intelligent access to the Web archive data. However, the annotation of the entire archive content on this scale is often infeasible. The most efficient way to access the documents within Web archives is provided through their URLs, which are...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • OntoValidate: OntoNotes 5.0 NER validation dataset

    Dane Badawcze
    wersja 1.2 open access

    OntoValidate dataset consists of 603 randomly chosen raw textsfrom the original OntoNote 5.0 dataset (3637 raw texts in total).

  • Szymon Olewniczak mgr inż.

    Osoby

    Jestem związany z Politechniką Gdańską od 2013 roku, kiedy to rozpocząłem studia inżynierskie na kierunku informatyka na Wydziale Elektroniki, Telekomunikacji i Informatyki. Po uzyskaniu tytułu magistra w 2019 roku podjąłem pracę jako asystent w Katedrze Architektury Systemów Komputerowych. Od 2024 roku pełnię również funkcję zastępcy kierownika katedry. Moje zainteresowania badawcze koncentrują się wokół tematów związanych z przetwarzaniem...

  • Towards Facts Extraction From Texts in Polish Language

    The Polish language differs from English in many ways. It has more complicated conjugation and declination. Because of that automatic facts extraction from texts is difficult. In this paper we present basic differences between those languages. The paper presents an algorithm for extraction of facts from articles from Polish Wikipedia. The algorithm is based on 7 proposed facts schemes that are searched for in the analyzed text....

    Pełny tekst do pobrania w portalu

  • DBpedia and YAGO Based System for Answering Questions in Natural Language

    In this paper we propose a method for answering class 1 and class 2 questions (out of 5 classes defined by Moldovan for TREC conference) based on DBpedia and YAGO. Our method is based on generating dependency trees for the query. In the dependency tree we look for paths leading from the root to the named entity of interest. These paths (referenced further as fibers) are candidates for representation of actual user intention. The...

    Pełny tekst do pobrania w portalu

  • Named Property Graphs

    Publikacja

    - Rok 2018

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Revitalized Mill Island in Bydgoszcz - the identity of the place created by the Brda River and its tributary named Młynowka

    Publikacja

    - Rok 2011

    Mill Island in Bydgoszcz, Poland is an example of downtown public space where a meander of the Młynówka creates the identity of the area.Before 2005, the only outstanding local feature was the fact that its south and west ends resembled the Venetian canals. The way the other parts of Mill Island were managed was inappropriate for a downtown.A comprehensive revitalization programme is returning Mill Island public space to residents,...

  • Euroregion as an Entity Stimulating the Sustainable Development of the Cross-Border Market for Cultural Services in a City Divided by a Border

    Publikacja

    - Rok 2019

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Euroregion as an Entity Stimulating the Sustainable Development of the Cross-Border Market for Cultural Services in a City Divided by a Border

    Publikacja

    - Sustainability - Rok 2019

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Stable nanoconjugates of transferrin with alloyed quaternary nanocrystals Ag–In–Zn–S as a biological entity for tumor recognition

    Publikacja

    - NANOSCALE - Rok 2018

    One way to limit the negative effects of anti-tumor drugs on healthy cells is targeted therapy employing functionalized drug carriers. Here we present a biocompatible and stable nanoconjugate of transferrin anchored to Ag-In-Zn-S quantum dots modified with 11-mercaptoundecanoic acid (Tf-QD) as a drug carrier versus typical anticancer drug, doxorubicin. Detailed investigations of Tf-QD nanoconjugates without and with doxorubicin...

    Pełny tekst do pobrania w portalu

  • Sylwester Kaczmarek dr hab. inż.

    Sylwester Kaczmarek ukończył studia w 1972 roku jako mgr inż. Elektroniki, a doktorat i habilitację uzyskał z technik komutacyjnych i inżynierii ruchu telekomunikacyjnego w 1981 i 1994 roku na Politechnice Gdańskiej. Jego zainteresowania badawcze ukierunkowane są na: sieci IP QoS, sieci GMPLS, sieci SDN, komutację, ruting QoS, inżynierię ruchu telekomunikacyjnego, usługi multimedialne i jakość usług. Aktualnie jego badania skupiają...

  • ITL International Journal of Applied Linguistics (formerly named ITL Review of Applied Linguistics)

    Czasopisma

    ISSN: 0019-0810

  • Annotating Words Using WordNet Semantic Glosses

    Publikacja

    - Rok 2012

    An approach to the word sense disambiguation (WSD) relaying onthe WordNet synsets is proposed. The method uses semantically tagged glosses to perform a process similar to the spreading activation in semantic network, creating ranking of the most probable meanings for word annotation. Preliminary evaluation shows quite promising results. Comparison with the state-of-theart WSD methods indicates that the use of WordNet relations...

  • Grzegorz Zieliński dr inż.

    Autor ponad 100 publikacji naukowych (zarówno w języku polskim, jak i angielskim) z zakresu zarządzania działalnością usługową, doskonalenia podmiotów, w tym podmiotów leczniczych. Zainteresowania naukowo-badawcze obejmują obszary związane z dojrzałością i doskonałością przedsiębiorstw w różnych aspektach ich działalności. Uczestniczył w projektach badawczych Narodowego Centrum Nauki oraz projektach realizowanych przez międzynarodowe...

  • Towards semantic-rich word embeddings

    Publikacja

    - Annals of Computer Science and Information Systems - Rok 2019

    In recent years, word embeddings have been shown to improve the performance in NLP tasks such as syntactic parsing or sentiment analysis. While useful, they are problematic in representing ambiguous words with multiple meanings, since they keep a single representation for each word in the vocabulary. Constructing separate embeddings for meanings of ambiguous words could be useful for solving the Word Sense Disambiguation (WSD)...

    Pełny tekst do pobrania w portalu

  • Implementation of Business Intelligence in an IT organization - the concept of an evaluation model

    Publikacja

    - Foundations of Management - Rok 2013

    This paper presents the issue of assessing the validity and effectiveness of implementing a Business Intelligence system in an IT Support Organization. This entity provides IT services to external clients involving, in particular, the storage and processing of large amounts of data. The vast amount of realized projects and also incidents reported in connection with those projects prevented effective decisions from being made without...

    Pełny tekst do pobrania w portalu

  • Karol Daliga dr inż.

    W 2005 roku ukończył klasę o profilu matematyczno - fizycznym i zdał maturę w I Liceum Ogólnokształcącym im. Władysława Gebika d. polskie gimnazjum w Kwidzynie.  W latach 2005-2010 odbył studia magisterskie na Wydziale Fizyki Technicznej i Matematyki Stosowanej Politechniki Gdanskiej, a w latach 2008 - 2012 odbył studia inżynierskie na Wydziale Inżynierii Lądowej i Środowiska Politechniki Gdańskiej.  31 marca 2021 r. obronił pracę...

  • Angelica Pegani mgr

    Osoby

    Absolwentka Wydziału Zarządzania i Ekonomii Politechniki Gdańskiej. Ukończyła menedżerskie studia podyplomowe oraz Program Przedsiębiorczości na Massachusetts Institute of Technology. Rozpoczęła studia doktoranckie i napisała rozprawę doktorską w dziedzinie nauk społecznych. Posiada liczne certyfikaty potwierdzające znajomość języka angielskiego, m. in z British Council oraz University of Cambridge. Posiada 12-letnie doświadczenie...

  • Wikipedia and WordNet integration based on words co-occurrences

    Publikacja

    - Rok 2009

    The article presents a method for automatic integration of two lexical resources: semantic dictionary WordNet and electronic encyclopaedia Wikipedia. Our goal is to add automatically an semantic tags - a WordNet synset identifier to the title of the Wikipedia article. We've analyze several different ap-proaches to these problem and implement our own solution, based on word occurrences in synsets descriptions and the article body....

  • MODEL FOR MEASUREMENT OF FLOW INSTALLATION TIME IN SDN SWITCH

    SDN is the approach in telecommunication networks that separates control plane from data forwarding plane by specifying a single network entity as a controller that defines rules (called flows) of traffic forwarding for the switches connected to it. The time that is required for installation of these rules might be a hindrance for the overall performance of SDN network. In the paper, a model for testing and evaluating the influence...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Linking music data in executable documents

    Publikacja

    - Rok 2022

    This paper presents the application of Interactive Open Document Architecture (IODA) to music and video data. This architecture was design to create multilayer documents which consist of many files. The paper shows the method of creating media documents on the basis of IODA. These kind of documents were called IODA Media Documents (IMD). IMD have links that connect many different kinds of files containing music and video data....

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Fast Approximate String Search for Wikification

    Publikacja

    The paper presents a novel method for fast approximate string search based on neural distance metrics embeddings. Our research is focused primarily on applying the proposed method for entity retrieval in the Wikification process, which is similar to edit distance-based similarity search on the typical dictionary. The proposed method has been compared with symmetric delete spelling correction algorithm and proven to be more efficient...

    Pełny tekst do pobrania w portalu

  • Elgold intermediate: raw texts

    Dane Badawcze

    The dataset contains raw texts scrapped from various internet sources which were used for creating the Elgold dataset.

  • Social networks as a context for small business? A new look at an enterprise in the context of a smallness and newness liability syndrome

    Publikacja

    - Rok 2011

    In this paper we aim to propose and outline key ingredients to a small enterprise success, emerging from the social capital of small business owner-managers and their business networks. We employ resource based view of an organization as well as an embeddedness perspective along with new approach transaction costs to outline the pillars of an advantage of a small business entity. The analysis of survey data leads us to conclusion,...

  • Is it all about networking? Building a sustainable value of a small enterprise in Polish context

    Publikacja

    - Rok 2010

    In this paper we aim to propose and outline key ingredients to a small enterprise success, emerging from the social capital of small business owner-managers and their business networks. We employ resource based view of an organization as well as an embeddedness perspective along with new approach transaction costs to outline the pillars of an advantage of a small business entity. The analysis of survey data leads us to conclusion,...

  • Using Decisional DNA to Enhance Industrial and Manufacturing Design: Conceptual Approach

    Publikacja

    - Rok 2013

    During recent years, manufacturing organizations are facing market changes such as the need for short product life cycles, technological advancement, intense pressure from competitors and the continuous customers’ expectation for high quality products at lower costs. In this scenario, knowledge and its associated engineering/management of every stage involved in the industrial design has become increasingly important for manufacturing...

  • Towards the 4th industrial revolution: networks, virtuality, experience based collective computational intelligence, and deep learning

    Publikacja

    - Rok 2016

    Quo vadis, Intelligent Enterprise? Where are you going? The authors of this paper aim at providing some answers to this fascinating question addressing emerging challenges related to the concept of semantically enhanced knowledge-based cyber-physical systems – the fourth industrial revolution named Industry 4.0.

  • Costs of privatization of the banking sector in Poland in 1997-2000

    Dane Badawcze
    open access

    On June 14, 1996, a special law was passed on the merger and grouping of certain banks in the form of joint-stock companies. Pursuant to these regulations, the PeKaO S.A. banking group was established, which was the only entity of this type established in this way. Additionally, by the end of 1996, four out of nine regional banks were sold, i.e. Wielkopolski...

  • Hedging Strategies of Derivatives Instruments for Commodity Trading Entities

    Publikacja

    - Rok 2015

    Hedging as an outcome of risk management arises to account several questions. Mentioned aspect of size of the hedging is one of them. Latter questioning refers to whether producer of manufacturer are willing to secure entire exposure, when the hedging should start, now or later in the future, what is the vision on market like direction of market, time of interest, magnitude of exposure, what would be the preferred instruments of...

  • Sensorless Control of Induction Machine Supplied by Current Source Inverter

    Publikacja

    The paper describes the voltage control technique of induction machines supplied by a current source inverter. The control system is based on proposed new multi-scalar variables, which are named “r.” The control system contains the output filter capacitor's model. In the sensorless control system the Z type backstepping speed observer was applied. The mathematical dependences are confirmed by simulation and experimental research.

    Pełny tekst do pobrania w serwisie zewnętrznym

  • Virtual touchpad - video-based multimodal interface

    A new computer interface named Virtual-Touchpad (VTP) is presented. The Virtual-Touchpad provides a multimodal interface which enables controlling computer applications by hand gestures captured with a typical webcam. The video stream is processed in the software layer of the interface. Hitherto existing video-based interfaces analyzing frames of hand gestures are presented. Then, the hardware configuration and software features...

  • ArchBGal32cB 441Glu mutein gene analysis dataset

    Dane Badawcze
    open access

     

  • List of public benefit organizations that in 2020 received 1% of the tax due for 2019

    Dane Badawcze
    open access

    The possibility of transferring 1% of personal income tax was introduced by the Act on Public Benefit and Volunteer Work in 2003, and specific provisions specifying who and how can transfer 1% of tax are included in the Personal Income Tax Act. In order to be able to accept 1% of income tax, first of all, the organization (or other authorized entity)...

  • Activated Sludge Process Development

    Publikacja

    - Rok 2014

    This paper summarizes the most significant steps in the activated sludge process development and recognizes key contributors. Recognition of the roles of oxygen and living organisms was the first step (1882-1914). Ardern and Lockett (1914) named the accumulated olids "activated sludge". The process was rapidly accepted and applied in the period 1914-1930. The most dramatic changes in the activated sludge process understanding and...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • XRD-TiO2 and SiO2

    Dane Badawcze
    open access

    Data contain results from XRD measurements of amorphous silica and TiO2 of antase and rutile phases. The commercial TiO2 named as P25 produced by Evonik was also analyzed.

  • Potential of Polish R&D industry in the context of prototyping, design, development and control of a dedicated national satellite SAR system for marine ecosystem monitoring. Technical paper - preliminary study

    Publikacja

    pace technology is currently one of the most important elements in the advance of information societies and knowledge-based economies all over the world. The European Space Agency (ESA) is in the focal point of European space activities, while the European Union provides strong financial support for the development of space technologies and applications in its flagship programs. In a domestic scope, the Polish Space Agency (POLSA)...

    Pełny tekst do pobrania w portalu

  • ECONOMICAL AND SAFE METHOD OF GRANULAR MATERIAL STORAGE IN SILOS IN OFFSHORE PORT TERMINALS

    Publikacja

    - Polish Maritime Research - Rok 2018

    The article discusses issues related with storage of granular materials in silos made of corrugated sheets and reinforced with vertical ribs. Advantages and disadvantages of these structures are named, and typical technological solutions used by largest silo producers are presented. Moreover, basic assumptions of Eurocode 3 are discussed in the context of determining the buckling load capacity of a ribbed jacket. Alternative methods...

    Pełny tekst do pobrania w portalu

  • QoS Resource Reservation Mechanisms for Switched Optical Networks

    Publikacja

    The paper regards the problem of resource reservation mechanisms for Quality of Service support in switched optical networks. The authors propose modifications and extensions for resources reservation strategy algorithms with resources pools, link capacity threshold and adaptive advance reservation approach. They examine proposed solutions in Automatically Switched Optical Network with Generalized Multi-Protocol Label Switching...

    Pełny tekst do pobrania w serwisie zewnętrznym

  • New type T-Source inverter

    Publikacja

    - Rok 2009

    This paper presents different topologies of voltage inverters with alternative input LC networks. The basic topology is known in the literature as a Z-source inverter (ZSI). Alternative passive networks were named by the authors as T-sources. T-source inverter has fewer reactive components in comparison to conventional Z-source inverter. The most significant advantage of the T-source inverter (TSI) is its use of a common voltage...

    Pełny tekst do pobrania w serwisie zewnętrznym