Wyniki wyszukiwania dla: entity linking
-
Elgold: gold standard, multi-genre dataset for named entity recognition and linking
Dane BadawczeThe dataset contains 276 multi-genre texts with marked named entities, which are linked to corresponding Wikipedia articles if available. Each entity was manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: News
Dane BadawczeThe dataset contains 37 English texts scrapped from news websites. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking...
-
Elgold partial: Scientific papers' abstracts
Dane BadawczeThe dataset contains 87 Scientific papers' abstracts in English randomly chosen from the folowing scientific disciplines: Biomedicine, Life Sciences, Mathematics, Medicine, Science, Humanities, Social Science.
-
Elgold partial: Amazon product reviews
Dane BadawczeThe dataset contains 34 Amazon product reviews in English. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: Automotive blogs
Dane BadawczeThe dataset contains 34 English texts scrapped from automotive blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and...
-
Elgold partial: Movie reviews
Dane BadawczeThe dataset contains 37 English texts with movie reviews. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: Job offers
Dane BadawczeThe dataset contains 34 English texts scrapped from the web portals offering job offers. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity...
-
Elgold partial: History blogs
Dane BadawczeThe dataset contains 13 texts from English history blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Szymon Olewniczak mgr inż.
OsobyJestem związany z Politechniką Gdańską od 2013 roku, kiedy to rozpocząłem studia inżynierskie na kierunku informatyka na Wydziale Elektroniki, Telekomunikacji i Informatyki. Po uzyskaniu tytułu magistra w 2019 roku podjąłem pracę jako asystent w Katedrze Architektury Systemów Komputerowych. Od 2024 roku pełnię również funkcję zastępcy kierownika katedry. Moje zainteresowania badawcze koncentrują się wokół tematów związanych z przetwarzaniem...
-
RDF dataset profiling - a survey of features, methods, vocabularies and applications
PublikacjaThe Web of Data, and in particular Linked Data, has seen tremendous growth over the past years. However, reuse and take-up of these rich data sources is often limited and focused on a few well-known and established RDF datasets. This can be partially attributed to the lack of reliable and up-to-date information about the characteristics of available datasets. While RDF datasets vary heavily with respect to the features related...
-
Linking music data in executable documents
PublikacjaThis paper presents the application of Interactive Open Document Architecture (IODA) to music and video data. This architecture was design to create multilayer documents which consist of many files. The paper shows the method of creating media documents on the basis of IODA. These kind of documents were called IODA Media Documents (IMD). IMD have links that connect many different kinds of files containing music and video data....
-
Elgold intermediate: verified by the authors
Dane BadawczeThe dataset contains the texts from Elgold intermediate: verified by verification team additionaly verified by the dataset authors but before the final validation step with the elgold toolset.
-
Elgold intermediate: raw texts
Dane BadawczeThe dataset contains raw texts scrapped from various internet sources which were used for creating the Elgold dataset.
-
Elgold intermediate: verified by verification team
Dane BadawczeThe dataset contains the texts from Elgold intermediate: annotated raw additionaly verified by the five-person verification team. arly 25% of the mentions were corrected in some aspect.
-
Elgold intermediate: annotated raw
Dane BadawczeThe dataset contains a subset of texts from Elgold intermediate: raw texts with named entities marked and linked to corresponding Wikipedia articles. The texts were annotated by 31 participants during the 1.5-hour session.
-
Marcin Żmuda
OsobyMarcin Żmuda - specjalista marketingu internetowego w Google oraz założyciel agencji Embasy. Autor Update Time - wiadomościa z obszaru SEO. Pierwsze kampanie SEO przygotowywałem już w 2007 roku. Aktualnie jestem Head of SEO w firmie Orion Media Group zajmującym się zagadnieniami związanymi z pozycjonowaniem w wyszukiwarkach internetowych oraz rozbudową grupy serwisów własnych. Moja agencja, Embasy specjalizuje się przede wszystkim...