Wyniki wyszukiwania dla: bridges

Wyniki wyszukiwania dla: bridges

wyników na stronę:
osadź ten widok na swojej stronie

Elgold: gold standard, multi-genre dataset for named entity recognition and linking
Dane Badawcze
wersja 1.0 open access
- S. Olewniczak
- J. Szymański
The dataset contains 276 multi-genre texts with marked named entities, which are linked to corresponding Wikipedia articles if available. Each entity was manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
Elgold intermediate: verified by the authors
Dane Badawcze
open access
- S. Olewniczak
- J. Szymański
- seria: Elgold intermediate
The dataset contains the texts from Elgold intermediate: verified by verification team additionaly verified by the dataset authors but before the final validation step with the elgold toolset.
Elgold intermediate: verified by verification team
Dane Badawcze
open access
- S. Olewniczak
- J. Szymański
- seria: Elgold intermediate
The dataset contains the texts from Elgold intermediate: annotated raw additionaly verified by the five-person verification team. arly 25% of the mentions were corrected in some aspect.
Elgold intermediate: annotated raw
Dane Badawcze
open access
- S. Olewniczak
- J. Szymański
- seria: Elgold - partial
The dataset contains a subset of texts from Elgold intermediate: raw texts with named entities marked and linked to corresponding Wikipedia articles. The texts were annotated by 31 participants during the 1.5-hour session.
Elgold partial: News
Dane Badawcze
open access
- S. Olewniczak
- J. Szymański
- seria: Elgold - partial
The dataset contains 37 English texts scrapped from news websites. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking...
Elgold partial: Automotive blogs
Dane Badawcze
open access
- S. Olewniczak
- J. Szymański
- seria: Elgold - partial
The dataset contains 34 English texts scrapped from automotive blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and...
Elgold partial: Movie reviews
Dane Badawcze
open access
- S. Olewniczak
- J. Szymański
- seria: Elgold - partial
The dataset contains 37 English texts with movie reviews. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
Elgold partial: Job offers
Dane Badawcze
open access
- S. Olewniczak
- J. Szymański
- seria: Elgold - partial
The dataset contains 34 English texts scrapped from the web portals offering job offers. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity...
Elgold partial: Scientific papers' abstracts
Dane Badawcze
open access
- S. Olewniczak
- J. Szymański
- seria: Elgold - partial
The dataset contains 87 Scientific papers' abstracts in English randomly chosen from the folowing scientific disciplines: Biomedicine, Life Sciences, Mathematics, Medicine, Science, Humanities, Social Science.
Elgold partial: Amazon product reviews
Dane Badawcze
open access
- S. Olewniczak
- J. Szymański
- seria: Elgold - partial
The dataset contains 34 Amazon product reviews in English. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
Elgold partial: History blogs
Dane Badawcze
open access
- S. Olewniczak
- J. Szymański
- seria: Elgold - partial
The dataset contains 13 texts from English history blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.

Filtry

Katalog

Rok publikacji

Dziedzina

Jednostka administracyjna

Model otwartości

Źródło danych

Elgold: gold standard, multi-genre dataset for named entity recognition and linking

Elgold intermediate: verified by the authors

Elgold intermediate: verified by verification team

Elgold intermediate: annotated raw

Elgold partial: News

Elgold partial: Automotive blogs

Elgold partial: Movie reviews

Elgold partial: Job offers

Elgold partial: Scientific papers' abstracts

Elgold partial: Amazon product reviews

Elgold partial: History blogs

Wyszukiwarka

Filtry

Katalog

Rok publikacji

Dziedzina

Jednostka administracyjna

Model otwartości

Źródło danych

Wyniki wyszukiwania dla: bridges