Wyniki wyszukiwania dla: ner - MOST Wiedzy

Wyszukiwarka

Wyniki wyszukiwania dla: ner

Filtry

wszystkich: 163
wybranych: 8

wyczyść wszystkie filtry


Filtry wybranego katalogu

wyczyść Filtry wybranego katalogu

Wyniki wyszukiwania dla: ner

  • OntoValidate: OntoNotes 5.0 NER validation dataset

    Dane Badawcze
    open access

    OntoValidate dataset consists of 603 randomly chosen raw textsfrom the original OntoNote 5.0 dataset (3637 raw texts in total).

  • Elgold partial: News

    Dane Badawcze

    The dataset contains 37 English texts scrapped from news websites. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking...

  • Elgold partial: Scientific papers' abstracts

    Dane Badawcze

    The dataset contains 87 Scientific papers' abstracts in English randomly chosen from the folowing scientific disciplines: Biomedicine, Life Sciences, Mathematics, Medicine, Science, Humanities, Social Science.

  • Elgold partial: Amazon product reviews

    Dane Badawcze

    The dataset contains 34 Amazon product reviews in English. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.

  • Elgold partial: Automotive blogs

    Dane Badawcze

    The dataset contains 34 English texts scrapped from automotive blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and...

  • Elgold partial: Movie reviews

    Dane Badawcze

    The dataset contains 37 English texts with movie reviews. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.

  • Elgold partial: Job offers

    Dane Badawcze

    The dataset contains 34 English texts scrapped from the web portals offering job offers. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity...

  • Elgold partial: History blogs

    Dane Badawcze

    The dataset contains 13 texts from English history blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.