Skip to search
Skip to content
Go to help page
Go to accessibility statement
Go to sitemap page

Search results for: wikipedia - Bridge of Knowledge

Change context for scientist
Change context for business
polski

Your account

login

Search

wyszukiwana fraza

search everywhere
search publications
search journals
search conferences
search publishing houses
search people
search inventions
search projects
search laboratories
search research teams
search research equipment
search e-learning courses
search events
search offers
search open research data

Main Page
Search

Search results for: wikipedia

results on page:
embed this view on your website

Filters

total: 76
filtered: 14

Catalog
- Publications 62 available results
- Open Research Data 14 available results

clear all filters

Chosen catalog filters

Year of publication
- 2024
- 2023
- 2019
Discipline
- Natural sciences
- Engineering and Technology
Administrative Unit

Administrative Unit
- Gdańsk University of Technology
  Expand the catalog: Gdańsk University of Technology
  - Faculty of Architecture
    Expand the catalog: Faculty of Architecture
    
    Department of Urban Architecture an...
  - Faculty of Electronics, Telecommunicat...
    Expand the catalog: Faculty of Electronics, Telecommunications and Informatics
    
    Department of Computer Architecture
    
    Department of Microwave and Antenna...
Open model
- open access
- restricted access
- embargo
Data source
- MOST DANYCH
- Virtual Microscope

clear Chosen catalog filters

Search results for: wikipedia

Automatically created and partially veriffied Wikipedia - WordNet mappings
Open Research Data
open access
- T. Boiński
- J. Szymański
Mapping between Wikipedia articles and WordNet synsets. The mappings between Wikipedia articles and WordNet synsets were obtained automatically using 4 algorithms of data processing. The automatically generated mappings were than a subject of verification by a group of volunteers using crowdsourcing approach through so called Games with a Purpose. The...
TF-IDF weighted bag-of-words preprocessed text documents from Simple English Wikipedia
Open Research Data
open access
The SimpleWiki2K-scores dataset contains TF-IDF weighted bag-of-words preprocessed text documents (raw strings are not available) [feature matrix] and their multi-label assignments [label-matrix]. Label scores for each document are also provided for an enhanced multi-label KNN [1] and LEML [2] classifiers. The aim of the dataset is to establish a benchmark...
WikiPrefs: human preferences dataset build from text edits
Open Research Data
open access
- J. Majkutewicz
- J. Szymański
The WikiPrefs dataset is a human preferences dataset for Large Language Models alignment. It was built using the EditPrefs method from historical edits of Wikipedia featured articles
Elgold: gold standard, multi-genre dataset for named entity recognition and linking
Open Research Data
version 1.0 open access
- S. Olewniczak
- J. Szymański
The dataset contains 276 multi-genre texts with marked named entities, which are linked to corresponding Wikipedia articles if available. Each entity was manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
Elgold intermediate: annotated raw
Open Research Data
open access
- S. Olewniczak
- J. Szymański
- series: Elgold - partial
The dataset contains a subset of texts from Elgold intermediate: raw texts with named entities marked and linked to corresponding Wikipedia articles. The texts were annotated by 31 participants during the 1.5-hour session.
Elgold partial: News
Open Research Data
open access
- S. Olewniczak
- J. Szymański
- series: Elgold - partial
The dataset contains 37 English texts scrapped from news websites. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking...
Elgold partial: Automotive blogs
Open Research Data
open access
- S. Olewniczak
- J. Szymański
- series: Elgold - partial
The dataset contains 34 English texts scrapped from automotive blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and...
Elgold partial: Movie reviews
Open Research Data
open access
- S. Olewniczak
- J. Szymański
- series: Elgold - partial
The dataset contains 37 English texts with movie reviews. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
Elgold partial: Job offers
Open Research Data
open access
- S. Olewniczak
- J. Szymański
- series: Elgold - partial
The dataset contains 34 English texts scrapped from the web portals offering job offers. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity...
Elgold partial: Scientific papers' abstracts
Open Research Data
open access
- S. Olewniczak
- J. Szymański
- series: Elgold - partial
The dataset contains 87 Scientific papers' abstracts in English randomly chosen from the folowing scientific disciplines: Biomedicine, Life Sciences, Mathematics, Medicine, Science, Humanities, Social Science.
Elgold partial: Amazon product reviews
Open Research Data
open access
- S. Olewniczak
- J. Szymański
- series: Elgold - partial
The dataset contains 34 Amazon product reviews in English. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
Elgold partial: History blogs
Open Research Data
open access
- S. Olewniczak
- J. Szymański
- series: Elgold - partial
The dataset contains 13 texts from English history blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
Elgold intermediate: verified by the authors
Open Research Data
open access
- S. Olewniczak
- J. Szymański
- series: Elgold intermediate
The dataset contains the texts from Elgold intermediate: verified by verification team additionaly verified by the dataset authors but before the final validation step with the elgold toolset.
Elgold intermediate: verified by verification team
Open Research Data
open access
- S. Olewniczak
- J. Szymański
- series: Elgold intermediate
The dataset contains the texts from Elgold intermediate: annotated raw additionaly verified by the five-person verification team. arly 25% of the mentions were corrected in some aspect.

Embed on your site

Put script on your page

Put script at the end of <body> section of your page. You only need to do this once per page.

<script src="https://mostwiedzy.pl/js/embed.js"
        data-base-url="https://mostwiedzy.pl"
        data-language="en">
</script>

Place widget on your page

Paste code below into right spot on your page

<div class="most-widget" data-widget="Search" data-query="wikipedia" data-limit="50"
     data-filter-cat='openResearchData' data-filter-source='["most"]'>
</div>

Additional information

Bridge of Data in a nutshell open in new tab
Bridge of Knowledge in a nutshell open in new tab
About MOST project open in new tab
Open Access
ORD Catalog Policy and Information
Terms of Service
Privacy policy
Site Map
RDF/JSON-LD
API
Help
Accessibility statement
Report problem
Contact

Copyright © 2024
Gdansk University of Technology IT Service Centre open in new tab

Report problem

Thank you!
Your application has been sent. It will be considered by us as soon as possible.

Choose type of problem

Describe problem

First name

Last name

E-mail address

Prove you are not a robot