Open Research Data
Filters
total: 91
Open Research Data
-
Remus: Polish-Kashubian parallel translation corpus
Open Research DataThe dataset contains 10,825 sentences from the Kashubian book "Life and Adventures of Remus" (Żëcé i przigòdë Remùsa) with parallel Polish translations. Aleksander Majkowski's book is considered the most important book in Kashubian literature, making it a valuable source of high-quality translation data.
-
Polish-Kashubian parallel translation corpus
Open Research DataThe dataset contains Polish words and sentences and their translations into Kashubian. The dataset consists of train and test subsets. The train subset contains about 100,000 parallel translations. It was created using two types of sources. The first one is the online dictionaries:
-
OntoValidate: OntoNotes 5.0 NER validation dataset
Open Research DataOntoValidate dataset consists of 603 randomly chosen raw textsfrom the original OntoNote 5.0 dataset (3637 raw texts in total).
-
Elgold intermediate: verified by verification team
Open Research DataThe dataset contains the texts from Elgold intermediate: annotated raw additionaly verified by the five-person verification team. arly 25% of the mentions were corrected in some aspect.
-
Elgold intermediate: annotated raw
Open Research DataThe dataset contains a subset of texts from Elgold intermediate: raw texts with named entities marked and linked to corresponding Wikipedia articles. The texts were annotated by 31 participants during the 1.5-hour session.
-
The American Sign Language alphabet
Open Research DataThe American Sign Language dataset contains all static letters of the American alphabet, meaning those that do not require movement to perform (the entire alphabet except for the letters 'J' and 'Z', which are dynamic and require hand movement).
-
Elgold intermediate: raw texts
Open Research DataThe dataset contains raw texts scrapped from various internet sources which were used for creating the Elgold dataset.
-
Elgold partial: History blogs
Open Research DataThe dataset contains 13 texts from English history blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: Scientific papers' abstracts
Open Research DataThe dataset contains 87 Scientific papers' abstracts in English randomly chosen from the folowing scientific disciplines: Biomedicine, Life Sciences, Mathematics, Medicine, Science, Humanities, Social Science.
-
Elgold partial: Amazon product reviews
Open Research DataThe dataset contains 34 Amazon product reviews in English. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: Automotive blogs
Open Research DataThe dataset contains 34 English texts scrapped from automotive blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and...
-
Elgold partial: Movie reviews
Open Research DataThe dataset contains 37 English texts with movie reviews. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: News
Open Research DataThe dataset contains 37 English texts scrapped from news websites. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking...
-
Elgold partial: Job offers
Open Research DataThe dataset contains 34 English texts scrapped from the web portals offering job offers. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity...
-
Rust QA: question answering dataset for "The Rust Programming Language" in SQuAD 2.0 format
Open Research DataRust QA is a dataset for training and evaluating QA systems. The dataset consists of 1068 questions to "The Rust Programming Language" book (https://doc.rust-lang.org/stable/book/) with the answers provided as text spans from the book. The dataset is released in SQuAD 2.0 format.
-
SWM for porous hierarchical nanocarbons composites
Open Research DataRaw Raman spectra and XRD diffractograms (background subtracted) of 13 among which carbon-derived secondary waste and reference materials.
-
Elgold: gold standard, multi-genre dataset for named entity recognition and linking
Open Research DataThe dataset contains 276 multi-genre texts with marked named entities, which are linked to corresponding Wikipedia articles if available. Each entity was manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
LEGO bricks for training classification network
Open Research DataThe data set contains images of 447 different classes of LEGO bricks used for training LEGO bricks classification network. The dataset contains two types of images: photos (10%) and renders (90%) aggregated into respective directories. Each directory (photos and renders) contains 447 directories labeled as the official brick type number. The images...
-
Images of LEGO bricks
Open Research DataThe set contains images of LEGO bricks (from multiple categories). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks part 2
Open Research DataThe data set conatins tagged images conatining LEGO bricks used for traning LEGO bricks detecting network. The images contain random number of the same LEGO bricks on white background. Only the whole bricks are labeled.
-
Tagged images with LEGO bricks
Open Research DataThe data set conatins tagged images conatining LEGO bricks used for traning LEGO bricks detecting network. The dataset contains two types of images:
-
LDRAW based renders of LEGO bricks moving on a conveyor belt with extracted models
Open Research DataThe set contains renders of LEGO bricks moving on a white conveyor belt. The images were prepared for training neural network for recognition of LEGO bricks. For each brick starting position, alignment and color was selected (simulating the brick falling down on the conveyour belt) and than 10 images was created while the brick was moved across the...
-
TF-IDF weighted bag-of-words preprocessed text documents from Simple English Wikipedia
Open Research DataThe SimpleWiki2K-scores dataset contains TF-IDF weighted bag-of-words preprocessed text documents (raw strings are not available) [feature matrix] and their multi-label assignments [label-matrix]. Label scores for each document are also provided for an enhanced multi-label KNN [1] and LEML [2] classifiers. The aim of the dataset is to establish a benchmark...
-
Thermographic imaging of electrochemical double layer capacitors during cycling charging - discharging 0 - 2,7 V at 240 mA. Sample 103.
Open Research DataDataset contains thermal images of prototype electrochemical double layer capacitor taken during cyclic charging - discharging. The sample was charged to 2,7 V and discharged to 10 mV by constant current 240 mA. Sample 103. The images were taken with thermographic camera VigoCAM V50. The sample was covered by black graphite paint to ensure uniform...
-
Thermographic imaging of electrochemical double layer capacitors during cycling charging - discharging 0 - 2,7 V at 360 mA. Sample 103.
Open Research DataDataset contains thermal images of prototype electrochemical double layer capacitor taken during cyclic charging - discharging. The sample was charged to 2,7 V and discharged to 10 mV by constant current 360 mA. Sample 103. The images were taken with thermographic camera VigoCAM V50. The sample was covered by black graphite paint to ensure uniform...
-
Thermographic imaging of electrochemical double layer capacitors during cycling charging - discharging 0 - 2,7 V at 300 mA. Sample 103.
Open Research DataDataset contains thermal images of prototype electrochemical double layer capacitor taken during cyclic charging - discharging. The sample was charged to 2,7 V and discharged to 10 mV by constant current 300 mA. Sample 103. The images were taken with thermographic camera VigoCAM V50. The sample was covered by black graphite paint to ensure uniform...
-
Thermographic imaging of electrochemical double layer capacitors during cycling charging - discharging 0 - 2,7 V at 180 mA. Sample 103.
Open Research DataDataset contains thermal images of prototype electrochemical double layer capacitor taken during cyclic charging - discharging. The sample was charged to 2,7 V and discharged to 10 mV by constant current 180 mA. Sample 103. The images were taken with thermographic camera VigoCAM V50. The sample was covered by black graphite paint to ensure uniform...
-
Distance measurement with the low coherent interferometer with silver mirror (the source wavelegth 1310 nm) - 70 um (serie 2)
Open Research DataThe obtained data was acquired by the interferometric fiber-optic sensor of distance. The setup was constructed of a broadband light source working at the central wavelength of 1310 nm, an optical spectrum analyzer, and a fiber-optic 2x1 coupler (with the power split 50:50). All elements were connected by standard single-mode optical fibers. The measurement...
-
Distance measurement with the low coherent interferometer with silver mirror (the source wavelegth 1310 nm) - 60 um (serie 2)
Open Research DataThe obtained data was acquired by the interferometric fiber-optic sensor of distance. The setup was constructed of a broadband light source working at the central wavelength of 1310 nm, an optical spectrum analyzer, and a fiber-optic 2x1 coupler (with the power split 50:50). All elements were connected by standard single-mode optical fibers. The measurement...
-
Distance measurement with the low coherent interferometer with silver mirror (the source wavelegth 1310 nm) - 50 um (serie 2)
Open Research DataThe obtained data was acquired by the interferometric fiber-optic sensor of distance. The setup was constructed of a broadband light source working at the central wavelength of 1310 nm, an optical spectrum analyzer, and a fiber-optic 2x1 coupler (with the power split 50:50). All elements were connected by standard single-mode optical fibers. The measurement...
-
Distance measurement with the low coherent interferometer with silver mirror (the source wavelegth 1310 nm) - 40 um (serie 2)
Open Research DataThe obtained data was acquired by the interferometric fiber-optic sensor of distance. The setup was constructed of a broadband light source working at the central wavelength of 1310 nm, an optical spectrum analyzer, and a fiber-optic 2x1 coupler (with the power split 50:50). All elements were connected by standard single-mode optical fibers. The measurement...
-
Distance measurement with the low coherent interferometer with silver mirror (the source wavelegth 1310 nm) - 30 um (serie 2)
Open Research DataThe obtained data was acquired by the interferometric fiber-optic sensor of distance. The setup was constructed of a broadband light source working at the central wavelength of 1310 nm, an optical spectrum analyzer, and a fiber-optic 2x1 coupler (with the power split 50:50). All elements were connected by standard single-mode optical fibers. The measurement...
-
Distance measurement with the low coherent interferometer with silver mirror (the source wavelegth 1310 nm) - 20 um (serie 2)
Open Research DataThe obtained data was acquired by the interferometric fiber-optic sensor of distance. The setup was constructed of a broadband light source working at the central wavelength of 1310 nm, an optical spectrum analyzer, and a fiber-optic 2x1 coupler (with the power split 50:50). All elements were connected by standard single-mode optical fibers. The measurement...
-
Distance measurement with the low coherent interferometer with silver mirror (the source wavelegth 1310 nm) - 10 um (serie 2)
Open Research DataThe obtained data was acquired by the interferometric fiber-optic sensor of distance. The setup was constructed of a broadband light source working at the central wavelength of 1310 nm, an optical spectrum analyzer, and a fiber-optic 2x1 coupler (with the power split 50:50). All elements were connected by standard single-mode optical fibers. The measurement...
-
Distance measurement with the low coherent interferometer with silver mirror (the source wavelegth 1310 nm) - 0 um (serie 2)
Open Research DataThe obtained data was acquired by the interferometric fiber-optic sensor of distance. The setup was constructed of a broadband light source working at the central wavelength of 1310 nm, an optical spectrum analyzer, and a fiber-optic 2x1 coupler (with the power split 50:50). All elements were connected by standard single-mode optical fibers. The measurement...
-
Tagged images with LEGO bricks - Technic Gears
Open Research DataThe set contains images of LEGO bricks (from Technic Gears category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Technic Steering Suspension and Engine
Open Research DataThe set contains images of LEGO bricks (from Technic Steering Suspension and Engine category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Technic Pins
Open Research DataThe set contains images of LEGO bricks (from Technic Pins category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Technic Connectors
Open Research DataThe set contains images of LEGO bricks (from Technic Connectors category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Technic Beams
Open Research DataThe set contains images of LEGO bricks (from Technic Beams category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Plates Round Curved and Dishes
Open Research DataThe set contains images of LEGO bricks (from Plates Round Curved and Dishes category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Bricks Sloped
Open Research DataThe set contains images of LEGO bricks (from Bricks Sloped category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Bricks
Open Research DataThe set contains images of LEGO bricks (from Bricks category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Windscreens and Fuselage
Open Research DataThe set contains images of LEGO bricks (from Windscreens and Fuselage category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Panels
Open Research DataThe set contains images of LEGO bricks (from Panels category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Technic Beams Special
Open Research DataThe set contains images of LEGO bricks (from Technic Beams Special category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Hinges Arms and Turntables
Open Research DataThe set contains images of LEGO bricks (from Hinges Arms and Turntables category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Bricks Wedged
Open Research DataThe set contains images of LEGO bricks (from Bricks Wedged category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Plates Special
Open Research DataThe set contains images of LEGO bricks (from Plates Special category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.
-
Tagged images with LEGO bricks - Bricks Special
Open Research DataThe set contains images of LEGO bricks (from Bricks Special category). The images were prepared for training neural network for recognition and labeling of LEGO bricks. The images contain one brick each. The images were taken from different sides by handheld camera hovering over the bricks lying on a white, non reflective surface.