Filters
total: 6308
filtered: 68
Search results for: TEXTUAL CLASSIFICATION
-
LEGO bricks for training classification network
Open Research DataThe data set contains images of 447 different classes of LEGO bricks used for training LEGO bricks classification network. The dataset contains two types of images: photos (10%) and renders (90%) aggregated into respective directories. Each directory (photos and renders) contains 447 directories labeled as the official brick type number. The images...
-
Microscopic examination of the texture of paper products
Open Research DataAtomic force microscopy (AFM) can be used to study the state of the paper fibers with the aim of providing qualitative and semi-quantitative information on degradation and aging. The work [1] reports the results of tests of various paper products subjected to deliberate aging processes under the influence of various factors. Chemical and biological...
-
The effect of the flaxseed addition on the texture of wheat bread
Open Research DataThe dataset contains the results of flaxseed addition on the texture of toasted bread. The following bread variants were tested: control bread, bread with 8% and 12% linseed addition and competitive bread. Measurements were made immediately after baking and after 4 days. On the basis of data the following parameters were determined: hardness, elasticity,...
-
Effect of Jerusalem artichoke addition on texture profile of bread
Open Research DataThe dataset contains the results of Jerusalem artichoke addition on the texture of wheat bread. The following bread variants were tested: control bread, bread with 15% and 30% Jerusalem artichoke addition. Measurements were made immediately after baking. On the basis of data, the following parameters were determined: hardness, elasticity, cohesion and...
-
Rust QA: question answering dataset for "The Rust Programming Language" in SQuAD 2.0 format
Open Research DataRust QA is a dataset for training and evaluating QA systems. The dataset consists of 1068 questions to "The Rust Programming Language" book (https://doc.rust-lang.org/stable/book/) with the answers provided as text spans from the book. The dataset is released in SQuAD 2.0 format.
-
Elgold: gold standard, multi-genre dataset for named entity recognition and linking
Open Research DataThe dataset contains 276 multi-genre texts with marked named entities, which are linked to corresponding Wikipedia articles if available. Each entity was manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold intermediate: verified by verification team
Open Research DataThe dataset contains the texts from Elgold intermediate: annotated raw additionaly verified by the five-person verification team. arly 25% of the mentions were corrected in some aspect.
-
Elgold intermediate: annotated raw
Open Research DataThe dataset contains a subset of texts from Elgold intermediate: raw texts with named entities marked and linked to corresponding Wikipedia articles. The texts were annotated by 31 participants during the 1.5-hour session.
-
Elgold partial: News
Open Research DataThe dataset contains 37 English texts scrapped from news websites. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking...
-
Elgold partial: Scientific papers' abstracts
Open Research DataThe dataset contains 87 Scientific papers' abstracts in English randomly chosen from the folowing scientific disciplines: Biomedicine, Life Sciences, Mathematics, Medicine, Science, Humanities, Social Science.
-
Elgold partial: Amazon product reviews
Open Research DataThe dataset contains 34 Amazon product reviews in English. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: Automotive blogs
Open Research DataThe dataset contains 34 English texts scrapped from automotive blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and...
-
Elgold partial: Movie reviews
Open Research DataThe dataset contains 37 English texts with movie reviews. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: Job offers
Open Research DataThe dataset contains 34 English texts scrapped from the web portals offering job offers. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity...
-
Elgold partial: History blogs
Open Research DataThe dataset contains 13 texts from English history blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Collective action against same-sex marriages_online research in Poland
Open Research DataThe study was conducted in the context of people opposing the equality of sexual minorities. At the beginning of the survey, each respondent was asked to mark their support for the equality of rights of heterosexual and homosexual persons, on a scale from -3 (definitely not) to 3 (definitely yes). The subsequent questions of the survey were formulated...
-
2 Latin letters by Georg Pauli (b.1586-d.1650) - transcription, translation and photographs
Open Research DataThe data set contains two Latin letters by Georg Pauli (b. 1586 – d. 1650) to his brother Adrian (d. 1622) in photographs, transcriptions, and translations into Polish and English. The first letter was sent by Georg from Gdańsk (formerly Danzig) in 1604 when he was still a student at the local Academic Gymnasium. The second one, in turn, was written...
-
Vident-real: an intra-oral video dataset for multi-task learning
Open Research DataWe introduce Vident-real, a large dataset of 100 video sequences of intra-oral scenes from real conservative dental treatments performed at the Medical University of Gdańsk, Poland. The dataset can be used for multi-task learning methods including:
-
A study of nighttime vehicle detection algorithms
Open Research DataThis dataset is from my master's thesis "A study of nighttime vehicle detection algorithms". It contains both raw data and preprocessed dataset ready to use. In the pictures below you can see how images were annotated.
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - All accidents
Open Research DataData contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
-
Bias mitigation benchmark that includes two datasets
Open Research DataISIC-2020 is the largest skin lesion dataset divided into two classes -- benign and malignant. It contains 33126 dermoscopic images from over 2000 patients. The diagnoses were confirmed either by histopathology, expert agreement or longitudinal follow-up. The dataset was gathered by The International Skin Imaging Collaboration (ISIC) from several medical...
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Pedestrian accidents
Open Research DataData contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, risk group: Pedestrians. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Young drivers accidents
Open Research DataData contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, risk group: young driver offender. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Motorcycle and moped accidents
Open Research DataData contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, risk group: motorcyclists and mopeds. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Head-on accidents
Open Research DataData contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, type of accidents: head-on. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Side-impact accidents
Open Research DataData contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, type of accidents: Side-impact. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Run off road accidents
Open Research DataData contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, type of accidents: Run off road. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Elderly people accidents
Open Research DataData contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, risk group: elderly people (65+) - drivers, passengers and . vulnerable road user. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Cyclist accidents
Open Research DataData contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, risk group: Cyclists. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Night accidents
Open Research DataData contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, time of accidents: Night. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Excessive speed accidents
Open Research DataData contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, cause of accidents: Excessive speed accidents. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Child accidents
Open Research DataData contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, risk group: children - drivers, passengers and . vulnerable road user.. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Alcohol and drug accidents
Open Research DataData contain risk classification on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019, risk group: Offenders under influence of alcohol or drug - driver or pedestrian. Measures used to assess the level of risk are (5 classes low, low to medium, medium, medium to high, high):
-
SkinDepth - synthetic 3D skin lesion database
Open Research DataSkinDepth is the first synthetic 3D skin lesion database. The release of SkinDepth dataset intends to contribute to the development of algorithms for:
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2017-2019 - Medium to High and high road sections
Open Research DataData contain road sections with the highest number of accidents and victims on regional roads (voivodeship roads) in pomorskie voivodeship in 2017-2019. Measures used to assess the level of risk is: minimum 4 accidents or 4 seriously injured or fatalities per one kilometer (5 classes: low, low to medium, medium, medium to high, high):
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2019 - Municipality areas
Open Research DataData contain the number of accidents, victims, accident costs divided on municipality areas (119 areas) on regional roads (voivodeship roads) in pomorskie voivodeship in 2019. Measures used to assess the level of social risk are (5 classes: low, low to medium, medium, medium to high, high):
-
Accidents, victims and risk levels on regional roads in pomorskie voivodeship, 2019 - Poviat areas
Open Research DataData contain the number of accidents, victims, accident costs divided on poviat areas (16 areas) on regional roads (voivodeship roads) in pomorskie voivodeship in 2019. Measures used to assess the level of social risk are (5 classes low, low to medium, medium, medium to high, high):
-
Clinical situations text database for Polish language
Open Research DataDataset contains a database of anonymized texts in Polish for the purposes of building a medical speech corpus, for clinical situations in the following areas: medical interview, interview and description of the result of an oncological examination, description of a radiological examination, description of a pathomorphological examination, description...
-
Surface EMG-based signal acquisition for decoding hand movements
Open Research DataBiosignal processing plays a crucial role in modern hand prosthetics. The challenge is to restore functionality of a lost limb based on the signals acquired from the surface of the stump. The number of sensors (emg channels) used for signal acquisition influence the quality of a prosthetic hand. Modern algorithms (including neural networks) can significantly...
-
The surface of the sensor used in the analysis of odorous substances
Open Research DataHuman industrial activity usually leads to smaller or larger interference with the ecosystem, contributing to changes affecting the quality of life. An example may be the emission of gaseous substances, not necessarily toxic, but due to their intense smell, they can cause discomfort to people exposed to their inhalation. The problem is so important...
-
Tensile curve of E grade steel for shipbuilding
Open Research DataIn the shipbuilding industry, the risk of brittle fractures developing in constructions is limited by employing certified materials of specific impact strength, determined using the Charpy method (for a given design temperature) and by exercising control over the welding processes (technology qualification, supervision of production, tests of non-destructive...
-
Simulation of ship turning circle test for ballast and full load conditions
Open Research DataThe data show the results of the turning circle spiral test for the simplified ship model, taking into account two states of loading: ballast and full load. During the circulation test, the manoeuvrability of the vessel is tested.
-
Angular welding distortion - one sided fillet weld
Open Research DataWelding is the basic method of joining ship hull elements during its construction. However, this method of joining structural elements generates shrinks. Shrinks causes deformation of the entire welded structure, both linear and angular. In the shipbuilding industry, there is a tendency to oversize fillet welds, at the design as well as manufacturing...
-
The aggregation of objects representing Gdańsk district buildings - scale 1:10000
Open Research DataThe process of automatic generalization is one of the elements of spatial data preparation for the purpose of creating digital cartographic studies. The presented data include a part of the process of generalization of building groups obtained from the national geodesy and cartography resource from BDOT10k (10k topographic database) [1].
-
The aggregation of objects representing buildings in the Kartuzy district - scale 1:10000
Open Research DataThe process of automatic generalization is one of the elements of spatial data preparation for the purpose of creating digital cartographic studies. The presented data include a part of the process of generalization of building groups obtained from the national geodesy and cartography resource from BDOT10k (10k topographic database) [1].
-
Herbarium of Division of Marine Biology and Ecology University of Gdańsk (DMBE)
Open Research DataHerbarium of Division of Marine Biology and Ecology University of Gdańsk (DMBE) is a research herbarium encompassing specimens of vascular plants and algae hosted by the Laboratory of Marine Plant Ecology at University of Gdańsk, Poland. The aim of Herbarium is to preserve marine plant and algae collections mostly from the Gulf of Gdańsk, but the herbarium...
-
The aggregation of objects representing buildings in the Kartuzy district - scale 1:25000
Open Research DataThe process of automatic generalization is one of the elements of spatial data preparation for the purpose of creating digital cartographic studies. The presented data include a part of the process of generalization of building groups obtained from the national geodesy and cartography resource from BDOT10k (10k topographic database) [1].
-
The aggregation of objects representing Gdańsk district buildings - scale 1:25000
Open Research DataThe process of automatic generalization is one of the elements of spatial data preparation for the purpose of creating digital cartographic studies. The presented data include a part of the process of generalization of building groups obtained from the national geodesy and cartography resource from BDOT10k (10k topographic database) [1].
-
SYNAT Music Genre Parameters PCA 19
Open Research DataThe dataset contains feature vector after Principal Component Analysis (PCA) performing, so there are 11 music genres and 19-element vector derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier research studies carried out by the team of authors [1-6]. A collection of 52532 music excerpts described...
-
SYNAT_PCA_48
Open Research DataThere is a series of datasets containing feature vectors derived from music tracks. The dataset contains 51582 music tracks (22 music genres) and feature vector after Principal Component Analysis (PCA) performing, so there are 48-element vectors derived from music excerpts. Originally, a feature vector containing 173 elements was conceived in earlier...