Filters
total: 1290
filtered: 514
Search results for: TEXT DOCUMENTS CATEGORIZATION
-
TF-IDF weighted bag-of-words preprocessed text documents from Simple English Wikipedia
Open Research DataThe SimpleWiki2K-scores dataset contains TF-IDF weighted bag-of-words preprocessed text documents (raw strings are not available) [feature matrix] and their multi-label assignments [label-matrix]. Label scores for each document are also provided for an enhanced multi-label KNN [1] and LEML [2] classifiers. The aim of the dataset is to establish a benchmark...
-
Foundation text of St. Mary's Church in Gdańsk
Open Research DataThe data set concerns epigraphy. It refers to the medieval foundation preserved on the wall above the sacristy entrance in St. Mary’s Church in Gdańsk, which confirms that the foundation stone of the temple was laid on 28th of March 1343. The data set contains one general photo of the foundation text, transcription of its text in Latin and its Polish...
-
Clinical situations text database for Polish language
Open Research DataDataset contains a database of anonymized texts in Polish for the purposes of building a medical speech corpus, for clinical situations in the following areas: medical interview, interview and description of the result of an oncological examination, description of a radiological examination, description of a pathomorphological examination, description...
-
WikiPrefs: human preferences dataset build from text edits
Open Research DataThe WikiPrefs dataset is a human preferences dataset for Large Language Models alignment. It was built using the EditPrefs method from historical edits of Wikipedia featured articles
-
3D knee model G with reduced thickness of articular cartilage - input text file for computation
Open Research DataThe finite element method was used to simulate the stance phase of the gait cycle. An intact knee model was prepared based on magnetic resonance scans of the left knee joint of a healthy volunteer. In the model G articular cartilage thickness was reduced in specific areas to simulate degenerative changes in the medial knee osteoarthritis. The file was...
-
3D knee model M with decreased material parameters of the cartilage and menisci - input text file for computation
Open Research DataThe finite element method was used to simulate the stance phase of the gait cycle. An intact knee model was prepared based on magnetic resonance scans of the left knee joint of a healthy volunteer. In the model M the material parameters of cartilage and menisci were reduced to simulate degenerative changes in the medial knee osteoarthritis. The file...
-
3D model of osteoarthritic (OA) knee joint for analysis of the medial meniscus biomechanics - input text file for computation
Open Research DataThe finite element method was used to simulate the stance phase of the gait cycle. An intact knee model was prepared based on magnetic resonance scans of the left knee joint of a healthy volunteer. In the OA model thickness of articular cartilage and material parameters of the cartilage and menisci were reduced to simulate degenerative changes in the...
-
3D intact knee model used in analysis of the medial meniscus biomechanics in the osteoarthritic knee joint - input text file for computation
Open Research DataThe finite element method was used to simulate the stance phase of the gait cycle. An intact knee model with original geometry and material parametetrs was prepared based on magnetic resonance scans of the left knee joint of a healthy volunteer. The file was created in Abaqus 6.14-2, but can be read in a text editor.
-
Internal legal acts of technical and medical universities in Poland regulating classes conducted in-person during the Covid-19 pandemic
Open Research DataA database of legal acts and other internal documents of medical and technical universities in Poland regulating the way of organizing in-person or hybrid classes during the COVID-19 pandemic from the summer semester 2019/2020 to the winter semester 2020/2021.Documents were encoded in two separate coding systems using the MAXQDA program for qualitative...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (Silesian University of Technology 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles.The most common definition of research productivity...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (Łódź University of Technology 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (Poznań University of Technology 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (Warsaw University of Technology 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (West Pomeranian University of Technology 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Dataset of bibliometric data for a research study on scientific productivity of Polish economic universities (Warsaw School of Economics 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish economic universities. The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (Lublin University of Technology 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (Białystok University of Technology 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (Gdańsk University of Technology 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (Technical University Radom 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (Wrocław University of Technology 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (Kielce University of Technology 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (Technical University Częstochowa 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (Koszalin University of Technology 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (Cracow University of Technology 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles.The most common definition of research productivity...
-
Dataset of bibliometric data for a research study on scientific productivity of Polish economic universities (University of Economics in Katowice 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish economic universities. The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (Opole University of Technology 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Dataset of bibliometric data for a research study on scientific productivity of Polish economic universities (Cracow University of Economics 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish economic universities. The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (Rzeszów University of Technology 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Bibliometric data for a research study on scientific productivity of Polish technical universities (AGH University of Science & Technology 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish technical universities.The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Dataset of bibliometric data for a research study on scientific productivity of Polish economic universities (Poznań University of Economics & Business 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish economic universities. The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Dataset of bibliometric data for a research study on scientific productivity of Polish economic universities (Wrocław University of Economics & Business 2016-2020) retrieved by InCites benchmarking tool.
Open Research DataThis dataset was created for the purpose of research on scientific productivity at Polish economic universities. The raw data was retrieved in July 2021 by the InCites benchmarking tool in xlsx format and will be used to create the research profiles of the universities and underlying data of journals articles. The most common definition of research...
-
Number of work permits issued in 2008-2018 by migrant's country of origin
Open Research DataThe scale of the influx of economic migrants to Poland can be proved by data referring to documents enabling legal work, including work permits. In 2018, the most frequently applied for work permit for citizens of eight countries presented in the dataset. The table contains data on the number of permits issued in the years 2008-2018.
-
Elgold partial: News
Open Research DataThe dataset contains 37 English texts scrapped from news websites. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking...
-
Elgold partial: Automotive blogs
Open Research DataThe dataset contains 34 English texts scrapped from automotive blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and...
-
Elgold partial: Movie reviews
Open Research DataThe dataset contains 37 English texts with movie reviews. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: Job offers
Open Research DataThe dataset contains 34 English texts scrapped from the web portals offering job offers. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity...
-
Elgold partial: Scientific papers' abstracts
Open Research DataThe dataset contains 87 Scientific papers' abstracts in English randomly chosen from the folowing scientific disciplines: Biomedicine, Life Sciences, Mathematics, Medicine, Science, Humanities, Social Science.
-
Elgold partial: Amazon product reviews
Open Research DataThe dataset contains 34 Amazon product reviews in English. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: History blogs
Open Research DataThe dataset contains 13 texts from English history blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Rust QA: question answering dataset for "The Rust Programming Language" in SQuAD 2.0 format
Open Research DataRust QA is a dataset for training and evaluating QA systems. The dataset consists of 1068 questions to "The Rust Programming Language" book (https://doc.rust-lang.org/stable/book/) with the answers provided as text spans from the book. The dataset is released in SQuAD 2.0 format.
-
Elgold: gold standard, multi-genre dataset for named entity recognition and linking
Open Research DataThe dataset contains 276 multi-genre texts with marked named entities, which are linked to corresponding Wikipedia articles if available. Each entity was manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold intermediate: verified by the authors
Open Research DataThe dataset contains the texts from Elgold intermediate: verified by verification team additionaly verified by the dataset authors but before the final validation step with the elgold toolset.
-
Elgold intermediate: verified by verification team
Open Research DataThe dataset contains the texts from Elgold intermediate: annotated raw additionaly verified by the five-person verification team. arly 25% of the mentions were corrected in some aspect.
-
Elgold intermediate: annotated raw
Open Research DataThe dataset contains a subset of texts from Elgold intermediate: raw texts with named entities marked and linked to corresponding Wikipedia articles. The texts were annotated by 31 participants during the 1.5-hour session.
-
Epitaph of Bartholomew Wagner in St. Mary's Church in Gdańsk
Open Research DataThe data set concerns epigraphy. It refers to the epitaph placed in St. Mary’s Church in Gdańsk, that is dedicated to Bartholomew Wagner, city physician from 1562, who came to Gdańsk from Konigsberg. Data set contains one general photo of the epitaph, transcription of its text in Latin, its translation in Polish, and the biography of the deceased, also...
-
Epitaph of John Schroeder in St. Mary's Church in Gdańsk
Open Research DataThe data set concerns epigraphy. It refers to the epitaph placed in St. Mary’s Church in Gdańsk, that is dedicated to John Schröder, son of Simon. John, as he was a single, erected this epitaph for himself. Data set contains one general photo of the epitaph, transcription of its text in Latin, its translation in Polish, and the biography of the deceased,...
-
Epitaph of Henry Giese in St. Mary's Church in Gdańsk
Open Research DataThe data set concerns epigraphy. It refers to the epitaph placed in St. Mary’s Church in Gdańsk, that is dedicated to Henry Giese, a member of merchant family that came from Germany and settled down in Gdańsk in XVI century. Data set contains one general photo of the epitaph, transcription of its text in Latin, its translation in Polish, and the biography...
-
Epitaph of Oehm family in St. Mary's Church in Gdańsk
Open Research DataThe data set concerns epigraphy. It refers to the epitaph placed in St. Mary’s Church in Gdańsk, that is dedicated to Öhm family, rather unknown to Gdańsk. The epitaph was erected by two latest representatives of family – John and Andrew. Data set contains one general photo of the epitaph, transcription of its text in Latin, its translation in Polish,...
-
Mechanical properties of single V2O5 nanocrystal - nanoindentation measurement in control of the max-load
Open Research DataThe DataSet contains the nanoindentation curves (indentation force Fn vs penetrationPd) for a single V2O5 nanocrystal supported on a substrate. The measurements were performed in control of the maximum load of Berkovich indenter force from 2 to 50 mN.
-
Mechanical properties of single V2O5 nanocrystal - nanoindentation measurement in control of the max-depth
Open Research DataThe DataSet contains the nanoindentation curves (indentation force Fn vs penetrationPd) for a single V2O5 nanocrystal supported on a substrate. The measurements were performed in control of the maximum depth of Berkovich indenter penetration: 60, 70, and 100 nm.