Filters
total: 1502
filtered: 187
-
Catalog
Chosen catalog filters
Search results for: English Language Teaching
-
MODALITY corpus - SPEAKER 33 - SEQUENCE S6
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 35 - COMMANDS C6
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 32 - COMMANDS C5
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 35 - COMMANDS C5
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 33 - COMMANDS C4
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 27 - SEQUENCE S3
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 27 - COMMANDS C3
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 33 - SEQUENCE S5
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 27 - SEQUENCE S2
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S1
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S4
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S2
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S5
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S3
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 17 - SEQUENCE S6
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
Rhetoric at school - a selection of the syllabi from the Academic Gymnasium in Gdańsk - transcription and photographs
Open Research DataThe following data set comprises a transcription (in txt and docx formats) and photographs (in jpg format) of the selected records from the Latin-language teaching syllabi termed 'Typus' or 'Catalogus lectionum' (an abbreviated title). Rhetoric is the main thematic criterion behind this choice. Various topics related to rhetoric have been taught not...
-
The American Sign Language alphabet
Open Research DataThe American Sign Language dataset contains all static letters of the American alphabet, meaning those that do not require movement to perform (the entire alphabet except for the letters 'J' and 'Z', which are dynamic and require hand movement).
-
Rust QA: question answering dataset for "The Rust Programming Language" in SQuAD 2.0 format
Open Research DataRust QA is a dataset for training and evaluating QA systems. The dataset consists of 1068 questions to "The Rust Programming Language" book (https://doc.rust-lang.org/stable/book/) with the answers provided as text spans from the book. The dataset is released in SQuAD 2.0 format.
-
Clinical situations text database for Polish language
Open Research DataDataset contains a database of anonymized texts in Polish for the purposes of building a medical speech corpus, for clinical situations in the following areas: medical interview, interview and description of the result of an oncological examination, description of a radiological examination, description of a pathomorphological examination, description...
-
TF-IDF weighted bag-of-words preprocessed text documents from Simple English Wikipedia
Open Research DataThe SimpleWiki2K-scores dataset contains TF-IDF weighted bag-of-words preprocessed text documents (raw strings are not available) [feature matrix] and their multi-label assignments [label-matrix]. Label scores for each document are also provided for an enhanced multi-label KNN [1] and LEML [2] classifiers. The aim of the dataset is to establish a benchmark...
-
Revenues from operating activities of non-public universities in Poland in 2010 (PLN thousand)
Open Research DataIn 2010, in non-public universities, 90.2% of revenues from operating activities were revenues from teaching activities. Revenues from research activity accounted for 2.8%, and other types of activity accounted for 6.9% of total operating revenues.Income from teaching activities of universities may be obtained from budget subsidies, funds to local governments...
-
Scientific development of staff at selected private universities in Gdańsk, Sopot and Gdynia in 2010
Open Research DataOne of the important elements influencing the raising of the university's level is the development of the academic staff. According to the available data, there is a large discrepancy in the number of research and teaching staff employed between individual universities. Universities that employ more teaching and research staff choose a different form...
-
The structure of revenues of non-public universities in 2010 from didactic activity by sources of financing (in%)
Open Research DataIncome from teaching activities of non-public universities accounted for almost 1/5 of income from teaching activities of all types of universities. Non-public universities generated the highest revenues from fees for teaching classes, which accounted for over half of the revenues from this title in relation to all universities.The average cost of education...
-
Structure of own costs of private universities in 2010
Open Research DataThe biggest costs are generated by basic teaching activities. However, in the context of the competitiveness and quality of education in private universities, research expenditure is a cause for concern. Dataset shows that these universities spend less than 4.0% on research compared to other activities. For comparison, in public universities research...
-
Results of calibration of the piezoelectric scanner using the probe TGQ1
Open Research DataTeaching file. Results of calibration of the piezoelectric scanner using the probe TGQ1. Scanning in contact mode. NTEGRA Prima (NT-MDT) device. CSG probe 10.
-
Number of students per one lecturer in the academic year 2010/2011 at Polish univeristies
Open Research DataAs at the end of December 2010, 103.5 thousand academic teachers worked in universities (full-time and part-time equivalent to full-time employment), including 1.9 thous. foreigners. Teachers working in public schools accounted for almost 82.7% of the total number of employees in higher education, and lecturers from non-public universities - 17.3%....
-
ALOFON corpus
Open Research DataThe ALOFON corpus is one of the multimodal database of word recordings in English, available at http://www.modality-corpus.org/. The ALOFON corpus is oriented towards the recording of the speech equivalence variants. For this purpose, a total of 7 people who are or speak English with native speaker fluency and a variety of Standard Southern British...
-
Automatically created and partially veriffied Wikipedia - WordNet mappings
Open Research DataMapping between Wikipedia articles and WordNet synsets. The mappings between Wikipedia articles and WordNet synsets were obtained automatically using 4 algorithms of data processing. The automatically generated mappings were than a subject of verification by a group of volunteers using crowdsourcing approach through so called Games with a Purpose. The...
-
Video recordings of static hand gestures for gesture based interaction
Open Research DataThis data set contains video recording of selected simple hand gestures related to sign language. The purpose of the data set is to evaluate different computer algorithms design for hand gesture detection as well as for hand features and hand pose detection and identification. The data set contains 5 video recordings in mp4 format. Each recording is...
-
Elgold partial: News
Open Research DataThe dataset contains 37 English texts scrapped from news websites. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking...
-
An facile Fortran-95 algorithm to simulate complex instabilities in three-dimensional hyperbolic systems
Open Research DataIt is well know that the simulation of fractional systems is a difficult task from all points of view. In particular, the computer implementation of numerical algorithms to simulate fractional systems of partial differential equations in three dimensions is a hard task which has no been solved satisfactorily. Here, we provide a Fortran-95 code to solve...
-
Auditory Brainstem Responses recorded employing Audio ABR device
Open Research DataThe dataset consists of ABR measurements employing click, burst and speech stimuli. Parameters of the particular stimuli were as follows:
-
Elgold intermediate: annotated raw
Open Research DataThe dataset contains a subset of texts from Elgold intermediate: raw texts with named entities marked and linked to corresponding Wikipedia articles. The texts were annotated by 31 participants during the 1.5-hour session.
-
Elgold: gold standard, multi-genre dataset for named entity recognition and linking
Open Research DataThe dataset contains 276 multi-genre texts with marked named entities, which are linked to corresponding Wikipedia articles if available. Each entity was manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Assessment of Poland's competitiveness as a location for BPO/SSC projects
Open Research DataIn compiled by A.T. Kearney in subsequent editions of the ranking (2009, 2011, 2016, 2019), the first three positions were taken by the same countries. India received the highest marks with a clear advantage. The country is a world leader in terms of attractiveness for locating business service centers. India can offer English-speaking skilled labor...
-
2 Latin letters by Georg Pauli (b.1586-d.1650) - transcription, translation and photographs
Open Research DataThe data set contains two Latin letters by Georg Pauli (b. 1586 – d. 1650) to his brother Adrian (d. 1622) in photographs, transcriptions, and translations into Polish and English. The first letter was sent by Georg from Gdańsk (formerly Danzig) in 1604 when he was still a student at the local Academic Gymnasium. The second one, in turn, was written...
-
Wernsdorf - a biography of an 18th century scholar - transcription, photographs and partial translation
Open Research DataThe data set contains the transcription of a large passage of the Latin biography of the eighteenth-century philologist and theologian Gottlieb Wernsdorf (1717-1774), as well as photos of the entire print. A translation from Latin into English and Polish was added to the data set as well (that is, excerpts on the education of both Gottlieb himself,...