Filters
total: 3961
filtered: 307
-
Catalog
- Publications 2931 available results
- Journals 439 available results
- Conferences 78 available results
- Publishing Houses 1 available results
- People 136 available results
- Projects 4 available results
- e-Learning Courses 62 available results
- Events 3 available results
- Open Research Data 307 available results
Chosen catalog filters
Search results for: natural language processing
-
Vident-lab: a dataset for multi-task video processing of phantom dental scenes
Open Research DataWe introduce a new, asymmetrically annotated dataset of natural teeth in phantom scenes for multi-task video processing: restoration, teeth segmentation, and inter-frame homography estimation. Pairs of frames were acquired with a beam splitter. The dataset constitutes a low-quality frame, its high-quality counterpart, a teeth segmentation mask, and...
-
Video recordings of static hand gestures for gesture based interaction
Open Research DataThis data set contains video recording of selected simple hand gestures related to sign language. The purpose of the data set is to evaluate different computer algorithms design for hand gesture detection as well as for hand features and hand pose detection and identification. The data set contains 5 video recordings in mp4 format. Each recording is...
-
Elgold partial: Automotive blogs
Open Research DataThe dataset contains 34 English texts scrapped from automotive blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and...
-
Elgold partial: Movie reviews
Open Research DataThe dataset contains 37 English texts with movie reviews. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: Job offers
Open Research DataThe dataset contains 34 English texts scrapped from the web portals offering job offers. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity...
-
Elgold partial: Scientific papers' abstracts
Open Research DataThe dataset contains 87 Scientific papers' abstracts in English randomly chosen from the folowing scientific disciplines: Biomedicine, Life Sciences, Mathematics, Medicine, Science, Humanities, Social Science.
-
Elgold partial: Amazon product reviews
Open Research DataThe dataset contains 34 Amazon product reviews in English. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Elgold partial: History blogs
Open Research DataThe dataset contains 13 texts from English history blogs. In each text, the named entities are marked. Each name entity is linked to the corresponding Wikipedia if possible. All entities were manually verified by at least three people, which makes the dataset a high-quality gold standard for the evaluation of named entity recognition and linking algorithms.
-
Rust QA: question answering dataset for "The Rust Programming Language" in SQuAD 2.0 format
Open Research DataRust QA is a dataset for training and evaluating QA systems. The dataset consists of 1068 questions to "The Rust Programming Language" book (https://doc.rust-lang.org/stable/book/) with the answers provided as text spans from the book. The dataset is released in SQuAD 2.0 format.
-
Vident-synth: a synthetic intra-oral video dataset for optical flow estimation
Open Research DataWe introduce Vident-synth, a large dataset of synthetic dental videos with corresponding ground truth forward and backward optical flows and occlusion masks. It can be used for:
-
Sieve analysis of natural and magnetite aggregate
Open Research DataSieve analysis of natural river sand and magnetite aggregate used for concrete production
-
Tribological test for evaluation of Natural PEEK
Open Research DataTest of PEEK natural (beige color) samples with sliding speed up to 1,2 m/s and up to 8 MPa of nominal load
-
The American Sign Language alphabet
Open Research DataThe American Sign Language dataset contains all static letters of the American alphabet, meaning those that do not require movement to perform (the entire alphabet except for the letters 'J' and 'Z', which are dynamic and require hand movement).
-
Raman spectra for pyrolized natural compounds
Open Research DataThe presented data showcases the results of Raman spectroscopy analysis conducted on pyrolyzed natural compounds both with and without the inclusion of graphene. The study delved into four specific compounds: methylcellulose with lysine (ML), methylcellulose with lysine-graphene composite (MLG), algae (A), and algae-graphene composite (AG). Raman spectra...
-
Data accompanying the paper: Zarach and Parteka (2023), Export diversification and dependence on natural resources , Economic Modelling (Elsevier), 126 (2023) 106436.
Open Research DataThe folder contains the data and codes used in the analysis described in the paper: Zarach Z.H., Parteka A. (2023). Export diversification and dependence on natural resources. Economic Modelling Volume 126, September 2023, 106436.
-
Data accompanying the paper: "Productivity effects of trade in natural resources – comparison with mechanisms of technological specialization" (Zarach and Parteka, The World Economy, 2023)
Open Research DataThe folder contains the data and codes used in the analysis described in the paper: Zarach Z.H., Parteka A. (2023). Productivity effects of trade in natural resources – comparison with mechanisms of technological specialization. The World Economy, 00, 1–23. https://doi.org/10.1111/twec.13456
-
Clinical situations text database for Polish language
Open Research DataDataset contains a database of anonymized texts in Polish for the purposes of building a medical speech corpus, for clinical situations in the following areas: medical interview, interview and description of the result of an oncological examination, description of a radiological examination, description of a pathomorphological examination, description...
-
Twenty years' (1996-2016) tropospheric parameters for selected EPN stations derived from GPS double-difference processing
Open Research DataPropagation of global navigation satellite systems (GNSS) signals through the Earth’s atmosphere is affected by its physical properties. Both hydrostatic and wet part of the atmosphere (mostly related with troposphere) causes delays of GNSS signal, which usually are expressed in the zenith direction (zenith hydrostatic delay - ZHD, and zenith wet delay...
-
The fertility rate (TFR) in selected EU countries in 2015
Open Research DataThe main reasons for the negative consequences of demographic changes are: natural increase in the life span of the population, decline in fertility and emigration of unusual dimensions.
-
Data obtained via parametrization of differently mixed audio signals
Open Research DataDataset consists of audio samples and the results of their parametrization. The extraction of music parameters was performed using MIRToolbox. Information extracted from the samples was used as a database for master's thesis titled 'The influence of audio signal processing chain in mixing on the emotional state of a music piece'.
-
X-ray images of Baltic herring
Open Research DataA methodology for studying the geometric shape of Baltic herring swimbladders including the optimal way of catching, transporting and storing fish, the X-ray measurements and the X-ray image analysis, that does not change the natural shape of the fish swimbladder was developed. Fish for research was obtained in the area of the Polish coastal zone...
-
Historical Wreck Inventory
Open Research DataThe measurement solution results in a point cloud obtained from the Leica P30 laser scanner. Another element is the processing of photos into point clouds with the Zenmuse P1 camera of the unmanned Matrice 300 RTK aircraft. The measurement provides complete geometrical information about the wreck. The measurement took place as part of the Photogrammetry...
-
Ambient vibrations - footbridge over the Kolibkowski Stream in Gdynia, span P2
Open Research DataAmbient vibration tests carried out on the P2 span of the footbridge over the Kolibkowski Stream. The research was carried out using a set of 14 acceleration sensors - MEMS accelerometers TE 4332M3-002 and TE 4312M3-002, which allowed for simultaneous measurement and recording of 20 acceleration channels. These sensors are characterized by a natural...
-
Ambient vibrations - footbridge over the Kolibkowski Stream in Gdynia, span P3
Open Research DataAmbient vibration tests carried out on the P3 span of the footbridge over the Kolibkowski Stream. The research was carried out using a set of 14 acceleration sensors - MEMS accelerometers TE 4332M3-002 and TE 4312M3-002, which allowed for simultaneous measurement and recording of 20 acceleration channels. These sensors are characterized by a natural...
-
PIT revenues in 2012-2021 in PLN billion
Open Research DataTaxes are the primary source of income for the state budget and local government units. Tax issues play an important role in the economy of each country, its citizens and economic entities that operate on the market. Taxes are the main instrument of the state's influence on the economy, as they cover almost all natural and legal persons operating on...
-
A novel method for drop in drop edible oils encapsulation with chitosan using a coaxial technique
Open Research DataThe dataset present a novel one step method for oil encapsulation. In this coaxial system the oil constitutes the core of the capsule, while the chitosan solution is the polymer shell surrounding the core to provide separation of the core from the external environment. In comparison to other encapsulation methods, the presented technique is much simpler...
-
WikiPrefs: human preferences dataset build from text edits
Open Research DataThe WikiPrefs dataset is a human preferences dataset for Large Language Models alignment. It was built using the EditPrefs method from historical edits of Wikipedia featured articles
-
Automatically created and partially veriffied Wikipedia - WordNet mappings
Open Research DataMapping between Wikipedia articles and WordNet synsets. The mappings between Wikipedia articles and WordNet synsets were obtained automatically using 4 algorithms of data processing. The automatically generated mappings were than a subject of verification by a group of volunteers using crowdsourcing approach through so called Games with a Purpose. The...
-
Cities that obtain the highest revenues from PIT in 2021 in PLN billion
Open Research DataTax issues play an important role in the economy of each country, its citizens and economic entities that operate on the market. Taxes are the main instrument of the state's influence on the economy, as they cover almost all natural and legal persons that operate on the market.
-
Fourier transform infrared spectroscopy (FTIR) of pre- PXBS (0 h) and PXBS during the crosslinking process (24 h–288 h)
Open Research DataThe goal of this research was developing biodegradable and biocompatibile xylitol-based copolymers with improved mechanical properties, and investigating the change in their thermal and chemical properties withprogress of the cross-linking process. Using a raw material of natural origin such as xylitol, a prepolymer wasobtained by esterification and...
-
Number of selected entities (companies) of the national economy in 2008 - 2017
Open Research DataCurrently, most enterprises in Poland are run as sole proprietorships. In addition, natural persons running a business may run an enterprise as part of civil partnerships, which are a relatively simple form of business operation by more than one person (at least two partners). In addition, there is a growing interest in establishing commercial law companies,...
-
Forecast of basic fuel prices in imports to Poland (constant prices in USD in 2007)
Open Research DataThis dataset presents price growth forecasts for conventional energy sources. It should be noted that the Ministry of Economy forecasts a more than two-fold increase in oil prices (although these forecasts may be greatly underestimated) over 23 years, an almost two-fold increase in natural gas prices and a 40% increase in coal prices.
-
Poland’s energy dependence - economic context
Open Research DataPoland does not have vast resources of non-renewable energy and no nuclear power plant, therefore the issue of the energy dependence of the state, which affects the level of energy security of the country, is an extremely important factor. It depends on both the volume of imports of energy raw materials and the policy of diversification of sources of...
-
Income obtained according to particular rates only by taxpayers conducting non-agricultural business activity in 2016
Open Research DataA special form of income taxation addressed to the SME sector is the Lump sum on registered income, which is a simplified form of income tax payment for natural persons conducting business activity.The choice of this form of taxation is optional. In 2016, the tax in this form could be paid by taxpayers who in 2015 obtained income from non-agricultural...
-
3D point cloud as a representation of silo / tank
Open Research DataThe product presents a point cloud in the set of coordinates X Y Z. The data was obtained by terrestrial laser scanning and its processing for the analysis of tanks geometry. The development process indicates the possibility to obtain the reliable results useful for the evaluation of the tank side surfaces geometry.
-
Input files for the Floodsar software
Open Research DataInput files for the Floodsar softwareAuthor: Tomasz Berezowski, Gdansk University of Technology, tomberez@eti.pg.edu.pl
-
Correction of far-field measurements obtained in non-anechoic test site
Open Research DataThe dataset contains raw and processed measurements of radiation pattern characteristics performed in non-anechoic regime for two geometrically small antenna structures: a spline-parameterized Vivaldi structure and a compact spline-based monopole. The responses have been obtained at the selected frequencies of interest as a function of mentioned structures...
-
Transmission measurements between two geometrically small Vivaldi antennas performed in non-anechoic propagation conditions
Open Research DataThe dataset contains unprocessed measurements of complex transmission (and reflection) characteristics obtained in non-anechoic regime for a geometrically small, broadband spline-parameterized Vivaldi structure. The measurement setup comprises two Vivaldi antennas with the same topology where one is used as a reference structure, and another one as...
-
Structural investigations of the LTO:Cu thin films
Open Research DataLithium titanate (Li1+xTi2-xO4) doped with Cu2+ ions was synthesized by sol-gel processing method. The structure was characterized by X-ray Diffraction (XRD). All samples revealed presence of LTO spinel phase. X-ray pattern of undoped LTO was free of any impurities and other crystal phases. Similarly, samples with low amount of copper dopant (x = 0.05...
-
The number of active enterprises in Poland in 1997-2014
Open Research DataAfter Poland joined the group of countries associated in the European Union and through participation in numerous economic and political organizations (including the World Trade Organization, OECD), as well as the commencement of trade exchange with virtually all countries of the world, Polish entrepreneurs and managers struggle with completely problems...
-
Share of gross value added generated by enterprises in GDP
Open Research DataAfter Poland joined the group of countries associated in the European Union and through participation in numerous economic and political organizations (including the World Trade Organization, OECD), as well as the commencement of trade exchange with virtually all countries of the world, Polish entrepreneurs and managers struggle with completely problems...
-
Voltage fluctuations on the main switchgear of the industrial power system supplying the rolling mill motors
Open Research DataThe dataset presents the voltage waveforms on the bus bars of the main switchgear of the industrial power network for the supply of rolling mills. The data was recorded during an experiment whose purpose was to determine a level of short-term and long-term flicker caused by voltage fluctuations. In the virtual application of flickermeter, a hardware...
-
TEM and EDX study of the Al2O3 ultra thin films
Open Research DataThe ultra-thin layers of Al2O3 were deposited on a silicon substrates. The method of atomic layer deposition (Beneq TFS 200 ALD system) was chosen as the proper method of dielectric layer deposition. This method provides precise thickness control down to a single atomic layer. The precursors used were trimethylaluminum (Sigma-Aldrich) and purified water....
-
Radiation pattern measurements of geometrically small antennas performed in non-anechoic environments
Open Research DataThe dataset contains unprocessed measurements of radiation pattern characteristics performed in non-anechoic regime for three geometrically small antenna structures: a spline-parameterized Vivaldi structure, a compact spline-based monopole, and a quasi-Yagi geometry with enhanced bandwidth. The responses have been obtained over broad frequency ranges...
-
Measurements of electrically small antenna radiation patterns in non-anechoic environments using TGM
Open Research DataThe dataset contains raw and processed measurements of radiation pattern characteristics performed in non-anechoic regime for four antenna structures: a spline-parameterized Vivaldi structure, a compact spline-based monopole, super-ultrawideband antenna, and a quasi-Yagi component. The responses have been obtained at the selected frequencies of interest...
-
MODALITY corpus - SPEAKER 10 - SEQUENCE S1
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 10 - SEQUENCE S5
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 10 - SEQUENCE S3
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 10 - SEQUENCE S2
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...
-
MODALITY corpus - SPEAKER 10 - SEQUENCE S4
Open Research DataThe MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system,...