Remus: Polish-Kashubian parallel translation corpus - Open Research Data - Bridge of Knowledge

Search

Remus: Polish-Kashubian parallel translation corpus

Description

The dataset contains 10,825 sentences from the Kashubian book "Life and Adventures of Remus" (Żëcé i przigòdë Remùsa) with parallel Polish translations. Aleksander Majkowski's book is considered the most important book in Kashubian literature, making it a valuable source of high-quality translation data.

Dataset file

Remus: Polish-Kashubian parallel translation corpus.zip
648.2 kB, S3 ETag 0f148fb45c79dd1dd9bc75fe3c1f646a-1, downloads: 0
The file hash is calculated from the formula
hexmd5(md5(part1)+md5(part2)+...)-{parts_count} where a single part of the file is 512 MB in size.

Example script for calculation:
https://github.com/antespi/s3md5
request access

File details

License:
Restricted access
Please contact the authors.

Details

Year of publication:
2025
Verification date:
2025-02-03
Dataset language:
Polish
Fields of science:
  • information and communication technology (Engineering and Technology)
DOI:
DOI ID 10.34808/ws91-gj05 open in new tab
Series:
Verified by:
Gdańsk University of Technology

Keywords

References

Cite as

seen 26 times