Description
The dataset contains structures of DNA G-quadruplexes (G4s), obtained through steered molecular dynamics simulations, for 128 oligonucleotides capable of forming 3- and 2-tetrad G4s. The general sequence of the oligonucleotides is GxTiGxTjGxTkGx, where x = 3 or 2 for 3- and 2-tetrad G4s, respectively, while i, j, and k vary independently from 1 to 4, representing the length of loop regions. Each sequence was folded into 26 G4 topologies, and for each topology, all possible tetrad polarity patterns were considered (8 or 4 such patterns for 3- and 2-tetrad G4s, respectively). Lists of all obtained structures, with specifications of whether the given structures are properly folded, can be found in the files ‘List_of_three-tetrad_structures.csv’ and ‘List_of_two-tetrad_structures.csv’ for 3- and 2-tetrad G4s, respectively. Additionally, pictures showing all obtained structures and movies visualizing the folding simulation for exemplary G4s are attached.
Dataset file
hexmd5(md5(part1)+md5(part2)+...)-{parts_count}
where a single part of the file is 512 MB in size.Example script for calculation:
https://github.com/antespi/s3md5
File details
- License:
-
open in new tabCC BYAttribution
- Software:
- VMD, PyMol
Details
- Year of publication:
- 2024
- Verification date:
- 2024-12-03
- Dataset language:
- English
- Fields of science:
-
- chemical sciences (Natural sciences)
- DOI:
- DOI ID 10.34808/fcyz-w866 open in new tab
- Funding:
- Verified by:
- Gdańsk University of Technology
Keywords
Cite as
Authors
seen 3 times