Load OQMD into PyG #186
-
Hello,
Thanks for your help. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
Hello Kshitij, Thanks for reaching out with your questions! Our lmdb dataset for OQMD has a total of 1,022,595 samples which was the amount available at the time of creating our dataset. To load the lmdb dataset with the PyG backend, you can use something like this: from matsciml.datasets import OQMDDataset
from matsciml.datasets.transforms import PointCloudToGraphTransform
oqmd_data = OQMDDataset("/path/to/oqmd/data/",transforms=[PointCloudToGraphTransform(backend="pyg")])
sample = oqmd_data.__getitem__(0)
sample_graph = sample['graph'] If you'd like to get started quickly without downloading the full dataset, you may load directly from the provided "devset" in our repo, which will load in 200 samples of data: oqmd_data = OQMDDataset.from_devset(transforms=[PointCloudToGraphTransform(backend="pyg")]) Have a look at our oqmd example for how to get started with training a model using this data. Let us know if there are any more questions. |
Beta Was this translation helpful? Give feedback.
It seems like you may have an old version of the dataset. Please redownload from the latest zenodo release(v2 from 3/4/2024) and try the same code snippet again.