Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add ability to load and inspect individual datasets #261

Open
npatki opened this issue Nov 13, 2023 · 1 comment
Open

Add ability to load and inspect individual datasets #261

npatki opened this issue Nov 13, 2023 · 1 comment
Labels
feature request Request for a new feature

Comments

@npatki
Copy link

npatki commented Nov 13, 2023

Problem Description

The SDGym library currently allows you to list the available datasets for benchmarking purposes. However, it does not offer any abilities to inspect these datasets -- users may want to do this in order to see what the columns, data types, or values look like before they apply them to the benchmarking run.

Expected behavior

Add a download_demo method that is similar to the one in the SDV library. This method would return the data and metadata so that SDGym users can inspect the dataset.

Workaround

The SDV library is a prerequisite of SDGym. So as a workaround, you can access the demo datasets through it.

import sdv

from sdv.datasets.demo import download_demo

data, metadata = download_demo(
    modality='single_table',
    dataset_name='adult'
)
@npatki npatki added the feature request Request for a new feature label Nov 13, 2023
@npatki
Copy link
Author

npatki commented Nov 13, 2023

For a related discussion, see #253

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request Request for a new feature
Projects
None yet
Development

No branches or pull requests

1 participant