Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SNOW-902662: DF.to_pandas_batches() batch size parameter #1027

Open
ghost opened this issue Aug 29, 2023 · 2 comments
Open

SNOW-902662: DF.to_pandas_batches() batch size parameter #1027

ghost opened this issue Aug 29, 2023 · 2 comments
Assignees
Labels
feature New feature or request status-triage_done Initial triage done, will be further handled by the driver team

Comments

@ghost
Copy link

ghost commented Aug 29, 2023

Current behaviour

When using DataFrame.to_pandas_batches() it returns a Pandas Dataframe Iterator that generates Pandas Dataframes with a "random" number of rows.

Desired behaviour

I would like to insert a parameter in the to_pandas_batches() method, where I fix the number of rows for each Pandas Dataframe generated.

How would this improve snowflake-snowpark-python?

This would be beneficial since the user would be able to control the chunk sizes to process and be sure that his processes don't get overloaded by the snowflake back-end's calculation of the number of rows.

@ghost ghost added the feature New feature or request label Aug 29, 2023
@github-actions github-actions bot changed the title DF.to_pandas_batches() batch size parameter SNOW-902662: DF.to_pandas_batches() batch size parameter Aug 29, 2023
@RahulDubey391
Copy link

Hi @MarcoFreitas0 , I would like to have a look on this issue!

@stong1108
Copy link

I am also interested in this feature

@sfc-gh-ashahi sfc-gh-ashahi self-assigned this Mar 20, 2024
@sfc-gh-ashahi sfc-gh-ashahi added the status-triage_done Initial triage done, will be further handled by the driver team label May 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature New feature or request status-triage_done Initial triage done, will be further handled by the driver team
Projects
None yet
Development

No branches or pull requests

3 participants