Release v1.12.1 #1243

sfc-gh-aalam · 2024-02-07T20:28:36Z

Please answer these questions before submitting your pull requests. Thanks!

What GitHub issue is this PR addressing? Make sure that there is an accompanying issue to your PR.

Fixes #NNNN
Fill out the following pre-review checklist:
- I am adding a new automated test(s) to verify correctness of my new code
- I am adding new logging messages
- I am adding a new telemetry message
- I am adding new credentials
- I am adding a new dependency
Please describe how your code solves the related issue.

Please write a short description of how your code change solves the related issue.

* Added fix for integer column with None values * Rename test name * Added changelog --------- Co-authored-by: Frederik Steiner <[email protected]>

* use split_blocks=True by default * changelog updates * more details in changelog * fix grammar * fix changelog * revert to minimal changelog

sfc-gh-mabrennan · 2024-02-07T21:01:50Z

CHANGELOG.md

+
+### Bug Fixes
+
+- Fixed a bug in `DataFrame.to_pandas` that caused an error when evaluating on a dataframe with an IntergerType column with null values.


dataframe -> DataFrame

sfc-gh-mabrennan · 2024-02-07T21:02:06Z

CHANGELOG.md

+
+### New Features
+
+- Use `split_blocks=True` by default during `to_pandas` conversion for optimal memory allocation. This parameter is passed to `pyarrow.Table.to_pandas` that enables `PyArrow` to split the memory allocation into smaller, more manageable blocks instead of allocating a single contiguous block thus giving better memory management when dealing with larger datasets.


I suggest changing this:

- Use `split_blocks=True` by default during `to_pandas` conversion for optimal memory allocation. This parameter is passed to `pyarrow.Table.to_pandas` that enables `PyArrow` to split the memory allocation into smaller, more manageable blocks instead of allocating a single contiguous block thus giving better memory management when dealing with larger datasets.

to this:

- Use `split_blocks=True` by default during `to_pandas` conversion, for optimal memory allocation. This parameter is passed to `pyarrow.Table.to_pandas`, which enables `PyArrow` to split the memory allocation into smaller, more manageable blocks instead of allocating a single contiguous block. This results in better memory management when dealing with larger datasets.

sfc-gh-mabrennan

I made 2 small suggestions. Thank you.

sfc-gh-yixie · 2024-02-07T22:11:16Z

CHANGELOG.md

@@ -1,5 +1,15 @@
 # Release History

+## 1.12.1 (2024-02-08)
+
+### New Features


We don't expose new APIs / features to users so "Improvements" might be better than "New Features".

sfc-gh-aalam · 2024-02-08T00:53:03Z

Manually ran merge gates here: https://github.com/snowflakedb/snowpark-python/actions/runs/7822307495

frederiksteiner and others added 3 commits February 7, 2024 11:53

Fix of to_pandas() when having integer column with null values (#1232)

93b55f5

* Added fix for integer column with None values * Rename test name * Added changelog --------- Co-authored-by: Frederik Steiner <[email protected]>

SNOW-1018660: use split_blocks=True by default (#1234)

8b051e9

* use split_blocks=True by default * changelog updates * more details in changelog * fix grammar * fix changelog * revert to minimal changelog

patch release bump

ed41909

sfc-gh-aalam requested a review from a team as a code owner February 7, 2024 20:28

sfc-gh-aalam requested review from sfc-gh-smirzaei, sfc-gh-evandenberg and sfc-gh-mabrennan February 7, 2024 20:28

sfc-gh-mabrennan reviewed Feb 7, 2024

View reviewed changes

sfc-gh-mabrennan approved these changes Feb 7, 2024

View reviewed changes

review comments

87e3b49

sfc-gh-stan approved these changes Feb 7, 2024

View reviewed changes

sfc-gh-yixie reviewed Feb 7, 2024

View reviewed changes

rename heading

86b16f7

Merge branch 'main' into changelog-update-post-1.12.1-release

915de1a

sfc-gh-aalam enabled auto-merge (squash) February 8, 2024 22:47

sfc-gh-aalam merged commit 26f561a into main Feb 8, 2024
57 checks passed

sfc-gh-aalam deleted the changelog-update-post-1.12.1-release branch February 8, 2024 23:23

github-actions bot locked and limited conversation to collaborators Feb 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release v1.12.1 #1243

Release v1.12.1 #1243

sfc-gh-aalam commented Feb 7, 2024

sfc-gh-mabrennan Feb 7, 2024

sfc-gh-mabrennan Feb 7, 2024 •

edited

Loading

sfc-gh-mabrennan left a comment

sfc-gh-yixie Feb 7, 2024

sfc-gh-aalam commented Feb 8, 2024


		### Bug Fixes

		- Fixed a bug in `DataFrame.to_pandas` that caused an error when evaluating on a dataframe with an IntergerType column with null values.


		### New Features

		- Use `split_blocks=True` by default during `to_pandas` conversion for optimal memory allocation. This parameter is passed to `pyarrow.Table.to_pandas` that enables `PyArrow` to split the memory allocation into smaller, more manageable blocks instead of allocating a single contiguous block thus giving better memory management when dealing with larger datasets.

Release v1.12.1 #1243

Release v1.12.1 #1243

Conversation

sfc-gh-aalam commented Feb 7, 2024

sfc-gh-mabrennan Feb 7, 2024

Choose a reason for hiding this comment

sfc-gh-mabrennan Feb 7, 2024 • edited Loading

Choose a reason for hiding this comment

sfc-gh-mabrennan left a comment

Choose a reason for hiding this comment

sfc-gh-yixie Feb 7, 2024

Choose a reason for hiding this comment

sfc-gh-aalam commented Feb 8, 2024

sfc-gh-mabrennan Feb 7, 2024 •

edited

Loading