Skip to content

Commit

Permalink
Update _posts/2024-07-16-17.0.0-release.md
Browse files Browse the repository at this point in the history
  • Loading branch information
amoeba authored Jul 18, 2024
1 parent 1a2adf9 commit ed2a2a5
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions _posts/2024-07-16-17.0.0-release.md
Original file line number Diff line number Diff line change
Expand Up @@ -191,6 +191,7 @@ Thanks for your contributions and participation in the project!
## R notes

* R functions that users write that use functions that Arrow supports in dataset queries now can be used in queries too. Previously, only functions that used arithmetic operators worked. For example, `time_hours <- function(mins) mins / 60` worked, but `time_hours_rounded <- function(mins) round(mins / 60)` did not; now both work. These are automatic translations rather than true user-defined functions (UDFs); for UDFs, see `register_scalar_function()`. [GH-41223](https://github.com/apache/arrow/issues/41223)
* `mutate()` expressions can now include aggregations, such as `x - mean(x)`. [GH-41350](https://github.com/apache/arrow/pull/41350)
* `summarize()` supports more complex expressions, and correctly handles cases where column names are reused in expressions. [GH-41323](https://github.com/apache/arrow/issues/41323)
* The `na_matches` argument to the `dplyr::*_join()` functions is now supported. This argument controls whether `NA` values are considered equal when joining. [GH-41223](https://github.com/apache/arrow/issues/41358)
* R metadata, stored in the Arrow schema to support round-tripping data between R and Arrow/Parquet, is now serialized and deserialized more strictly. This makes it safer to load data from files from unknown sources into R data.frames. [GH-41223](https://github.com/apache/arrow/issues/41969)
Expand Down

0 comments on commit ed2a2a5

Please sign in to comment.