Version 5.0.0 #796
jpivarski
started this conversation in
New releases
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Uproot version 5.0.0
Uproot version 5 has a few major new features, one removal (
uproot.lazy
), and is based on Awkward Array version 2 instead of version 1.uproot.lazy → uproot.dask
@kkothari2001 upgraded Uproot from Awkward version 1 to version 2, the major part of which was replacing
uproot.lazy
, which is based on Awkward 1's virtual and partitioned lazy arrays, with the new Dask collection, dask-awkward. The entry point for this function isuproot.dask
.@kkothari2001 also simplified Uproot's Pandas backend, which used to "explode" ragged arrays from ROOT into Pandas DataFrames with a non-trivial MultiIndex. Now, it takes advantage of awkward-pandas to put ragged (and more complex) Awkward Arrays directly into Pandas columns.
If you want the old behavior, you can read data using
library="ak"
to get an Awkward Array, and use ak.to_dataframe to "explode" the data into a MultiIndex.TTree-reading with AwkwardForth
@aryan26roy added a new code path to the TTree-reading routines to read them with AwkwardForth instead of pure Python. Users won't see any interface changes due to this code, but the performance of reading TBranches with
AsObject
orAsStrings
Interpretations should be orders of magnitude faster. For example,std::vector<std::vector<float>>
reading is now 400× faster.Reading RNTuples
@Moelf added a complete reader of RNTuple data with most of an RNTuple-writer in an unmerged pull request (#705). Although the RNTuple format is still in development, this is a very good start at reading RNTuple data, whose structure is a close match to Awkward Arrays (so the translation is more one-to-one than it is for TTrees, for instance).
New features
from_map
like optimization for dask arrays by @kkothari2001 in feat:from_map
like optimization for dask arrays #679toXXX
methods awkward#1919. by @jpivarski in feat: adjust for name change in scikit-hep/awkward#1919. #788Bug-fixes and performance
from_map
optimization for delayed numpy arrays and add tests with empty branches for the same by @kkothari2001 in fix: Usefrom_map
optimization for delayed numpy arrays and add tests with empty branches for the same #703Other
semantic-pr-title
by @agoose77 in ci: use concurrency group forsemantic-pr-title
#691"uproot"
parameter. by @jpivarski in chore: remove Identifier and"uproot"
parameter. #770uproot.dask
docs by @kkothari2001 in docs:uproot.dask
docs #702New Contributors
semantic-pr-title
#691Full Changelog: 4.2.4...v5.0.0
This discussion was created from the release Version 5.0.0.
Beta Was this translation helpful? Give feedback.
All reactions