Skip to content

Latest commit

 

History

History
1308 lines (1000 loc) · 127 KB

CHANGELOG.md

File metadata and controls

1308 lines (1000 loc) · 127 KB

Changelog

v0.17.0 (2023-03-27)

Full Changelog

Breaking changes:

New features:

Fixed bugs:

  • parquet_read panics when working with date64s #1400
  • Round Trip [Rust -> arrow2_convert -> Arrow -> Parquet -> Arrow -> Rust] #1376
  • Parquet writes incorrect List<u32> #1368
  • Slicing nullable list arrays into multiple parquet pages doesn't work #1356
  • Reading parquet file with multiple row groups and nested nullable struct types panics #1249
  • Changed encoded float::Inf as null in json #1427 (SimonSchneider)
  • Fixed statistics writing flag and correct null_count in dictionaries #1414 (ritchie46)
  • Fixed ahash dependency for wasm #1407 (hzuo)
  • Fixed writing of sliced arrays to parquet #1397 (jorgecarleitao)
  • Fixed writing nested parquet #1390 (jorgecarleitao)

Enhancements:

Documentation updates:

v0.16.0 (2023-02-09)

Full Changelog

Breaking changes:

Fixed bugs:

  • invalid written parquet file of nested structures. (Mixing list with structs) #1325
  • Fix incorrect downcast in estimated_size_bytes #1351 (jleibs)
  • fix(parquet): nested struct /list writing #1347 (ritchie46)
  • Fixed csv infer_schema on empty fields #1342 (tripokey)

Enhancements:

Documentation updates:

Testing updates:

v0.15.0 (2022-12-18)

Full Changelog

Breaking changes:

New features:

Fixed bugs:

  • Parquet writes all values of sliced arrays? #1323
  • Avro schema: Invalid record names #1269
  • Fixed writing nested/sliced arrays to parquet #1326 (ritchie46)
  • Fixed failing to accept dictionary full of nulls #1312 (ritchie46)
  • Added support for Extension types in ffi #1300 (jondo2010)
  • Fixed error in memory usage of sliced binary/list/utf8arrays #1293 (ritchie46)
  • Fixed descending ordering when specify nulls first #1286 (sandflee)
  • Added avro record names when converting arrow schema to avro #1279 (Samrose-Ahmed)

Enhancements:

Documentation updates:

v0.15.0 (2022-12-18)

Full Changelog

Breaking changes:

New features:

Fixed bugs:

  • Parquet writes all values of sliced arrays? #1323
  • Avro schema: Invalid record names #1269
  • Fixed writing nested/sliced arrays to parquet #1326 (ritchie46)
  • Fixed failing to accept dictionary full of nulls #1312 (ritchie46)
  • Added support for Extension types in ffi #1300 (jondo2010)
  • Fixed error in memory usage of sliced binary/list/utf8arrays #1293 (ritchie46)
  • Fixed descending ordering when specify nulls first #1286 (sandflee)
  • Added avro record names when converting arrow schema to avro #1279 (Samrose-Ahmed)

Enhancements:

Documentation updates:

v0.14.2 (2022-10-05)

Full Changelog

New features:

Fixed bugs:

  • Unnecessary println in library code #1263

Testing updates:

v0.14.1 (2022-09-27)

Full Changelog

Fixed bugs:

Enhancements:

Testing updates:

v0.14.0 (2022-09-12)

Full Changelog

Breaking changes:

New features:

Fixed bugs:

Enhancements:

Documentation updates:

Testing updates:

v0.13.0 (2022-07-31)

Full Changelog

Breaking changes:

New features:

Fixed bugs:

Enhancements:

Documentation updates:

Testing updates:

v0.12.0 (2022-06-05)

Full Changelog

Breaking changes:

New features:

Fixed bugs:

  • Writing of ListArray does not preserve all values #1008
  • Write a two-dimensional list to parquet file failed #992
  • Writing to Parquet fails for extension types that contain lists #830
  • Fixed using lower limit than size of first parquet row group #1046 (arxra)
  • Fixed error in consuming sliced FixedSizedBinary from c data interface (FFI) #1026 (jorgecarleitao)
  • Fixed lexsort limit equal or greater than row_count #1021 (b41sh)
  • Fixed error in reading nested parquet structs #1015 (jorgecarleitao)
  • Fixed panic on debug print of invalid timezones #1013 (jorgecarleitao)
  • Treat empty timezone string as no-timezone #1009 (dbr)
  • Fixed encoding of NaN to json #990 (SimonSchneider)
  • Fixed error in writing ListArray to parquet #984 (jorgecarleitao)
  • Fixed decoding Binary Plain pages with dictionary pages #982 (aptr322)

Enhancements:

Documentation updates:

Testing updates:

v0.11.2 (2022-05-05)

Full Changelog

New features:

Fixed bugs:

Enhancements:

v0.11.1 (2022-04-27)

Full Changelog

Fixed bugs:

v0.11.0 (2022-04-27)

Full Changelog

Breaking changes:

New features:

Fixed bugs:

  • Parquet regression: exceptions.ArrowErrorException: NotYetImplemented("Can't read Dictionary(UInt32, LargeUtf8, false) from parquet") #955
  • Reading Parquet binary column panics during deserialization 'attempt to subtract with overflow` #944
  • Reading Parquet file written by pyarrow with lz4 compression fails with OutOfSpec("Thrift out of range") #940
  • Issues when trying to create a parquet file with FixedSizedListArray #691
  • Fixed bug in writing csv with buffer resizing #965 (ritchie46)
  • Fixed bug in reading binary parquet #945 (jorgecarleitao)
  • Fixed error in writing fixedSizeListArray to parquet #941 (jorgecarleitao)
  • Fixed support to read dict nested binary parquet #924 (jorgecarleitao)

Enhancements:

Documentation updates:

Testing updates:

v0.10.1 (2022-03-16)

Full Changelog

New features:

Fixed bugs:

Enhancements:

Testing updates:

v0.10.0 (2022-03-12)

Full Changelog

Breaking changes:

New features:

Fixed bugs:

Enhancements:

Documentation updates:

Testing updates:

v0.9.1 (2022-01-19)

Full Changelog

New features:

Fixed bugs:

Enhancements:

Testing updates:

v0.9.0 (2022-01-14)

Full Changelog

Breaking changes:

New features:

Fixed bugs:

Enhancements:

Documentation updates:

Testing updates:

v0.8.1 (2021-11-27)

Full Changelog

Fixed bugs:

  • Fixed compilation with individual features activated #642 (ritchie46)

v0.8.0 (2021-11-27)

Full Changelog

Breaking changes:

New features:

Fixed bugs:

Enhancements:

Documentation updates:

Testing updates:

v0.7.0 (2021-10-29)

Full Changelog

Breaking changes:

New features:

Fixed bugs:

Enhancements:

Documentation updates:

Testing updates:

v0.6.2 (2021-10-09)

Full Changelog

New features:

  • Added wrapping version arithmetics for PrimitiveArray #496 (yjhmelody)

Fixed bugs:

Enhancements:

v0.6.1 (2021-10-07)

Full Changelog

Breaking changes:

  • Bring MutableFixedSizeListArray to the spec used by the rest of the Mutable API #475
  • Removed ALIGNMENT invariant from [Mutable]Buffer #449
  • Un-nested compute::arithemtics::basic #461 (jorgecarleitao)
  • Added more serialization options for csv writer. #453 (ritchie46)
  • Changed validity from &Option<Bitmap> to Option<&Bitmap>. #431 (jorgecarleitao)
  • Bumped parquet2 #422 (jorgecarleitao)
  • Changed IPC FileWriter to own the writer. #420 (yjshen)
  • Made DynComparator Send+Sync #414 (yjshen)

New features:

Fixed bugs:

Enhancements:

Documentation updates:

Testing updates:

v0.5.3 (2021-09-14)

Full Changelog

New features:

  • Added support to read and write extension types to and from parquet #396 (jorgecarleitao)

Fixed bugs:

Enhancements:

  • Added support to read dict-encoded required primitive types from parquet #402 (Dandandan)
  • Added Array::with_validity #399 (ritchie46)

Testing updates:

v0.5.1 (2021-09-09)

Full Changelog

Documentation updates:

  • [0.5] The docs io module has no submodules #390
  • Made docs be compiled with feature full #391 (jorgecarleitao)

v0.5.0 (2021-09-07)

Full Changelog

Breaking changes:

  • Added Extension to DataType #361
  • MonthDayNano added to enum IntervalUnit #360
  • Make io::parquet::write::write_* return size of file in bytes #354
  • Renamed bitmap::utils::null_count to bitmap::utils::count_zeros #342
  • Made GroupFilter optional in parquet'sRecordReader and added method to set it. #386 (jorgecarleitao)
  • Removed PartialOrd and Ord of all enums in datatypes #379 (jorgecarleitao)
  • Made cargo features not default #369 (jorgecarleitao)
  • Prepare APIs for extension types #357 (jorgecarleitao)

New features:

Fixed bugs:

  • Parquet read skips a few rows at the end of the page #373
  • parquet_read fails when a column has too many rows with string values #366
  • parquet_read panics with index_out_of_bounds #351
  • Fixed error in MutableBitmap::push_unchecked #384 (jorgecarleitao)
  • Fixed display of timestamp with tz. #375 (jorgecarleitao)

Enhancements:

Documentation updates:

Testing updates:

v0.4.0 (2021-08-24)

Full Changelog

Breaking changes:

  • Change dictionary iterator of values from Arrays of one element to Scalars #335
  • Align FFI API with arrow's C++ API #328
  • Make *_compare_scalar not return Result #316
  • Make io::print, get_value_display and get_display not return Result #286
  • Add MetadataVersion to IPC interfaces #282
  • Change DataType::Union to enable round trips in IPC #281
  • Removed clone requirement in StructArray -> RecordBatch #307 (jorgecarleitao)
  • Fixed error in reading a non-finished IPC stream. #302 (jorgecarleitao)
  • Generalized ZipIterator to accept a BitmapIter #296 (jorgecarleitao)

New features:

Fixed bugs:

Enhancements:

Documentation updates:

Testing updates:

Closed issues:

  • Make parquet_read_record support async #331
  • Panic due to SIMD comparison #312
  • Bitmap::mutable line 155 may Panic/segfault #309
  • IPC's StreamReader may abort due to excessive memory by overflowing a usized variable #301
  • Improve performance of rem_scalar/div_scalar for integer types (4x-10x) #259

v0.3.0 (2021-08-11)

Full Changelog

Breaking changes:

  • Renamed sum to sum_primitive #273
  • Moved trait Index from array::Index to types::Index #272
  • Added optional projection to IPC FileReader #271
  • Added optional page_filter to parquet's RecordReader and get_page_iterator #270
  • Renamed parquets' CompressionCodec to Compression #269

New features:

Fixed bugs:

Enhancements:

  • Improve the performance in cast kernel using AsPrimitive trait in generic dispatch #252
  • Poor performance in sort::sort_to_indices with limit option in arrow2 #245
  • Support loading Feather v2 (IPC) files with more than 1 million tables #231
  • Migrated to parquet2 v0.3 #265 (jorgecarleitao)
  • Added more tests to cast and min/max #253 (jorgecarleitao)
  • Prettytable is unmaintained. Change to comfy-table #251 (PsiACE)
  • Added IndexRange to remove checks in hot loops #247 (jorgecarleitao)
  • Make merge_sort_slices MergeSortSlices public #243 (sundy-li)

Documentation updates:

Closed issues:

  • Allow projection pushdown to IPC files #261
  • Add support to write dictionary-encoded pages #211
  • Make IpcWriteOptions easier to find. #120

v0.2.0 (2021-07-30)

Full Changelog

Breaking changes:

New features:

Fixed bugs:

Enhancements:

  • Avoid materialization of indices in filter_record_batch for single arrays #234
  • Add integration tests for writing to parquet #80
  • Short-circuited boolean evaluation in GrowableList #228 (ritchie46)
  • Add extra inlining to speed up take #226 (Dandandan)
  • Removed un-needed unsafe #225 (jorgecarleitao)

Documentation updates:

  • Add documentation to guide #96
  • Add git submodule command to correct the test doc #223 (sundy-li)
  • Added badges to README #216 (sundy-li)
  • Clarified differences with arrow crate #210 (alamb)
  • Clarified differences with arrow crate #209 (alamb)

* This Changelog was automatically generated by github_changelog_generator