Releases: G-Research/ParquetSharp
ParquetSharp 15.0.0-beta2
What's Changed
- Document workarounds for writing TimeSpan values by @adamreeve in #422
- Support the TimeOnly and DateOnly types added in .NET 6 by @adamreeve in #424
- Add the key management tools API for encryption by @adamreeve in #426
Full Changelog: 15.0.0-beta1...15.0.0-beta2
ParquetSharp 15.0.0-beta1
This version of ParquetSharp upgrades the underlying Arrow C++ Parquet library to version 15.0.0.
What's Changed
- Upgrade C++ Arrow to 15.0.0 by @adamreeve in #417
- Add support for reading and writing the .NET Half type by @adamreeve in #418
- Expose page checksum verification reader and writer properties by @adamreeve in #419
Full Changelog: 14.0.2...15.0.0-beta1
ParquetSharp 14.0.2
This version of ParquetSharp is identical to 14.0.2-beta1, which has been promoted to stable.
This version of ParquetSharp upgrades the underlying Arrow C++ Parquet library to version 14.0.2.
New features
- Add Skip at the LogicalColumnReader level by @mfkl in #388
- Add support for custom conversion logic with the row-oriented API by @jescalada in #393
Bug fixes
- Fix segfault writing to no longer valid column writer by @adamreeve in #401
Miscellaneous
- Upgrade Arrow C++ to 14.0.1 by @adamreeve in #402
- Upgrade Arrow C++ to 14.0.2 by @adamreeve in #407
- Small bit of simplification by @philjdf in #397
- CI: enable CodeQL runs by @ljubon in #392
- Copy vcpkg.json before cloning vcpkg in devcontainer Dockerfile by @adamreeve in #408
- add caching to save time on CodeQL analysis by @ljubon in #410
- Lint: Update rules for formating code by @ljubon in #414
Full Changelog: 13.0.0...14.0.2
ParquetSharp 14.0.2-beta1
This version of ParquetSharp upgrades the underlying Arrow C++ Parquet library to version 14.0.2.
New features
- Add Skip at the LogicalColumnReader level by @mfkl in #388
- Add support for custom conversion logic with the row-oriented API by @jescalada in #393
Bug fixes
- Fix segfault writing to no longer valid column writer by @adamreeve in #401
Miscellaneous
- Upgrade Arrow C++ to 14.0.1 by @adamreeve in #402
- Upgrade Arrow C++ to 14.0.2 by @adamreeve in #407
- Small bit of simplification by @philjdf in #397
- CI: enable CodeQL runs by @ljubon in #392
- Copy vcpkg.json before cloning vcpkg in devcontainer Dockerfile by @adamreeve in #408
- add caching to save time on CodeQL analysis by @ljubon in #410
Full Changelog: 13.0.0...14.0.2-beta1
ParquetSharp 13.0.0
This version of ParquetSharp is identical to 13.0.0-beta1, which has been promoted to stable.
This version of ParquetSharp upgrades the underlying Arrow C++ Parquet library to version 13.0.0, and adds the ability to interoperate with the Arrow C# library, supporting reading Parquet data as Arrow record batches and writing Arrow data to Parquet. See the working with Arrow data documentation for further details.
Full list of changes
- fix ci warnings by @ljubon in #363
- Integrate self-hoster runners by @pavlovic-ivan in #364
- Finalize net7.0 migration by @jgiannuzzi in #365
- Fix "All required checks done" CI job to never be skipped by @jgiannuzzi in #366
- Create "All required checks succeeded" check run when "All required checks done" CI job is actually successful by @jgiannuzzi in #367
- Create MAINTAINERS.md by @demarillacizere in #359
- Upgrade to Arrow 13.0.0 by @adamreeve in #368
- Add prebuilt devcontainer by @jgiannuzzi in #369
- Split up LogicalBatchReader/Writer into separate files by @philjdf in #371
- Support reading and writing data with the Arrow format by @adamreeve in #354
- Upgrade the NUnit test adapter by @adamreeve in #372
- Document workarounds for using the dev container with podman and SELinux by @adamreeve in #373
- add ISSUE_TEMPLATE: Bug Report Form by @naskio in #370
- add ISSUE_TEMPLATE: feature request by @naskio in #374
- TestWriterDoubleDispose: remove outdated comment by @mfkl in #378
- README Enhancements by @jescalada in #379
- exceptions: explicitly catch and throw OutOfMemoryException by @mfkl in #380
- Run tests on Apple Silicon by @jgiannuzzi in #382
- Documentation Enhancements by @jescalada in #383
- Write vcpkg arrow port ABI info in CI by @adamreeve in #384
- Enhance PowerShell documentation by @jescalada in #386
- test: update Microsoft.NET.Test.Sdk by @mfkl in #389
New Contributors
- @demarillacizere made their first contribution in #359
- @naskio made their first contribution in #370
- @jescalada made their first contribution in #379
Full Changelog: 12.0.1...13.0.0
ParquetSharp 13.0.0-beta1
This version of ParquetSharp upgrades the underlying Arrow C++ Parquet library to version 13.0.0, and adds the ability to interoperate with the Arrow C# library, supporting reading Parquet data as Arrow record batches and writing Arrow data to Parquet. See the working with Arrow data documentation for further details.
Full list of changes
- fix ci warnings by @ljubon in #363
- Integrate self-hoster runners by @pavlovic-ivan in #364
- Finalize net7.0 migration by @jgiannuzzi in #365
- Fix "All required checks done" CI job to never be skipped by @jgiannuzzi in #366
- Create "All required checks succeeded" check run when "All required checks done" CI job is actually successful by @jgiannuzzi in #367
- Create MAINTAINERS.md by @demarillacizere in #359
- Upgrade to Arrow 13.0.0 by @adamreeve in #368
- Add prebuilt devcontainer by @jgiannuzzi in #369
- Split up LogicalBatchReader/Writer into separate files by @philjdf in #371
- Support reading and writing data with the Arrow format by @adamreeve in #354
- Upgrade the NUnit test adapter by @adamreeve in #372
- Document workarounds for using the dev container with podman and SELinux by @adamreeve in #373
- add ISSUE_TEMPLATE: Bug Report Form by @naskio in #370
- add ISSUE_TEMPLATE: feature request by @naskio in #374
- TestWriterDoubleDispose: remove outdated comment by @mfkl in #378
- README Enhancements by @jescalada in #379
- exceptions: explicitly catch and throw OutOfMemoryException by @mfkl in #380
- Run tests on Apple Silicon by @jgiannuzzi in #382
- Documentation Enhancements by @jescalada in #383
- Write vcpkg arrow port ABI info in CI by @adamreeve in #384
- Enhance PowerShell documentation by @jescalada in #386
- test: update Microsoft.NET.Test.Sdk by @mfkl in #389
New Contributors
- @demarillacizere made their first contribution in #359
- @naskio made their first contribution in #370
- @jescalada made their first contribution in #379
Full Changelog: 12.0.1...13.0.0-beta1
ParquetSharp 12.0.1
This version of ParquetSharp is identical to 12.0.1-beta1, which has been promoted to stable.
What's Changed
- Fix using internal F# types with the row oriented API by @adamreeve in #332
- Use ISO 8601 date format for Date.ToString by @adamreeve in #330
- Allow setting process-wide default writer properties by @adamreeve in #337
- Don't retry writing in Dispose after encountering an error in ParquetRowWriter by @adamreeve in #344
- Defer checking decimal precision and type length until creating converter by @adamreeve in #348
- Add WriteRowSpan method to ParquetRowWriter by @adamreeve in #349
- Upgrade to Arrow 12.0.1 by @adamreeve in #356
Misc
- Update benchmark code and results by @adamreeve in #341
- Update nested data documentation by @adamreeve in #342
- remove deprecated ubuntu-18.04 runners by @ljubon in #346
- Document lack of Int96 timestamp support by @adamreeve in #331
Full Changelog: 10.0.1...12.0.1
ParquetSharp 12.0.1 Beta1
What's Changed
- Fix using internal F# types with the row oriented API by @adamreeve in #332
- Use ISO 8601 date format for Date.ToString by @adamreeve in #330
- Allow setting process-wide default writer properties by @adamreeve in #337
- Don't retry writing in Dispose after encountering an error in ParquetRowWriter by @adamreeve in #344
- Defer checking decimal precision and type length until creating converter by @adamreeve in #348
- Add WriteRowSpan method to ParquetRowWriter by @adamreeve in #349
- Upgrade to Arrow 12.0.1 by @adamreeve in #356
Misc
- Update benchmark code and results by @adamreeve in #341
- Update nested data documentation by @adamreeve in #342
- remove deprecated ubuntu-18.04 runners by @ljubon in #346
- Document lack of Int96 timestamp support by @adamreeve in #331
Full Changelog: 10.0.1...12.0.1-beta1
ParquetSharp 10.0.1
This version of ParquetSharp is identical to 10.0.1-beta1, which has been promoted to stable.
This version upgrades Arrow to 10.0.1, which writes Parquet files with the version 2.4 format by default, and dictionary encoded data will use the RleDictionary
encoding rather than PlainDictionary
. Version 1 files can continue to be written by specifying the file format version in the WriterProperties
.
What's Changed
- Integrate Slack Github Action by @pavlovic-ivan in #308
- GitHub actions updates to fix deprecation warnings by @adamreeve in #310
- Support round-tripping nested values by @adamreeve in #301
- Avoid use of ManagedOutputStream and RandomAccessFile after GC by @adamreeve in #313
- Support reading decimal data with int32 or int64 physical type by @adamreeve in #315
- Convert paths to extend length format on Windows and add documentation on long path support by @adamreeve in #311
- Allow creating a ParquetFileReader/Writer directly from a .NET stream by @adamreeve in #320
- Support of optional nested-nested types reading by @oldukhno in #319
- Upgrade to Arrow 10.0.1 by @adamreeve in #321
- Change version to 10.0.1 to match Arrow by @adamreeve in #323
Incompatible API Changes
LogicalColumnStream.SchemaNodesPath
has been replaced byLogicalColumnStream.GetSchemaNodesPath
New Contributors
- @pavlovic-ivan made their first contribution in #308
- @oldukhno made their first contribution in #319
Full Changelog: 8.0.0...10.0.1
ParquetSharp 10.0.1 Beta1
This version upgrades Arrow to 10.0.1, which writes Parquet files with the version 2.4 format by default, and dictionary encoded data will use the RleDictionary
encoding rather than PlainDictionary
. Version 1 files can continue to be written by specifying the file format version in the WriterProperties
.
What's Changed
- Integrate Slack Github Action by @pavlovic-ivan in #308
- GitHub actions updates to fix deprecation warnings by @adamreeve in #310
- Support round-tripping nested values by @adamreeve in #301
- Avoid use of ManagedOutputStream and RandomAccessFile after GC by @adamreeve in #313
- Support reading decimal data with int32 or int64 physical type by @adamreeve in #315
- Convert paths to extend length format on Windows and add documentation on long path support by @adamreeve in #311
- Allow creating a ParquetFileReader/Writer directly from a .NET stream by @adamreeve in #320
- Support of optional nested-nested types reading by @oldukhno in #319
- Upgrade to Arrow 10.0.1 by @adamreeve in #321
- Change version to 10.0.1 to match Arrow by @adamreeve in #323
Incompatible API Changes
LogicalColumnStream.SchemaNodesPath
has been replaced byLogicalColumnStream.GetSchemaNodesPath
New Contributors
- @pavlovic-ivan made their first contribution in #308
- @oldukhno made their first contribution in #319
Full Changelog: 8.0.0...10.0.1-beta1