- Restrict dask < 2024.11.0 (:pr:`341`)
- Use github action to install poetry (:pr:`340`)
- Update readthedocs version (:pr:`339`)
- Pin minimum version of katdal to 0.23 (:pr:`337`)
- Pin minimum version of python-casacore to 3.6.1 (:pr:`337`)
- Pin minimum version of NumPy to 2.0.0 (:pr:`337`)
- Deprecate Python 3.9 support (:pr:`338`)
- Update minio server version (:pr:`338`)
- Remove complicated push setup in github action workflow (:pr:`338`)
- Fix date typo in HISTORY.rst (:pr:`336`)
- Restrict NumPy to < 2.0.0 (:pr:`333`)
- Bump donfig dependency to 0.8.0 due to install error with pip (:pr:`332`)
- Fix katdal import typos (:pr:`331`)
- Add an epoch argument to xds_{from,to}_zarr to uniquely identify datasets in a distributed context (:pr:`330`)
- Improve table schema handling (:pr:`329`)
- Identify channel and correlation-like dimensions in non-standard MS columns (:pr:`329`)
- DaskMSStore depends on
fsspec >= 2022.7.0
(:pr:`328`) - Optimise broadcast_arrays in katdal import (:pr:`326`)
- Change dask-ms katdal import to dask-ms import katdal (:pr:`325`)
- Configure dependabot (:pr:`319`)
- Add chunk specification to
dask-ms katdal import
(:pr:`318`) - Add a
dask-ms katdal import
application for exporting SARAO archive data directly to zarr (:pr:`315`) - Define dask-ms command line applications with click (:pr:`317`)
- Make poetry dev and docs groups optional (:pr:`316`)
- Only test Github Action Push events on master (:pr:`313`)
- Move consolidated metadata into partition subdirectories (:pr:`312`)
- Set
_ARRAY_DIMENSIONS
attribute on Data Variables (:pr:`311`) - Use JSON codec for writing zarr strings (:pr:`310`)
- Address warnings (:pr:`309`)
- Update calver dependencies to use inequality, rather than caret dependency specifiers (:pr:`307`)
- Update pre-commit to actions/[email protected] (:pr:`308`)
- Update readthedocs python version to 3.9 and poetry to 1.7.1 (:pr:`303`)
- Re-enable exceptions in multiprocessing test case (:pr:`302`)
- Fix auto-formatted f-strings (:pr:`301`)
- Upgrade to pyarrow 14.0.1 (:pr:`299`)
- Use new minio.MinioAdmin client in test suite and update minio server version (:pr:`298`)
- Replace black with ruff in pre-commit hooks (:pr:`297`)
- Lazily load casacore tables module (:pr:`294`)
- Deprecate Python 3.8 support (:pr:`296`)
- Temporarily add Pandas as an arrow extra dependency (:pr:`296`)
- Ignore non-existent columns (:pr:`290`)
- Add experimental fragments functionality (:pr:`282`, :pr:`291`)
- Add rechunk by size utility funtion (:pr:`284`)
- Run CI weekly on Monday @ 2h30 am UTC (:pr:`288`)
- Update minio server and client versions (:pr:`287`)
- Retain ROWID coordinates during MS conversion (:pr:`286`)
- Change setmaxcachesize to require a read lock, not a write lock (:pr:`281`)
- Update calver versioned software such as dask and xarray to 2023 variants (:pr:`279`)
- Remove unused requirements_dev.txt (:pr:`275`)
- Support optional CASA columns (:pr:`270`)
- Fix poetry install and cache hit detection on CI (:pr:`266`)
- Support --taql-where for Measurement Sets in dask-ms convert (:pr:`264`)
- xds_from_zarr should always open zarr groups in read mode (:pr:`262`)
- Fail on reads if non-existent or invalid store type found (:pr:`259`, :pr:`260`)
- Fix for nan chunks/dims breaking writes (:pr:`255`)
- Fix minio deb download URL (:pr:`257`)
- Apply black to the code base (:pr:`252`)
- Manage dask-ms with python poetry (:pr:`250`)
- Don't force conversion to tuple during attribute decoding (:pr:`248`)
- Fail if zarr chunks exceed 2GB (:pr:`247`)
- Add a
--exclude
argument todask-ms convert
that allows columns to be excluded during conversion (:pr:`246`). - Make
--output
a requireddask-ms convert
argument (:pr:`245`).
- Consolidate metadata by default during
xds_to_zarr
and always attempt to read consolidated metadata inxds_from_zarr
(:pr:`243`). - Fail if ROWID dimensions don't match associated dataset arrays (:pr:`239`)
- Pin flake8 to >= 4.0.0 and < 5.0.0 (:pr:`240`)
- Improve chunking in xds_to_zarr when rechunk==True. (:pr:`236`)
- Assign custom fsspec storage options from url match in yaml configuration files. (:pr:`237`)
- Minor DaskMSStore changes to improve URL handling. (:pr:`233`)
- Deprecate Python 3.7 support and test on Python 3.10 (:pr:`232`)
- Support loading of storage_options in dask-ms convert (:pr:`231`)
- Reintroduce group columns on output CASA format only (:pr:`230`)
- Stop converting results returned from DaskMSStore into Path objects (:pr:`229`)
- Accept storage_options in dataset read/write methods (:pr:`228`)
- Add
__repr__
and__str__
forDaskMSStore
(:pr:`224`)
- Recreate Grouping Columns when writing to CASA format in
dask-ms convert
(:pr:`222`) - Ignore SOURCE subtable in dask-ms convert (:pr:`221`)
- Fix performance bug in cached_array implementation. (:pr:`220`)
- Use DaskMSStore throughout dask-ms convert (:pr:`218`)
- Upgrade from deprecated
visit_Num
tovisit_Constant
(:pr:`217`) - Ensure url and table consistency in DaskMSStore (:pr:`216`)
- Wait for minio to start with greater precision (:pr:`215`)
- Chunk correctly when reading from parquet. (:pr:`210`)
- Fix minor bugs in zarr and conversion functionality. (:pr:`208`)
- Add xds_to_storage_table. (:pr:`207`)
- Add option to rechunk automatically on writes. (:pr:`204`)
- Raise more informative error. (:pr:`203`)
- Improve tiling. (:pr:`202`)
- Do not create spurious fields in zarr writes. (:pr:`200`)
- Error out when missing datavars should be written. (:pr:`197`, :pr:`198`)
- Allow non-standard columns to be tiled. (:pr:`196`)
- Fix roundtripping of boolean tensor arrays. (:pr:`194`)
- Ignore unsupported kwargs in
xds_from_storage_*
functions. (:pr:`192`) - Improve handling of subtables with variably sized rows in daskms-convert. (:pr:`191`)
- Ensure that
xds_from_zarr
sorts groups as integers and not strings (:pr:`188`) - Ensure Natural Ordering for parquet files (:pr:`183`)
- Fix
xds_from_zarr
andxds_from_parquet
chunking behaviour (:pr:`182`) - Add LazyProxy and LazyProxyMultiton patterns to dask-ms (:pr:`177`)
- Support cloud native storage formats via fsspec (:pr:`174`)
- Fix inclusion of MANIFEST.in files (:pr:`173`)
- Add --group-columns to
dask-ms convert
for CASA Formats (:pr:`169`) - Add ComplexArray -> numpy conversion (:pr:`168`)
- Ignore row dimension when fixing column shapes (:pr:`165`)
- Bump pip from 9.0.1 to 19.2 (:pr:`164`)
- Fix zarr coordinate writes (:pr:`162`)
- Deprecate Python 3.6 (:pr:`161`)
- Add IMAGING_WEIGHT_SPECTRUM to default Measurement Schema (:pr:`160`)
- Remove default time ordering from xds_from_ms (:pr:`156`)
- Make zarr writes completely lazy (:pr:`157`)
- Copy partitioning information when writing (:pr:`155`)
- Add a
dask-ms convert
script for converting between CASA, Zarr and Parquet formats (:pr:`145`) - Convert code-base to f-strings with flynt (:pr:`144`)
- Consolidate Dataset Types into daskms.dataset (:pr:`143`)
- Correct Dataset persistence issues (:pr:`140`)
- Experimental arrow support (:pr:`130`, :pr:`132`, :pr:`133`, :pr:`135`, :pr:`136`, :pr:`138`, :pr:`145`)
- Experimental zarr support (:pr:`129`, :pr:`133`, :pr:`139`, :pr:`142`, :pr:`150`, :pr:`145`)
- Test data fix (:pr:`128`)
- Fix array inlining for writes (:pr:`126`)
- Allow Multi-Layer Inlining (:pr:`125`)
- Support DATA Column Expressions (:pr:`124`, :pr:`134`, :pr:`146`, :pr:`147`, :pr:`148`, :pr:`151`)
- Remove table close in ThreadPool for the last time (:pr:`122`)
- Respect the High Level Graph specification better during inline array creation (:pr:`123`)
- Support dictionary writes via putvarcol (:pr:`119`)
- Use getcell instead of getcellslice in sorted orderings (:pr:`120`)
- Update to pytest-flake8 1.0.6 (:pr:`117`)
- Test on Python 3.8 (:pr:`116`)
- Depend on python-casacore 3.3.1 (:pr:`116`)
- Remove deadlock in TableProxy weakref.finalize on Python 3.6 (:pr:`113`)
- Use python-casacore wheels for travis testing, instead of kernsuite packages (:pr:`115`)
- Documentation updates (:pr:`110`)
- Provide better warnings for unusual ROWID graphs during table updates (:pr:`108`)
- Work around casacore getcolslice caching (:pr:`107`)
- Update LICENSE year (:pr:`105`)
- Update license and production status in pypi classifiers (:pr:`104`)
- Use WHERE rather than HAVING clause in group ordering TAQL (:pr:`98`)
- Improve the dask task key names for clearer graph visualization (:pr:`102`)
- Cache and inline row runs in write operations (:pr:`96`)
- Support getcolslice and putcolslice in TableProxy (:pr:`91`)
- Use weakref.finalize to cleanup TableProxy and Executor objects (:pr:`89`)
- Pickle Executor key argument (:pr:`88`)
- Deprecate Python 3.5 support and test on Python 3.7 (:pr:`87`)
- Optionally expose TableProxy objects in dataset read/write methods (:pr:`85`)
- Upgrade to python-casacore 3.2 (:pr:`84`)
- Re-introduce xarray handling in dataset.as_variable (:pr:`83`)
- Explicitly require dask Arrays on write datasets (:pr:`83`)
- Document python-casacore install process (:pr:`80`, :pr:`81`)
- Remove __future__ import (:pr:`79`)
- Update examples (:pr:`78`)
- Only log aggressively when the log level is DEBUG (:pr:`76`)
- Optimise dask graphs produced by dask-ms such that each data access node no longer has common ancestors but is instead an independent root node. This improves memory usage in case of the predict. (:pr:`75`)
- Read-lock TAQL row reference table by default (:pr:`74`)
- Produce write datasets rather a single concatenated dask array (:pr:`70`, :pr:`72`)
- Fix spacing in TAQL WHERE queries (:pr:`68`)
- Constrain table object access to Executor. Simplify table locking (:pr:`66`).
- Fix stress test (:pr:`65`)
- Remove keywords from variable attributes (:pr:`64`)
- Fix and test non-standard sub-table creation (:pr:`60`)
- Improve sub-table creation logic (:pr:`59`, :pr:`60`)
- Support table and column keywords (:pr:`58`, :pr:`62`)
- Support concurrent access of multiple independent tables (:pr:`57`)
- Fix WEIGHT_SPECTRUM schema dimensions (:pr:`56`)
- Pin python-casacore to 3.0.0 (:pr:`54`)
- Drop python 2 support (:pr:`51`)
- Simplify Table Schemas (:pr:`50`)
- Add Concepts + Tutorial Documentation (:pr:`48`)
- Supporting reading and updating column keywords (:pr:`48`)
- Add OBSERVATION, FEED, POINTING, SOURCE table schemas (:pr:`48`)
- Remove single row squeezing in the group_cols="__row__" case (:pr:`48`)
- Handle multi-dimensional string arrays (:pr:`48`)
- Add preliminary example_ms (:pr:`48`)
- Add Concepts + Tutorial Documentation (:pr:`48`)
- Make xarray an optional dependency (:pr:`45`)
- Rename xarray-ms to dask-ms (:pr:`43`)
- Allow chunking by arbitrary dimensions (:pr:`41`)
- Add a simple Dataset, making xarray an optional dependency. (:pr:`41`, :pr:`46`, :pr:`47`, :pr:`52`)
- Add support for writing new tables from Datasets (:pr:`41`, :pr:`53`)
- Add support for appending to tables from Datasets (:pr:`41`)