Releases: VIDA-NYU/openclean-core
Releases · VIDA-NYU/openclean-core
Release 0.4.1
This release includes the following changes:
- Remove environment variable OPENCLEAN_WORKERS.
Release 0.4.0
This release contains the following changes:
- Use compact serialization for HISTORE archives.
- Load and sample datasets from a data stream in
openclean.engine.base.OpencleanEngine
. - Support stream operators on dataset snapshots in
openclean.engine.base.OpencleanEngine
. - Add summary for data frame conflict groups.
Release 0.3.2
This release contains the following changes:
- Make checking out a committed dataset in the
openclean.data.archive.base.ArchiveStore
optional. - Enable cache refresh for cached datasets in
openclean.data.archive.cache.CachedDatastore
.
Release 0.3.1
This release includes:
- Add optional version parameter when requesting metadata for a dataset version in
openclean.engine.dataset.DatasetHandle
.
Release 0.3.0
This release contains the following changes:
- Add
openclean.function.token.base.Token
as separate class. - Rename
openclean.function.token.base.StringTokenizer
toTokenizer
- Adjust token transformer and tokenizer for new Token class.
- Change structure of datatype count in column profiler.
- Option to get set of conflicting values from
DataFrameGrouping
groups. - Multi-threading for
ValueFunction.apply()
. - Separate DBSCAN outlier class.
- Move us-street name functions to
openclean-geo
.
Release 0.2.0
This release includes:
- Standardize parameter names for sample methods (#115)
- Bug fix for
openclean-notebook
Release 0.1.1
This release introduces the following changes:
- Bump
flowserv-core
dependency to version0.8.0
for better support of running applications that are not implemented in Python. - Introduce environment variable OPENCLEAN_WORKERS to configure workers for
flowserv-core
. - Replace environment variables OPENCLEAN_METADATA_DIR with OPENCLEAN_DATADIR.