Skip to content

Releases: VIDA-NYU/openclean-core

Release 0.4.1

16 Jun 08:25
06c9b42
Compare
Choose a tag to compare

This release includes the following changes:

  • Remove environment variable OPENCLEAN_WORKERS.

Release 0.4.0

27 Apr 10:45
Compare
Choose a tag to compare

This release contains the following changes:

  • Use compact serialization for HISTORE archives.
  • Load and sample datasets from a data stream in openclean.engine.base.OpencleanEngine.
  • Support stream operators on dataset snapshots in openclean.engine.base.OpencleanEngine.
  • Add summary for data frame conflict groups.

Release 0.3.2

06 Apr 20:58
da6c501
Compare
Choose a tag to compare

This release contains the following changes:

  • Make checking out a committed dataset in the openclean.data.archive.base.ArchiveStore optional.
  • Enable cache refresh for cached datasets in openclean.data.archive.cache.CachedDatastore.

Release 0.3.1

30 Mar 18:04
584abd7
Compare
Choose a tag to compare

This release includes:

  • Add optional version parameter when requesting metadata for a dataset version in openclean.engine.dataset.DatasetHandle.

Release 0.3.0

29 Mar 11:51
b09c93a
Compare
Choose a tag to compare

This release contains the following changes:

  • Add openclean.function.token.base.Token as separate class.
  • Rename openclean.function.token.base.StringTokenizer to Tokenizer
  • Adjust token transformer and tokenizer for new Token class.
  • Change structure of datatype count in column profiler.
  • Option to get set of conflicting values from DataFrameGrouping groups.
  • Multi-threading for ValueFunction.apply().
  • Separate DBSCAN outlier class.
  • Move us-street name functions to openclean-geo.

Release 0.2.0

18 Mar 21:40
78f6292
Compare
Choose a tag to compare

This release includes:

  • Standardize parameter names for sample methods (#115)
  • Bug fix for openclean-notebook

Release 0.1.1

08 Mar 18:15
be9ccaa
Compare
Choose a tag to compare

This release introduces the following changes:

  • Bump flowserv-core dependency to version 0.8.0 for better support of running applications that are not implemented in Python.
  • Introduce environment variable OPENCLEAN_WORKERS to configure workers for flowserv-core.
  • Replace environment variables OPENCLEAN_METADATA_DIR with OPENCLEAN_DATADIR.

v0.1.0

24 Feb 03:37
5243a42
Compare
Choose a tag to compare
Update README.rst