Skip to content

Releases: elastic/eland

7.10.1b1

12 Jan 18:39
985afe7
Compare
Choose a tag to compare

Added

  • Added support for Pandas 1.2.0 (#336)

  • Added DataFrame.mode() and Series.mode() aggregation (#323, contributed by @V1NAY8)

  • Added support for pd.set_option("display.max_rows", None)
    (#308, contributed by @V1NAY8)

  • Added Elasticsearch storage usage to df.info() (#321, contributed by @V1NAY8)

Removed

  • Removed deprecated aliases read_es, read_csv, DataFrame.info_es,
    and MLModel(overwrite=True) (#331, contributed by @V1NAY8)

7.10.0b1

29 Oct 18:48
31760fe
Compare
Choose a tag to compare

Added

  • Added DataFrame.groupby() method with all aggregations (#278, #291, #292, #300 contributed by @V1NAY8)

  • Added es_match() method to DataFrame and Series for filtering rows with full-text search (#301)

  • Added support for type hints of the elasticsearch-py package (#295)

  • Added support for passing dictionaries to es_type_overrides parameter in the pandas_to_eland() function to directly control the field mapping generated in Elasticsearch (#310)

  • Added es_dtypes property to DataFrame and Series (#285)

Changed

  • Changed pandas_to_eland() to use the parallel_bulk() helper instead of single-threaded bulk() helper to improve performance (#279, contributed by @V1NAY8)

  • Changed the es_type_overrides parameter in pandas_to_eland() to raise ValueError if an unknown column is given (#302)

  • Changed DataFrame.filter() to preserve the order of items (#283, contributed by @V1NAY8)

  • Changed when setting es_type_overrides={"column": "text"} in pandas_to_eland() will automatically add the column.keyword sub-field so that aggregations are available for the field as well (#310)

Fixed

  • Fixed Series.__repr__ when the series is empty (#306)

7.9.1a1

30 Sep 16:19
Compare
Choose a tag to compare

Added

  • Added the predict() method and model_type, feature_names, and results_field properties to MLModel (#266)

Deprecated

  • Deprecated ImportedMLModel in favor of MLModel.import_model(...) (#266)

Changed

  • Changed DataFrame aggregations to use numeric_only=None instead of numeric_only=True by default. This is the same behavior as Pandas (#270, contributed by @V1NAY8)

Fixed

  • Fixed DataFrame.agg() when given a string instead of a list of aggregations will now properly return a Series instead of a DataFrame (#263, contributed by @V1NAY8)

7.9.0a1

18 Aug 17:03
7180c96
Compare
Choose a tag to compare

7.9.0a1 (2020-08-18)

Added

  • Added support for Pandas v1.1 (#253)
  • Added support for LightGBM LGBMRegressor and LGBMClassifier to ImportedMLModel (#247, #252)
  • Added support for multi:softmax and multi:softprob XGBoost operators to ImportedMLModel (#246)
  • Added column names to DataFrame.__dir__() for better auto-completion support (#223, contributed by @leonardbinet)
  • Added support for es_if_exists='append' to pandas_to_eland() (#217)
  • Added support for aggregating datetimes with nunique and mean (#253)
  • Added es_compress_model_definition parameter to ImportedMLModel constructor (#220)
  • Added .size and .ndim properties to DataFrame and Series (#231 and #233)
  • Added .dtype property to Series (#258)
  • Added support for using pandas.Series with Series.isin() (#231)
  • Added type hints to many APIs in DataFrame and Series (#231)

Deprecated

  • Deprecated the overwrite parameter in favor of es_if_exists in ImportedMLModel constructor (#249, contributed by @V1NAY8)

Changed

  • Changed aggregations for datetimes to be higher precision when available (#253)

Fixed

  • Fixed ImportedMLModel.predict() to fail when errors are present in the ingest.simulate response (#220)
  • Fixed Series.median() aggregation to return a scalar instead of pandas.Series (#253)
  • Fixed Series.describe() to return a pandas.Series instead of pandas.DataFrame (#258)
  • Fixed DataFrame.mean() and Series.mean() dtype (#258)
  • Fixed DataFrame.agg() aggregations when using extended_stats Elasticsearch aggregation (#253)

7.7.0a1

12 Aug 20:44
Compare
Choose a tag to compare

7.7.0a1 (2020-05-20)

Added

  • Added the package to Conda Forge, install via
    conda install -c conda-forge eland (#209)
  • Added DataFrame.sample() and Series.sample() for querying
    a random sample of data from the index (#196, contributed by @mesejo)
  • Added Series.isna() and Series.notna() for filtering out
    missing, NaN or null values from a column (#210, contributed by @mesejo)
  • Added DataFrame.filter() and Series.filter() for reducing an axis
    using a sequence of items or a pattern (#212)
  • Added DataFrame.to_pandas() and Series.to_pandas() for converting
    an Eland dataframe or series into a Pandas dataframe or series inline (#208)
  • Added support for XGBoost v1.0.0 (#200)

Deprecated

  • Deprecated info_es() in favor of es_info() (#208)
  • Deprecated eland.read_csv() in favor of eland.csv_to_eland() (#208)
  • Deprecated eland.read_es() in favor of eland.DataFrame() (#208)

Changed

  • Changed var and std aggregations to use sample instead of
    population in line with Pandas (#185)
  • Changed painless scripts to use source rather than inline to improve
    script caching performance (#191, contributed by @mesejo)
  • Changed minimum elasticsearch Python library version to v7.7.0 (#207)
  • Changed name of Index.field_name to Index.es_field_name (#208)

Fixed

  • Fixed DeprecationWarning raised from pandas.Series when an
    an empty series was created without specifying dtype (#188, contributed by @mesejo)
  • Fixed a bug when filtering columns on complex combinations of and and or (#204)
  • Fixed an issue where DataFrame.shape would return a larger value than
    in the index if a sized operation like .head(X) was applied to the data
    frame (#205, contributed by @mesejo)
  • Fixed issue where both scikit-learn and xgboost libraries were
    required to use eland.ml.ImportedMLModel, now only one library is
    required to use this feature (#206)

7.6.0a5

12 Aug 20:41
e71420c
Compare
Choose a tag to compare

7.6.0a5 (2020-04-14)

Added

  • Added support for Pandas v1.0.0 (#141, contributed by @mesejo)
  • Added use_pandas_index_for_es_ids parameter to pandas_to_eland() (#154)
  • Added es_type_overrides parameter to pandas_to_eland() (#181)
  • Added NDFrame.var(), .std() and .median() aggregations (#175, #176, contributed by @mesejo)
  • Added DataFrame.es_query() to allow modifying ES queries directly (#156)
  • Added eland.__version__ (#153, contributed by @mesejo)

Changed

  • Changed ML model serialization to be slightly smaller (#159)
  • Changed minimum elasticsearch Python library version to v7.6.0 (#181)

Fixed

  • Fixed inference_config being required on ML models for ES >=7.8 (#174)
  • Fixed unpacking for DataFrame.aggregate("median") (#161)

Removed

  • Removed support for Python 3.5 (#150)
  • Removed eland.Client() interface, use
    elasticsearch.Elasticsearch() client instead (#166)
  • Removed all private objects from top-level eland namespace (#170)
  • Removed geo_points from pandas_to_eland() in favor of es_type_overrides (#181)

"""

7.6.0a4

12 Aug 20:36
2e74a56
Compare
Choose a tag to compare

7.6.0a4 (2020-03-23)

Fixed

  • Fixed issue in DataFrame.info() when called on an empty frame (#135)
  • Fixed issues where many _source fields would generate a too_long_frame error (#135, #137)

Changed

  • Changed requirement for xgboost from >=0.90 to ==0.90