Skip to content

Releases: mljar/mljar-supervised

v0.1.3

23 Apr 14:24
Compare
Choose a tag to compare
v0.1.3 Pre-release
Pre-release
  • set metric to be optimized (#17)
  • create table with model details (#8)
  • progress bar for training (#9)
  • add reproducibility tests (#5)
  • callback to control number of iterations (#11)
  • fixed: set path for catboost snapshot (#16)
  • learning curves (#14)

Predict labels

13 Apr 12:54
Compare
Choose a tag to compare
Predict labels Pre-release
Pre-release

The autoML predicts categorical labels as addition to probabilities. There is an optimal threshold computed for the best model which maximize F1 score.

The predicted data frame right now looks like this:

p_0, p_1, label
0.1, 0.9, 1
0.1, 0.9, 1
0.9, 0.1, 0
...

The p_0 is probability for class 0. The p_1 is probability for class 1. The 'label' column is the prediction label decided based on threshold.

In case in target columns there are other values than 0 and 1, then they will be internally converted to 0, 1 but in predicted data frame they will appear in columns. For example if there are A and B values in a target column, then the predicted data frame will look like:

p_A, p_B, label
0.1, 0.9, B
0.1, 0.9, B
0.9, 0.1, A

The first release

09 Apr 13:52
Compare
Choose a tag to compare
The first release Pre-release
Pre-release

The AutoML solution that can solve binary classification tasks with respect to LogLoss metric. There are used following algorithms:

  • Random Forest
  • CatBoost
  • LightGBM
  • Xgboost
  • Neural Networks