Hyperparameter tuning #67

ablaom · 2022-05-05T03:56:49Z

Looking at bit into MLJ integration. For better or worse, hyper-parameter optimization (eg, grid search) in MLJ generally works by mutating the field values of the model struct. I wonder if TableTransforms.jl would consider changing their transformer types to mutable structs? I think in ML applications, at least, any loss in performance would be pretty minimal, but perhaps there are wider use-cases to consider?

The alternative for our use case is for the MLJ model wrapper to be mutable (for now a wrapper is necessary anyway) and that a user wanting to do a search does something like

values = [Scale(low=0, high=x) for x in 1.0:0.1:10]     <---- extra step 
values = range(wrapped_transformer, :model, values=values)

However, while this might be fine for Grid search, it doesn't really work for other optimization strategies.

Thoughts?

ablaom · 2022-05-05T04:05:52Z

I suppose with some work we could create a wrapper that essentially "mutafies" the original model (overload setproperty! to re-generate the atomic model each time it's called) but that would be kinda complicated.

juliohm · 2022-05-05T11:47:06Z

I wonder if hyperparameter tuning could leverage Setfield.jl or its successor Acessors.jl instead of enforcing mutability of the structs? My understanding is that mutable structs in Julia are only justified when we have hot loops. In other words, mutable structs feel like an anti-pattern in the language unless you have a "array-like" struct that you need to setindex! many times.

My main concern with enforcing mutability is that it may compromise parallelization later on. I am not an expert in compiler technology but I know that non-mutable structs help the compiler do more aggressive optimizations. And sending copies of these tiny transforms to different threads and processes should be more performant than dealing with chunks of memory in the heap? Again, I am not an expert in compilers, but my intuition is pointing to the non-mutable design as the more conservative and optimal design.

@eliascarv do you have any comments to add?

eliascarv · 2022-05-05T13:35:50Z

I have an idea. We can make a wrapper type to make Transforms mutable.
Something like this:

mutable struct Mut{T<:Transform}
  transform::T
end

Base.getproperty(transf::Mut, prop::Symbol) = getproperty(transf.transform, prop)

function Base.setproperty!(transf::Mut, prop::Symbol)
  # code
end

apply(transf::Mut, table) = apply(transf.transform, table)

# code

statictransf = Scale(low=0, high=1)
mutabletransf = Mut(statictransf)

juliohm · 2022-05-05T13:42:41Z

Yes, it is always possible to wrap the structs as @ablaom mentioned.

Let's postpone this decision to a future major release when hyperparameter tuning can be considered.

ablaom · 2022-05-06T01:38:42Z

I wonder if hyperparameter tuning could leverage Setfield.jl

We already do (in-house) recursive setproperty!. I'm not understanding how that helps us, though. Except to make implementation of the proposed wrapper easier. Perhaps that's what you meant?

juliohm · 2022-05-09T21:52:09Z

I mean that there is nothing intrinsic to hyperparameter tuning that requires types to be mutable. We can certainly provide an API that extracts parameters of a pipeline into a "flat vector" of parameters, performs an optimization step, and then instantiates a new pipeline with improved parameters. The amount of computing we save by forcing the structs to be mutable is not very clear, specially considering that we are talking about a couple of dozen parameters in general?

CameronBieganek · 2023-10-20T17:15:01Z

This proposal for MLJTuning would allow MLJ to handle immutable pipelines.

ParadaCarleton · 2023-10-25T03:20:01Z

@juliohm has there been any work on hyperparameter tuning with TableTransforms.jl? I'd like to add asinh (pseudolog) transformations to the package, but that would require the ability to tune 1 or 2 hyperparameters.

juliohm · 2023-10-25T08:35:24Z

We are starting to brainstorm an API for this feature. It would be nice to have an additional hand :) Em qua., 25 de out. de 2023 00:20, Carlos Parada ***@***.***> escreveu:

…

@juliohm <https://github.com/juliohm> has there been any work on hyperparameter tuning with TableTransforms.jl? I'd like to add asinh (pseudolog) transformations to the package, but that would require the ability to tune 1 or 2 hyperparameters. — Reply to this email directly, view it on GitHub <#67 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAZQW3PXLLCXRRLRRMPIOQ3YBCAOZAVCNFSM5VD22KQKU5DIOJSWCZC7NNSXTN2JONZXKZKDN5WW2ZLOOQ5TCNZXHA2DIMJVGU4A> . You are receiving this because you were mentioned.Message ID: ***@***.***>

ParadaCarleton · 2023-10-25T17:15:24Z

We are starting to brainstorm an API for this feature. It would be nice to have an additional hand :)

Sounds good! Should we just stick to just the MLJ model interface for now?

juliohm · 2023-10-25T17:26:24Z

Sounds good! Should we just stick to just the MLJ model interface for now?

What interface exactly? Can you share a MWE?

Also, take a look at StatsLearnModels.jl where we already have a Learn transform with statistical learning models.

The goal is to be able to tune entire transform pipelines, which may or may not include Learn components.

ParadaCarleton · 2023-10-25T19:53:07Z

Sounds good! Should we just stick to just the MLJ model interface for now?

What interface exactly? Can you share a MWE?

This interface.

juliohm · 2023-10-25T19:55:54Z

This interface is already supported with package extensions in StatsLearnModels.jl. That means that you can use any MLJ.jl model there directly and we will wrap the interface into a more general interface with tables.

ParadaCarleton · 2023-10-25T19:57:58Z

Ahh, sounds perfect!

That means that you can use any MLJ.jl model there directly and we will wrap the interface into a more general interface with tables.

Does the converse hold as well? That is, if I define a StatsLearnModels.jl model, will it work with MLJ.jl?

juliohm · 2023-10-25T20:00:24Z

Does the converse hold as well? That is, if I define a StatsLearnModels.jl model, will it work with MLJ.jl?

Probably not, as our interface is a bit more general. For example, we don't assume that predictions are univariate, but as far as I remember MLJ.fit assumes X a matrix and y a vector (single column).

ParadaCarleton · 2023-10-25T20:08:52Z

Probably not, as our interface is a bit more general.

Hmm, if that's the way it goes, I think it should be possible to have SLM.jl models work with MLJModelInterface, so you can define an SLM model and it will work with MLJModelInterface (although not all SLM.jl features, e.g. multivariate prediction, would work with MLJ models). Alternatively, MLJModelInterface could be generalized to handle the cases in SLM.jl. @ablaom any thoughts?

juliohm · 2023-10-25T20:19:41Z

What features of the MLJ stack are you missing @ParadaCarleton? Perhaps that is the right question to ask?

ParadaCarleton · 2023-10-25T21:41:28Z

Right now, model tuning. In the future, I just want to avoid duplicating work across the MLJ and JuliaML ecosystems.

juliohm · 2023-10-26T11:34:11Z

Right now, model tuning.

That is one of the next steps in our roadmap. Stay tuned.

In the future, I just want to avoid duplicating work across the MLJ and JuliaML ecosystems.

These are different ecosystems with diverging design decisions. I would pick the one that fits your needs and would devote energy to it. Keep in mind that we assessed MLJ and other alternatives before attempting to start a parallel initiative. There are some fundamental differences that make a huge difference in practice.

ParadaCarleton · 2023-10-26T20:30:18Z

These are different ecosystems with diverging design decisions.

How/why? MLJ doesn't support multivariate targets yet, but that's a feature they've wanted for quite a while now

That is one of the next steps in our roadmap. Stay tuned.

OK, but is there no way to use MLJTuning on a small transformation from TableTransforms.jl?

juliohm · 2023-10-26T21:54:03Z

How/why? MLJ doesn't support multivariate targets yet, but that's a feature they've wanted for quite a while now

To name a few diverging views:

DataScienceTraits.jl in place of ScientificTypes.jl (see Consider a single default convention JuliaAI/ScientificTypes.jl#180)
Different opinion about "measure" traits (see Taking loss functions seriously JuliaAI/MLJ.jl#450)
Model interface that supports multivariate prediction in StatsLearnModels.jl
Pipeline interface with various kinds of transforms that are not just "stats" (see chapter 5 of the book: https://juliaearth.github.io/geospatial-data-science-with-julia/05-transforms.html)

OK, but is there no way to use MLJTuning on a small transformation from TableTransforms.jl?

I honestly don't know because we don't use MLJ.jl in our industrial projects anymore. I appreciate your attempt to improve collaboration, but that is very unlikely given the different application requirements.

We are happy to collaborate with specific modules that may benefit both stacks, but that is it.

ParadaCarleton · 2023-10-26T23:31:12Z

Pipeline interface with various kinds of transforms that are not just "stats" (see chapter 5 of the book: https://juliaearth.github.io/geospatial-data-science-with-julia/05-transforms.html)

These are implemented in MLJModels.jl, where these transforms are considered a special class of unsupervised models. I would agree that the name MLJModels is very unintuitive, though. Even if a transformation is technically a kind of unsupervised model in the MLJ classification, that's definitely not a very intuitive way to think about it.

How about we provide a separate type for transforms and host them in a separate package (TableTransforms.jl), but have them reexported by MLJModels.jl?

Model interface that supports multivariate prediction in StatsLearnModels.jl

I think MLJModels would be ecstatic to support this interface! It's a very requested feature.

juliohm added the question Further information is requested label May 5, 2022

juliohm changed the title ~~Make transformer structs mutable?~~ Hyperparameter tuning May 5, 2022

juliohm added feature and removed question Further information is requested labels May 5, 2022

ablaom mentioned this issue May 16, 2022

Allow hyper-parameter tuning for immutable models. JuliaAI/MLJTuning.jl#174

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hyperparameter tuning #67

Hyperparameter tuning #67

ablaom commented May 5, 2022

ablaom commented May 5, 2022

juliohm commented May 5, 2022

eliascarv commented May 5, 2022 •

edited

Loading

juliohm commented May 5, 2022

ablaom commented May 6, 2022 •

edited

Loading

juliohm commented May 9, 2022

CameronBieganek commented Oct 20, 2023

ParadaCarleton commented Oct 25, 2023

juliohm commented Oct 25, 2023 via email

ParadaCarleton commented Oct 25, 2023

juliohm commented Oct 25, 2023

ParadaCarleton commented Oct 25, 2023

juliohm commented Oct 25, 2023

ParadaCarleton commented Oct 25, 2023

juliohm commented Oct 25, 2023

ParadaCarleton commented Oct 25, 2023

juliohm commented Oct 25, 2023

ParadaCarleton commented Oct 25, 2023

juliohm commented Oct 26, 2023

ParadaCarleton commented Oct 26, 2023

juliohm commented Oct 26, 2023

ParadaCarleton commented Oct 26, 2023 •

edited

Loading

Hyperparameter tuning #67

Hyperparameter tuning #67

Comments

ablaom commented May 5, 2022

ablaom commented May 5, 2022

juliohm commented May 5, 2022

eliascarv commented May 5, 2022 • edited Loading

juliohm commented May 5, 2022

ablaom commented May 6, 2022 • edited Loading

juliohm commented May 9, 2022

CameronBieganek commented Oct 20, 2023

ParadaCarleton commented Oct 25, 2023

juliohm commented Oct 25, 2023 via email

ParadaCarleton commented Oct 25, 2023

juliohm commented Oct 25, 2023

ParadaCarleton commented Oct 25, 2023

juliohm commented Oct 25, 2023

ParadaCarleton commented Oct 25, 2023

juliohm commented Oct 25, 2023

ParadaCarleton commented Oct 25, 2023

juliohm commented Oct 25, 2023

ParadaCarleton commented Oct 25, 2023

juliohm commented Oct 26, 2023

ParadaCarleton commented Oct 26, 2023

juliohm commented Oct 26, 2023

ParadaCarleton commented Oct 26, 2023 • edited Loading

eliascarv commented May 5, 2022 •

edited

Loading

ablaom commented May 6, 2022 •

edited

Loading

ParadaCarleton commented Oct 26, 2023 •

edited

Loading