transformer to apply mean normalization #763

solegalli · 2024-05-15T12:01:48Z

in mean normalization, we subtract the mean from each value and then divide by the value range. This centres the variables at 0, and scales their values between -1 and 1. It is an alternative to standardization.

sklearn has no transformer to apply mean normalization. but we can combine the standard scaler and the robust scaler to do so. The thing is, that both transformers need to be fit over the raw data, so we can't use them within a pipeline, because the pipeline applies the transformation before the next transformer learns the required parameters.

My idea is to wrap both transformers within a class, so that fit is applied without transform, and then with those parameters, we can transform the data. See for example here: https://github.com/solegalli/Python-Feature-Engineering-Cookbook-Second-Edition/blob/main/ch07-scaling/Recipe-4-mean-normalization.ipynb

VascoSch92 · 2024-05-15T21:24:36Z

Instead of wrapping two transformers within a class, could also be an idea to create a brand new transformer which accomplish mean normalisation?

I understand that this could create some duplicated code, but on the other hand, the new mean normalisation transformer will be easy to understand and to debug.

I'm just thinking :-)

solegalli · 2024-05-16T08:58:59Z

Yes, it's perhaps a better idea.

solegalli · 2024-08-24T16:03:03Z

By any chance, would you like to give this one a go? creating a new class? and a new module. We don't have a module for scaling yet. @VascoSch92

VascoSch92 · 2024-08-25T11:23:17Z

Yes I can give it a try :-)

Just a little summary (please correct me if I'm wrong):

the idea is to create a new module scaling which contains transformers to scale columns
the first of these transformers will be the mean_normalization transformer right?

On a side note: there is still this PR on the distance transformer which attend a review :-) I corrected it and now should be better. Of course, if you have time :-D

solegalli · 2024-08-25T16:38:09Z

Yes, that's correct. Thanks for the reminder. I'll take a look this week :)

solegalli · 2024-11-02T18:41:56Z

Closing as completed

VascoSch92 mentioned this issue Aug 29, 2024

Mean Normalisation Scaling #806

Merged

solegalli closed this as completed Nov 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

transformer to apply mean normalization #763

transformer to apply mean normalization #763

solegalli commented May 15, 2024

VascoSch92 commented May 15, 2024

solegalli commented May 16, 2024

solegalli commented Aug 24, 2024

VascoSch92 commented Aug 25, 2024

solegalli commented Aug 25, 2024

solegalli commented Nov 2, 2024

transformer to apply mean normalization #763

transformer to apply mean normalization #763

Comments

solegalli commented May 15, 2024

VascoSch92 commented May 15, 2024

solegalli commented May 16, 2024

solegalli commented Aug 24, 2024

VascoSch92 commented Aug 25, 2024

solegalli commented Aug 25, 2024

solegalli commented Nov 2, 2024