Mean Normalisation Scaling #806

VascoSch92 · 2024-08-29T10:12:19Z

First version of the MeanNormalizationScaling as discussed in #763

I create a new module scaling as discussed.

Probably, you will also add new scaling in this module.

codecov · 2024-08-29T10:19:49Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.00%. Comparing base (5dfceb8) to head (b001017).
Report is 5 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #806      +/-   ##
==========================================
+ Coverage   97.98%   98.00%   +0.01%     
==========================================
  Files         107      109       +2     
  Lines        4320     4350      +30     
  Branches      857      709     -148     
==========================================
+ Hits         4233     4263      +30     
  Misses         54       54              
  Partials       33       33

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

feature_engine/scaling/mean_normalization.py

solegalli

Hey @VascoSch92

This is looking really good. Thank you for the first draft.

I think we could tidy it a bit so that we don't loop neither in fit nor in transform.

Could you take a look?

Thank you!

feature_engine/scaling/mean_normalization.py

tests/test_scaling/test_mean_normalization.py

VascoSch92 · 2024-08-30T13:50:09Z

Hey @solegalli

I addressed your comments.

Please let me know what you think :-)

feature_engine/scaling/mean_normalization.py

solegalli

Hi @VascoSch92

This is looking really good. The tests are great.

Regarding the logic, I think we can speed this up if we store the range instead of min max,, and if we use dictionaries instead of dataframes. Could you give that a go?

Feel free to start working on the dosctrings and on adding a user guide :)

VascoSch92 · 2024-08-31T06:44:03Z

Hey @solegalli

I changed what requested.
Now,

params_ is a dictionary Dict[str, pd.Series] with keys 'mean' and 'range'
MeanNormalizationScaling -> MeanNormalizationScaler
error message for constant columns is a little better :-)

feature_engine/scaling/mean_normalization.py

solegalli

Hey @VascoSch92 really good work here. Thank you so much!

We need to add a few files to create the docs now. Would you be able to do that as well?

Thanks a lot for the hard work.

solegalli · 2024-08-31T10:19:10Z

Last but not least: we need to add the new module on the readme and on the frontpage of the documentation, which lives here: https://github.com/feature-engine/feature_engine/blob/main/docs/index.rst

Thank you!!

VascoSch92 · 2024-09-03T07:09:46Z

Hey @solegalli

added the documentation for the new scaler
divided params_ into mean_ and range_. Now mean_ and range_ are pd.Series. We can also make them np.array if you want. Sklearn uses np.arrays, should we make the same?

docs/index.rst

feature_engine/scaling/mean_normalization.py

solegalli

Hey @VascoSch92 thanks for the quick turnaround. The api docs look great.

Co-authored-by: Soledad Galli <[email protected]>

VascoSch92 · 2024-09-04T08:41:07Z

Hey @solegalli

I changed to dictionaries instead of pd.Series, and it works :-)

solegalli · 2024-09-04T09:02:22Z

Amazing! Thanks a lot!

We just need to add a description / demo to the user guide folder in the docs and we are good to go then :)

VascoSch92 · 2024-09-04T09:08:29Z

let me give a look ;-)

VascoSch92 · 2024-09-04T09:16:15Z

@solegalli quick question: in general feature engine use scaling transformers from sklearn. Should we also include examples of these transformers?

VascoSch92 · 2024-09-05T10:32:11Z

Hey @solegalli

I updated the docs with a demo.

Let me know what do you think. It is just a first version :-)

VascoSch92 · 2024-09-17T09:02:53Z

Hey @solegalli :-) did you have time to look at the latest changes?

solegalli · 2024-10-06T10:46:29Z

Pending acceptance of suggested changes: VascoSch92#1

…ormalization rewords the documentation and adds missing links

first version of mean normalization

a98e016

augment coverage

97e55ab

solegalli reviewed Aug 30, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Show resolved Hide resolved

solegalli reviewed Aug 30, 2024

View reviewed changes

VascoSch92 added 2 commits August 30, 2024 15:49

changes after review

8b344bc

add new tests and fix after review

10c2c55

solegalli reviewed Aug 30, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Aug 30, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Aug 30, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Aug 30, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Aug 30, 2024

View reviewed changes

second update after discussion

52b5592

VascoSch92 requested a review from solegalli August 31, 2024 06:44

solegalli reviewed Aug 31, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Aug 31, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Aug 31, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Aug 31, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Aug 31, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Show resolved Hide resolved

solegalli reviewed Aug 31, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Aug 31, 2024

View reviewed changes

VascoSch92 added 3 commits September 3, 2024 09:06

add mean normalization to the docs

5a5ff73

improve docstrings

a10fec6

devide _params into _mean and _var

84caa2a

VascoSch92 added 2 commits September 4, 2024 09:48

deleted formula from docstring

1eb655f

add scaling into index

11c4057

solegalli reviewed Sep 4, 2024

View reviewed changes

docs/index.rst Outdated Show resolved Hide resolved

solegalli reviewed Sep 4, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Sep 4, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Sep 4, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Sep 4, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Sep 4, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Sep 4, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Sep 4, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Sep 4, 2024

View reviewed changes

feature_engine/scaling/mean_normalization.py Outdated Show resolved Hide resolved

solegalli reviewed Sep 4, 2024

View reviewed changes

VascoSch92 and others added 6 commits September 4, 2024 10:34

Update docs/index.rst

2ce24f4

Co-authored-by: Soledad Galli <[email protected]>

Update docs/index.rst

03c8805

Co-authored-by: Soledad Galli <[email protected]>

Update feature_engine/scaling/mean_normalization.py

03a9ba1

Co-authored-by: Soledad Galli <[email protected]>

Update feature_engine/scaling/mean_normalization.py

9350bbf

Co-authored-by: Soledad Galli <[email protected]>

Update feature_engine/scaling/mean_normalization.py

413d95a

Co-authored-by: Soledad Galli <[email protected]>

change to dictionaries

336d0a6

update docs with demo

df531d9

VascoSch92 added 3 commits September 5, 2024 13:39

fix

5124d9c

fix

138a793

fix

3b15034

minor rewording here and there

bcc13b1

Merge pull request #1 from feature-engine/transformer_to_apply_mean_n…

b001017

…ormalization rewords the documentation and adds missing links

solegalli merged commit ca28618 into feature-engine:main Oct 12, 2024
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mean Normalisation Scaling #806

Mean Normalisation Scaling #806

VascoSch92 commented Aug 29, 2024 •

edited

Loading

codecov bot commented Aug 29, 2024 •

edited

Loading

solegalli left a comment

VascoSch92 commented Aug 30, 2024

solegalli left a comment

VascoSch92 commented Aug 31, 2024

solegalli left a comment

solegalli commented Aug 31, 2024

VascoSch92 commented Sep 3, 2024

solegalli left a comment

VascoSch92 commented Sep 4, 2024

solegalli commented Sep 4, 2024

VascoSch92 commented Sep 4, 2024

VascoSch92 commented Sep 4, 2024

VascoSch92 commented Sep 5, 2024

VascoSch92 commented Sep 17, 2024

solegalli commented Oct 6, 2024

Mean Normalisation Scaling #806

Mean Normalisation Scaling #806

Conversation

VascoSch92 commented Aug 29, 2024 • edited Loading

codecov bot commented Aug 29, 2024 • edited Loading

Codecov Report

solegalli left a comment

Choose a reason for hiding this comment

VascoSch92 commented Aug 30, 2024

solegalli left a comment

Choose a reason for hiding this comment

VascoSch92 commented Aug 31, 2024

solegalli left a comment

Choose a reason for hiding this comment

solegalli commented Aug 31, 2024

VascoSch92 commented Sep 3, 2024

solegalli left a comment

Choose a reason for hiding this comment

VascoSch92 commented Sep 4, 2024

solegalli commented Sep 4, 2024

VascoSch92 commented Sep 4, 2024

VascoSch92 commented Sep 4, 2024

VascoSch92 commented Sep 5, 2024

VascoSch92 commented Sep 17, 2024

solegalli commented Oct 6, 2024

VascoSch92 commented Aug 29, 2024 •

edited

Loading

codecov bot commented Aug 29, 2024 •

edited

Loading