How to get the p_value of the whole model #77

SHEN-Cheng · 2020-11-04T03:23:56Z

yeah, through my_pwlf.p_values() i can get the calculate the p-value for each beta parameter. Like first the beta parameters (intercept + slopes) and then the breakpoints.
but how to get the whole model p_value ?

cjekel · 2020-11-24T03:34:06Z

I just created an example that adds a test for model significance, and get's a p-value for the entire model. https://github.com/cjekel/piecewise_linear_fit_py/blob/master/examples/test_for_model_significance.py

As defined in Section 2.4.1 of Myers RH, Montgomery DC, Anderson-Cook CM. Response surface methodology . Hoboken. New Jersey: John Wiley & Sons, Inc. 2009;20:38-44.

In the linear model case we setup a hypothesis test as:

In the non-linear model case, we'll include the breakpoints as beta parameters. (since the breakpoints are unknown model parameters).

You reject H0 when p-values are less than some alpha.

Please leave this issue open, as the object should include this method!

SHEN-Cheng · 2020-11-24T09:15:44Z

I just created an example that adds a test for model significance, and get's a p-value for the entire model. https://github.com/cjekel/piecewise_linear_fit_py/blob/master/examples/test_for_model_significance.py

As defined in Section 2.4.1 of Myers RH, Montgomery DC, Anderson-Cook CM. Response surface methodology . Hoboken. New Jersey: John Wiley & Sons, Inc. 2009;20:38-44.

In the linear model case we setup a hypothesis test as:

In the non-linear model case, we'll include the breakpoints as beta parameters. (since the breakpoints are unknown model parameters).

You reject H0 when p-values are less than some alpha.

Please leave this issue open, as the object should include this method!

Great！ You slove my problem.

kM-Stone · 2021-09-07T06:10:01Z

I just created an example that adds a test for model significance, and get's a p-value for the entire model. https://github.com/cjekel/piecewise_linear_fit_py/blob/master/examples/test_for_model_significance.py

As defined in Section 2.4.1 of Myers RH, Montgomery DC, Anderson-Cook CM. Response surface methodology . Hoboken. New Jersey: John Wiley & Sons, Inc. 2009;20:38-44.

In the linear model case we setup a hypothesis test as:

In the non-linear model case, we'll include the breakpoints as beta parameters. (since the breakpoints are unknown model parameters).

You reject H0 when p-values are less than some alpha.

Please leave this issue open, as the object should include this method!

Hi~ Thanks for the great work! I ran your code above, but I am confused for the result:

it's your last comment in code:

in both these cases, the p_value is very large, so we can't reject H0

Indeed, the results show large p-values for both case (0.85 and 0.95), but my_pwlf.p_values() shows array([1.17134878e-06, 7.30540082e-51, 1.00331376e-21]). So why each beta is significant but whole model not?
line 77: f0 = (ssr / k) / (sse / (n - k -1))，the F-statistics form is consistent with your reference, i.e.

but the ssr in code is seems sum of squared of error (section 2.1 formula 10 ) , instead of sum of squared of regression? I swaped ssr and sse in code, and got quite small p-values.

cjekel · 2021-09-07T17:38:47Z

Hi~ Thanks for the great work! I ran your code above, but I am confused for the result:

* it's your last comment in code:
  > in both these cases, the p_value is very large, so we can't reject H0
  
  
  Indeed, the results show large p-values for both case (0.85 and 0.95), but `my_pwlf.p_values()` shows `array([1.17134878e-06, 7.30540082e-51, 1.00331376e-21])`. So why each beta is significant  but whole model not?

Does the following change impact these results?

* line 77: `f0 = (ssr / k) / (sse / (n - k -1))`，the F-statistics form is consistent with your reference, i.e.
  ![image](https://user-images.githubusercontent.com/50538789/132291846-1e52a2e0-ed82-4f3b-8e61-8ee4daf31e02.png)
  but the ssr in code  is seems sum of squared of error (section 2.1 formula 10 ) , instead of sum of squared of regression? I swaped ssr and sse in code, and got quite small p-values.

Yup, nice catch! SSR in my code is actually SSE in that book, and vice versa. Sorry about this.

(look how this wiki article uses ESS and RSS, and the E and R in theses are swapped from the above book https://en.wikipedia.org/wiki/Explained_sum_of_squares )

cjekel · 2021-09-11T19:48:56Z

Hi~ Thanks for the great work! I ran your code above, but I am confused for the result:

* it's your last comment in code:
  > in both these cases, the p_value is very large, so we can't reject H0
  
  
  Indeed, the results show large p-values for both case (0.85 and 0.95), but `my_pwlf.p_values()` shows `array([1.17134878e-06, 7.30540082e-51, 1.00331376e-21])`. So why each beta is significant  but whole model not?

Does the following change impact these results?

The answer to this is yes. Fixed in 101711b Many thanks to @kM-Stone for catching this mistake.

cjekel · 2021-09-11T19:51:38Z

To clarify, all uses of ssr in PiecewiseLinFit are okay and don't need changing. Including PiecewiseLinFit.r_squared.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to get the p_value of the whole model #77

How to get the p_value of the whole model #77

SHEN-Cheng commented Nov 4, 2020

cjekel commented Nov 24, 2020 •

edited

Loading

SHEN-Cheng commented Nov 24, 2020

kM-Stone commented Sep 7, 2021

cjekel commented Sep 7, 2021 •

edited

Loading

cjekel commented Sep 11, 2021

cjekel commented Sep 11, 2021

How to get the p_value of the whole model #77

How to get the p_value of the whole model #77

Comments

SHEN-Cheng commented Nov 4, 2020

cjekel commented Nov 24, 2020 • edited Loading

SHEN-Cheng commented Nov 24, 2020

kM-Stone commented Sep 7, 2021

cjekel commented Sep 7, 2021 • edited Loading

cjekel commented Sep 11, 2021

cjekel commented Sep 11, 2021

cjekel commented Nov 24, 2020 •

edited

Loading

cjekel commented Sep 7, 2021 •

edited

Loading