Fast TopDown/BottomUp #109

kdgutier · 2022-11-02T22:37:01Z

Added option of sparse reconciliation to BottomUp, and TopDown methods.
Approach resulted in 15x speed gains.
Added unit test comparing equality versus previous version.

Pending:
The ERM method by design creates a sparse P matrix, we should be able to see the same speed gains during inference. Check if gains come from clever ordering of matrix operations.
If so update all such matrix operations accordingly.

…nto S_indices

…/TopDown

review-notebook-app · 2022-11-02T22:37:05Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

AzulGarza

Please keep the changes as modular and clean as possible. Some changes from this PR are included in this new PR. Please separate them.

AzulGarza · 2022-11-02T23:08:14Z

nbs/methods.ipynb

@@ -44,6 +44,7 @@
    "\n",


Line #17. res = {'mean': np.matmul(S @ P, y_hat)}
np.matmul(S @ P, y_hat) is equivalent to S.dot(P.dot(y_hat)) when using numpy arrays? If that's the case, we could use {'mean': S.dot(P.dot(y_hat))} for all cases and leave the transformation to sparse arrays to the methods (to avoid adding a new argument).

Reply via ReviewNB

Sounds good, I added an issue to change the corresponding matrix multiplication for the dot method.

I would keep the tests and the sparsity parameter momentarily to track these and other method's speed gains, they can help in future development.

Currently is a default parameter that is not shown to the higher-level methods.

AzulGarza · 2022-11-02T23:08:14Z

nbs/methods.ipynb

@@ -44,6 +44,7 @@
    "\n",


Line #24. sparsity: Optional[bool] = True):
sparsity could be an attribute of the class instead of an extra argument. If self.sparsity=True , then P and S could be sent to _reconcile as sparse matrices.

Reply via ReviewNB

My thought on skipping the self.sparsity attribute is to avoid extra complexity to the methods. Some reconciliation methods might report gains consistently while other don't.

We might want to make the selection a default.

AzulGarza · 2022-11-02T23:08:14Z

nbs/methods.ipynb

@@ -44,6 +44,7 @@
    "\n",


The nbs/methods.ipynb file is intended to work only with numpy arrays. If you want to test a large summing matrix, please create it from scratch using numpy instead of downloading and importing a dataframe. Since the inclusion of CodeTimer is not reviewed yet, you could test the performance of the change using the time module:

from time import time
performace wo sparsity

init_wo_sparsity = time()
cls_bu = BottomUp()
cls_bu.reconcile(...)
end_wo_sparsity = time()

performance sparsity

init_sparsity = time()
cls_bu = BottomUp(sparsity=True)
cls_bu.reconcile(...)
end_sparsity = time()

test sparsity helps

assert (end_sparsity - init_sparsity) < (end_wo_sparsity - init_wo_sparsity)

Reply via ReviewNB

AzulGarza · 2022-11-02T23:08:14Z

nbs/methods.ipynb

@@ -44,6 +44,7 @@
    "\n",


See previous comment on the sparsity attribute. If it works, it should be included in all the methods.

Reply via ReviewNB

AzulGarza

I like the idea. I think it would be better if the possibility to use sparse matrices is an argument of HierarchicalReconciliation instead of an argument/attribute of each class. For example, HierarchicalReconciliation(sparsity=True, ...). In that case, the transformation of S is paid only once and all methods receive the same transformed matrix.
Subsequently, the _reconcile function could determine if the matrix S is sparse and in that case transform P.

kdgutier · 2022-11-05T15:51:53Z

While doing internal testing on computation speed I realized that speed gains come from two sources:

Ordering of matrix multiplications (S P) y_[a,b] vs S ( P y_[a,b])
Sparsity of S and P (that comes with conversion fixed cost)

Creation of sparse S should be done in HierarchicalForecast.reconcile once.
While the creation of sparse P should be done once for methods BottomUp, TopDown, ERM-lasso.
Rather than in every call to the methods._reconcile function.

kdgutier added 11 commits October 29, 2022 16:03

Efficient aggregate function

94b527b

Merge branch 'main' of ssh://github.com/Nixtla/hierarchicalforecast i…

acb8bc0

…nto S_indices

testing _to_summing output's equality

4b221cd

Matched aggregation tags with old tags

4baa972

merged documentation links and improvements from main

c7d8534

Solving S_df unique_id ordering

2017663

Speed, values, columns, indexes tests for aggregation functions

eb97634

Faster aggregation with equality tests

d59dd35

testing strict hierarchical equality

1b463b8

New aggregation with speed, values, columns index equality tests

9bace8c

Added tqdm to HierarchicalForecast.reconcile and sparsity to BottomUp…

ea91677

…/TopDown

AzulGarza self-requested a review November 2, 2022 22:41

Added tqdm to track reconcilers progress

62fe084

AzulGarza requested changes Nov 2, 2022

View reviewed changes

AzulGarza reviewed Nov 2, 2022

View reviewed changes

AzulGarza self-requested a review November 2, 2022 23:11

AzulGarza requested changes Nov 2, 2022

View reviewed changes

kdgutier closed this Nov 5, 2022

kdgutier deleted the fast_topdown_bottomup branch November 5, 2022 04:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fast TopDown/BottomUp #109

Fast TopDown/BottomUp #109

kdgutier commented Nov 2, 2022 •

edited

Loading

review-notebook-app bot commented Nov 2, 2022

AzulGarza left a comment

AzulGarza Nov 2, 2022

kdgutier Nov 3, 2022 •

edited

Loading

AzulGarza Nov 2, 2022

kdgutier Nov 3, 2022

AzulGarza Nov 2, 2022

AzulGarza Nov 2, 2022

AzulGarza left a comment

kdgutier commented Nov 5, 2022

Fast TopDown/BottomUp #109

Fast TopDown/BottomUp #109

Conversation

kdgutier commented Nov 2, 2022 • edited Loading

review-notebook-app bot commented Nov 2, 2022

AzulGarza left a comment

Choose a reason for hiding this comment

AzulGarza Nov 2, 2022

Choose a reason for hiding this comment

kdgutier Nov 3, 2022 • edited Loading

Choose a reason for hiding this comment

AzulGarza Nov 2, 2022

Choose a reason for hiding this comment

kdgutier Nov 3, 2022

Choose a reason for hiding this comment

AzulGarza Nov 2, 2022

Choose a reason for hiding this comment

performace wo sparsity

performance sparsity

test sparsity helps

AzulGarza Nov 2, 2022

Choose a reason for hiding this comment

AzulGarza left a comment

Choose a reason for hiding this comment

kdgutier commented Nov 5, 2022

kdgutier commented Nov 2, 2022 •

edited

Loading

kdgutier Nov 3, 2022 •

edited

Loading