You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hypothesis: the issue may be due to jit running non-optimized code on the first pass. This may have a different RNG behavior and/or different dtypes.
The text was updated successfully, but these errors were encountered:
justheuristic
changed the title
First and second predictions are slightly different when scripting
First and second predictions yield slightly different results when jit and dropout are enabled
Mar 17, 2022
justheuristic
changed the title
First and second predictions yield slightly different results when jit and dropout are enabled
First and second predictions yield slightly different results when both jit and dropout are enabled
Mar 17, 2022
Curiously, the first two iterations of LeanTransformer on CPU may differ by a small amount (~1e-5) even with use_deterministic_algorithms(True)
To reproduce, go to this test and remove "for i in range 2"
lean_transformer/tests/test_modifications.py
Lines 63 to 68 in e737a8f
Known facts:
Hypothesis: the issue may be due to jit running non-optimized code on the first pass. This may have a different RNG behavior and/or different dtypes.
The text was updated successfully, but these errors were encountered: