Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Force log_prob to return a tensordict when kwargs are passed to ProbabilisticTensorDictSequential.log_prob #1146

Merged
merged 2 commits into from
Dec 19, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Dec 18, 2024

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 18, 2024
…d to ProbabilisticTensorDictSequential.log_prob

ghstack-source-id: a7482a31fbb6b65562589cc3771e73a61ce0ed17
Pull Request resolved: #1146
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 18, 2024
Copy link

github-actions bot commented Dec 18, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}9$. Worsened: $\large\color{#d91a1a}32$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 53.5790μs 22.1892μs 45.0670 KOps/s 50.3731 KOps/s $\textbf{\color{#d91a1a}-10.53\%}$
test_plain_set_stack_nested 56.9260μs 21.9783μs 45.4993 KOps/s 49.8115 KOps/s $\textbf{\color{#d91a1a}-8.66\%}$
test_plain_set_nested_inplace 69.4780μs 23.8154μs 41.9896 KOps/s 45.8989 KOps/s $\textbf{\color{#d91a1a}-8.52\%}$
test_plain_set_stack_nested_inplace 50.9550μs 23.9101μs 41.8233 KOps/s 44.0462 KOps/s $\textbf{\color{#d91a1a}-5.05\%}$
test_items 28.1420μs 4.2233μs 236.7797 KOps/s 235.6818 KOps/s $\color{#35bf28}+0.47\%$
test_items_nested 0.7959ms 0.4033ms 2.4795 KOps/s 2.4768 KOps/s $\color{#35bf28}+0.11\%$
test_items_nested_locked 0.8255ms 0.4039ms 2.4756 KOps/s 2.4808 KOps/s $\color{#d91a1a}-0.21\%$
test_items_nested_leaf 0.1455ms 76.6681μs 13.0432 KOps/s 12.8960 KOps/s $\color{#35bf28}+1.14\%$
test_items_stack_nested 0.6081ms 0.4075ms 2.4542 KOps/s 2.4667 KOps/s $\color{#d91a1a}-0.51\%$
test_items_stack_nested_leaf 0.1392ms 80.3848μs 12.4402 KOps/s 12.5665 KOps/s $\color{#d91a1a}-1.01\%$
test_items_stack_nested_locked 0.8010ms 0.4130ms 2.4215 KOps/s 2.4511 KOps/s $\color{#d91a1a}-1.21\%$
test_keys 92.4720μs 3.6045μs 277.4323 KOps/s 284.5814 KOps/s $\color{#d91a1a}-2.51\%$
test_keys_nested 0.2803ms 0.1669ms 5.9923 KOps/s 6.0938 KOps/s $\color{#d91a1a}-1.67\%$
test_keys_nested_locked 1.7822ms 0.1744ms 5.7345 KOps/s 5.8432 KOps/s $\color{#d91a1a}-1.86\%$
test_keys_nested_leaf 0.2849ms 0.1455ms 6.8734 KOps/s 7.0085 KOps/s $\color{#d91a1a}-1.93\%$
test_keys_stack_nested 0.3387ms 0.1653ms 6.0498 KOps/s 6.1227 KOps/s $\color{#d91a1a}-1.19\%$
test_keys_stack_nested_leaf 0.3534ms 0.1440ms 6.9433 KOps/s 7.1013 KOps/s $\color{#d91a1a}-2.22\%$
test_keys_stack_nested_locked 0.2663ms 0.1718ms 5.8194 KOps/s 5.9346 KOps/s $\color{#d91a1a}-1.94\%$
test_values 6.1314μs 1.0974μs 911.2512 KOps/s 976.4345 KOps/s $\textbf{\color{#d91a1a}-6.68\%}$
test_values_nested 0.1641ms 63.4527μs 15.7598 KOps/s 16.2551 KOps/s $\color{#d91a1a}-3.05\%$
test_values_nested_locked 0.1137ms 64.0611μs 15.6101 KOps/s 16.1125 KOps/s $\color{#d91a1a}-3.12\%$
test_values_nested_leaf 0.1667ms 72.3944μs 13.8132 KOps/s 13.9459 KOps/s $\color{#d91a1a}-0.95\%$
test_values_stack_nested 0.1451ms 64.8499μs 15.4202 KOps/s 15.6547 KOps/s $\color{#d91a1a}-1.50\%$
test_values_stack_nested_leaf 0.1650ms 72.8356μs 13.7296 KOps/s 14.0241 KOps/s $\color{#d91a1a}-2.10\%$
test_values_stack_nested_locked 0.1160ms 64.5711μs 15.4868 KOps/s 15.6869 KOps/s $\color{#d91a1a}-1.28\%$
test_membership 18.1230μs 0.8827μs 1.1329 MOps/s 1.1547 MOps/s $\color{#d91a1a}-1.89\%$
test_membership_nested 31.0580μs 2.9801μs 335.5648 KOps/s 346.9498 KOps/s $\color{#d91a1a}-3.28\%$
test_membership_nested_leaf 30.8170μs 2.9666μs 337.0919 KOps/s 342.5238 KOps/s $\color{#d91a1a}-1.59\%$
test_membership_stacked_nested 22.9530μs 2.9323μs 341.0270 KOps/s 344.6614 KOps/s $\color{#d91a1a}-1.05\%$
test_membership_stacked_nested_leaf 27.5520μs 2.9701μs 336.6852 KOps/s 343.9579 KOps/s $\color{#d91a1a}-2.11\%$
test_membership_nested_last 32.7110μs 4.5136μs 221.5507 KOps/s 228.0562 KOps/s $\color{#d91a1a}-2.85\%$
test_membership_nested_leaf_last 29.9150μs 4.4467μs 224.8882 KOps/s 227.5054 KOps/s $\color{#d91a1a}-1.15\%$
test_membership_stacked_nested_last 34.7740μs 5.7341μs 174.3965 KOps/s 193.2964 KOps/s $\textbf{\color{#d91a1a}-9.78\%}$
test_membership_stacked_nested_leaf_last 38.6510μs 5.7149μs 174.9797 KOps/s 191.1723 KOps/s $\textbf{\color{#d91a1a}-8.47\%}$
test_nested_getleaf 58.2980μs 10.9272μs 91.5148 KOps/s 91.9235 KOps/s $\color{#d91a1a}-0.44\%$
test_nested_get 36.5280μs 10.3528μs 96.5924 KOps/s 96.9270 KOps/s $\color{#d91a1a}-0.35\%$
test_stacked_getleaf 35.6760μs 10.8423μs 92.2311 KOps/s 92.2345 KOps/s $-0.00\%$
test_stacked_get 40.8360μs 10.3083μs 97.0088 KOps/s 97.5777 KOps/s $\color{#d91a1a}-0.58\%$
test_nested_getitemleaf 37.8910μs 11.3436μs 88.1556 KOps/s 89.1961 KOps/s $\color{#d91a1a}-1.17\%$
test_nested_getitem 48.9210μs 10.6309μs 94.0650 KOps/s 96.0844 KOps/s $\color{#d91a1a}-2.10\%$
test_stacked_getitemleaf 35.0050μs 11.3974μs 87.7391 KOps/s 90.4701 KOps/s $\color{#d91a1a}-3.02\%$
test_stacked_getitem 37.8100μs 10.7173μs 93.3067 KOps/s 97.0981 KOps/s $\color{#d91a1a}-3.90\%$
test_lock_nested 4.8723ms 0.4643ms 2.1536 KOps/s 2.1880 KOps/s $\color{#d91a1a}-1.57\%$
test_lock_stack_nested 0.6696ms 0.4234ms 2.3618 KOps/s 2.3392 KOps/s $\color{#35bf28}+0.97\%$
test_unlock_nested 0.7379ms 0.3737ms 2.6757 KOps/s 2.6576 KOps/s $\color{#35bf28}+0.68\%$
test_unlock_stack_nested 0.6343ms 0.3421ms 2.9229 KOps/s 2.8809 KOps/s $\color{#35bf28}+1.46\%$
test_flatten_speed 0.1996ms 0.1011ms 9.8919 KOps/s 9.9795 KOps/s $\color{#d91a1a}-0.88\%$
test_unflatten_speed 0.7243ms 0.5394ms 1.8540 KOps/s 1.8743 KOps/s $\color{#d91a1a}-1.08\%$
test_common_ops 1.6214ms 0.8149ms 1.2272 KOps/s 1.3162 KOps/s $\textbf{\color{#d91a1a}-6.77\%}$
test_creation 17.9430μs 2.5142μs 397.7381 KOps/s 403.0386 KOps/s $\color{#d91a1a}-1.32\%$
test_creation_empty 35.6260μs 13.5132μs 74.0015 KOps/s 104.9215 KOps/s $\textbf{\color{#d91a1a}-29.47\%}$
test_creation_nested_1 37.6400μs 16.5944μs 60.2611 KOps/s 81.7470 KOps/s $\textbf{\color{#d91a1a}-26.28\%}$
test_creation_nested_2 1.4016ms 21.4065μs 46.7148 KOps/s 59.5620 KOps/s $\textbf{\color{#d91a1a}-21.57\%}$
test_clone 66.4930μs 13.5458μs 73.8234 KOps/s 72.7205 KOps/s $\color{#35bf28}+1.52\%$
test_getitem[int] 0.1941ms 12.8137μs 78.0416 KOps/s 76.6752 KOps/s $\color{#35bf28}+1.78\%$
test_getitem[slice_int] 0.1373ms 24.7030μs 40.4809 KOps/s 39.8926 KOps/s $\color{#35bf28}+1.47\%$
test_getitem[range] 0.2168ms 50.3405μs 19.8647 KOps/s 19.9516 KOps/s $\color{#d91a1a}-0.44\%$
test_getitem[tuple] 0.1390ms 20.7670μs 48.1532 KOps/s 48.6160 KOps/s $\color{#d91a1a}-0.95\%$
test_getitem[list] 0.1702ms 44.2044μs 22.6222 KOps/s 21.8473 KOps/s $\color{#35bf28}+3.55\%$
test_setitem_dim[int] 52.6280μs 24.9079μs 40.1480 KOps/s 38.6012 KOps/s $\color{#35bf28}+4.01\%$
test_setitem_dim[slice_int] 83.4650μs 51.7423μs 19.3265 KOps/s 18.8539 KOps/s $\color{#35bf28}+2.51\%$
test_setitem_dim[range] 0.1048ms 72.8798μs 13.7212 KOps/s 13.4436 KOps/s $\color{#35bf28}+2.07\%$
test_setitem_dim[tuple] 76.7620μs 39.3745μs 25.3971 KOps/s 23.8396 KOps/s $\textbf{\color{#35bf28}+6.53\%}$
test_setitem 84.9180μs 21.6394μs 46.2120 KOps/s 50.0552 KOps/s $\textbf{\color{#d91a1a}-7.68\%}$
test_set 67.9260μs 21.0303μs 47.5504 KOps/s 52.7402 KOps/s $\textbf{\color{#d91a1a}-9.84\%}$
test_set_shared 1.1676ms 0.1688ms 5.9234 KOps/s 5.7979 KOps/s $\color{#35bf28}+2.17\%$
test_update 0.2213ms 24.9709μs 40.0466 KOps/s 48.9808 KOps/s $\textbf{\color{#d91a1a}-18.24\%}$
test_update_nested 0.1272ms 36.2706μs 27.5706 KOps/s 31.4965 KOps/s $\textbf{\color{#d91a1a}-12.46\%}$
test_update__nested 0.3800ms 34.4004μs 29.0694 KOps/s 28.5284 KOps/s $\color{#35bf28}+1.90\%$
test_set_nested 92.6920μs 23.5093μs 42.5364 KOps/s 47.1224 KOps/s $\textbf{\color{#d91a1a}-9.73\%}$
test_set_nested_new 0.1259ms 28.5105μs 35.0748 KOps/s 37.9869 KOps/s $\textbf{\color{#d91a1a}-7.67\%}$
test_select 0.2178ms 44.7826μs 22.3301 KOps/s 23.4915 KOps/s $\color{#d91a1a}-4.94\%$
test_select_nested 0.1278ms 63.5101μs 15.7455 KOps/s 15.3319 KOps/s $\color{#35bf28}+2.70\%$
test_exclude_nested 0.1867ms 82.9881μs 12.0499 KOps/s 11.9404 KOps/s $\color{#35bf28}+0.92\%$
test_empty[True] 0.8267ms 0.4146ms 2.4121 KOps/s 2.4377 KOps/s $\color{#d91a1a}-1.05\%$
test_empty[False] 6.4220μs 1.4632μs 683.4543 KOps/s 717.4568 KOps/s $\color{#d91a1a}-4.74\%$
test_unbind_speed 0.3912ms 0.2700ms 3.7035 KOps/s 3.6323 KOps/s $\color{#35bf28}+1.96\%$
test_unbind_speed_stack0 0.4074ms 0.2650ms 3.7735 KOps/s 3.7147 KOps/s $\color{#35bf28}+1.58\%$
test_unbind_speed_stack1 0.1020s 0.7860ms 1.2722 KOps/s 1.5353 KOps/s $\textbf{\color{#d91a1a}-17.13\%}$
test_split 1.7436ms 1.5837ms 631.4275 Ops/s 561.5414 Ops/s $\textbf{\color{#35bf28}+12.45\%}$
test_chunk 99.0406ms 1.7467ms 572.4983 Ops/s 561.2980 Ops/s $\color{#35bf28}+2.00\%$
test_consolidate_njt[False-None] 0.1024s 8.7481ms 114.3106 Ops/s 124.0720 Ops/s $\textbf{\color{#d91a1a}-7.87\%}$
test_creation[device0] 0.2286ms 90.3925μs 11.0629 KOps/s 10.6664 KOps/s $\color{#35bf28}+3.72\%$
test_creation_from_tensor 3.6381ms 94.0737μs 10.6300 KOps/s 10.5769 KOps/s $\color{#35bf28}+0.50\%$
test_add_one[memmap_tensor0] 0.1650ms 4.7927μs 208.6526 KOps/s 193.5440 KOps/s $\textbf{\color{#35bf28}+7.81\%}$
test_contiguous[memmap_tensor0] 9.6880μs 0.5324μs 1.8783 MOps/s 1.9028 MOps/s $\color{#d91a1a}-1.29\%$
test_stack[memmap_tensor0] 47.2380μs 3.4542μs 289.5018 KOps/s 287.3478 KOps/s $\color{#35bf28}+0.75\%$
test_memmaptd_index 1.1154ms 0.2396ms 4.1733 KOps/s 4.0747 KOps/s $\color{#35bf28}+2.42\%$
test_memmaptd_index_astensor 0.7698ms 0.3291ms 3.0383 KOps/s 3.0140 KOps/s $\color{#35bf28}+0.81\%$
test_memmaptd_index_op 1.1067ms 0.6215ms 1.6091 KOps/s 1.7372 KOps/s $\textbf{\color{#d91a1a}-7.37\%}$
test_serialize_model 0.1204s 0.1128s 8.8632 Ops/s 7.7955 Ops/s $\textbf{\color{#35bf28}+13.70\%}$
test_serialize_model_pickle 0.4586s 0.3946s 2.5343 Ops/s 2.5420 Ops/s $\color{#d91a1a}-0.30\%$
test_serialize_weights 0.2180s 0.1286s 7.7733 Ops/s 8.8836 Ops/s $\textbf{\color{#d91a1a}-12.50\%}$
test_serialize_weights_returnearly 0.2626s 0.1723s 5.8024 Ops/s 6.5266 Ops/s $\textbf{\color{#d91a1a}-11.10\%}$
test_serialize_weights_pickle 0.5574s 0.4288s 2.3321 Ops/s 2.5272 Ops/s $\textbf{\color{#d91a1a}-7.72\%}$
test_serialize_weights_filesystem 0.1462s 0.1388s 7.2035 Ops/s 7.1528 Ops/s $\color{#35bf28}+0.71\%$
test_serialize_model_filesystem 0.2394s 0.1575s 6.3493 Ops/s 6.2160 Ops/s $\color{#35bf28}+2.14\%$
test_reshape_pytree 61.3040μs 26.4612μs 37.7911 KOps/s 37.5257 KOps/s $\color{#35bf28}+0.71\%$
test_reshape_td 91.4290μs 33.2640μs 30.0625 KOps/s 29.3349 KOps/s $\color{#35bf28}+2.48\%$
test_view_pytree 85.5390μs 26.9645μs 37.0858 KOps/s 37.3013 KOps/s $\color{#d91a1a}-0.58\%$
test_view_td 84.8480μs 40.3569μs 24.7789 KOps/s 25.1460 KOps/s $\color{#d91a1a}-1.46\%$
test_unbind_pytree 86.5100μs 29.4340μs 33.9744 KOps/s 33.2930 KOps/s $\color{#35bf28}+2.05\%$
test_unbind_td 0.3369ms 39.9440μs 25.0350 KOps/s 24.9680 KOps/s $\color{#35bf28}+0.27\%$
test_split_pytree 85.2380μs 28.9991μs 34.4838 KOps/s 33.7221 KOps/s $\color{#35bf28}+2.26\%$
test_split_td 0.5407ms 45.0307μs 22.2071 KOps/s 21.5652 KOps/s $\color{#35bf28}+2.98\%$
test_add_pytree 82.7540μs 35.3708μs 28.2719 KOps/s 27.6480 KOps/s $\color{#35bf28}+2.26\%$
test_add_td 0.1630ms 63.2383μs 15.8132 KOps/s 17.4011 KOps/s $\textbf{\color{#d91a1a}-9.13\%}$
test_compile_add_one_nested[tensordict-compile] 0.1416ms 60.7840μs 16.4517 KOps/s 16.4657 KOps/s $\color{#d91a1a}-0.08\%$
test_compile_add_one_nested[tensordict-eager] 0.3741ms 0.1716ms 5.8277 KOps/s 5.8335 KOps/s $\color{#d91a1a}-0.10\%$
test_compile_add_one_nested[pytree-compile] 0.1186ms 44.8314μs 22.3058 KOps/s 22.1681 KOps/s $\color{#35bf28}+0.62\%$
test_compile_add_one_nested[pytree-eager] 0.2598ms 0.1191ms 8.3990 KOps/s 8.3839 KOps/s $\color{#35bf28}+0.18\%$
test_compile_copy_nested[tensordict-compile] 77.4530μs 25.7000μs 38.9105 KOps/s 39.2563 KOps/s $\color{#d91a1a}-0.88\%$
test_compile_copy_nested[tensordict-eager] 0.1118ms 58.9514μs 16.9631 KOps/s 17.0650 KOps/s $\color{#d91a1a}-0.60\%$
test_compile_copy_nested[pytree-compile] 0.1498ms 78.9290μs 12.6696 KOps/s 12.8630 KOps/s $\color{#d91a1a}-1.50\%$
test_compile_copy_nested[pytree-eager] 0.1589ms 67.1619μs 14.8894 KOps/s 14.9117 KOps/s $\color{#d91a1a}-0.15\%$
test_compile_add_one_flat[tensordict-compile] 0.1914ms 0.1027ms 9.7405 KOps/s 9.5781 KOps/s $\color{#35bf28}+1.70\%$
test_compile_add_one_flat[tensordict-eager] 0.3917ms 0.2127ms 4.7020 KOps/s 4.6768 KOps/s $\color{#35bf28}+0.54\%$
test_compile_add_one_flat[tensorclass-compile] 0.1127ms 44.1429μs 22.6537 KOps/s 22.4187 KOps/s $\color{#35bf28}+1.05\%$
test_compile_add_one_flat[tensorclass-eager] 0.4833ms 65.1921μs 15.3393 KOps/s 15.2770 KOps/s $\color{#35bf28}+0.41\%$
test_compile_add_one_flat[pytree-compile] 0.1815ms 0.1028ms 9.7259 KOps/s 9.8347 KOps/s $\color{#d91a1a}-1.11\%$
test_compile_add_one_flat[pytree-eager] 0.4761ms 0.2023ms 4.9436 KOps/s 4.9634 KOps/s $\color{#d91a1a}-0.40\%$
test_compile_add_self_flat[tensordict-eager] 0.3783ms 0.2300ms 4.3484 KOps/s 4.2830 KOps/s $\color{#35bf28}+1.53\%$
test_compile_add_self_flat[tensordict-compile] 0.1725ms 0.1031ms 9.7001 KOps/s 9.5644 KOps/s $\color{#35bf28}+1.42\%$
test_compile_add_self_flat[tensorclass-eager] 0.1557ms 59.3033μs 16.8625 KOps/s 16.7337 KOps/s $\color{#35bf28}+0.77\%$
test_compile_add_self_flat[tensorclass-compile] 0.1002ms 45.3123μs 22.0690 KOps/s 21.8943 KOps/s $\color{#35bf28}+0.80\%$
test_compile_add_self_flat[pytree-eager] 0.2442ms 0.1590ms 6.2895 KOps/s 6.2370 KOps/s $\color{#35bf28}+0.84\%$
test_compile_add_self_flat[pytree-compile] 0.2416ms 0.1025ms 9.7600 KOps/s 9.8729 KOps/s $\color{#d91a1a}-1.14\%$
test_compile_copy_flat[tensordict-compile] 51.6760μs 20.6102μs 48.5196 KOps/s 46.6758 KOps/s $\color{#35bf28}+3.95\%$
test_compile_copy_flat[tensordict-eager] 0.1308ms 66.1000μs 15.1286 KOps/s 15.1500 KOps/s $\color{#d91a1a}-0.14\%$
test_compile_copy_flat[pytree-compile] 0.1550ms 81.4622μs 12.2756 KOps/s 12.5756 KOps/s $\color{#d91a1a}-2.39\%$
test_compile_copy_flat[pytree-eager] 0.1372ms 70.0715μs 14.2711 KOps/s 14.9347 KOps/s $\color{#d91a1a}-4.44\%$
test_compile_assign_and_add[tensordict-compile] 0.4251ms 0.2042ms 4.8974 KOps/s 4.9011 KOps/s $\color{#d91a1a}-0.08\%$
test_compile_assign_and_add[tensordict-eager] 2.4694ms 1.3489ms 741.3338 Ops/s 748.6825 Ops/s $\color{#d91a1a}-0.98\%$
test_compile_assign_and_add[pytree-compile] 0.2915ms 0.2026ms 4.9364 KOps/s 5.0108 KOps/s $\color{#d91a1a}-1.49\%$
test_compile_assign_and_add[pytree-eager] 1.3441ms 0.7909ms 1.2644 KOps/s 1.2743 KOps/s $\color{#d91a1a}-0.78\%$
test_compile_assign_and_add_stack[compile] 0.9141ms 0.4620ms 2.1644 KOps/s 2.1868 KOps/s $\color{#d91a1a}-1.02\%$
test_compile_assign_and_add_stack[eager] 3.0547ms 2.7784ms 359.9247 Ops/s 396.3730 Ops/s $\textbf{\color{#d91a1a}-9.20\%}$
test_compile_indexing[tensor-tensordict-compile] 81.7120μs 35.3043μs 28.3251 KOps/s 27.9934 KOps/s $\color{#35bf28}+1.18\%$
test_compile_indexing[tensor-tensordict-eager] 0.5408ms 32.8440μs 30.4470 KOps/s 29.1746 KOps/s $\color{#35bf28}+4.36\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1239ms 29.2398μs 34.2000 KOps/s 34.7864 KOps/s $\color{#d91a1a}-1.69\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1903ms 23.7083μs 42.1793 KOps/s 42.5778 KOps/s $\color{#d91a1a}-0.94\%$
test_compile_indexing[tensor-pytree-compile] 80.4800μs 29.3333μs 34.0910 KOps/s 33.1407 KOps/s $\color{#35bf28}+2.87\%$
test_compile_indexing[tensor-pytree-eager] 93.8240μs 23.3010μs 42.9166 KOps/s 43.0716 KOps/s $\color{#d91a1a}-0.36\%$
test_compile_indexing[slice-tensordict-compile] 0.1336ms 49.9739μs 20.0105 KOps/s 19.7373 KOps/s $\color{#35bf28}+1.38\%$
test_compile_indexing[slice-tensordict-eager] 0.5768ms 20.4563μs 48.8848 KOps/s 47.9735 KOps/s $\color{#35bf28}+1.90\%$
test_compile_indexing[slice-tensorclass-compile] 95.7270μs 42.6726μs 23.4343 KOps/s 22.9221 KOps/s $\color{#35bf28}+2.23\%$
test_compile_indexing[slice-tensorclass-eager] 59.1600μs 18.6938μs 53.4936 KOps/s 52.0176 KOps/s $\color{#35bf28}+2.84\%$
test_compile_indexing[slice-pytree-compile] 96.5790μs 43.5194μs 22.9782 KOps/s 22.3887 KOps/s $\color{#35bf28}+2.63\%$
test_compile_indexing[slice-pytree-eager] 55.8940μs 18.4667μs 54.1515 KOps/s 53.5740 KOps/s $\color{#35bf28}+1.08\%$
test_compile_indexing[int-tensordict-compile] 0.1125ms 51.0208μs 19.5998 KOps/s 19.0586 KOps/s $\color{#35bf28}+2.84\%$
test_compile_indexing[int-tensordict-eager] 0.9950ms 20.3466μs 49.1482 KOps/s 48.8358 KOps/s $\color{#35bf28}+0.64\%$
test_compile_indexing[int-tensorclass-compile] 0.1101ms 43.9120μs 22.7728 KOps/s 22.5193 KOps/s $\color{#35bf28}+1.13\%$
test_compile_indexing[int-tensorclass-eager] 88.8350μs 18.3903μs 54.3764 KOps/s 54.3314 KOps/s $\color{#35bf28}+0.08\%$
test_compile_indexing[int-pytree-compile] 0.1087ms 43.9595μs 22.7482 KOps/s 22.5471 KOps/s $\color{#35bf28}+0.89\%$
test_compile_indexing[int-pytree-eager] 0.3421ms 19.0059μs 52.6153 KOps/s 54.2406 KOps/s $\color{#d91a1a}-3.00\%$
test_mod_add[eager] 97.6910μs 34.9725μs 28.5939 KOps/s 30.7839 KOps/s $\textbf{\color{#d91a1a}-7.11\%}$
test_mod_add[compile] 0.1563ms 47.4592μs 21.0707 KOps/s 21.1748 KOps/s $\color{#d91a1a}-0.49\%$
test_mod_add[compile-overhead] 0.1144ms 46.5416μs 21.4862 KOps/s 21.3407 KOps/s $\color{#35bf28}+0.68\%$
test_mod_wrap[eager] 0.4281ms 0.2178ms 4.5912 KOps/s 4.5606 KOps/s $\color{#35bf28}+0.67\%$
test_mod_wrap[compile] 0.4518ms 0.2027ms 4.9342 KOps/s 4.8766 KOps/s $\color{#35bf28}+1.18\%$
test_mod_wrap[compile-overhead] 0.2926ms 0.1982ms 5.0446 KOps/s 4.8834 KOps/s $\color{#35bf28}+3.30\%$
test_mod_wrap_and_backward[eager] 16.2745ms 11.6671ms 85.7115 Ops/s 85.5771 Ops/s $\color{#35bf28}+0.16\%$
test_mod_wrap_and_backward[compile] 13.4136ms 11.0625ms 90.3957 Ops/s 78.5474 Ops/s $\textbf{\color{#35bf28}+15.08\%}$
test_mod_wrap_and_backward[compile-overhead] 14.4505ms 12.6298ms 79.1777 Ops/s 79.6752 Ops/s $\color{#d91a1a}-0.62\%$
test_seq_add[eager] 0.2889ms 0.1156ms 8.6522 KOps/s 8.8085 KOps/s $\color{#d91a1a}-1.78\%$
test_seq_add[compile] 0.1309ms 60.8213μs 16.4416 KOps/s 16.3906 KOps/s $\color{#35bf28}+0.31\%$
test_seq_add[compile-overhead] 0.1535ms 58.9335μs 16.9683 KOps/s 16.7135 KOps/s $\color{#35bf28}+1.52\%$
test_seq_wrap[eager] 0.6802ms 0.4350ms 2.2989 KOps/s 2.2856 KOps/s $\color{#35bf28}+0.58\%$
test_seq_wrap[compile] 0.4803ms 0.2221ms 4.5024 KOps/s 4.4079 KOps/s $\color{#35bf28}+2.14\%$
test_seq_wrap[compile-overhead] 0.4126ms 0.2217ms 4.5107 KOps/s 4.4272 KOps/s $\color{#35bf28}+1.89\%$
test_func_call_runtime[False-eager] 0.8113ms 0.5299ms 1.8871 KOps/s 1.8078 KOps/s $\color{#35bf28}+4.39\%$
test_func_call_runtime[False-compile] 0.5130ms 0.4134ms 2.4190 KOps/s 2.3202 KOps/s $\color{#35bf28}+4.26\%$
test_func_call_runtime[False-compile-overhead] 0.8291ms 0.4153ms 2.4079 KOps/s 2.3358 KOps/s $\color{#35bf28}+3.09\%$
test_func_call_runtime[True-eager] 1.2692ms 0.7438ms 1.3445 KOps/s 1.3142 KOps/s $\color{#35bf28}+2.31\%$
test_func_call_runtime[True-compile] 0.5919ms 0.4529ms 2.2077 KOps/s 2.1421 KOps/s $\color{#35bf28}+3.07\%$
test_func_call_runtime[True-compile-overhead] 0.6101ms 0.4566ms 2.1901 KOps/s 2.1378 KOps/s $\color{#35bf28}+2.45\%$
test_func_call_cm_runtime[False-eager] 0.6577ms 0.5346ms 1.8707 KOps/s 1.8162 KOps/s $\color{#35bf28}+3.00\%$
test_func_call_cm_runtime[False-compile] 0.5286ms 0.4116ms 2.4297 KOps/s 2.3435 KOps/s $\color{#35bf28}+3.67\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5776ms 0.4125ms 2.4242 KOps/s 2.3461 KOps/s $\color{#35bf28}+3.33\%$
test_func_call_cm_runtime[True-eager] 1.0729ms 0.8842ms 1.1310 KOps/s 1.0930 KOps/s $\color{#35bf28}+3.47\%$
test_func_call_cm_runtime[True-compile] 0.5704ms 0.4779ms 2.0926 KOps/s 2.0206 KOps/s $\color{#35bf28}+3.56\%$
test_func_call_cm_runtime[True-compile-overhead] 0.8964ms 0.4853ms 2.0604 KOps/s 2.0340 KOps/s $\color{#35bf28}+1.30\%$
test_vmap_func_call_cm_runtime[eager] 2.6340ms 1.8830ms 531.0697 Ops/s 516.9322 Ops/s $\color{#35bf28}+2.73\%$
test_vmap_func_call_cm_runtime[compile] 0.7121ms 0.5102ms 1.9601 KOps/s 1.9193 KOps/s $\color{#35bf28}+2.12\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.7262ms 0.5121ms 1.9529 KOps/s 1.9498 KOps/s $\color{#35bf28}+0.16\%$
test_distributed 0.2828ms 0.1233ms 8.1072 KOps/s 7.7863 KOps/s $\color{#35bf28}+4.12\%$
test_tdmodule 44.3120μs 26.8256μs 37.2779 KOps/s 40.3613 KOps/s $\textbf{\color{#d91a1a}-7.64\%}$
test_tdmodule_dispatch 74.9590μs 50.0609μs 19.9757 KOps/s 22.0440 KOps/s $\textbf{\color{#d91a1a}-9.38\%}$
test_tdseq 65.1910μs 29.8380μs 33.5143 KOps/s 36.2948 KOps/s $\textbf{\color{#d91a1a}-7.66\%}$
test_tdseq_dispatch 89.3050μs 55.8641μs 17.9006 KOps/s 19.8994 KOps/s $\textbf{\color{#d91a1a}-10.04\%}$
test_instantiation_functorch 2.2074ms 1.5044ms 664.7150 Ops/s 649.4958 Ops/s $\color{#35bf28}+2.34\%$
test_exec_functorch 0.4233ms 0.1754ms 5.7028 KOps/s 5.5649 KOps/s $\color{#35bf28}+2.48\%$
test_exec_functional_call 0.3082ms 0.1662ms 6.0156 KOps/s 5.7698 KOps/s $\color{#35bf28}+4.26\%$
test_exec_td_decorator 0.4375ms 0.2274ms 4.3983 KOps/s 4.2667 KOps/s $\color{#35bf28}+3.08\%$
test_vmap_mlp_speed_decorator[True-True] 0.8168ms 0.6502ms 1.5380 KOps/s 1.5305 KOps/s $\color{#35bf28}+0.49\%$
test_vmap_mlp_speed_decorator[True-False] 0.8748ms 0.6491ms 1.5407 KOps/s 1.5594 KOps/s $\color{#d91a1a}-1.20\%$
test_vmap_mlp_speed_decorator[False-True] 0.8399ms 0.5240ms 1.9084 KOps/s 1.8852 KOps/s $\color{#35bf28}+1.23\%$
test_vmap_mlp_speed_decorator[False-False] 0.6912ms 0.5225ms 1.9140 KOps/s 1.9087 KOps/s $\color{#35bf28}+0.28\%$
test_to_module_speed[True] 2.0889ms 1.3528ms 739.1966 Ops/s 740.4467 Ops/s $\color{#d91a1a}-0.17\%$
test_to_module_speed[False] 2.5115ms 1.3245ms 755.0289 Ops/s 762.9415 Ops/s $\color{#d91a1a}-1.04\%$
test_tc_init 96.7600μs 51.5645μs 19.3932 KOps/s 22.2833 KOps/s $\textbf{\color{#d91a1a}-12.97\%}$
test_tc_init_nested 0.2189ms 0.1005ms 9.9485 KOps/s 10.9302 KOps/s $\textbf{\color{#d91a1a}-8.98\%}$
test_tc_first_layer_tensor 27.6010μs 1.5259μs 655.3562 KOps/s 652.6953 KOps/s $\color{#35bf28}+0.41\%$
test_tc_first_layer_nontensor 33.6320μs 4.6685μs 214.2011 KOps/s 214.6502 KOps/s $\color{#d91a1a}-0.21\%$
test_tc_second_layer_tensor 43.1400μs 2.8213μs 354.4511 KOps/s 352.4612 KOps/s $\color{#35bf28}+0.56\%$
test_tc_second_layer_nontensor 46.1460μs 6.0070μs 166.4711 KOps/s 167.5294 KOps/s $\color{#d91a1a}-0.63\%$
test_unbind 0.2065s 13.4079ms 74.5830 Ops/s 77.5016 Ops/s $\color{#d91a1a}-3.77\%$
test_full_like 7.7284ms 6.9854ms 143.1549 Ops/s 85.5219 Ops/s $\textbf{\color{#35bf28}+67.39\%}$
test_zeros_like 3.0553ms 2.7003ms 370.3350 Ops/s 143.3408 Ops/s $\textbf{\color{#35bf28}+158.36\%}$
test_ones_like 3.5846ms 3.1643ms 316.0252 Ops/s 130.8095 Ops/s $\textbf{\color{#35bf28}+141.59\%}$
test_clone 5.3504ms 4.8614ms 205.7027 Ops/s 108.6201 Ops/s $\textbf{\color{#35bf28}+89.38\%}$
test_squeeze 72.4540μs 12.6574μs 79.0052 KOps/s 80.3386 KOps/s $\color{#d91a1a}-1.66\%$
test_unsqueeze 0.1917ms 92.5474μs 10.8053 KOps/s 10.8697 KOps/s $\color{#d91a1a}-0.59\%$
test_split 0.3573ms 0.1962ms 5.0957 KOps/s 5.0913 KOps/s $\color{#35bf28}+0.09\%$
test_permute 0.3356ms 0.2116ms 4.7252 KOps/s 4.7095 KOps/s $\color{#35bf28}+0.33\%$
test_stack 27.6562ms 23.4369ms 42.6677 Ops/s 42.2642 Ops/s $\color{#35bf28}+0.95\%$
test_cat 27.9690ms 23.3047ms 42.9098 Ops/s 42.3350 Ops/s $\color{#35bf28}+1.36\%$

Copy link

github-actions bot commented Dec 18, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}28$. Worsened: $\large\color{#d91a1a}5$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 34.4910μs 11.4828μs 87.0871 KOps/s 78.2187 KOps/s $\textbf{\color{#35bf28}+11.34\%}$
test_plain_set_stack_nested 65.6930μs 11.6132μs 86.1088 KOps/s 77.4860 KOps/s $\textbf{\color{#35bf28}+11.13\%}$
test_plain_set_nested_inplace 43.0130μs 12.5535μs 79.6590 KOps/s 72.3825 KOps/s $\textbf{\color{#35bf28}+10.05\%}$
test_plain_set_stack_nested_inplace 0.1822ms 12.5061μs 79.9612 KOps/s 72.5211 KOps/s $\textbf{\color{#35bf28}+10.26\%}$
test_items 0.1934ms 2.9012μs 344.6792 KOps/s 342.7524 KOps/s $\color{#35bf28}+0.56\%$
test_items_nested 0.3916ms 0.3600ms 2.7780 KOps/s 2.7915 KOps/s $\color{#d91a1a}-0.48\%$
test_items_nested_locked 0.4079ms 0.3596ms 2.7812 KOps/s 2.7836 KOps/s $\color{#d91a1a}-0.08\%$
test_items_nested_leaf 0.1597ms 58.2562μs 17.1655 KOps/s 17.1443 KOps/s $\color{#35bf28}+0.12\%$
test_items_stack_nested 0.4058ms 0.3647ms 2.7419 KOps/s 2.7519 KOps/s $\color{#d91a1a}-0.36\%$
test_items_stack_nested_leaf 0.1623ms 59.7399μs 16.7392 KOps/s 16.4610 KOps/s $\color{#35bf28}+1.69\%$
test_items_stack_nested_locked 0.4129ms 0.3635ms 2.7511 KOps/s 2.7640 KOps/s $\color{#d91a1a}-0.47\%$
test_keys 33.0120μs 3.4613μs 288.9114 KOps/s 288.9970 KOps/s $\color{#d91a1a}-0.03\%$
test_keys_nested 0.1345ms 81.5372μs 12.2643 KOps/s 12.3771 KOps/s $\color{#d91a1a}-0.91\%$
test_keys_nested_locked 0.7659ms 87.3105μs 11.4534 KOps/s 11.5944 KOps/s $\color{#d91a1a}-1.22\%$
test_keys_nested_leaf 0.1352ms 71.9691μs 13.8949 KOps/s 14.0044 KOps/s $\color{#d91a1a}-0.78\%$
test_keys_stack_nested 0.1244ms 82.4112μs 12.1343 KOps/s 12.1501 KOps/s $\color{#d91a1a}-0.13\%$
test_keys_stack_nested_leaf 0.1195ms 73.4164μs 13.6209 KOps/s 13.5119 KOps/s $\color{#35bf28}+0.81\%$
test_keys_stack_nested_locked 0.1351ms 88.9390μs 11.2437 KOps/s 11.3278 KOps/s $\color{#d91a1a}-0.74\%$
test_values 5.0473μs 0.8497μs 1.1769 MOps/s 1.1728 MOps/s $\color{#35bf28}+0.35\%$
test_values_nested 81.7950μs 34.5737μs 28.9238 KOps/s 29.1560 KOps/s $\color{#d91a1a}-0.80\%$
test_values_nested_locked 99.6450μs 36.4386μs 27.4434 KOps/s 27.9994 KOps/s $\color{#d91a1a}-1.99\%$
test_values_nested_leaf 0.1826ms 39.1558μs 25.5390 KOps/s 25.8149 KOps/s $\color{#d91a1a}-1.07\%$
test_values_stack_nested 0.2090ms 34.8440μs 28.6993 KOps/s 28.5270 KOps/s $\color{#35bf28}+0.60\%$
test_values_stack_nested_leaf 0.2334ms 39.6782μs 25.2028 KOps/s 25.4110 KOps/s $\color{#d91a1a}-0.82\%$
test_values_stack_nested_locked 70.9930μs 36.6822μs 27.2612 KOps/s 27.3074 KOps/s $\color{#d91a1a}-0.17\%$
test_membership 2.3201μs 0.5067μs 1.9734 MOps/s 1.9798 MOps/s $\color{#d91a1a}-0.32\%$
test_membership_nested 15.5010μs 2.0182μs 495.4920 KOps/s 495.1415 KOps/s $\color{#35bf28}+0.07\%$
test_membership_nested_leaf 20.0660μs 2.0167μs 495.8671 KOps/s 499.7909 KOps/s $\color{#d91a1a}-0.79\%$
test_membership_stacked_nested 56.5030μs 2.1150μs 472.8241 KOps/s 479.7154 KOps/s $\color{#d91a1a}-1.44\%$
test_membership_stacked_nested_leaf 29.1010μs 2.1021μs 475.7169 KOps/s 483.2556 KOps/s $\color{#d91a1a}-1.56\%$
test_membership_nested_last 33.2520μs 3.0776μs 324.9260 KOps/s 321.2040 KOps/s $\color{#35bf28}+1.16\%$
test_membership_nested_leaf_last 29.5820μs 3.1438μs 318.0867 KOps/s 317.5767 KOps/s $\color{#35bf28}+0.16\%$
test_membership_stacked_nested_last 31.7310μs 3.6610μs 273.1500 KOps/s 212.7867 KOps/s $\textbf{\color{#35bf28}+28.37\%}$
test_membership_stacked_nested_leaf_last 32.2020μs 3.6441μs 274.4189 KOps/s 214.7199 KOps/s $\textbf{\color{#35bf28}+27.80\%}$
test_nested_getleaf 39.1120μs 6.1093μs 163.6855 KOps/s 164.1866 KOps/s $\color{#d91a1a}-0.31\%$
test_nested_get 32.5820μs 5.8040μs 172.2964 KOps/s 176.5626 KOps/s $\color{#d91a1a}-2.42\%$
test_stacked_getleaf 62.0730μs 6.1494μs 162.6169 KOps/s 164.3402 KOps/s $\color{#d91a1a}-1.05\%$
test_stacked_get 34.2420μs 5.8078μs 172.1810 KOps/s 173.1613 KOps/s $\color{#d91a1a}-0.57\%$
test_nested_getitemleaf 31.3120μs 6.2693μs 159.5082 KOps/s 158.5850 KOps/s $\color{#35bf28}+0.58\%$
test_nested_getitem 31.8720μs 5.9759μs 167.3394 KOps/s 167.1278 KOps/s $\color{#35bf28}+0.13\%$
test_stacked_getitemleaf 31.2020μs 6.2624μs 159.6843 KOps/s 160.4982 KOps/s $\color{#d91a1a}-0.51\%$
test_stacked_getitem 33.6720μs 5.9442μs 168.2306 KOps/s 168.5256 KOps/s $\color{#d91a1a}-0.18\%$
test_lock_nested 9.7327ms 0.3895ms 2.5677 KOps/s 2.5754 KOps/s $\color{#d91a1a}-0.30\%$
test_lock_stack_nested 0.4425ms 0.3491ms 2.8642 KOps/s 2.8853 KOps/s $\color{#d91a1a}-0.73\%$
test_unlock_nested 0.7180ms 0.3170ms 3.1543 KOps/s 3.1300 KOps/s $\color{#35bf28}+0.78\%$
test_unlock_stack_nested 0.3995ms 0.2883ms 3.4688 KOps/s 3.4992 KOps/s $\color{#d91a1a}-0.87\%$
test_flatten_speed 0.1092ms 75.1480μs 13.3071 KOps/s 13.3478 KOps/s $\color{#d91a1a}-0.30\%$
test_unflatten_speed 0.3859ms 0.3204ms 3.1208 KOps/s 3.0932 KOps/s $\color{#35bf28}+0.89\%$
test_common_ops 1.6579ms 0.5771ms 1.7329 KOps/s 1.6062 KOps/s $\textbf{\color{#35bf28}+7.89\%}$
test_creation 0.1057ms 1.7545μs 569.9631 KOps/s 561.6564 KOps/s $\color{#35bf28}+1.48\%$
test_creation_empty 48.5620μs 6.8486μs 146.0157 KOps/s 105.9181 KOps/s $\textbf{\color{#35bf28}+37.86\%}$
test_creation_nested_1 37.3020μs 8.5745μs 116.6254 KOps/s 89.8712 KOps/s $\textbf{\color{#35bf28}+29.77\%}$
test_creation_nested_2 63.2530μs 11.3301μs 88.2608 KOps/s 71.3109 KOps/s $\textbf{\color{#35bf28}+23.77\%}$
test_clone 76.3940μs 10.1796μs 98.2357 KOps/s 95.9733 KOps/s $\color{#35bf28}+2.36\%$
test_getitem[int] 1.7805ms 11.0506μs 90.4927 KOps/s 90.4705 KOps/s $\color{#35bf28}+0.02\%$
test_getitem[slice_int] 0.1113ms 21.1793μs 47.2159 KOps/s 47.0141 KOps/s $\color{#35bf28}+0.43\%$
test_getitem[range] 0.1457ms 36.9335μs 27.0757 KOps/s 26.9479 KOps/s $\color{#35bf28}+0.47\%$
test_getitem[tuple] 0.1109ms 18.4538μs 54.1893 KOps/s 52.5800 KOps/s $\color{#35bf28}+3.06\%$
test_getitem[list] 0.2230ms 32.4635μs 30.8038 KOps/s 30.5951 KOps/s $\color{#35bf28}+0.68\%$
test_setitem_dim[int] 50.8330μs 18.2308μs 54.8522 KOps/s 56.0910 KOps/s $\color{#d91a1a}-2.21\%$
test_setitem_dim[slice_int] 70.4940μs 37.6286μs 26.5755 KOps/s 26.7904 KOps/s $\color{#d91a1a}-0.80\%$
test_setitem_dim[range] 94.6550μs 51.0595μs 19.5850 KOps/s 19.1238 KOps/s $\color{#35bf28}+2.41\%$
test_setitem_dim[tuple] 60.0930μs 30.8371μs 32.4285 KOps/s 30.9900 KOps/s $\color{#35bf28}+4.64\%$
test_setitem 84.8940μs 13.9279μs 71.7985 KOps/s 65.0391 KOps/s $\textbf{\color{#35bf28}+10.39\%}$
test_set 94.8660μs 13.1618μs 75.9776 KOps/s 67.5034 KOps/s $\textbf{\color{#35bf28}+12.55\%}$
test_set_shared 1.4938ms 0.1479ms 6.7626 KOps/s 6.7893 KOps/s $\color{#d91a1a}-0.39\%$
test_update 0.3804ms 15.6940μs 63.7185 KOps/s 52.1748 KOps/s $\textbf{\color{#35bf28}+22.13\%}$
test_update_nested 93.1350μs 21.1190μs 47.3507 KOps/s 41.0110 KOps/s $\textbf{\color{#35bf28}+15.46\%}$
test_update__nested 0.4824ms 24.8186μs 40.2923 KOps/s 39.6074 KOps/s $\color{#35bf28}+1.73\%$
test_set_nested 80.4940μs 14.6613μs 68.2067 KOps/s 60.9255 KOps/s $\textbf{\color{#35bf28}+11.95\%}$
test_set_nested_new 91.5950μs 16.8707μs 59.2743 KOps/s 53.3406 KOps/s $\textbf{\color{#35bf28}+11.12\%}$
test_select 0.2293ms 29.6433μs 33.7344 KOps/s 32.2647 KOps/s $\color{#35bf28}+4.56\%$
test_select_nested 0.2413ms 43.8745μs 22.7923 KOps/s 22.5435 KOps/s $\color{#35bf28}+1.10\%$
test_exclude_nested 94.3750μs 62.8253μs 15.9172 KOps/s 15.7001 KOps/s $\color{#35bf28}+1.38\%$
test_empty[True] 0.3842ms 0.2892ms 3.4572 KOps/s 3.4725 KOps/s $\color{#d91a1a}-0.44\%$
test_empty[False] 4.6892μs 0.8312μs 1.2031 MOps/s 1.2014 MOps/s $\color{#35bf28}+0.14\%$
test_to 93.2250μs 57.5474μs 17.3770 KOps/s 17.7084 KOps/s $\color{#d91a1a}-1.87\%$
test_to_nonblocking 0.1981ms 47.4605μs 21.0702 KOps/s 21.0740 KOps/s $\color{#d91a1a}-0.02\%$
test_unbind_speed 0.2955ms 0.2372ms 4.2159 KOps/s 4.2092 KOps/s $\color{#35bf28}+0.16\%$
test_unbind_speed_stack0 0.3768ms 0.2369ms 4.2213 KOps/s 4.1764 KOps/s $\color{#35bf28}+1.08\%$
test_unbind_speed_stack1 0.7158ms 0.6140ms 1.6287 KOps/s 1.6522 KOps/s $\color{#d91a1a}-1.42\%$
test_split 95.7698ms 1.6193ms 617.5530 Ops/s 615.2225 Ops/s $\color{#35bf28}+0.38\%$
test_chunk 99.0344ms 1.6204ms 617.1414 Ops/s 612.1799 Ops/s $\color{#35bf28}+0.81\%$
test_consolidate[False-None] 0.1009s 2.9823ms 335.3148 Ops/s 332.0126 Ops/s $\color{#35bf28}+0.99\%$
test_consolidate[default-None] 1.9052ms 1.6998ms 588.2889 Ops/s 589.7314 Ops/s $\color{#d91a1a}-0.24\%$
test_consolidate[reduce-overhead-None] 1.8726ms 1.7400ms 574.6985 Ops/s 572.8154 Ops/s $\color{#35bf28}+0.33\%$
test_consolidate_njt[False-None] 6.7363ms 6.5956ms 151.6159 Ops/s 151.5442 Ops/s $\color{#35bf28}+0.05\%$
test_to[False-False-None] 1.8494ms 1.6914ms 591.2099 Ops/s 600.6610 Ops/s $\color{#d91a1a}-1.57\%$
test_to[True-False-None] 1.5566ms 1.3413ms 745.5382 Ops/s 741.1838 Ops/s $\color{#35bf28}+0.59\%$
test_to[within-False-None] 4.4675ms 4.2294ms 236.4375 Ops/s 237.6263 Ops/s $\color{#d91a1a}-0.50\%$
test_to[True-default-None] 5.5598ms 5.2889ms 189.0760 Ops/s 188.7127 Ops/s $\color{#35bf28}+0.19\%$
test_to_njt[False-False-None] 7.0210ms 6.8296ms 146.4214 Ops/s 145.3480 Ops/s $\color{#35bf28}+0.74\%$
test_to_njt[True-False-None] 5.9306ms 5.5650ms 179.6948 Ops/s 179.7828 Ops/s $\color{#d91a1a}-0.05\%$
test_to_njt[within-False-None] 12.3824ms 12.2270ms 81.7860 Ops/s 80.8158 Ops/s $\color{#35bf28}+1.20\%$
test_creation[device0] 0.5361ms 78.8507μs 12.6822 KOps/s 12.7532 KOps/s $\color{#d91a1a}-0.56\%$
test_creation_from_tensor 0.5296ms 82.0582μs 12.1865 KOps/s 12.0993 KOps/s $\color{#35bf28}+0.72\%$
test_add_one[memmap_tensor0] 0.2403ms 6.2697μs 159.4971 KOps/s 156.8847 KOps/s $\color{#35bf28}+1.67\%$
test_contiguous[memmap_tensor0] 3.6537μs 0.4125μs 2.4244 MOps/s 2.4163 MOps/s $\color{#35bf28}+0.34\%$
test_stack[memmap_tensor0] 0.2128ms 4.6225μs 216.3321 KOps/s 209.5802 KOps/s $\color{#35bf28}+3.22\%$
test_memmaptd_index 1.9841ms 0.2538ms 3.9397 KOps/s 3.8633 KOps/s $\color{#35bf28}+1.98\%$
test_memmaptd_index_astensor 0.9807ms 0.3171ms 3.1539 KOps/s 3.1231 KOps/s $\color{#35bf28}+0.99\%$
test_memmaptd_index_op 0.9487ms 0.5510ms 1.8150 KOps/s 1.6733 KOps/s $\textbf{\color{#35bf28}+8.47\%}$
test_serialize_model 0.1319s 0.1305s 7.6623 Ops/s 7.6294 Ops/s $\color{#35bf28}+0.43\%$
test_serialize_model_pickle 1.3483s 1.2134s 0.8241 Ops/s 0.8209 Ops/s $\color{#35bf28}+0.39\%$
test_serialize_weights 0.1311s 0.1302s 7.6783 Ops/s 7.6882 Ops/s $\color{#d91a1a}-0.13\%$
test_serialize_weights_returnearly 0.3337s 54.9439ms 18.2004 Ops/s 15.5861 Ops/s $\textbf{\color{#35bf28}+16.77\%}$
test_serialize_weights_pickle 1.4002s 1.2264s 0.8154 Ops/s 0.8211 Ops/s $\color{#d91a1a}-0.70\%$
test_reshape_pytree 0.1592ms 22.2274μs 44.9895 KOps/s 43.7700 KOps/s $\color{#35bf28}+2.79\%$
test_reshape_td 61.4540μs 26.8891μs 37.1898 KOps/s 36.4837 KOps/s $\color{#35bf28}+1.94\%$
test_view_pytree 51.0830μs 22.1740μs 45.0979 KOps/s 44.5274 KOps/s $\color{#35bf28}+1.28\%$
test_view_td 87.6250μs 30.1251μs 33.1949 KOps/s 30.5854 KOps/s $\textbf{\color{#35bf28}+8.53\%}$
test_unbind_pytree 0.1487ms 27.8709μs 35.8797 KOps/s 34.8655 KOps/s $\color{#35bf28}+2.91\%$
test_unbind_td 0.8464ms 36.7153μs 27.2366 KOps/s 26.4494 KOps/s $\color{#35bf28}+2.98\%$
test_split_pytree 0.1480ms 30.1305μs 33.1890 KOps/s 32.1021 KOps/s $\color{#35bf28}+3.39\%$
test_split_td 0.2009ms 39.3501μs 25.4129 KOps/s 24.6636 KOps/s $\color{#35bf28}+3.04\%$
test_add_pytree 0.1822ms 34.2780μs 29.1732 KOps/s 30.0866 KOps/s $\color{#d91a1a}-3.04\%$
test_add_td 0.2101ms 48.2735μs 20.7153 KOps/s 20.4069 KOps/s $\color{#35bf28}+1.51\%$
test_compile_add_one_nested[tensordict-compile] 0.2658ms 0.1197ms 8.3568 KOps/s 8.1339 KOps/s $\color{#35bf28}+2.74\%$
test_compile_add_one_nested[tensordict-eager] 0.2790ms 0.1305ms 7.6649 KOps/s 7.6042 KOps/s $\color{#35bf28}+0.80\%$
test_compile_add_one_nested[pytree-compile] 0.2230ms 95.9686μs 10.4201 KOps/s 10.2060 KOps/s $\color{#35bf28}+2.10\%$
test_compile_add_one_nested[pytree-eager] 1.3300ms 0.1469ms 6.8070 KOps/s 6.7421 KOps/s $\color{#35bf28}+0.96\%$
test_compile_copy_nested[tensordict-compile] 0.1609ms 23.3179μs 42.8855 KOps/s 43.4001 KOps/s $\color{#d91a1a}-1.19\%$
test_compile_copy_nested[tensordict-eager] 0.1776ms 29.3493μs 34.0724 KOps/s 33.1579 KOps/s $\color{#35bf28}+2.76\%$
test_compile_copy_nested[pytree-compile] 0.3840ms 65.1327μs 15.3533 KOps/s 15.1766 KOps/s $\color{#35bf28}+1.16\%$
test_compile_copy_nested[pytree-eager] 79.4540μs 49.0630μs 20.3820 KOps/s 19.9642 KOps/s $\color{#35bf28}+2.09\%$
test_compile_add_one_flat[tensordict-compile] 0.2675ms 0.1414ms 7.0735 KOps/s 7.0063 KOps/s $\color{#35bf28}+0.96\%$
test_compile_add_one_flat[tensordict-eager] 0.3995ms 0.2160ms 4.6302 KOps/s 4.5935 KOps/s $\color{#35bf28}+0.80\%$
test_compile_add_one_flat[tensorclass-compile] 0.2260ms 97.4457μs 10.2621 KOps/s 10.1864 KOps/s $\color{#35bf28}+0.74\%$
test_compile_add_one_flat[tensorclass-eager] 0.2357ms 54.3602μs 18.3958 KOps/s 18.6018 KOps/s $\color{#d91a1a}-1.11\%$
test_compile_add_one_flat[pytree-compile] 0.1757ms 0.1344ms 7.4420 KOps/s 7.3611 KOps/s $\color{#35bf28}+1.10\%$
test_compile_add_one_flat[pytree-eager] 0.6233ms 0.4657ms 2.1474 KOps/s 2.1364 KOps/s $\color{#35bf28}+0.52\%$
test_compile_add_self_flat[tensordict-eager] 0.3868ms 0.2577ms 3.8808 KOps/s 3.8467 KOps/s $\color{#35bf28}+0.89\%$
test_compile_add_self_flat[tensordict-compile] 0.2848ms 0.1416ms 7.0613 KOps/s 7.0529 KOps/s $\color{#35bf28}+0.12\%$
test_compile_add_self_flat[tensorclass-eager] 0.2219ms 65.6058μs 15.2425 KOps/s 15.4580 KOps/s $\color{#d91a1a}-1.39\%$
test_compile_add_self_flat[tensorclass-compile] 0.2403ms 97.2314μs 10.2847 KOps/s 10.1804 KOps/s $\color{#35bf28}+1.02\%$
test_compile_add_self_flat[pytree-eager] 0.5451ms 0.3948ms 2.5331 KOps/s 2.5372 KOps/s $\color{#d91a1a}-0.16\%$
test_compile_add_self_flat[pytree-compile] 0.2454ms 0.1341ms 7.4563 KOps/s 7.4456 KOps/s $\color{#35bf28}+0.14\%$
test_compile_copy_flat[tensordict-compile] 0.2092ms 20.8551μs 47.9500 KOps/s 53.9267 KOps/s $\textbf{\color{#d91a1a}-11.08\%}$
test_compile_copy_flat[tensordict-eager] 88.2750μs 31.7113μs 31.5345 KOps/s 32.0948 KOps/s $\color{#d91a1a}-1.75\%$
test_compile_copy_flat[pytree-compile] 99.6360μs 70.1006μs 14.2652 KOps/s 13.9199 KOps/s $\color{#35bf28}+2.48\%$
test_compile_copy_flat[pytree-eager] 80.2040μs 51.6386μs 19.3653 KOps/s 18.8013 KOps/s $\color{#35bf28}+3.00\%$
test_compile_assign_and_add[tensordict-compile] 1.6758ms 0.3999ms 2.5009 KOps/s 2.1940 KOps/s $\textbf{\color{#35bf28}+13.99\%}$
test_compile_assign_and_add[tensordict-eager] 2.8745ms 2.6242ms 381.0739 Ops/s 388.5012 Ops/s $\color{#d91a1a}-1.91\%$
test_compile_assign_and_add[pytree-compile] 1.6514ms 0.4501ms 2.2218 KOps/s 2.2321 KOps/s $\color{#d91a1a}-0.46\%$
test_compile_assign_and_add[pytree-eager] 2.7745ms 2.5771ms 388.0342 Ops/s 388.8494 Ops/s $\color{#d91a1a}-0.21\%$
test_compile_indexing[tensor-tensordict-compile] 0.2812ms 0.1159ms 8.6251 KOps/s 8.9853 KOps/s $\color{#d91a1a}-4.01\%$
test_compile_indexing[tensor-tensordict-eager] 0.5661ms 80.7637μs 12.3818 KOps/s 12.9052 KOps/s $\color{#d91a1a}-4.06\%$
test_compile_indexing[tensor-tensorclass-compile] 0.2856ms 0.1080ms 9.2591 KOps/s 9.7751 KOps/s $\textbf{\color{#d91a1a}-5.28\%}$
test_compile_indexing[tensor-tensorclass-eager] 0.2516ms 69.3905μs 14.4112 KOps/s 15.1051 KOps/s $\color{#d91a1a}-4.59\%$
test_compile_indexing[tensor-pytree-compile] 0.2937ms 0.1106ms 9.0419 KOps/s 9.6942 KOps/s $\textbf{\color{#d91a1a}-6.73\%}$
test_compile_indexing[tensor-pytree-eager] 0.2602ms 69.4142μs 14.4063 KOps/s 15.1129 KOps/s $\color{#d91a1a}-4.68\%$
test_compile_indexing[slice-tensordict-compile] 0.3127ms 0.1021ms 9.7947 KOps/s 9.9744 KOps/s $\color{#d91a1a}-1.80\%$
test_compile_indexing[slice-tensordict-eager] 0.1423ms 17.2539μs 57.9578 KOps/s 53.7358 KOps/s $\textbf{\color{#35bf28}+7.86\%}$
test_compile_indexing[slice-tensorclass-compile] 0.2720ms 99.6251μs 10.0376 KOps/s 10.3490 KOps/s $\color{#d91a1a}-3.01\%$
test_compile_indexing[slice-tensorclass-eager] 0.1497ms 15.9427μs 62.7246 KOps/s 61.7833 KOps/s $\color{#35bf28}+1.52\%$
test_compile_indexing[slice-pytree-compile] 0.2827ms 0.1002ms 9.9779 KOps/s 10.2616 KOps/s $\color{#d91a1a}-2.76\%$
test_compile_indexing[slice-pytree-eager] 0.1244ms 15.6586μs 63.8627 KOps/s 61.9275 KOps/s $\color{#35bf28}+3.12\%$
test_compile_indexing[int-tensordict-compile] 0.2728ms 0.1052ms 9.5063 KOps/s 9.8691 KOps/s $\color{#d91a1a}-3.68\%$
test_compile_indexing[int-tensordict-eager] 0.6960ms 17.8945μs 55.8832 KOps/s 56.7415 KOps/s $\color{#d91a1a}-1.51\%$
test_compile_indexing[int-tensorclass-compile] 0.2546ms 99.3711μs 10.0633 KOps/s 10.2974 KOps/s $\color{#d91a1a}-2.27\%$
test_compile_indexing[int-tensorclass-eager] 0.1369ms 15.8150μs 63.2311 KOps/s 61.6860 KOps/s $\color{#35bf28}+2.50\%$
test_compile_indexing[int-pytree-compile] 0.2222ms 95.9184μs 10.4255 KOps/s 10.3139 KOps/s $\color{#35bf28}+1.08\%$
test_compile_indexing[int-pytree-eager] 48.6030μs 15.7376μs 63.5421 KOps/s 61.2154 KOps/s $\color{#35bf28}+3.80\%$
test_mod_add[eager] 0.2108ms 39.1981μs 25.5115 KOps/s 26.2216 KOps/s $\color{#d91a1a}-2.71\%$
test_mod_add[compile] 0.1463ms 83.8191μs 11.9305 KOps/s 12.2819 KOps/s $\color{#d91a1a}-2.86\%$
test_mod_add[compile-overhead] 0.3240ms 0.1665ms 6.0074 KOps/s 5.4864 KOps/s $\textbf{\color{#35bf28}+9.49\%}$
test_mod_wrap[eager] 0.4398ms 0.2561ms 3.9041 KOps/s 4.0503 KOps/s $\color{#d91a1a}-3.61\%$
test_mod_wrap[compile] 0.4863ms 0.2784ms 3.5919 KOps/s 3.4915 KOps/s $\color{#35bf28}+2.87\%$
test_mod_wrap[compile-overhead] 7.1078ms 3.7244ms 268.4968 Ops/s 275.3967 Ops/s $\color{#d91a1a}-2.51\%$
test_mod_wrap_and_backward[eager] 1.7093ms 1.3499ms 740.7772 Ops/s 697.2666 Ops/s $\textbf{\color{#35bf28}+6.24\%}$
test_mod_wrap_and_backward[compile] 1.4771ms 1.2562ms 796.0803 Ops/s 728.7811 Ops/s $\textbf{\color{#35bf28}+9.23\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3664ms 0.9221ms 1.0845 KOps/s 970.2342 Ops/s $\textbf{\color{#35bf28}+11.77\%}$
test_seq_add[eager] 0.3003ms 0.1179ms 8.4842 KOps/s 8.5146 KOps/s $\color{#d91a1a}-0.36\%$
test_seq_add[compile] 0.5979ms 90.8828μs 11.0032 KOps/s 11.1287 KOps/s $\color{#d91a1a}-1.13\%$
test_seq_add[compile-overhead] 0.2640ms 0.1294ms 7.7280 KOps/s 7.7116 KOps/s $\color{#35bf28}+0.21\%$
test_seq_wrap[eager] 0.5484ms 0.4044ms 2.4725 KOps/s 2.3489 KOps/s $\textbf{\color{#35bf28}+5.26\%}$
test_seq_wrap[compile] 0.5015ms 0.3064ms 3.2640 KOps/s 3.3043 KOps/s $\color{#d91a1a}-1.22\%$
test_seq_wrap[compile-overhead] 0.4083ms 0.2278ms 4.3897 KOps/s 4.4163 KOps/s $\color{#d91a1a}-0.60\%$
test_func_call_runtime[False-eager] 0.9415ms 0.7373ms 1.3564 KOps/s 1.3728 KOps/s $\color{#d91a1a}-1.19\%$
test_func_call_runtime[False-compile] 0.8894ms 0.7366ms 1.3576 KOps/s 1.3351 KOps/s $\color{#35bf28}+1.68\%$
test_func_call_runtime[False-compile-overhead] 0.5057ms 0.3617ms 2.7648 KOps/s 2.7486 KOps/s $\color{#35bf28}+0.59\%$
test_func_call_runtime[True-eager] 1.0984ms 0.8774ms 1.1397 KOps/s 1.1225 KOps/s $\color{#35bf28}+1.53\%$
test_func_call_runtime[True-compile] 0.9063ms 0.7526ms 1.3287 KOps/s 1.3039 KOps/s $\color{#35bf28}+1.90\%$
test_func_call_runtime[True-compile-overhead] 0.4973ms 0.3857ms 2.5925 KOps/s 2.5945 KOps/s $\color{#d91a1a}-0.08\%$
test_func_call_cm_runtime[False-eager] 0.9611ms 0.7165ms 1.3957 KOps/s 1.3830 KOps/s $\color{#35bf28}+0.91\%$
test_func_call_cm_runtime[False-compile] 0.8883ms 0.7385ms 1.3541 KOps/s 1.3346 KOps/s $\color{#35bf28}+1.46\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5015ms 0.3650ms 2.7399 KOps/s 2.7210 KOps/s $\color{#35bf28}+0.70\%$
test_func_call_cm_runtime[True-eager] 1.1142ms 0.9695ms 1.0315 KOps/s 1.0025 KOps/s $\color{#35bf28}+2.90\%$
test_func_call_cm_runtime[True-compile] 0.9205ms 0.7818ms 1.2791 KOps/s 1.2587 KOps/s $\color{#35bf28}+1.63\%$
test_func_call_cm_runtime[True-compile-overhead] 0.6325ms 0.4105ms 2.4362 KOps/s 2.4282 KOps/s $\color{#35bf28}+0.33\%$
test_vmap_func_call_cm_runtime[eager] 2.5136ms 2.0144ms 496.4296 Ops/s 490.3833 Ops/s $\color{#35bf28}+1.23\%$
test_vmap_func_call_cm_runtime[compile] 1.0242ms 0.8223ms 1.2161 KOps/s 1.2296 KOps/s $\color{#d91a1a}-1.10\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5551ms 0.4117ms 2.4287 KOps/s 2.4121 KOps/s $\color{#35bf28}+0.69\%$
test_distributed 5.7234ms 0.2998ms 3.3359 KOps/s 7.4103 KOps/s $\textbf{\color{#d91a1a}-54.98\%}$
test_tdmodule 51.8830μs 19.7034μs 50.7526 KOps/s 47.7680 KOps/s $\textbf{\color{#35bf28}+6.25\%}$
test_tdmodule_dispatch 0.2023ms 35.1029μs 28.4877 KOps/s 27.3016 KOps/s $\color{#35bf28}+4.34\%$
test_tdseq 75.1940μs 20.9274μs 47.7842 KOps/s 46.7695 KOps/s $\color{#35bf28}+2.17\%$
test_tdseq_dispatch 0.1582ms 38.7413μs 25.8122 KOps/s 25.2797 KOps/s $\color{#35bf28}+2.11\%$
test_instantiation_functorch 1.7020ms 1.5733ms 635.5967 Ops/s 640.6480 Ops/s $\color{#d91a1a}-0.79\%$
test_exec_functorch 0.2365ms 0.1420ms 7.0416 KOps/s 7.0422 KOps/s $-0.01\%$
test_exec_functional_call 0.3232ms 0.1315ms 7.6019 KOps/s 7.4933 KOps/s $\color{#35bf28}+1.45\%$
test_exec_td_decorator 0.4062ms 0.1902ms 5.2581 KOps/s 5.4487 KOps/s $\color{#d91a1a}-3.50\%$
test_vmap_mlp_speed_decorator[True-True] 0.8253ms 0.6637ms 1.5067 KOps/s 1.4895 KOps/s $\color{#35bf28}+1.15\%$
test_vmap_mlp_speed_decorator[True-False] 0.8405ms 0.6666ms 1.5002 KOps/s 1.4777 KOps/s $\color{#35bf28}+1.52\%$
test_vmap_mlp_speed_decorator[False-True] 0.8019ms 0.5917ms 1.6900 KOps/s 1.7144 KOps/s $\color{#d91a1a}-1.43\%$
test_vmap_mlp_speed_decorator[False-False] 0.8111ms 0.5986ms 1.6706 KOps/s 1.7158 KOps/s $\color{#d91a1a}-2.64\%$
test_vmap_transformer_speed_decorator[True-True] 19.3622ms 18.6550ms 53.6050 Ops/s 53.3398 Ops/s $\color{#35bf28}+0.50\%$
test_vmap_transformer_speed_decorator[True-False] 19.4611ms 18.6961ms 53.4872 Ops/s 53.2849 Ops/s $\color{#35bf28}+0.38\%$
test_vmap_transformer_speed_decorator[False-True] 18.8643ms 18.4753ms 54.1264 Ops/s 53.7772 Ops/s $\color{#35bf28}+0.65\%$
test_vmap_transformer_speed_decorator[False-False] 19.2365ms 18.5605ms 53.8779 Ops/s 53.7467 Ops/s $\color{#35bf28}+0.24\%$
test_to_module_speed[True] 1.0968ms 0.9606ms 1.0411 KOps/s 1.0201 KOps/s $\color{#35bf28}+2.06\%$
test_to_module_speed[False] 1.3516ms 0.9541ms 1.0481 KOps/s 1.0468 KOps/s $\color{#35bf28}+0.13\%$
test_tc_init 71.0240μs 36.8684μs 27.1235 KOps/s 26.0737 KOps/s $\color{#35bf28}+4.03\%$
test_tc_init_nested 0.1390ms 73.2858μs 13.6452 KOps/s 13.1872 KOps/s $\color{#35bf28}+3.47\%$
test_tc_first_layer_tensor 7.6376μs 0.6903μs 1.4487 MOps/s 1.4123 MOps/s $\color{#35bf28}+2.58\%$
test_tc_first_layer_nontensor 29.2610μs 2.3363μs 428.0361 KOps/s 425.1189 KOps/s $\color{#35bf28}+0.69\%$
test_tc_second_layer_tensor 10.5607μs 1.4212μs 703.6193 KOps/s 700.4956 KOps/s $\color{#35bf28}+0.45\%$
test_tc_second_layer_nontensor 38.6720μs 3.1060μs 321.9536 KOps/s 325.7942 KOps/s $\color{#d91a1a}-1.18\%$
test_unbind 0.2443s 10.6312ms 94.0624 Ops/s 143.9015 Ops/s $\textbf{\color{#d91a1a}-34.63\%}$
test_full_like 11.9940ms 10.1034ms 98.9770 Ops/s 99.0367 Ops/s $\color{#d91a1a}-0.06\%$
test_zeros_like 9.4859ms 7.2940ms 137.0986 Ops/s 112.8620 Ops/s $\textbf{\color{#35bf28}+21.47\%}$
test_ones_like 5.0588ms 4.3642ms 229.1362 Ops/s 221.9504 Ops/s $\color{#35bf28}+3.24\%$
test_clone 7.5834ms 7.0421ms 142.0027 Ops/s 139.6824 Ops/s $\color{#35bf28}+1.66\%$
test_squeeze 61.1940μs 9.5538μs 104.6709 KOps/s 104.1985 KOps/s $\color{#35bf28}+0.45\%$
test_unsqueeze 0.2318ms 73.8532μs 13.5404 KOps/s 14.0328 KOps/s $\color{#d91a1a}-3.51\%$
test_split 0.3245ms 0.1626ms 6.1519 KOps/s 6.1592 KOps/s $\color{#d91a1a}-0.12\%$
test_permute 0.3507ms 0.1809ms 5.5279 KOps/s 5.4709 KOps/s $\color{#35bf28}+1.04\%$
test_stack 52.7764ms 52.1104ms 19.1900 Ops/s 19.2250 Ops/s $\color{#d91a1a}-0.18\%$
test_cat 52.7344ms 52.1113ms 19.1897 Ops/s 19.2543 Ops/s $\color{#d91a1a}-0.34\%$

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 18, 2024
…d to ProbabilisticTensorDictSequential.log_prob

ghstack-source-id: 326d0763c9bbb13b51daac91edca4f0e821adf62
Pull Request resolved: #1146
@vmoens vmoens added the enhancement New feature or request label Dec 18, 2024
@vmoens vmoens merged commit 2b9ef57 into gh/vmoens/40/base Dec 19, 2024
28 of 55 checks passed
vmoens added a commit that referenced this pull request Dec 19, 2024
…d to ProbabilisticTensorDictSequential.log_prob

ghstack-source-id: 326d0763c9bbb13b51daac91edca4f0e821adf62
Pull Request resolved: #1146
@vmoens vmoens deleted the gh/vmoens/40/head branch December 19, 2024 08:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants