Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Refactor] Refactor context managers #1098

Merged
merged 7 commits into from
Nov 25, 2024
Merged

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 20, 2024

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 20, 2024
ghstack-source-id: 9666ab7653889907c139ba14eb2cb4e6151c3f5d
Pull Request resolved: #1098
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 20, 2024
Copy link

github-actions bot commented Nov 20, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}7$. Worsened: $\large\color{#d91a1a}15$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 41.3470μs 17.6624μs 56.6174 KOps/s 56.7636 KOps/s $\color{#d91a1a}-0.26\%$
test_plain_set_stack_nested 41.3370μs 17.9069μs 55.8444 KOps/s 55.8640 KOps/s $\color{#d91a1a}-0.03\%$
test_plain_set_nested_inplace 73.1170μs 19.4251μs 51.4799 KOps/s 51.1795 KOps/s $\color{#35bf28}+0.59\%$
test_plain_set_stack_nested_inplace 51.0550μs 19.6623μs 50.8587 KOps/s 51.0534 KOps/s $\color{#d91a1a}-0.38\%$
test_items 32.1600μs 4.1826μs 239.0841 KOps/s 232.1014 KOps/s $\color{#35bf28}+3.01\%$
test_items_nested 0.5594ms 0.3424ms 2.9206 KOps/s 2.9346 KOps/s $\color{#d91a1a}-0.48\%$
test_items_nested_locked 0.4949ms 0.3422ms 2.9222 KOps/s 2.9058 KOps/s $\color{#35bf28}+0.56\%$
test_items_nested_leaf 0.1339ms 72.5926μs 13.7755 KOps/s 13.7323 KOps/s $\color{#35bf28}+0.31\%$
test_items_stack_nested 0.4782ms 0.3465ms 2.8863 KOps/s 2.8425 KOps/s $\color{#35bf28}+1.54\%$
test_items_stack_nested_leaf 0.1358ms 75.7729μs 13.1973 KOps/s 13.5517 KOps/s $\color{#d91a1a}-2.61\%$
test_items_stack_nested_locked 0.4344ms 0.3478ms 2.8752 KOps/s 2.8641 KOps/s $\color{#35bf28}+0.39\%$
test_keys 41.6180μs 3.4839μs 287.0377 KOps/s 262.5652 KOps/s $\textbf{\color{#35bf28}+9.32\%}$
test_keys_nested 0.2305ms 0.1348ms 7.4209 KOps/s 7.3786 KOps/s $\color{#35bf28}+0.57\%$
test_keys_nested_locked 1.8704ms 0.1396ms 7.1634 KOps/s 7.0522 KOps/s $\color{#35bf28}+1.58\%$
test_keys_nested_leaf 0.2083ms 0.1160ms 8.6199 KOps/s 8.5777 KOps/s $\color{#35bf28}+0.49\%$
test_keys_stack_nested 0.2225ms 0.1355ms 7.3801 KOps/s 7.4044 KOps/s $\color{#d91a1a}-0.33\%$
test_keys_stack_nested_leaf 0.1986ms 0.1163ms 8.6006 KOps/s 8.7592 KOps/s $\color{#d91a1a}-1.81\%$
test_keys_stack_nested_locked 0.2427ms 0.1399ms 7.1464 KOps/s 7.1982 KOps/s $\color{#d91a1a}-0.72\%$
test_values 7.0752μs 1.0378μs 963.6228 KOps/s 983.2852 KOps/s $\color{#d91a1a}-2.00\%$
test_values_nested 0.1021ms 55.5676μs 17.9961 KOps/s 18.0888 KOps/s $\color{#d91a1a}-0.51\%$
test_values_nested_locked 0.1020ms 55.3077μs 18.0807 KOps/s 18.1247 KOps/s $\color{#d91a1a}-0.24\%$
test_values_nested_leaf 0.1070ms 60.1119μs 16.6356 KOps/s 16.4418 KOps/s $\color{#35bf28}+1.18\%$
test_values_stack_nested 0.1073ms 56.5165μs 17.6940 KOps/s 17.4677 KOps/s $\color{#35bf28}+1.30\%$
test_values_stack_nested_leaf 0.1263ms 60.3562μs 16.5683 KOps/s 16.5024 KOps/s $\color{#35bf28}+0.40\%$
test_values_stack_nested_locked 0.1187ms 56.8134μs 17.6015 KOps/s 17.7180 KOps/s $\color{#d91a1a}-0.66\%$
test_membership 16.1210μs 0.8972μs 1.1145 MOps/s 1.1659 MOps/s $\color{#d91a1a}-4.40\%$
test_membership_nested 23.9750μs 2.7375μs 365.3033 KOps/s 368.3903 KOps/s $\color{#d91a1a}-0.84\%$
test_membership_nested_leaf 29.6050μs 2.7703μs 360.9690 KOps/s 365.1590 KOps/s $\color{#d91a1a}-1.15\%$
test_membership_stacked_nested 22.5420μs 2.7218μs 367.4016 KOps/s 360.6474 KOps/s $\color{#35bf28}+1.87\%$
test_membership_stacked_nested_leaf 30.2470μs 2.7456μs 364.2236 KOps/s 368.7912 KOps/s $\color{#d91a1a}-1.24\%$
test_membership_nested_last 26.3890μs 4.1082μs 243.4155 KOps/s 245.6866 KOps/s $\color{#d91a1a}-0.92\%$
test_membership_nested_leaf_last 33.7330μs 4.0536μs 246.6952 KOps/s 245.2453 KOps/s $\color{#35bf28}+0.59\%$
test_membership_stacked_nested_last 29.8650μs 6.1769μs 161.8947 KOps/s 77.6187 KOps/s $\textbf{\color{#35bf28}+108.58\%}$
test_membership_stacked_nested_leaf_last 30.3470μs 6.0524μs 165.2245 KOps/s 77.3379 KOps/s $\textbf{\color{#35bf28}+113.64\%}$
test_nested_getleaf 46.5070μs 10.5216μs 95.0426 KOps/s 94.5150 KOps/s $\color{#35bf28}+0.56\%$
test_nested_get 49.3420μs 10.0916μs 99.0923 KOps/s 99.5563 KOps/s $\color{#d91a1a}-0.47\%$
test_stacked_getleaf 45.7190μs 10.4543μs 95.6540 KOps/s 95.3065 KOps/s $\color{#35bf28}+0.36\%$
test_stacked_get 33.6530μs 10.6972μs 93.4827 KOps/s 98.7096 KOps/s $\textbf{\color{#d91a1a}-5.30\%}$
test_nested_getitemleaf 53.1190μs 10.9957μs 90.9443 KOps/s 90.4221 KOps/s $\color{#35bf28}+0.58\%$
test_nested_getitem 54.2120μs 10.2243μs 97.8064 KOps/s 95.3629 KOps/s $\color{#35bf28}+2.56\%$
test_stacked_getitemleaf 38.2410μs 11.0913μs 90.1607 KOps/s 92.2064 KOps/s $\color{#d91a1a}-2.22\%$
test_stacked_getitem 46.9380μs 10.2852μs 97.2269 KOps/s 97.7581 KOps/s $\color{#d91a1a}-0.54\%$
test_lock_nested 4.4064ms 0.4410ms 2.2677 KOps/s 1.8491 KOps/s $\textbf{\color{#35bf28}+22.63\%}$
test_lock_stack_nested 0.6473ms 0.4111ms 2.4326 KOps/s 2.5126 KOps/s $\color{#d91a1a}-3.18\%$
test_unlock_nested 0.7465ms 0.3559ms 2.8096 KOps/s 2.7658 KOps/s $\color{#35bf28}+1.58\%$
test_unlock_stack_nested 0.6550ms 0.3308ms 3.0225 KOps/s 3.1630 KOps/s $\color{#d91a1a}-4.44\%$
test_flatten_speed 0.1700ms 92.5366μs 10.8065 KOps/s 10.8869 KOps/s $\color{#d91a1a}-0.74\%$
test_unflatten_speed 0.8051ms 0.4800ms 2.0835 KOps/s 2.1300 KOps/s $\color{#d91a1a}-2.19\%$
test_common_ops 5.9916ms 0.7563ms 1.3222 KOps/s 1.3226 KOps/s $\color{#d91a1a}-0.03\%$
test_creation 72.6960μs 2.1255μs 470.4715 KOps/s 479.5434 KOps/s $\color{#d91a1a}-1.89\%$
test_creation_empty 42.2590μs 10.6214μs 94.1499 KOps/s 96.9390 KOps/s $\color{#d91a1a}-2.88\%$
test_creation_nested_1 41.2370μs 13.3425μs 74.9487 KOps/s 76.7621 KOps/s $\color{#d91a1a}-2.36\%$
test_creation_nested_2 50.5440μs 17.5625μs 56.9396 KOps/s 58.3215 KOps/s $\color{#d91a1a}-2.37\%$
test_clone 63.7690μs 12.9650μs 77.1305 KOps/s 77.6310 KOps/s $\color{#d91a1a}-0.64\%$
test_getitem[int] 0.8954ms 12.4510μs 80.3151 KOps/s 79.8338 KOps/s $\color{#35bf28}+0.60\%$
test_getitem[slice_int] 0.1418ms 23.5893μs 42.3922 KOps/s 43.2876 KOps/s $\color{#d91a1a}-2.07\%$
test_getitem[range] 0.2345ms 46.9362μs 21.3055 KOps/s 21.7915 KOps/s $\color{#d91a1a}-2.23\%$
test_getitem[tuple] 0.1317ms 19.6531μs 50.8825 KOps/s 52.5643 KOps/s $\color{#d91a1a}-3.20\%$
test_getitem[list] 0.2396ms 42.6182μs 23.4641 KOps/s 24.0290 KOps/s $\color{#d91a1a}-2.35\%$
test_setitem_dim[int] 42.8000μs 24.6545μs 40.5605 KOps/s 41.5562 KOps/s $\color{#d91a1a}-2.40\%$
test_setitem_dim[slice_int] 85.8410μs 50.1648μs 19.9343 KOps/s 19.9411 KOps/s $\color{#d91a1a}-0.03\%$
test_setitem_dim[range] 0.1514ms 75.1223μs 13.3116 KOps/s 13.8293 KOps/s $\color{#d91a1a}-3.74\%$
test_setitem_dim[tuple] 64.6310μs 39.6102μs 25.2461 KOps/s 25.2968 KOps/s $\color{#d91a1a}-0.20\%$
test_setitem 91.2410μs 19.5651μs 51.1115 KOps/s 52.4296 KOps/s $\color{#d91a1a}-2.51\%$
test_set 97.1720μs 19.0811μs 52.4079 KOps/s 53.6435 KOps/s $\color{#d91a1a}-2.30\%$
test_set_shared 4.3407ms 0.1674ms 5.9739 KOps/s 6.0585 KOps/s $\color{#d91a1a}-1.40\%$
test_update 0.1784ms 21.5802μs 46.3388 KOps/s 47.7035 KOps/s $\color{#d91a1a}-2.86\%$
test_update_nested 0.1473ms 30.7722μs 32.4969 KOps/s 32.9577 KOps/s $\color{#d91a1a}-1.40\%$
test_update__nested 0.4277ms 32.0378μs 31.2131 KOps/s 32.9269 KOps/s $\textbf{\color{#d91a1a}-5.20\%}$
test_set_nested 0.1237ms 20.8514μs 47.9584 KOps/s 49.5198 KOps/s $\color{#d91a1a}-3.15\%$
test_set_nested_new 0.1300ms 25.4356μs 39.3150 KOps/s 40.2267 KOps/s $\color{#d91a1a}-2.27\%$
test_select 0.2174ms 41.7443μs 23.9554 KOps/s 24.6740 KOps/s $\color{#d91a1a}-2.91\%$
test_select_nested 0.1240ms 60.7423μs 16.4630 KOps/s 16.9252 KOps/s $\color{#d91a1a}-2.73\%$
test_exclude_nested 0.1762ms 76.1823μs 13.1264 KOps/s 13.4922 KOps/s $\color{#d91a1a}-2.71\%$
test_empty[True] 0.6499ms 0.3490ms 2.8651 KOps/s 2.8627 KOps/s $\color{#35bf28}+0.08\%$
test_empty[False] 15.4613μs 1.2337μs 810.5962 KOps/s 791.2869 KOps/s $\color{#35bf28}+2.44\%$
test_unbind_speed 0.3530ms 0.2600ms 3.8461 KOps/s 3.8488 KOps/s $\color{#d91a1a}-0.07\%$
test_unbind_speed_stack0 0.5594ms 0.2585ms 3.8690 KOps/s 3.9919 KOps/s $\color{#d91a1a}-3.08\%$
test_unbind_speed_stack1 0.1064s 0.7018ms 1.4249 KOps/s 1.4493 KOps/s $\color{#d91a1a}-1.69\%$
test_split 99.1645ms 1.7286ms 578.5172 Ops/s 580.6146 Ops/s $\color{#d91a1a}-0.36\%$
test_chunk 0.1031s 1.7166ms 582.5351 Ops/s 577.3398 Ops/s $\color{#35bf28}+0.90\%$
test_consolidate_njt[False-None] 0.1115s 9.0389ms 110.6335 Ops/s 121.2632 Ops/s $\textbf{\color{#d91a1a}-8.77\%}$
test_creation[device0] 0.2287ms 89.0368μs 11.2313 KOps/s 10.9725 KOps/s $\color{#35bf28}+2.36\%$
test_creation_from_tensor 3.3852ms 92.4340μs 10.8185 KOps/s 10.4939 KOps/s $\color{#35bf28}+3.09\%$
test_add_one[memmap_tensor0] 0.3162ms 4.8127μs 207.7850 KOps/s 212.0505 KOps/s $\color{#d91a1a}-2.01\%$
test_contiguous[memmap_tensor0] 10.9310μs 0.5015μs 1.9940 MOps/s 1.9726 MOps/s $\color{#35bf28}+1.08\%$
test_stack[memmap_tensor0] 40.8560μs 3.2939μs 303.5891 KOps/s 300.7255 KOps/s $\color{#35bf28}+0.95\%$
test_memmaptd_index 1.1231ms 0.2339ms 4.2753 KOps/s 4.3147 KOps/s $\color{#d91a1a}-0.91\%$
test_memmaptd_index_astensor 0.5767ms 0.3089ms 3.2375 KOps/s 3.2503 KOps/s $\color{#d91a1a}-0.39\%$
test_memmaptd_index_op 1.0152ms 0.5624ms 1.7780 KOps/s 1.7785 KOps/s $\color{#d91a1a}-0.03\%$
test_serialize_model 0.1237s 0.1155s 8.6578 Ops/s 7.5609 Ops/s $\textbf{\color{#35bf28}+14.51\%}$
test_serialize_model_pickle 0.4425s 0.3889s 2.5716 Ops/s 2.4947 Ops/s $\color{#35bf28}+3.09\%$
test_serialize_weights 0.2174s 0.1281s 7.8074 Ops/s 8.5053 Ops/s $\textbf{\color{#d91a1a}-8.21\%}$
test_serialize_weights_returnearly 0.1685s 0.1558s 6.4203 Ops/s 6.3006 Ops/s $\color{#35bf28}+1.90\%$
test_serialize_weights_pickle 0.4829s 0.4332s 2.3086 Ops/s 2.4973 Ops/s $\textbf{\color{#d91a1a}-7.55\%}$
test_serialize_weights_filesystem 0.1540s 0.1433s 6.9779 Ops/s 6.9060 Ops/s $\color{#35bf28}+1.04\%$
test_serialize_model_filesystem 0.2520s 0.1655s 6.0426 Ops/s 6.6536 Ops/s $\textbf{\color{#d91a1a}-9.18\%}$
test_reshape_pytree 65.9230μs 26.5034μs 37.7310 KOps/s 35.7981 KOps/s $\textbf{\color{#35bf28}+5.40\%}$
test_reshape_td 69.4700μs 32.6564μs 30.6219 KOps/s 30.3447 KOps/s $\color{#35bf28}+0.91\%$
test_view_pytree 76.8540μs 27.1380μs 36.8487 KOps/s 37.7258 KOps/s $\color{#d91a1a}-2.32\%$
test_view_td 99.7160μs 37.0768μs 26.9711 KOps/s 26.9687 KOps/s $+0.01\%$
test_unbind_pytree 63.4190μs 29.7006μs 33.6694 KOps/s 33.4410 KOps/s $\color{#35bf28}+0.68\%$
test_unbind_td 0.3104ms 37.6894μs 26.5327 KOps/s 26.1285 KOps/s $\color{#35bf28}+1.55\%$
test_split_pytree 69.0690μs 29.3075μs 34.1209 KOps/s 34.4503 KOps/s $\color{#d91a1a}-0.96\%$
test_split_td 0.5012ms 43.5336μs 22.9708 KOps/s 23.2882 KOps/s $\color{#d91a1a}-1.36\%$
test_add_pytree 0.1200ms 35.6907μs 28.0185 KOps/s 28.6158 KOps/s $\color{#d91a1a}-2.09\%$
test_add_td 0.1326ms 53.0812μs 18.8391 KOps/s 18.2367 KOps/s $\color{#35bf28}+3.30\%$
test_compile_add_one_nested[tensordict-compile] 0.1486ms 61.9162μs 16.1509 KOps/s 16.1148 KOps/s $\color{#35bf28}+0.22\%$
test_compile_add_one_nested[tensordict-eager] 0.3685ms 0.1601ms 6.2442 KOps/s 6.3673 KOps/s $\color{#d91a1a}-1.93\%$
test_compile_add_one_nested[pytree-compile] 0.1109ms 44.1145μs 22.6683 KOps/s 22.0807 KOps/s $\color{#35bf28}+2.66\%$
test_compile_add_one_nested[pytree-eager] 0.2262ms 0.1185ms 8.4409 KOps/s 8.5051 KOps/s $\color{#d91a1a}-0.76\%$
test_compile_copy_nested[tensordict-compile] 87.1670μs 26.6743μs 37.4893 KOps/s 38.6247 KOps/s $\color{#d91a1a}-2.94\%$
test_compile_copy_nested[tensordict-eager] 0.1085ms 53.3582μs 18.7412 KOps/s 18.4736 KOps/s $\color{#35bf28}+1.45\%$
test_compile_copy_nested[pytree-compile] 0.1468ms 78.2186μs 12.7847 KOps/s 12.7749 KOps/s $\color{#35bf28}+0.08\%$
test_compile_copy_nested[pytree-eager] 0.1316ms 68.1005μs 14.6842 KOps/s 14.7767 KOps/s $\color{#d91a1a}-0.63\%$
test_compile_add_one_flat[tensordict-compile] 0.1634ms 0.1042ms 9.5985 KOps/s 9.4652 KOps/s $\color{#35bf28}+1.41\%$
test_compile_add_one_flat[tensordict-eager] 0.2770ms 0.1989ms 5.0286 KOps/s 5.0286 KOps/s $-0.00\%$
test_compile_add_one_flat[tensorclass-compile] 0.1242ms 44.4186μs 22.5131 KOps/s 21.7060 KOps/s $\color{#35bf28}+3.72\%$
test_compile_add_one_flat[tensorclass-eager] 0.4600ms 59.8997μs 16.6946 KOps/s 16.6343 KOps/s $\color{#35bf28}+0.36\%$
test_compile_add_one_flat[pytree-compile] 0.1606ms 0.1019ms 9.8132 KOps/s 9.9503 KOps/s $\color{#d91a1a}-1.38\%$
test_compile_add_one_flat[pytree-eager] 0.3754ms 0.2043ms 4.8953 KOps/s 5.0000 KOps/s $\color{#d91a1a}-2.09\%$
test_compile_add_self_flat[tensordict-eager] 0.3088ms 0.2105ms 4.7510 KOps/s 4.8133 KOps/s $\color{#d91a1a}-1.29\%$
test_compile_add_self_flat[tensordict-compile] 0.2421ms 0.1052ms 9.5086 KOps/s 9.6548 KOps/s $\color{#d91a1a}-1.51\%$
test_compile_add_self_flat[tensorclass-eager] 0.1840ms 54.7347μs 18.2700 KOps/s 19.0334 KOps/s $\color{#d91a1a}-4.01\%$
test_compile_add_self_flat[tensorclass-compile] 94.2760μs 45.6476μs 21.9070 KOps/s 22.0266 KOps/s $\color{#d91a1a}-0.54\%$
test_compile_add_self_flat[pytree-eager] 0.6233ms 0.1606ms 6.2279 KOps/s 6.4152 KOps/s $\color{#d91a1a}-2.92\%$
test_compile_add_self_flat[pytree-compile] 0.1883ms 0.1029ms 9.7192 KOps/s 9.7409 KOps/s $\color{#d91a1a}-0.22\%$
test_compile_copy_flat[tensordict-compile] 86.4070μs 22.1924μs 45.0604 KOps/s 46.3956 KOps/s $\color{#d91a1a}-2.88\%$
test_compile_copy_flat[tensordict-eager] 0.1230ms 59.0104μs 16.9462 KOps/s 16.7845 KOps/s $\color{#35bf28}+0.96\%$
test_compile_copy_flat[pytree-compile] 0.1608ms 81.5886μs 12.2566 KOps/s 12.5578 KOps/s $\color{#d91a1a}-2.40\%$
test_compile_copy_flat[pytree-eager] 0.1465ms 70.6854μs 14.1472 KOps/s 14.8474 KOps/s $\color{#d91a1a}-4.72\%$
test_compile_assign_and_add[tensordict-compile] 0.4186ms 0.2129ms 4.6979 KOps/s 4.8696 KOps/s $\color{#d91a1a}-3.53\%$
test_compile_assign_and_add[tensordict-eager] 1.5416ms 1.2664ms 789.6654 Ops/s 794.4088 Ops/s $\color{#d91a1a}-0.60\%$
test_compile_assign_and_add[pytree-compile] 0.3772ms 0.2061ms 4.8520 KOps/s 4.9671 KOps/s $\color{#d91a1a}-2.32\%$
test_compile_assign_and_add[pytree-eager] 1.3810ms 0.7854ms 1.2733 KOps/s 1.3004 KOps/s $\color{#d91a1a}-2.09\%$
test_compile_assign_and_add_stack[compile] 0.5901ms 0.4708ms 2.1241 KOps/s 2.2203 KOps/s $\color{#d91a1a}-4.33\%$
test_compile_assign_and_add_stack[eager] 3.9344ms 2.5674ms 389.4997 Ops/s 393.3065 Ops/s $\color{#d91a1a}-0.97\%$
test_compile_indexing[tensor-tensordict-compile] 95.1980μs 35.8714μs 27.8773 KOps/s 28.6317 KOps/s $\color{#d91a1a}-2.63\%$
test_compile_indexing[tensor-tensordict-eager] 0.6943ms 31.5340μs 31.7118 KOps/s 32.3085 KOps/s $\color{#d91a1a}-1.85\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1091ms 28.8126μs 34.7071 KOps/s 35.5802 KOps/s $\color{#d91a1a}-2.45\%$
test_compile_indexing[tensor-tensorclass-eager] 83.2550μs 22.7275μs 43.9996 KOps/s 44.4863 KOps/s $\color{#d91a1a}-1.09\%$
test_compile_indexing[tensor-pytree-compile] 89.9980μs 29.5121μs 33.8844 KOps/s 33.9752 KOps/s $\color{#d91a1a}-0.27\%$
test_compile_indexing[tensor-pytree-eager] 84.5780μs 22.9006μs 43.6670 KOps/s 44.5172 KOps/s $\color{#d91a1a}-1.91\%$
test_compile_indexing[slice-tensordict-compile] 0.1219ms 51.3783μs 19.4635 KOps/s 19.6617 KOps/s $\color{#d91a1a}-1.01\%$
test_compile_indexing[slice-tensordict-eager] 0.5457ms 19.2141μs 52.0452 KOps/s 52.9059 KOps/s $\color{#d91a1a}-1.63\%$
test_compile_indexing[slice-tensorclass-compile] 0.1002ms 43.6058μs 22.9328 KOps/s 22.9951 KOps/s $\color{#d91a1a}-0.27\%$
test_compile_indexing[slice-tensorclass-eager] 57.0470μs 18.9612μs 52.7392 KOps/s 54.4758 KOps/s $\color{#d91a1a}-3.19\%$
test_compile_indexing[slice-pytree-compile] 0.1411ms 44.6619μs 22.3904 KOps/s 22.5197 KOps/s $\color{#d91a1a}-0.57\%$
test_compile_indexing[slice-pytree-eager] 64.8210μs 18.8073μs 53.1707 KOps/s 54.5906 KOps/s $\color{#d91a1a}-2.60\%$
test_compile_indexing[int-tensordict-compile] 0.1318ms 52.4848μs 19.0531 KOps/s 19.1540 KOps/s $\color{#d91a1a}-0.53\%$
test_compile_indexing[int-tensordict-eager] 0.8613ms 19.1538μs 52.2090 KOps/s 53.1633 KOps/s $\color{#d91a1a}-1.80\%$
test_compile_indexing[int-tensorclass-compile] 0.1243ms 45.8399μs 21.8150 KOps/s 22.6748 KOps/s $\color{#d91a1a}-3.79\%$
test_compile_indexing[int-tensorclass-eager] 66.6550μs 18.6447μs 53.6346 KOps/s 54.5286 KOps/s $\color{#d91a1a}-1.64\%$
test_compile_indexing[int-pytree-compile] 93.3450μs 44.1838μs 22.6327 KOps/s 22.6266 KOps/s $\color{#35bf28}+0.03\%$
test_compile_indexing[int-pytree-eager] 60.6130μs 18.6453μs 53.6328 KOps/s 54.5548 KOps/s $\color{#d91a1a}-1.69\%$
test_mod_add[eager] 94.0750μs 29.7088μs 33.6601 KOps/s 37.8884 KOps/s $\textbf{\color{#d91a1a}-11.16\%}$
test_mod_add[compile] 89.6280μs 44.3149μs 22.5658 KOps/s 22.9864 KOps/s $\color{#d91a1a}-1.83\%$
test_mod_add[compile-overhead] 0.1091ms 45.3179μs 22.0663 KOps/s 22.2194 KOps/s $\color{#d91a1a}-0.69\%$
test_mod_wrap[eager] 0.3533ms 0.2185ms 4.5776 KOps/s 4.7970 KOps/s $\color{#d91a1a}-4.57\%$
test_mod_wrap[compile] 1.7766ms 0.2052ms 4.8743 KOps/s 4.9371 KOps/s $\color{#d91a1a}-1.27\%$
test_mod_wrap[compile-overhead] 2.1775ms 0.2019ms 4.9533 KOps/s 5.0552 KOps/s $\color{#d91a1a}-2.02\%$
test_mod_wrap_and_backward[eager] 15.2559ms 12.0860ms 82.7401 Ops/s 84.1716 Ops/s $\color{#d91a1a}-1.70\%$
test_mod_wrap_and_backward[compile] 12.7849ms 11.0617ms 90.4019 Ops/s 79.7100 Ops/s $\textbf{\color{#35bf28}+13.41\%}$
test_mod_wrap_and_backward[compile-overhead] 16.6570ms 13.7658ms 72.6438 Ops/s 80.1235 Ops/s $\textbf{\color{#d91a1a}-9.34\%}$
test_seq_add[eager] 0.2060ms 0.1022ms 9.7830 KOps/s 10.9897 KOps/s $\textbf{\color{#d91a1a}-10.98\%}$
test_seq_add[compile] 0.1301ms 59.7009μs 16.7502 KOps/s 17.2070 KOps/s $\color{#d91a1a}-2.65\%$
test_seq_add[compile-overhead] 0.1303ms 58.0787μs 17.2180 KOps/s 17.6799 KOps/s $\color{#d91a1a}-2.61\%$
test_seq_wrap[eager] 0.7266ms 0.4059ms 2.4638 KOps/s 2.6247 KOps/s $\textbf{\color{#d91a1a}-6.13\%}$
test_seq_wrap[compile] 0.3608ms 0.2271ms 4.4034 KOps/s 4.5584 KOps/s $\color{#d91a1a}-3.40\%$
test_seq_wrap[compile-overhead] 0.4508ms 0.2261ms 4.4231 KOps/s 4.5013 KOps/s $\color{#d91a1a}-1.74\%$
test_func_call_runtime[False-eager] 1.2951ms 0.5542ms 1.8043 KOps/s 1.8847 KOps/s $\color{#d91a1a}-4.27\%$
test_func_call_runtime[False-compile] 0.8075ms 0.4288ms 2.3320 KOps/s 2.4159 KOps/s $\color{#d91a1a}-3.47\%$
test_func_call_runtime[False-compile-overhead] 0.9341ms 0.4257ms 2.3489 KOps/s 2.4010 KOps/s $\color{#d91a1a}-2.17\%$
test_func_call_runtime[True-eager] 1.2962ms 0.7649ms 1.3073 KOps/s 1.3360 KOps/s $\color{#d91a1a}-2.15\%$
test_func_call_runtime[True-compile] 0.6129ms 0.4680ms 2.1369 KOps/s 2.2017 KOps/s $\color{#d91a1a}-2.94\%$
test_func_call_runtime[True-compile-overhead] 0.7240ms 0.4644ms 2.1533 KOps/s 2.1759 KOps/s $\color{#d91a1a}-1.04\%$
test_func_call_cm_runtime[False-eager] 0.7844ms 0.5485ms 1.8230 KOps/s 1.8909 KOps/s $\color{#d91a1a}-3.59\%$
test_func_call_cm_runtime[False-compile] 0.5357ms 0.4248ms 2.3541 KOps/s 2.4140 KOps/s $\color{#d91a1a}-2.48\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5335ms 0.4255ms 2.3504 KOps/s 2.4011 KOps/s $\color{#d91a1a}-2.11\%$
test_func_call_cm_runtime[True-eager] 1.0895ms 0.8946ms 1.1178 KOps/s 1.1463 KOps/s $\color{#d91a1a}-2.49\%$
test_func_call_cm_runtime[True-compile] 0.8773ms 0.4946ms 2.0220 KOps/s 2.0781 KOps/s $\color{#d91a1a}-2.70\%$
test_func_call_cm_runtime[True-compile-overhead] 0.8579ms 0.4910ms 2.0367 KOps/s 2.0752 KOps/s $\color{#d91a1a}-1.86\%$
test_vmap_func_call_cm_runtime[eager] 2.5576ms 1.8885ms 529.5116 Ops/s 535.9459 Ops/s $\color{#d91a1a}-1.20\%$
test_vmap_func_call_cm_runtime[compile] 0.9208ms 0.5171ms 1.9340 KOps/s 1.9433 KOps/s $\color{#d91a1a}-0.48\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.9938ms 0.5199ms 1.9235 KOps/s 1.9582 KOps/s $\color{#d91a1a}-1.77\%$
test_distributed 0.2587ms 0.1261ms 7.9281 KOps/s 7.7970 KOps/s $\color{#35bf28}+1.68\%$
test_tdmodule 83.1160μs 21.6752μs 46.1357 KOps/s 54.0843 KOps/s $\textbf{\color{#d91a1a}-14.70\%}$
test_tdmodule_dispatch 60.7940μs 39.3361μs 25.4220 KOps/s 27.7407 KOps/s $\textbf{\color{#d91a1a}-8.36\%}$
test_tdseq 49.6220μs 24.1157μs 41.4668 KOps/s 47.2016 KOps/s $\textbf{\color{#d91a1a}-12.15\%}$
test_tdseq_dispatch 72.8160μs 44.4486μs 22.4979 KOps/s 24.2681 KOps/s $\textbf{\color{#d91a1a}-7.29\%}$
test_instantiation_functorch 1.6723ms 1.5112ms 661.7476 Ops/s 661.0114 Ops/s $\color{#35bf28}+0.11\%$
test_exec_functorch 0.3285ms 0.1804ms 5.5437 KOps/s 5.6221 KOps/s $\color{#d91a1a}-1.39\%$
test_exec_functional_call 0.3686ms 0.1722ms 5.8060 KOps/s 5.8988 KOps/s $\color{#d91a1a}-1.57\%$
test_exec_td_decorator 0.5969ms 0.2309ms 4.3308 KOps/s 4.4104 KOps/s $\color{#d91a1a}-1.80\%$
test_vmap_mlp_speed_decorator[True-True] 0.9645ms 0.6286ms 1.5909 KOps/s 1.5992 KOps/s $\color{#d91a1a}-0.52\%$
test_vmap_mlp_speed_decorator[True-False] 0.9793ms 0.6298ms 1.5879 KOps/s 1.5353 KOps/s $\color{#35bf28}+3.43\%$
test_vmap_mlp_speed_decorator[False-True] 0.8715ms 0.5170ms 1.9344 KOps/s 1.9354 KOps/s $\color{#d91a1a}-0.05\%$
test_vmap_mlp_speed_decorator[False-False] 0.8454ms 0.5153ms 1.9405 KOps/s 1.9084 KOps/s $\color{#35bf28}+1.68\%$
test_to_module_speed[True] 1.6739ms 1.3017ms 768.2122 Ops/s 779.6275 Ops/s $\color{#d91a1a}-1.46\%$
test_to_module_speed[False] 1.8726ms 1.2617ms 792.5538 Ops/s 800.6814 Ops/s $\color{#d91a1a}-1.02\%$
test_tc_init 82.7850μs 43.2287μs 23.1328 KOps/s 22.5988 KOps/s $\color{#35bf28}+2.36\%$
test_tc_init_nested 0.1413ms 86.6660μs 11.5385 KOps/s 11.5950 KOps/s $\color{#d91a1a}-0.49\%$
test_tc_first_layer_tensor 40.1750μs 1.5249μs 655.7632 KOps/s 662.9040 KOps/s $\color{#d91a1a}-1.08\%$
test_tc_first_layer_nontensor 33.9740μs 4.8412μs 206.5604 KOps/s 212.7714 KOps/s $\color{#d91a1a}-2.92\%$
test_tc_second_layer_tensor 48.0900μs 2.8094μs 355.9426 KOps/s 349.0668 KOps/s $\color{#35bf28}+1.97\%$
test_tc_second_layer_nontensor 32.8710μs 6.0532μs 165.2020 KOps/s 165.6716 KOps/s $\color{#d91a1a}-0.28\%$
test_unbind 0.2313s 12.7459ms 78.4566 Ops/s 82.7458 Ops/s $\textbf{\color{#d91a1a}-5.18\%}$
test_full_like 8.3245ms 7.5707ms 132.0883 Ops/s 133.4384 Ops/s $\color{#d91a1a}-1.01\%$
test_zeros_like 3.6451ms 2.9382ms 340.3415 Ops/s 349.9355 Ops/s $\color{#d91a1a}-2.74\%$
test_ones_like 3.8113ms 3.3310ms 300.2064 Ops/s 306.6540 Ops/s $\color{#d91a1a}-2.10\%$
test_clone 6.4951ms 5.3274ms 187.7085 Ops/s 187.9645 Ops/s $\color{#d91a1a}-0.14\%$
test_squeeze 60.8430μs 11.7130μs 85.3755 KOps/s 81.9014 KOps/s $\color{#35bf28}+4.24\%$
test_unsqueeze 0.2498ms 89.2921μs 11.1992 KOps/s 11.4916 KOps/s $\color{#d91a1a}-2.54\%$
test_split 0.4780ms 0.1919ms 5.2105 KOps/s 5.3186 KOps/s $\color{#d91a1a}-2.03\%$
test_permute 0.3362ms 0.2151ms 4.6501 KOps/s 4.6358 KOps/s $\color{#35bf28}+0.31\%$
test_stack 29.9206ms 25.2599ms 39.5885 Ops/s 39.5196 Ops/s $\color{#35bf28}+0.17\%$
test_cat 28.2657ms 25.3406ms 39.4623 Ops/s 39.8864 Ops/s $\color{#d91a1a}-1.06\%$

Copy link

github-actions bot commented Nov 20, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}5$. Worsened: $\large\color{#d91a1a}69$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 32.3510μs 11.0208μs 90.7371 KOps/s 102.6233 KOps/s $\textbf{\color{#d91a1a}-11.58\%}$
test_plain_set_stack_nested 0.1271ms 11.0634μs 90.3884 KOps/s 102.5486 KOps/s $\textbf{\color{#d91a1a}-11.86\%}$
test_plain_set_nested_inplace 68.6140μs 11.9400μs 83.7522 KOps/s 94.2056 KOps/s $\textbf{\color{#d91a1a}-11.10\%}$
test_plain_set_stack_nested_inplace 40.5920μs 11.8576μs 84.3339 KOps/s 93.7387 KOps/s $\textbf{\color{#d91a1a}-10.03\%}$
test_items 28.6510μs 2.8913μs 345.8629 KOps/s 340.2915 KOps/s $\color{#35bf28}+1.64\%$
test_items_nested 0.4478ms 0.3573ms 2.7986 KOps/s 3.0722 KOps/s $\textbf{\color{#d91a1a}-8.91\%}$
test_items_nested_locked 0.3838ms 0.3582ms 2.7918 KOps/s 3.0634 KOps/s $\textbf{\color{#d91a1a}-8.87\%}$
test_items_nested_leaf 83.3440μs 57.8820μs 17.2765 KOps/s 17.1515 KOps/s $\color{#35bf28}+0.73\%$
test_items_stack_nested 0.3812ms 0.3584ms 2.7903 KOps/s 3.0607 KOps/s $\textbf{\color{#d91a1a}-8.83\%}$
test_items_stack_nested_leaf 91.6740μs 59.0671μs 16.9299 KOps/s 16.8616 KOps/s $\color{#35bf28}+0.41\%$
test_items_stack_nested_locked 0.5298ms 0.3556ms 2.8123 KOps/s 3.0201 KOps/s $\textbf{\color{#d91a1a}-6.88\%}$
test_keys 72.9930μs 3.5137μs 284.5985 KOps/s 285.7738 KOps/s $\color{#d91a1a}-0.41\%$
test_keys_nested 0.1710ms 69.9501μs 14.2959 KOps/s 14.2231 KOps/s $\color{#35bf28}+0.51\%$
test_keys_nested_locked 0.7832ms 76.8277μs 13.0161 KOps/s 13.2482 KOps/s $\color{#d91a1a}-1.75\%$
test_keys_nested_leaf 0.2255ms 61.2667μs 16.3221 KOps/s 16.3226 KOps/s $-0.00\%$
test_keys_stack_nested 0.2308ms 70.2784μs 14.2291 KOps/s 14.0253 KOps/s $\color{#35bf28}+1.45\%$
test_keys_stack_nested_leaf 0.2455ms 60.9476μs 16.4076 KOps/s 16.0040 KOps/s $\color{#35bf28}+2.52\%$
test_keys_stack_nested_locked 0.1212ms 75.5471μs 13.2368 KOps/s 13.0935 KOps/s $\color{#35bf28}+1.09\%$
test_values 5.3202μs 0.8391μs 1.1918 MOps/s 1.1493 MOps/s $\color{#35bf28}+3.69\%$
test_values_nested 63.2030μs 31.2955μs 31.9535 KOps/s 31.9626 KOps/s $\color{#d91a1a}-0.03\%$
test_values_nested_locked 74.9440μs 33.0014μs 30.3018 KOps/s 30.4346 KOps/s $\color{#d91a1a}-0.44\%$
test_values_nested_leaf 60.7230μs 33.7843μs 29.5995 KOps/s 29.6392 KOps/s $\color{#d91a1a}-0.13\%$
test_values_stack_nested 0.1536ms 31.7883μs 31.4581 KOps/s 31.6173 KOps/s $\color{#d91a1a}-0.50\%$
test_values_stack_nested_leaf 60.3030μs 34.2071μs 29.2337 KOps/s 29.1413 KOps/s $\color{#35bf28}+0.32\%$
test_values_stack_nested_locked 61.6130μs 33.1456μs 30.1699 KOps/s 29.9858 KOps/s $\color{#35bf28}+0.61\%$
test_membership 2.1921μs 0.5076μs 1.9699 MOps/s 1.9741 MOps/s $\color{#d91a1a}-0.21\%$
test_membership_nested 17.8860μs 2.0133μs 496.6973 KOps/s 516.8007 KOps/s $\color{#d91a1a}-3.89\%$
test_membership_nested_leaf 15.4905μs 2.0045μs 498.8892 KOps/s 515.8343 KOps/s $\color{#d91a1a}-3.28\%$
test_membership_stacked_nested 27.3910μs 2.1221μs 471.2354 KOps/s 494.9491 KOps/s $\color{#d91a1a}-4.79\%$
test_membership_stacked_nested_leaf 32.5310μs 2.0890μs 478.7032 KOps/s 499.0756 KOps/s $\color{#d91a1a}-4.08\%$
test_membership_nested_last 43.7220μs 2.9564μs 338.2494 KOps/s 357.5477 KOps/s $\textbf{\color{#d91a1a}-5.40\%}$
test_membership_nested_leaf_last 29.1510μs 3.0137μs 331.8203 KOps/s 356.9817 KOps/s $\textbf{\color{#d91a1a}-7.05\%}$
test_membership_stacked_nested_last 32.6220μs 7.9285μs 126.1272 KOps/s 349.2016 KOps/s $\textbf{\color{#d91a1a}-63.88\%}$
test_membership_stacked_nested_leaf_last 0.1859ms 7.9129μs 126.3751 KOps/s 350.1396 KOps/s $\textbf{\color{#d91a1a}-63.91\%}$
test_nested_getleaf 0.1751ms 6.1286μs 163.1685 KOps/s 165.8826 KOps/s $\color{#d91a1a}-1.64\%$
test_nested_get 28.4810μs 5.8110μs 172.0866 KOps/s 174.3789 KOps/s $\color{#d91a1a}-1.31\%$
test_stacked_getleaf 26.9610μs 6.1527μs 162.5290 KOps/s 166.3464 KOps/s $\color{#d91a1a}-2.29\%$
test_stacked_get 0.1700ms 5.8403μs 171.2240 KOps/s 177.3222 KOps/s $\color{#d91a1a}-3.44\%$
test_nested_getitemleaf 27.1910μs 6.2457μs 160.1110 KOps/s 164.1864 KOps/s $\color{#d91a1a}-2.48\%$
test_nested_getitem 0.2004ms 5.9281μs 168.6893 KOps/s 173.0667 KOps/s $\color{#d91a1a}-2.53\%$
test_stacked_getitemleaf 35.1520μs 6.2730μs 159.4146 KOps/s 164.0452 KOps/s $\color{#d91a1a}-2.82\%$
test_stacked_getitem 36.2710μs 5.9144μs 169.0790 KOps/s 174.6795 KOps/s $\color{#d91a1a}-3.21\%$
test_lock_nested 0.7289ms 0.3688ms 2.7117 KOps/s 2.7244 KOps/s $\color{#d91a1a}-0.47\%$
test_lock_stack_nested 0.4610ms 0.3323ms 3.0089 KOps/s 3.0111 KOps/s $\color{#d91a1a}-0.07\%$
test_unlock_nested 0.5943ms 0.3060ms 3.2682 KOps/s 3.3003 KOps/s $\color{#d91a1a}-0.97\%$
test_unlock_stack_nested 0.3951ms 0.2710ms 3.6907 KOps/s 3.7013 KOps/s $\color{#d91a1a}-0.29\%$
test_flatten_speed 0.1098ms 74.0892μs 13.4972 KOps/s 13.7955 KOps/s $\color{#d91a1a}-2.16\%$
test_unflatten_speed 0.3991ms 0.3052ms 3.2761 KOps/s 3.4313 KOps/s $\color{#d91a1a}-4.52\%$
test_common_ops 1.5938ms 0.6274ms 1.5939 KOps/s 1.8176 KOps/s $\textbf{\color{#d91a1a}-12.31\%}$
test_creation 0.1822ms 1.4954μs 668.7046 KOps/s 673.2649 KOps/s $\color{#d91a1a}-0.68\%$
test_creation_empty 40.9820μs 8.2006μs 121.9429 KOps/s 172.4207 KOps/s $\textbf{\color{#d91a1a}-29.28\%}$
test_creation_nested_1 30.1110μs 9.7786μs 102.2645 KOps/s 136.5522 KOps/s $\textbf{\color{#d91a1a}-25.11\%}$
test_creation_nested_2 35.0620μs 12.4100μs 80.5804 KOps/s 102.3451 KOps/s $\textbf{\color{#d91a1a}-21.27\%}$
test_clone 0.1635ms 11.2751μs 88.6913 KOps/s 95.5740 KOps/s $\textbf{\color{#d91a1a}-7.20\%}$
test_getitem[int] 1.9469ms 11.0981μs 90.1058 KOps/s 95.4690 KOps/s $\textbf{\color{#d91a1a}-5.62\%}$
test_getitem[slice_int] 0.2232ms 21.8159μs 45.8380 KOps/s 50.1014 KOps/s $\textbf{\color{#d91a1a}-8.51\%}$
test_getitem[range] 0.1663ms 39.7811μs 25.1376 KOps/s 26.1824 KOps/s $\color{#d91a1a}-3.99\%$
test_getitem[tuple] 0.1902ms 18.5145μs 54.0116 KOps/s 56.8233 KOps/s $\color{#d91a1a}-4.95\%$
test_getitem[list] 0.3095ms 34.8243μs 28.7156 KOps/s 29.9197 KOps/s $\color{#d91a1a}-4.02\%$
test_setitem_dim[int] 39.3220μs 20.0882μs 49.7804 KOps/s 53.6439 KOps/s $\textbf{\color{#d91a1a}-7.20\%}$
test_setitem_dim[slice_int] 78.3140μs 39.5848μs 25.2622 KOps/s 26.6444 KOps/s $\textbf{\color{#d91a1a}-5.19\%}$
test_setitem_dim[range] 0.1833ms 55.8060μs 17.9192 KOps/s 18.8774 KOps/s $\textbf{\color{#d91a1a}-5.08\%}$
test_setitem_dim[tuple] 53.8020μs 33.7860μs 29.5981 KOps/s 31.4656 KOps/s $\textbf{\color{#d91a1a}-5.94\%}$
test_setitem 0.1222ms 15.9524μs 62.6865 KOps/s 72.6510 KOps/s $\textbf{\color{#d91a1a}-13.72\%}$
test_set 0.1910ms 15.3697μs 65.0632 KOps/s 76.9461 KOps/s $\textbf{\color{#d91a1a}-15.44\%}$
test_set_shared 1.6234ms 0.1490ms 6.7131 KOps/s 6.8546 KOps/s $\color{#d91a1a}-2.06\%$
test_update 0.3624ms 18.5618μs 53.8740 KOps/s 67.1948 KOps/s $\textbf{\color{#d91a1a}-19.82\%}$
test_update_nested 0.1520ms 23.5323μs 42.4947 KOps/s 51.4879 KOps/s $\textbf{\color{#d91a1a}-17.47\%}$
test_update__nested 0.7078ms 25.3117μs 39.5074 KOps/s 39.8918 KOps/s $\color{#d91a1a}-0.96\%$
test_set_nested 0.1517ms 16.6798μs 59.9529 KOps/s 70.3483 KOps/s $\textbf{\color{#d91a1a}-14.78\%}$
test_set_nested_new 0.1681ms 18.7669μs 53.2853 KOps/s 61.4704 KOps/s $\textbf{\color{#d91a1a}-13.32\%}$
test_select 0.1646ms 31.0731μs 32.1822 KOps/s 35.1610 KOps/s $\textbf{\color{#d91a1a}-8.47\%}$
test_select_nested 73.1640μs 41.7137μs 23.9729 KOps/s 24.0101 KOps/s $\color{#d91a1a}-0.15\%$
test_exclude_nested 0.1620ms 62.1346μs 16.0941 KOps/s 16.9019 KOps/s $\color{#d91a1a}-4.78\%$
test_empty[True] 0.3274ms 0.2759ms 3.6248 KOps/s 3.9263 KOps/s $\textbf{\color{#d91a1a}-7.68\%}$
test_empty[False] 3.1981μs 0.7509μs 1.3317 MOps/s 1.3607 MOps/s $\color{#d91a1a}-2.14\%$
test_to 92.0240μs 56.4114μs 17.7269 KOps/s 18.1960 KOps/s $\color{#d91a1a}-2.58\%$
test_to_nonblocking 0.1929ms 46.9142μs 21.3155 KOps/s 22.2775 KOps/s $\color{#d91a1a}-4.32\%$
test_unbind_speed 0.2673ms 0.2347ms 4.2608 KOps/s 4.3954 KOps/s $\color{#d91a1a}-3.06\%$
test_unbind_speed_stack0 0.2757ms 0.2290ms 4.3668 KOps/s 4.3492 KOps/s $\color{#35bf28}+0.41\%$
test_unbind_speed_stack1 96.4313ms 0.6442ms 1.5523 KOps/s 1.5446 KOps/s $\color{#35bf28}+0.50\%$
test_split 97.1829ms 1.6410ms 609.3884 Ops/s 643.5567 Ops/s $\textbf{\color{#d91a1a}-5.31\%}$
test_chunk 0.1059s 1.6439ms 608.2934 Ops/s 582.2561 Ops/s $\color{#35bf28}+4.47\%$
test_consolidate[False-None] 0.1001s 2.8531ms 350.4924 Ops/s 383.4670 Ops/s $\textbf{\color{#d91a1a}-8.60\%}$
test_consolidate[default-None] 2.1777ms 1.8406ms 543.2865 Ops/s 596.8840 Ops/s $\textbf{\color{#d91a1a}-8.98\%}$
test_consolidate[reduce-overhead-None] 1.9075ms 1.7753ms 563.2969 Ops/s 579.6660 Ops/s $\color{#d91a1a}-2.82\%$
test_consolidate_njt[False-None] 7.0175ms 6.5847ms 151.8683 Ops/s 151.0358 Ops/s $\color{#35bf28}+0.55\%$
test_to[False-False-None] 1.9468ms 1.7026ms 587.3289 Ops/s 597.0286 Ops/s $\color{#d91a1a}-1.62\%$
test_to[True-False-None] 1.6018ms 1.3465ms 742.6737 Ops/s 763.7735 Ops/s $\color{#d91a1a}-2.76\%$
test_to[within-False-None] 4.2642ms 4.0541ms 246.6619 Ops/s 250.5716 Ops/s $\color{#d91a1a}-1.56\%$
test_to[True-default-None] 5.6504ms 5.2596ms 190.1284 Ops/s 196.6031 Ops/s $\color{#d91a1a}-3.29\%$
test_to_njt[False-False-None] 7.3854ms 7.0404ms 142.0371 Ops/s 143.4402 Ops/s $\color{#d91a1a}-0.98\%$
test_to_njt[True-False-None] 5.8243ms 5.5613ms 179.8150 Ops/s 182.0736 Ops/s $\color{#d91a1a}-1.24\%$
test_to_njt[within-False-None] 12.6459ms 12.4266ms 80.4724 Ops/s 81.1961 Ops/s $\color{#d91a1a}-0.89\%$
test_creation[device0] 0.6413ms 80.6248μs 12.4031 KOps/s 12.4965 KOps/s $\color{#d91a1a}-0.75\%$
test_creation_from_tensor 0.7169ms 83.4966μs 11.9765 KOps/s 12.0180 KOps/s $\color{#d91a1a}-0.34\%$
test_add_one[memmap_tensor0] 0.3135ms 7.3387μs 136.2637 KOps/s 143.9797 KOps/s $\textbf{\color{#d91a1a}-5.36\%}$
test_contiguous[memmap_tensor0] 1.7886μs 0.4091μs 2.4444 MOps/s 2.4409 MOps/s $\color{#35bf28}+0.14\%$
test_stack[memmap_tensor0] 44.4230μs 4.7195μs 211.8874 KOps/s 227.7469 KOps/s $\textbf{\color{#d91a1a}-6.96\%}$
test_memmaptd_index 1.6763ms 0.2553ms 3.9167 KOps/s 4.0542 KOps/s $\color{#d91a1a}-3.39\%$
test_memmaptd_index_astensor 0.5845ms 0.3154ms 3.1707 KOps/s 3.2787 KOps/s $\color{#d91a1a}-3.29\%$
test_memmaptd_index_op 1.0549ms 0.6092ms 1.6414 KOps/s 1.7783 KOps/s $\textbf{\color{#d91a1a}-7.70\%}$
test_serialize_model 0.1333s 0.1316s 7.5972 Ops/s 7.6228 Ops/s $\color{#d91a1a}-0.34\%$
test_serialize_model_pickle 1.3476s 1.2190s 0.8204 Ops/s 0.8440 Ops/s $\color{#d91a1a}-2.80\%$
test_serialize_weights 0.4438s 0.1752s 5.7068 Ops/s 7.7231 Ops/s $\textbf{\color{#d91a1a}-26.11\%}$
test_serialize_weights_returnearly 0.3608s 55.3662ms 18.0615 Ops/s 13.9720 Ops/s $\textbf{\color{#35bf28}+29.27\%}$
test_serialize_weights_pickle 1.3659s 1.2213s 0.8188 Ops/s 0.8374 Ops/s $\color{#d91a1a}-2.22\%$
test_reshape_pytree 43.6520μs 22.3801μs 44.6825 KOps/s 44.1551 KOps/s $\color{#35bf28}+1.19\%$
test_reshape_td 0.1383ms 28.0113μs 35.6999 KOps/s 36.9193 KOps/s $\color{#d91a1a}-3.30\%$
test_view_pytree 0.2149ms 22.1816μs 45.0823 KOps/s 44.7668 KOps/s $\color{#35bf28}+0.70\%$
test_view_td 0.1242ms 30.5297μs 32.7550 KOps/s 32.7994 KOps/s $\color{#d91a1a}-0.14\%$
test_unbind_pytree 0.1375ms 28.1553μs 35.5173 KOps/s 35.6106 KOps/s $\color{#d91a1a}-0.26\%$
test_unbind_td 0.7492ms 37.8533μs 26.4178 KOps/s 27.6365 KOps/s $\color{#d91a1a}-4.41\%$
test_split_pytree 0.1611ms 30.1535μs 33.1636 KOps/s 32.0652 KOps/s $\color{#35bf28}+3.43\%$
test_split_td 1.0475ms 39.2681μs 25.4659 KOps/s 26.4282 KOps/s $\color{#d91a1a}-3.64\%$
test_add_pytree 0.1835ms 37.0414μs 26.9968 KOps/s 28.9662 KOps/s $\textbf{\color{#d91a1a}-6.80\%}$
test_add_td 0.1956ms 51.9011μs 19.2674 KOps/s 23.1996 KOps/s $\textbf{\color{#d91a1a}-16.95\%}$
test_compile_add_one_nested[tensordict-compile] 0.2742ms 0.1240ms 8.0677 KOps/s 7.9956 KOps/s $\color{#35bf28}+0.90\%$
test_compile_add_one_nested[tensordict-eager] 0.3425ms 0.1254ms 7.9766 KOps/s 7.9573 KOps/s $\color{#35bf28}+0.24\%$
test_compile_add_one_nested[pytree-compile] 0.2425ms 97.2560μs 10.2821 KOps/s 10.1826 KOps/s $\color{#35bf28}+0.98\%$
test_compile_add_one_nested[pytree-eager] 0.3076ms 0.1544ms 6.4779 KOps/s 6.5578 KOps/s $\color{#d91a1a}-1.22\%$
test_compile_copy_nested[tensordict-compile] 0.1663ms 22.3462μs 44.7503 KOps/s 43.8703 KOps/s $\color{#35bf28}+2.01\%$
test_compile_copy_nested[tensordict-eager] 0.1513ms 26.7986μs 37.3154 KOps/s 36.8164 KOps/s $\color{#35bf28}+1.36\%$
test_compile_copy_nested[pytree-compile] 0.4521ms 65.2094μs 15.3352 KOps/s 15.2593 KOps/s $\color{#35bf28}+0.50\%$
test_compile_copy_nested[pytree-eager] 79.4440μs 49.6151μs 20.1552 KOps/s 19.9495 KOps/s $\color{#35bf28}+1.03\%$
test_compile_add_one_flat[tensordict-compile] 0.2959ms 0.1441ms 6.9414 KOps/s 6.8805 KOps/s $\color{#35bf28}+0.88\%$
test_compile_add_one_flat[tensordict-eager] 0.3555ms 0.2070ms 4.8313 KOps/s 4.8420 KOps/s $\color{#d91a1a}-0.22\%$
test_compile_add_one_flat[tensorclass-compile] 0.2551ms 0.1006ms 9.9355 KOps/s 10.0703 KOps/s $\color{#d91a1a}-1.34\%$
test_compile_add_one_flat[tensorclass-eager] 0.2542ms 53.4296μs 18.7162 KOps/s 19.1145 KOps/s $\color{#d91a1a}-2.08\%$
test_compile_add_one_flat[pytree-compile] 0.3335ms 0.1392ms 7.1813 KOps/s 7.2231 KOps/s $\color{#d91a1a}-0.58\%$
test_compile_add_one_flat[pytree-eager] 0.7150ms 0.5328ms 1.8768 KOps/s 1.9796 KOps/s $\textbf{\color{#d91a1a}-5.19\%}$
test_compile_add_self_flat[tensordict-eager] 0.4305ms 0.2467ms 4.0530 KOps/s 4.0909 KOps/s $\color{#d91a1a}-0.93\%$
test_compile_add_self_flat[tensordict-compile] 0.2899ms 0.1461ms 6.8454 KOps/s 6.9127 KOps/s $\color{#d91a1a}-0.97\%$
test_compile_add_self_flat[tensorclass-eager] 0.2250ms 64.2807μs 15.5568 KOps/s 16.0065 KOps/s $\color{#d91a1a}-2.81\%$
test_compile_add_self_flat[tensorclass-compile] 0.2477ms 0.1022ms 9.7874 KOps/s 10.0197 KOps/s $\color{#d91a1a}-2.32\%$
test_compile_add_self_flat[pytree-eager] 0.5784ms 0.4263ms 2.3460 KOps/s 2.3750 KOps/s $\color{#d91a1a}-1.22\%$
test_compile_add_self_flat[pytree-compile] 0.2787ms 0.1395ms 7.1666 KOps/s 7.2781 KOps/s $\color{#d91a1a}-1.53\%$
test_compile_copy_flat[tensordict-compile] 0.1695ms 19.8052μs 50.4918 KOps/s 55.2374 KOps/s $\textbf{\color{#d91a1a}-8.59\%}$
test_compile_copy_flat[tensordict-eager] 0.1384ms 26.2679μs 38.0693 KOps/s 37.6753 KOps/s $\color{#35bf28}+1.05\%$
test_compile_copy_flat[pytree-compile] 0.1410ms 70.0008μs 14.2856 KOps/s 14.1028 KOps/s $\color{#35bf28}+1.30\%$
test_compile_copy_flat[pytree-eager] 81.5440μs 52.4496μs 19.0659 KOps/s 19.2475 KOps/s $\color{#d91a1a}-0.94\%$
test_compile_assign_and_add[tensordict-compile] 1.6980ms 0.4595ms 2.1764 KOps/s 2.2117 KOps/s $\color{#d91a1a}-1.60\%$
test_compile_assign_and_add[tensordict-eager] 3.1529ms 2.7487ms 363.8079 Ops/s 389.4260 Ops/s $\textbf{\color{#d91a1a}-6.58\%}$
test_compile_assign_and_add[pytree-compile] 1.5907ms 0.4324ms 2.3124 KOps/s 2.2273 KOps/s $\color{#35bf28}+3.82\%$
test_compile_assign_and_add[pytree-eager] 3.4803ms 2.8905ms 345.9572 Ops/s 373.2112 Ops/s $\textbf{\color{#d91a1a}-7.30\%}$
test_compile_indexing[tensor-tensordict-compile] 0.5057ms 0.1229ms 8.1397 KOps/s 8.6449 KOps/s $\textbf{\color{#d91a1a}-5.84\%}$
test_compile_indexing[tensor-tensordict-eager] 0.5817ms 85.3586μs 11.7153 KOps/s 11.9090 KOps/s $\color{#d91a1a}-1.63\%$
test_compile_indexing[tensor-tensorclass-compile] 0.3198ms 0.1167ms 8.5677 KOps/s 8.9240 KOps/s $\color{#d91a1a}-3.99\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2897ms 77.2752μs 12.9408 KOps/s 14.0026 KOps/s $\textbf{\color{#d91a1a}-7.58\%}$
test_compile_indexing[tensor-pytree-compile] 0.3490ms 0.1150ms 8.6981 KOps/s 8.8835 KOps/s $\color{#d91a1a}-2.09\%$
test_compile_indexing[tensor-pytree-eager] 0.2868ms 75.5755μs 13.2318 KOps/s 14.2393 KOps/s $\textbf{\color{#d91a1a}-7.08\%}$
test_compile_indexing[slice-tensordict-compile] 0.3112ms 0.1101ms 9.0821 KOps/s 9.2675 KOps/s $\color{#d91a1a}-2.00\%$
test_compile_indexing[slice-tensordict-eager] 0.1798ms 18.4063μs 54.3292 KOps/s 58.5195 KOps/s $\textbf{\color{#d91a1a}-7.16\%}$
test_compile_indexing[slice-tensorclass-compile] 0.2951ms 0.1039ms 9.6224 KOps/s 10.3028 KOps/s $\textbf{\color{#d91a1a}-6.60\%}$
test_compile_indexing[slice-tensorclass-eager] 0.2049ms 16.5530μs 60.4122 KOps/s 64.4570 KOps/s $\textbf{\color{#d91a1a}-6.28\%}$
test_compile_indexing[slice-pytree-compile] 0.2957ms 0.1059ms 9.4464 KOps/s 10.1587 KOps/s $\textbf{\color{#d91a1a}-7.01\%}$
test_compile_indexing[slice-pytree-eager] 0.1266ms 16.5904μs 60.2757 KOps/s 62.0362 KOps/s $\color{#d91a1a}-2.84\%$
test_compile_indexing[int-tensordict-compile] 0.3031ms 0.1123ms 8.9067 KOps/s 9.6922 KOps/s $\textbf{\color{#d91a1a}-8.10\%}$
test_compile_indexing[int-tensordict-eager] 0.6428ms 17.9412μs 55.7375 KOps/s 60.4074 KOps/s $\textbf{\color{#d91a1a}-7.73\%}$
test_compile_indexing[int-tensorclass-compile] 0.2822ms 0.1050ms 9.5270 KOps/s 10.1207 KOps/s $\textbf{\color{#d91a1a}-5.87\%}$
test_compile_indexing[int-tensorclass-eager] 0.1801ms 16.4586μs 60.7585 KOps/s 63.9174 KOps/s $\color{#d91a1a}-4.94\%$
test_compile_indexing[int-pytree-compile] 0.2506ms 98.9899μs 10.1020 KOps/s 9.6624 KOps/s $\color{#35bf28}+4.55\%$
test_compile_indexing[int-pytree-eager] 89.7740μs 16.3562μs 61.1388 KOps/s 64.4265 KOps/s $\textbf{\color{#d91a1a}-5.10\%}$
test_mod_add[eager] 0.1886ms 38.0288μs 26.2959 KOps/s 29.1649 KOps/s $\textbf{\color{#d91a1a}-9.84\%}$
test_mod_add[compile] 0.3000ms 84.2434μs 11.8704 KOps/s 12.3034 KOps/s $\color{#d91a1a}-3.52\%$
test_mod_add[compile-overhead] 0.3297ms 0.1706ms 5.8626 KOps/s 5.5799 KOps/s $\textbf{\color{#35bf28}+5.07\%}$
test_mod_wrap[eager] 0.4087ms 0.2542ms 3.9343 KOps/s 3.9421 KOps/s $\color{#d91a1a}-0.20\%$
test_mod_wrap[compile] 0.4957ms 0.3039ms 3.2903 KOps/s 3.3132 KOps/s $\color{#d91a1a}-0.69\%$
test_mod_wrap[compile-overhead] 7.3821ms 3.7932ms 263.6262 Ops/s 263.3723 Ops/s $\color{#35bf28}+0.10\%$
test_mod_wrap_and_backward[eager] 1.6639ms 1.3813ms 723.9353 Ops/s 689.4583 Ops/s $\textbf{\color{#35bf28}+5.00\%}$
test_mod_wrap_and_backward[compile] 1.6468ms 1.3056ms 765.9194 Ops/s 720.6396 Ops/s $\textbf{\color{#35bf28}+6.28\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3759ms 0.9417ms 1.0619 KOps/s 948.4177 Ops/s $\textbf{\color{#35bf28}+11.97\%}$
test_seq_add[eager] 0.2902ms 0.1135ms 8.8110 KOps/s 10.2908 KOps/s $\textbf{\color{#d91a1a}-14.38\%}$
test_seq_add[compile] 0.2700ms 90.0464μs 11.1054 KOps/s 10.6914 KOps/s $\color{#35bf28}+3.87\%$
test_seq_add[compile-overhead] 0.2796ms 0.1317ms 7.5922 KOps/s 7.4414 KOps/s $\color{#35bf28}+2.03\%$
test_seq_wrap[eager] 0.6132ms 0.4324ms 2.3129 KOps/s 2.5486 KOps/s $\textbf{\color{#d91a1a}-9.25\%}$
test_seq_wrap[compile] 0.5022ms 0.3192ms 3.1325 KOps/s 3.0991 KOps/s $\color{#35bf28}+1.08\%$
test_seq_wrap[compile-overhead] 0.4072ms 0.2261ms 4.4226 KOps/s 4.3835 KOps/s $\color{#35bf28}+0.89\%$
test_func_call_runtime[False-eager] 0.9685ms 0.7764ms 1.2881 KOps/s 1.3058 KOps/s $\color{#d91a1a}-1.36\%$
test_func_call_runtime[False-compile] 0.9469ms 0.7650ms 1.3071 KOps/s 1.3148 KOps/s $\color{#d91a1a}-0.58\%$
test_func_call_runtime[False-compile-overhead] 0.5149ms 0.3679ms 2.7184 KOps/s 2.7238 KOps/s $\color{#d91a1a}-0.20\%$
test_func_call_runtime[True-eager] 1.0742ms 0.9205ms 1.0864 KOps/s 1.0834 KOps/s $\color{#35bf28}+0.27\%$
test_func_call_runtime[True-compile] 0.9608ms 0.7860ms 1.2723 KOps/s 1.2935 KOps/s $\color{#d91a1a}-1.63\%$
test_func_call_runtime[True-compile-overhead] 0.5306ms 0.3888ms 2.5723 KOps/s 2.5724 KOps/s $-0.00\%$
test_func_call_cm_runtime[False-eager] 0.8997ms 0.7514ms 1.3309 KOps/s 1.3331 KOps/s $\color{#d91a1a}-0.17\%$
test_func_call_cm_runtime[False-compile] 0.9295ms 0.7695ms 1.2995 KOps/s 1.2558 KOps/s $\color{#35bf28}+3.49\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5075ms 0.3690ms 2.7100 KOps/s 2.7069 KOps/s $\color{#35bf28}+0.11\%$
test_func_call_cm_runtime[True-eager] 1.1619ms 1.0077ms 992.3572 Ops/s 992.0077 Ops/s $\color{#35bf28}+0.04\%$
test_func_call_cm_runtime[True-compile] 0.9876ms 0.8163ms 1.2250 KOps/s 1.2333 KOps/s $\color{#d91a1a}-0.67\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5563ms 0.4160ms 2.4037 KOps/s 2.4161 KOps/s $\color{#d91a1a}-0.51\%$
test_vmap_func_call_cm_runtime[eager] 2.5642ms 2.0965ms 476.9956 Ops/s 475.1750 Ops/s $\color{#35bf28}+0.38\%$
test_vmap_func_call_cm_runtime[compile] 0.9825ms 0.8266ms 1.2098 KOps/s 1.2174 KOps/s $\color{#d91a1a}-0.63\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5557ms 0.4146ms 2.4118 KOps/s 2.3938 KOps/s $\color{#35bf28}+0.75\%$
test_distributed 2.6069ms 0.2101ms 4.7587 KOps/s 8.7162 KOps/s $\textbf{\color{#d91a1a}-45.40\%}$
test_tdmodule 88.6540μs 19.0874μs 52.3905 KOps/s 70.3082 KOps/s $\textbf{\color{#d91a1a}-25.48\%}$
test_tdmodule_dispatch 0.1352ms 35.6627μs 28.0405 KOps/s 36.5878 KOps/s $\textbf{\color{#d91a1a}-23.36\%}$
test_tdseq 55.5430μs 19.5942μs 51.0354 KOps/s 71.7430 KOps/s $\textbf{\color{#d91a1a}-28.86\%}$
test_tdseq_dispatch 56.7920μs 36.8622μs 27.1281 KOps/s 34.0130 KOps/s $\textbf{\color{#d91a1a}-20.24\%}$
test_instantiation_functorch 1.7842ms 1.5800ms 632.8971 Ops/s 633.6815 Ops/s $\color{#d91a1a}-0.12\%$
test_exec_functorch 0.1914ms 0.1499ms 6.6733 KOps/s 6.8751 KOps/s $\color{#d91a1a}-2.94\%$
test_exec_functional_call 0.2568ms 0.1493ms 6.6982 KOps/s 7.1441 KOps/s $\textbf{\color{#d91a1a}-6.24\%}$
test_exec_td_decorator 0.4089ms 0.1987ms 5.0337 KOps/s 5.4635 KOps/s $\textbf{\color{#d91a1a}-7.87\%}$
test_vmap_mlp_speed_decorator[True-True] 0.9322ms 0.7265ms 1.3764 KOps/s 1.4767 KOps/s $\textbf{\color{#d91a1a}-6.79\%}$
test_vmap_mlp_speed_decorator[True-False] 5.9392ms 0.6969ms 1.4350 KOps/s 1.4758 KOps/s $\color{#d91a1a}-2.76\%$
test_vmap_mlp_speed_decorator[False-True] 0.8532ms 0.6238ms 1.6031 KOps/s 1.6760 KOps/s $\color{#d91a1a}-4.35\%$
test_vmap_mlp_speed_decorator[False-False] 0.8163ms 0.6319ms 1.5825 KOps/s 1.6714 KOps/s $\textbf{\color{#d91a1a}-5.32\%}$
test_vmap_transformer_speed_decorator[True-True] 20.5156ms 19.5019ms 51.2772 Ops/s 52.0163 Ops/s $\color{#d91a1a}-1.42\%$
test_vmap_transformer_speed_decorator[True-False] 19.8522ms 19.4626ms 51.3806 Ops/s 52.0076 Ops/s $\color{#d91a1a}-1.21\%$
test_vmap_transformer_speed_decorator[False-True] 19.5639ms 19.2838ms 51.8571 Ops/s 52.5710 Ops/s $\color{#d91a1a}-1.36\%$
test_vmap_transformer_speed_decorator[False-False] 19.5646ms 19.2974ms 51.8204 Ops/s 52.3909 Ops/s $\color{#d91a1a}-1.09\%$
test_to_module_speed[True] 2.2049ms 0.9396ms 1.0643 KOps/s 1.0741 KOps/s $\color{#d91a1a}-0.92\%$
test_to_module_speed[False] 1.0227ms 0.9205ms 1.0864 KOps/s 1.0933 KOps/s $\color{#d91a1a}-0.63\%$
test_tc_init 0.1704ms 36.7765μs 27.1913 KOps/s 31.1243 KOps/s $\textbf{\color{#d91a1a}-12.64\%}$
test_tc_init_nested 0.1982ms 74.9199μs 13.3476 KOps/s 15.4809 KOps/s $\textbf{\color{#d91a1a}-13.78\%}$
test_tc_first_layer_tensor 5.7474μs 0.6963μs 1.4363 MOps/s 1.4214 MOps/s $\color{#35bf28}+1.04\%$
test_tc_first_layer_nontensor 22.5520μs 2.3363μs 428.0364 KOps/s 433.6499 KOps/s $\color{#d91a1a}-1.29\%$
test_tc_second_layer_tensor 7.9855μs 1.4339μs 697.3783 KOps/s 698.0164 KOps/s $\color{#d91a1a}-0.09\%$
test_tc_second_layer_nontensor 27.2510μs 3.0419μs 328.7457 KOps/s 324.3498 KOps/s $\color{#35bf28}+1.36\%$
test_unbind 6.8538ms 6.6274ms 150.8894 Ops/s 149.4326 Ops/s $\color{#35bf28}+0.97\%$
test_full_like 12.3394ms 10.0736ms 99.2694 Ops/s 95.5097 Ops/s $\color{#35bf28}+3.94\%$
test_zeros_like 5.4689ms 4.4794ms 223.2417 Ops/s 223.9305 Ops/s $\color{#d91a1a}-0.31\%$
test_ones_like 5.4531ms 4.5296ms 220.7699 Ops/s 219.5411 Ops/s $\color{#35bf28}+0.56\%$
test_clone 8.5756ms 7.1977ms 138.9334 Ops/s 139.1438 Ops/s $\color{#d91a1a}-0.15\%$
test_squeeze 81.5040μs 9.0951μs 109.9490 KOps/s 106.0246 KOps/s $\color{#35bf28}+3.70\%$
test_unsqueeze 0.1800ms 70.2594μs 14.2330 KOps/s 14.0553 KOps/s $\color{#35bf28}+1.26\%$
test_split 0.2905ms 0.1580ms 6.3281 KOps/s 6.2797 KOps/s $\color{#35bf28}+0.77\%$
test_permute 0.3722ms 0.1824ms 5.4838 KOps/s 5.5752 KOps/s $\color{#d91a1a}-1.64\%$
test_stack 53.2498ms 52.1037ms 19.1925 Ops/s 19.2064 Ops/s $\color{#d91a1a}-0.07\%$
test_cat 53.7446ms 51.8667ms 19.2802 Ops/s 19.1905 Ops/s $\color{#35bf28}+0.47\%$

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 21, 2024
ghstack-source-id: 23f30be0b5b36329dc6dfb2d7eb303aaf87b8474
Pull Request resolved: #1098
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
@vmoens vmoens added the Refactor Refactoring code - not a new feature label Nov 25, 2024
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 25, 2024
ghstack-source-id: c16baa83f6e41c4afd6637f3b3739d4e5cf25f1e
Pull Request resolved: #1098
@vmoens vmoens merged commit 45d0dea into gh/vmoens/35/base Nov 25, 2024
45 of 50 checks passed
vmoens added a commit that referenced this pull request Nov 25, 2024
ghstack-source-id: c16baa83f6e41c4afd6637f3b3739d4e5cf25f1e
Pull Request resolved: #1098
@vmoens vmoens deleted the gh/vmoens/35/head branch November 25, 2024 16:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Refactor Refactoring code - not a new feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants