-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] NonTensorStack.data #1132
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Dec 9, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 50.6950μs | 18.3295μs | 54.5568 KOps/s | 60.8564 KOps/s | |
test_plain_set_stack_nested | 48.0700μs | 18.3965μs | 54.3583 KOps/s | 59.0330 KOps/s | |
test_plain_set_nested_inplace | 58.5700μs | 20.1941μs | 49.5194 KOps/s | 54.4133 KOps/s | |
test_plain_set_stack_nested_inplace | 51.1250μs | 19.9710μs | 50.0725 KOps/s | 54.0211 KOps/s | |
test_items | 52.1480μs | 4.2718μs | 234.0945 KOps/s | 239.3360 KOps/s | |
test_items_nested | 0.5082ms | 0.3990ms | 2.5062 KOps/s | 2.4805 KOps/s | |
test_items_nested_locked | 0.5889ms | 0.3982ms | 2.5114 KOps/s | 2.4759 KOps/s | |
test_items_nested_leaf | 0.1253ms | 70.9331μs | 14.0978 KOps/s | 14.0378 KOps/s | |
test_items_stack_nested | 0.5855ms | 0.4036ms | 2.4774 KOps/s | 2.4712 KOps/s | |
test_items_stack_nested_leaf | 0.1325ms | 73.7310μs | 13.5628 KOps/s | 13.8373 KOps/s | |
test_items_stack_nested_locked | 0.5703ms | 0.4014ms | 2.4915 KOps/s | 2.4903 KOps/s | |
test_keys | 19.2960μs | 3.5143μs | 284.5503 KOps/s | 277.3255 KOps/s | |
test_keys_nested | 0.2232ms | 0.1337ms | 7.4786 KOps/s | 7.2112 KOps/s | |
test_keys_nested_locked | 1.5954ms | 0.1404ms | 7.1203 KOps/s | 6.9131 KOps/s | |
test_keys_nested_leaf | 0.2022ms | 0.1149ms | 8.7008 KOps/s | 8.3447 KOps/s | |
test_keys_stack_nested | 0.2396ms | 0.1367ms | 7.3156 KOps/s | 7.1619 KOps/s | |
test_keys_stack_nested_leaf | 0.1998ms | 0.1170ms | 8.5507 KOps/s | 8.3781 KOps/s | |
test_keys_stack_nested_locked | 0.2310ms | 0.1416ms | 7.0603 KOps/s | 7.0802 KOps/s | |
test_values | 5.4642μs | 1.0560μs | 946.9491 KOps/s | 920.4211 KOps/s | |
test_values_nested | 0.1087ms | 56.5278μs | 17.6904 KOps/s | 18.2354 KOps/s | |
test_values_nested_locked | 0.1124ms | 56.5343μs | 17.6884 KOps/s | 18.0473 KOps/s | |
test_values_nested_leaf | 0.1110ms | 61.1055μs | 16.3651 KOps/s | 16.6252 KOps/s | |
test_values_stack_nested | 0.1135ms | 57.4219μs | 17.4149 KOps/s | 18.0107 KOps/s | |
test_values_stack_nested_leaf | 0.1212ms | 61.7741μs | 16.1880 KOps/s | 16.5032 KOps/s | |
test_values_stack_nested_locked | 0.1146ms | 57.5626μs | 17.3724 KOps/s | 16.5943 KOps/s | |
test_membership | 14.6770μs | 0.8803μs | 1.1360 MOps/s | 1.1127 MOps/s | |
test_membership_nested | 25.2870μs | 2.9027μs | 344.5075 KOps/s | 339.7393 KOps/s | |
test_membership_nested_leaf | 19.8270μs | 2.9386μs | 340.3029 KOps/s | 339.0843 KOps/s | |
test_membership_stacked_nested | 33.7440μs | 2.8723μs | 348.1570 KOps/s | 340.4675 KOps/s | |
test_membership_stacked_nested_leaf | 39.0530μs | 2.8781μs | 347.4562 KOps/s | 344.3698 KOps/s | |
test_membership_nested_last | 26.5800μs | 4.1762μs | 239.4528 KOps/s | 235.3182 KOps/s | |
test_membership_nested_leaf_last | 23.1230μs | 4.1945μs | 238.4077 KOps/s | 233.3137 KOps/s | |
test_membership_stacked_nested_last | 24.2550μs | 4.1606μs | 240.3508 KOps/s | 234.7756 KOps/s | |
test_membership_stacked_nested_leaf_last | 28.2940μs | 4.1978μs | 238.2189 KOps/s | 235.8222 KOps/s | |
test_nested_getleaf | 32.4010μs | 10.9253μs | 91.5309 KOps/s | 94.7713 KOps/s | |
test_nested_get | 33.0110μs | 10.3804μs | 96.3352 KOps/s | 100.1626 KOps/s | |
test_stacked_getleaf | 32.2000μs | 10.7112μs | 93.3603 KOps/s | 95.3148 KOps/s | |
test_stacked_get | 30.9580μs | 10.3474μs | 96.6429 KOps/s | 100.5680 KOps/s | |
test_nested_getitemleaf | 42.1690μs | 11.3037μs | 88.4666 KOps/s | 91.6584 KOps/s | |
test_nested_getitem | 33.7740μs | 10.5648μs | 94.6536 KOps/s | 97.5263 KOps/s | |
test_stacked_getitemleaf | 32.8010μs | 11.2573μs | 88.8312 KOps/s | 90.5577 KOps/s | |
test_stacked_getitem | 29.9470μs | 10.4441μs | 95.7477 KOps/s | 96.8300 KOps/s | |
test_lock_nested | 1.9421ms | 0.4361ms | 2.2932 KOps/s | 2.2912 KOps/s | |
test_lock_stack_nested | 1.0679ms | 0.4137ms | 2.4174 KOps/s | 2.4367 KOps/s | |
test_unlock_nested | 0.6874ms | 0.3519ms | 2.8417 KOps/s | 2.8009 KOps/s | |
test_unlock_stack_nested | 0.7714ms | 0.3301ms | 3.0293 KOps/s | 3.0138 KOps/s | |
test_flatten_speed | 0.1896ms | 94.8769μs | 10.5400 KOps/s | 10.6178 KOps/s | |
test_unflatten_speed | 0.6605ms | 0.4919ms | 2.0329 KOps/s | 2.0947 KOps/s | |
test_common_ops | 4.5796ms | 0.7801ms | 1.2819 KOps/s | 1.3975 KOps/s | |
test_creation | 16.7410μs | 2.0752μs | 481.8754 KOps/s | 491.9960 KOps/s | |
test_creation_empty | 33.7340μs | 11.5451μs | 86.6171 KOps/s | 114.7263 KOps/s | |
test_creation_nested_1 | 42.2490μs | 14.6156μs | 68.4200 KOps/s | 87.1738 KOps/s | |
test_creation_nested_2 | 51.7270μs | 18.6730μs | 53.5533 KOps/s | 62.1488 KOps/s | |
test_clone | 0.1510ms | 13.0343μs | 76.7209 KOps/s | 76.7884 KOps/s | |
test_getitem[int] | 1.4056ms | 12.3772μs | 80.7939 KOps/s | 80.7295 KOps/s | |
test_getitem[slice_int] | 0.1318ms | 23.2411μs | 43.0272 KOps/s | 40.9555 KOps/s | |
test_getitem[range] | 0.2546ms | 47.6705μs | 20.9773 KOps/s | 21.0158 KOps/s | |
test_getitem[tuple] | 0.1378ms | 19.8252μs | 50.4408 KOps/s | 50.6969 KOps/s | |
test_getitem[list] | 0.2616ms | 41.6645μs | 24.0013 KOps/s | 23.6489 KOps/s | |
test_setitem_dim[int] | 44.3730μs | 24.7102μs | 40.4691 KOps/s | 41.1356 KOps/s | |
test_setitem_dim[slice_int] | 94.7380μs | 49.5842μs | 20.1677 KOps/s | 19.3640 KOps/s | |
test_setitem_dim[range] | 0.1323ms | 71.7324μs | 13.9407 KOps/s | 13.7122 KOps/s | |
test_setitem_dim[tuple] | 79.0180μs | 39.7147μs | 25.1796 KOps/s | 24.1736 KOps/s | |
test_setitem | 0.1392ms | 20.1970μs | 49.5124 KOps/s | 54.0237 KOps/s | |
test_set | 0.1104ms | 19.7308μs | 50.6822 KOps/s | 56.4686 KOps/s | |
test_set_shared | 1.4354ms | 0.1635ms | 6.1157 KOps/s | 5.9837 KOps/s | |
test_update | 0.1986ms | 22.4986μs | 44.4472 KOps/s | 51.6716 KOps/s | |
test_update_nested | 0.1159ms | 32.9067μs | 30.3890 KOps/s | 34.0368 KOps/s | |
test_update__nested | 0.9270ms | 30.9901μs | 32.2684 KOps/s | 31.2881 KOps/s | |
test_set_nested | 0.1085ms | 21.5595μs | 46.3832 KOps/s | 50.2571 KOps/s | |
test_set_nested_new | 0.1253ms | 26.0878μs | 38.3321 KOps/s | 41.5333 KOps/s | |
test_select | 0.1275ms | 42.0479μs | 23.7824 KOps/s | 25.2574 KOps/s | |
test_select_nested | 0.1412ms | 59.1882μs | 16.8952 KOps/s | 16.7418 KOps/s | |
test_exclude_nested | 0.3695ms | 77.4933μs | 12.9043 KOps/s | 12.7520 KOps/s | |
test_empty[True] | 0.4434ms | 0.3791ms | 2.6375 KOps/s | 2.6138 KOps/s | |
test_empty[False] | 7.8648μs | 1.2469μs | 801.9674 KOps/s | 789.0224 KOps/s | |
test_unbind_speed | 0.4216ms | 0.2625ms | 3.8096 KOps/s | 3.8544 KOps/s | |
test_unbind_speed_stack0 | 0.4133ms | 0.2557ms | 3.9113 KOps/s | 3.8595 KOps/s | |
test_unbind_speed_stack1 | 95.5769ms | 0.7489ms | 1.3354 KOps/s | 1.4480 KOps/s | |
test_split | 0.1020s | 1.7034ms | 587.0658 Ops/s | 578.2894 Ops/s | |
test_chunk | 94.8459ms | 1.6861ms | 593.0758 Ops/s | 575.0217 Ops/s | |
test_consolidate_njt[False-None] | 8.5766ms | 8.1478ms | 122.7327 Ops/s | 121.2887 Ops/s | |
test_creation[device0] | 0.2399ms | 89.8033μs | 11.1354 KOps/s | 10.9853 KOps/s | |
test_creation_from_tensor | 4.0734ms | 93.9360μs | 10.6455 KOps/s | 10.2539 KOps/s | |
test_add_one[memmap_tensor0] | 0.2242ms | 4.6382μs | 215.6029 KOps/s | 202.7073 KOps/s | |
test_contiguous[memmap_tensor0] | 9.1570μs | 0.5138μs | 1.9464 MOps/s | 2.0035 MOps/s | |
test_stack[memmap_tensor0] | 76.0930μs | 3.1861μs | 313.8679 KOps/s | 304.6542 KOps/s | |
test_memmaptd_index | 1.0027ms | 0.2331ms | 4.2908 KOps/s | 4.2861 KOps/s | |
test_memmaptd_index_astensor | 0.5824ms | 0.3112ms | 3.2132 KOps/s | 3.2140 KOps/s | |
test_memmaptd_index_op | 1.1259ms | 0.5853ms | 1.7085 KOps/s | 1.8678 KOps/s | |
test_serialize_model | 0.1214s | 0.1157s | 8.6443 Ops/s | 7.7748 Ops/s | |
test_serialize_model_pickle | 0.4479s | 0.3898s | 2.5653 Ops/s | 2.5638 Ops/s | |
test_serialize_weights | 0.2085s | 0.1292s | 7.7412 Ops/s | 8.9181 Ops/s | |
test_serialize_weights_returnearly | 0.3344s | 0.1788s | 5.5930 Ops/s | 6.3216 Ops/s | |
test_serialize_weights_pickle | 1.1519s | 0.6711s | 1.4900 Ops/s | 2.4823 Ops/s | |
test_serialize_weights_filesystem | 0.1420s | 0.1393s | 7.1784 Ops/s | 6.6214 Ops/s | |
test_serialize_model_filesystem | 0.2488s | 0.1530s | 6.5370 Ops/s | 6.8980 Ops/s | |
test_reshape_pytree | 59.2410μs | 26.0054μs | 38.4535 KOps/s | 37.4041 KOps/s | |
test_reshape_td | 69.8210μs | 32.7904μs | 30.4967 KOps/s | 31.1006 KOps/s | |
test_view_pytree | 76.6830μs | 25.8744μs | 38.6482 KOps/s | 37.4505 KOps/s | |
test_view_td | 80.1000μs | 36.7203μs | 27.2329 KOps/s | 26.8618 KOps/s | |
test_unbind_pytree | 65.1020μs | 29.0753μs | 34.3934 KOps/s | 33.7566 KOps/s | |
test_unbind_td | 0.3129ms | 37.7453μs | 26.4934 KOps/s | 26.0134 KOps/s | |
test_split_pytree | 64.5620μs | 28.9229μs | 34.5747 KOps/s | 33.8534 KOps/s | |
test_split_td | 0.5371ms | 42.7798μs | 23.3755 KOps/s | 22.5180 KOps/s | |
test_add_pytree | 93.7560μs | 34.5744μs | 28.9231 KOps/s | 27.8459 KOps/s | |
test_add_td | 0.1242ms | 53.2089μs | 18.7939 KOps/s | 19.6107 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1198ms | 63.0446μs | 15.8618 KOps/s | 15.5969 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3737ms | 0.1590ms | 6.2879 KOps/s | 6.1862 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1171ms | 46.4719μs | 21.5184 KOps/s | 21.3895 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2256ms | 0.1151ms | 8.6869 KOps/s | 8.4192 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 89.7780μs | 26.8150μs | 37.2925 KOps/s | 38.6049 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1279ms | 53.5271μs | 18.6821 KOps/s | 18.6632 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1688ms | 78.4244μs | 12.7511 KOps/s | 12.5326 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1603ms | 67.5746μs | 14.7985 KOps/s | 14.4990 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2454ms | 0.1062ms | 9.4177 KOps/s | 9.3112 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4164ms | 0.1999ms | 5.0015 KOps/s | 5.0565 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1373ms | 45.8197μs | 21.8247 KOps/s | 21.3873 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4678ms | 59.9832μs | 16.6713 KOps/s | 16.4549 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1852ms | 0.1044ms | 9.5767 KOps/s | 9.4873 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3695ms | 0.1985ms | 5.0369 KOps/s | 4.9877 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4125ms | 0.2088ms | 4.7884 KOps/s | 4.7560 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2028ms | 0.1092ms | 9.1561 KOps/s | 9.2029 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1344ms | 54.8461μs | 18.2328 KOps/s | 18.3414 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1115ms | 45.9510μs | 21.7623 KOps/s | 20.9172 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5993ms | 0.1566ms | 6.3837 KOps/s | 6.3516 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2261ms | 0.1049ms | 9.5363 KOps/s | 9.2849 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 76.4830μs | 22.5369μs | 44.3717 KOps/s | 46.6798 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1649ms | 59.5941μs | 16.7802 KOps/s | 16.7426 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1589ms | 80.9587μs | 12.3520 KOps/s | 12.2376 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1342ms | 69.0003μs | 14.4927 KOps/s | 14.2083 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.4035ms | 0.2152ms | 4.6462 KOps/s | 4.5846 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.2091ms | 1.2947ms | 772.4059 Ops/s | 760.4812 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.4680ms | 0.2044ms | 4.8934 KOps/s | 4.6341 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3596ms | 0.7690ms | 1.3004 KOps/s | 1.2481 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.5583ms | 0.4542ms | 2.2016 KOps/s | 2.0663 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.9302ms | 2.6496ms | 377.4143 Ops/s | 397.1554 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1065ms | 36.3522μs | 27.5087 KOps/s | 27.0584 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.7026ms | 31.9390μs | 31.3097 KOps/s | 30.9813 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 78.7380μs | 29.7638μs | 33.5978 KOps/s | 33.6177 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 63.0780μs | 22.8529μs | 43.7580 KOps/s | 43.1401 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 77.8060μs | 31.0664μs | 32.1891 KOps/s | 33.3028 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 82.5950μs | 23.0885μs | 43.3116 KOps/s | 43.4584 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1171ms | 52.1649μs | 19.1700 KOps/s | 19.0087 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5978ms | 19.1345μs | 52.2616 KOps/s | 49.6038 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 92.0430μs | 44.3981μs | 22.5235 KOps/s | 22.6519 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 59.7520μs | 18.6206μs | 53.7038 KOps/s | 51.9075 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 92.7140μs | 45.2537μs | 22.0976 KOps/s | 22.1754 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 64.8410μs | 18.6688μs | 53.5652 KOps/s | 51.7726 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1125ms | 53.2045μs | 18.7954 KOps/s | 19.0202 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9041ms | 19.2571μs | 51.9289 KOps/s | 49.5603 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1296ms | 45.9281μs | 21.7731 KOps/s | 21.9119 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1034ms | 18.5041μs | 54.0420 KOps/s | 52.0810 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1170ms | 45.6887μs | 21.8872 KOps/s | 22.2679 KOps/s | |
test_compile_indexing[int-pytree-eager] | 76.0620μs | 18.5518μs | 53.9032 KOps/s | 51.9886 KOps/s | |
test_mod_add[eager] | 90.6800μs | 33.1000μs | 30.2115 KOps/s | 30.5172 KOps/s | |
test_mod_add[compile] | 0.1559ms | 47.6030μs | 21.0071 KOps/s | 20.0105 KOps/s | |
test_mod_add[compile-overhead] | 93.8960μs | 46.3795μs | 21.5612 KOps/s | 19.6083 KOps/s | |
test_mod_wrap[eager] | 0.4698ms | 0.2268ms | 4.4086 KOps/s | 4.4823 KOps/s | |
test_mod_wrap[compile] | 0.4008ms | 0.2066ms | 4.8403 KOps/s | 4.7179 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4605ms | 0.2063ms | 4.8469 KOps/s | 4.8485 KOps/s | |
test_mod_wrap_and_backward[eager] | 12.5437ms | 11.0509ms | 90.4906 Ops/s | 83.6058 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.4817ms | 12.4986ms | 80.0087 Ops/s | 76.0080 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 19.2086ms | 13.4076ms | 74.5843 Ops/s | 76.1937 Ops/s | |
test_seq_add[eager] | 0.2429ms | 0.1150ms | 8.6961 KOps/s | 9.3930 KOps/s | |
test_seq_add[compile] | 0.1223ms | 63.0774μs | 15.8535 KOps/s | 15.6372 KOps/s | |
test_seq_add[compile-overhead] | 0.1325ms | 59.6880μs | 16.7538 KOps/s | 15.9516 KOps/s | |
test_seq_wrap[eager] | 0.7734ms | 0.4512ms | 2.2161 KOps/s | 2.2309 KOps/s | |
test_seq_wrap[compile] | 0.3841ms | 0.2314ms | 4.3208 KOps/s | 4.2459 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3509ms | 0.2290ms | 4.3662 KOps/s | 4.2715 KOps/s | |
test_func_call_runtime[False-eager] | 1.1194ms | 0.5596ms | 1.7869 KOps/s | 1.7842 KOps/s | |
test_func_call_runtime[False-compile] | 0.7280ms | 0.4251ms | 2.3523 KOps/s | 2.3182 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.6074ms | 0.4264ms | 2.3455 KOps/s | 2.3251 KOps/s | |
test_func_call_runtime[True-eager] | 1.5848ms | 0.7648ms | 1.3075 KOps/s | 1.2918 KOps/s | |
test_func_call_runtime[True-compile] | 0.7501ms | 0.4664ms | 2.1442 KOps/s | 2.1266 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6068ms | 0.4710ms | 2.1233 KOps/s | 2.1224 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8716ms | 0.5508ms | 1.8156 KOps/s | 1.7966 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6160ms | 0.4264ms | 2.3452 KOps/s | 2.3483 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 1.4262ms | 0.4300ms | 2.3256 KOps/s | 2.3419 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4410ms | 0.8937ms | 1.1190 KOps/s | 1.1013 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.6434ms | 0.4878ms | 2.0499 KOps/s | 2.0122 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.1521ms | 0.4920ms | 2.0326 KOps/s | 2.0301 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5300ms | 1.8635ms | 536.6183 Ops/s | 522.3332 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8867ms | 0.5108ms | 1.9578 KOps/s | 1.9037 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7187ms | 0.5125ms | 1.9514 KOps/s | 1.8783 KOps/s | |
test_distributed | 0.2987ms | 0.1262ms | 7.9267 KOps/s | 7.7721 KOps/s | |
test_tdmodule | 67.1360μs | 25.9348μs | 38.5582 KOps/s | 39.8064 KOps/s | |
test_tdmodule_dispatch | 64.1100μs | 47.3672μs | 21.1117 KOps/s | 21.6395 KOps/s | |
test_tdseq | 57.1870μs | 26.2845μs | 38.0453 KOps/s | 40.2754 KOps/s | |
test_tdseq_dispatch | 77.8060μs | 50.5365μs | 19.7877 KOps/s | 21.3088 KOps/s | |
test_instantiation_functorch | 1.6233ms | 1.5068ms | 663.6607 Ops/s | 649.6264 Ops/s | |
test_exec_functorch | 0.3199ms | 0.1791ms | 5.5837 KOps/s | 5.4619 KOps/s | |
test_exec_functional_call | 0.3175ms | 0.1719ms | 5.8190 KOps/s | 5.8059 KOps/s | |
test_exec_td_decorator | 0.4997ms | 0.2300ms | 4.3474 KOps/s | 4.3603 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7704ms | 0.6349ms | 1.5750 KOps/s | 1.5401 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9025ms | 0.6357ms | 1.5730 KOps/s | 1.4977 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7797ms | 0.5149ms | 1.9421 KOps/s | 1.8783 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8134ms | 0.5148ms | 1.9423 KOps/s | 1.8480 KOps/s | |
test_to_module_speed[True] | 1.4699ms | 1.2879ms | 776.4306 Ops/s | 763.5412 Ops/s | |
test_to_module_speed[False] | 2.0292ms | 1.2763ms | 783.5108 Ops/s | 788.2762 Ops/s | |
test_tc_init | 89.3480μs | 46.8443μs | 21.3473 KOps/s | 22.4492 KOps/s | |
test_tc_init_nested | 0.1734ms | 95.4040μs | 10.4817 KOps/s | 11.3033 KOps/s | |
test_tc_first_layer_tensor | 18.3050μs | 1.5213μs | 657.3375 KOps/s | 658.0320 KOps/s | |
test_tc_first_layer_nontensor | 23.7140μs | 4.7491μs | 210.5680 KOps/s | 211.4761 KOps/s | |
test_tc_second_layer_tensor | 24.8770μs | 2.8144μs | 355.3144 KOps/s | 351.4010 KOps/s | |
test_tc_second_layer_nontensor | 26.9710μs | 6.0338μs | 165.7335 KOps/s | 162.8539 KOps/s | |
test_unbind | 0.2178s | 12.3689ms | 80.8481 Ops/s | 83.0624 Ops/s | |
test_full_like | 15.3104ms | 11.6661ms | 85.7188 Ops/s | 89.2642 Ops/s | |
test_zeros_like | 11.0764ms | 7.0486ms | 141.8712 Ops/s | 138.9173 Ops/s | |
test_ones_like | 10.4874ms | 7.4841ms | 133.6163 Ops/s | 137.5399 Ops/s | |
test_clone | 14.8723ms | 8.9787ms | 111.3749 Ops/s | 112.2437 Ops/s | |
test_squeeze | 60.9650μs | 11.8461μs | 84.4158 KOps/s | 84.3499 KOps/s | |
test_unsqueeze | 0.1781ms | 86.8284μs | 11.5170 KOps/s | 11.1615 KOps/s | |
test_split | 0.4673ms | 0.1871ms | 5.3436 KOps/s | 5.1854 KOps/s | |
test_permute | 0.3300ms | 0.2035ms | 4.9133 KOps/s | 4.9863 KOps/s | |
test_stack | 28.6176ms | 23.4384ms | 42.6650 Ops/s | 39.5085 Ops/s | |
test_cat | 25.9119ms | 23.0661ms | 43.3537 Ops/s | 40.9527 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 33.8300μs | 11.2274μs | 89.0679 KOps/s | 97.5241 KOps/s | |
test_plain_set_stack_nested | 38.0700μs | 11.2849μs | 88.6136 KOps/s | 97.3940 KOps/s | |
test_plain_set_nested_inplace | 48.8800μs | 11.9319μs | 83.8087 KOps/s | 91.2193 KOps/s | |
test_plain_set_stack_nested_inplace | 39.0200μs | 11.9508μs | 83.6762 KOps/s | 90.4664 KOps/s | |
test_items | 22.4610μs | 2.8642μs | 349.1318 KOps/s | 346.4585 KOps/s | |
test_items_nested | 0.4013ms | 0.3626ms | 2.7582 KOps/s | 2.7326 KOps/s | |
test_items_nested_locked | 0.4397ms | 0.3607ms | 2.7723 KOps/s | 2.6827 KOps/s | |
test_items_nested_leaf | 84.4410μs | 57.9927μs | 17.2435 KOps/s | 17.1195 KOps/s | |
test_items_stack_nested | 0.4004ms | 0.3634ms | 2.7520 KOps/s | 2.7233 KOps/s | |
test_items_stack_nested_leaf | 89.6610μs | 59.4658μs | 16.8164 KOps/s | 17.0020 KOps/s | |
test_items_stack_nested_locked | 0.4122ms | 0.3637ms | 2.7492 KOps/s | 2.6971 KOps/s | |
test_keys | 0.1893ms | 3.4610μs | 288.9356 KOps/s | 269.5632 KOps/s | |
test_keys_nested | 0.2532ms | 70.0158μs | 14.2825 KOps/s | 14.2986 KOps/s | |
test_keys_nested_locked | 0.7908ms | 75.8346μs | 13.1866 KOps/s | 13.2203 KOps/s | |
test_keys_nested_leaf | 3.0033ms | 61.8239μs | 16.1750 KOps/s | 16.3196 KOps/s | |
test_keys_stack_nested | 0.1076ms | 69.9874μs | 14.2883 KOps/s | 14.3265 KOps/s | |
test_keys_stack_nested_leaf | 0.1295ms | 61.3684μs | 16.2950 KOps/s | 16.4564 KOps/s | |
test_keys_stack_nested_locked | 0.1602ms | 75.2213μs | 13.2941 KOps/s | 13.2783 KOps/s | |
test_values | 7.3200μs | 0.8458μs | 1.1823 MOps/s | 1.1890 MOps/s | |
test_values_nested | 61.4900μs | 31.1237μs | 32.1299 KOps/s | 32.3267 KOps/s | |
test_values_nested_locked | 60.5510μs | 32.6317μs | 30.6451 KOps/s | 30.9100 KOps/s | |
test_values_nested_leaf | 0.1132ms | 33.5016μs | 29.8493 KOps/s | 29.9344 KOps/s | |
test_values_stack_nested | 84.9110μs | 31.2705μs | 31.9790 KOps/s | 31.8413 KOps/s | |
test_values_stack_nested_leaf | 0.2137ms | 34.0886μs | 29.3353 KOps/s | 29.6967 KOps/s | |
test_values_stack_nested_locked | 0.1226ms | 32.9648μs | 30.3354 KOps/s | 30.7635 KOps/s | |
test_membership | 1.5885μs | 0.5065μs | 1.9743 MOps/s | 1.9370 MOps/s | |
test_membership_nested | 24.8005μs | 1.9884μs | 502.9044 KOps/s | 478.0304 KOps/s | |
test_membership_nested_leaf | 16.7255μs | 1.9870μs | 503.2756 KOps/s | 487.7148 KOps/s | |
test_membership_stacked_nested | 29.8310μs | 2.0824μs | 480.2152 KOps/s | 472.0930 KOps/s | |
test_membership_stacked_nested_leaf | 35.6710μs | 2.0840μs | 479.8405 KOps/s | 468.9147 KOps/s | |
test_membership_nested_last | 0.1788ms | 2.9078μs | 343.8981 KOps/s | 337.5746 KOps/s | |
test_membership_nested_leaf_last | 0.1733ms | 2.9203μs | 342.4268 KOps/s | 334.3441 KOps/s | |
test_membership_stacked_nested_last | 44.6400μs | 3.8588μs | 259.1455 KOps/s | 125.5689 KOps/s | |
test_membership_stacked_nested_leaf_last | 0.1896ms | 3.8191μs | 261.8387 KOps/s | 126.2107 KOps/s | |
test_nested_getleaf | 35.7400μs | 6.1408μs | 162.8455 KOps/s | 162.4509 KOps/s | |
test_nested_get | 26.9800μs | 5.8633μs | 170.5520 KOps/s | 170.0335 KOps/s | |
test_stacked_getleaf | 45.6300μs | 6.1051μs | 163.7980 KOps/s | 162.7402 KOps/s | |
test_stacked_get | 34.8000μs | 5.8339μs | 171.4112 KOps/s | 170.6465 KOps/s | |
test_nested_getitemleaf | 34.8900μs | 6.1953μs | 161.4116 KOps/s | 158.7665 KOps/s | |
test_nested_getitem | 27.5700μs | 5.8860μs | 169.8954 KOps/s | 167.0299 KOps/s | |
test_stacked_getitemleaf | 32.6200μs | 6.2057μs | 161.1420 KOps/s | 158.5977 KOps/s | |
test_stacked_getitem | 31.5800μs | 5.8879μs | 169.8409 KOps/s | 166.8264 KOps/s | |
test_lock_nested | 2.5842ms | 0.3691ms | 2.7095 KOps/s | 2.6579 KOps/s | |
test_lock_stack_nested | 0.4401ms | 0.3348ms | 2.9868 KOps/s | 3.0403 KOps/s | |
test_unlock_nested | 0.6444ms | 0.3078ms | 3.2486 KOps/s | 3.2653 KOps/s | |
test_unlock_stack_nested | 0.3831ms | 0.2742ms | 3.6471 KOps/s | 3.7314 KOps/s | |
test_flatten_speed | 0.1962ms | 74.4532μs | 13.4313 KOps/s | 13.4218 KOps/s | |
test_unflatten_speed | 0.3448ms | 0.3040ms | 3.2893 KOps/s | 3.2456 KOps/s | |
test_common_ops | 1.5986ms | 0.6163ms | 1.6225 KOps/s | 1.6858 KOps/s | |
test_creation | 0.1070ms | 1.4811μs | 675.1585 KOps/s | 680.2831 KOps/s | |
test_creation_empty | 29.8400μs | 8.4854μs | 117.8500 KOps/s | 152.7940 KOps/s | |
test_creation_nested_1 | 47.2010μs | 10.0802μs | 99.2048 KOps/s | 124.3574 KOps/s | |
test_creation_nested_2 | 40.8310μs | 12.5938μs | 79.4040 KOps/s | 94.3487 KOps/s | |
test_clone | 0.1535ms | 11.0253μs | 90.7002 KOps/s | 85.8167 KOps/s | |
test_getitem[int] | 2.0948ms | 11.0735μs | 90.3056 KOps/s | 90.4040 KOps/s | |
test_getitem[slice_int] | 0.1350ms | 21.5938μs | 46.3096 KOps/s | 45.3283 KOps/s | |
test_getitem[range] | 0.1404ms | 38.4207μs | 26.0276 KOps/s | 25.1725 KOps/s | |
test_getitem[tuple] | 0.1160ms | 18.6225μs | 53.6985 KOps/s | 52.7840 KOps/s | |
test_getitem[list] | 0.3112ms | 34.2196μs | 29.2230 KOps/s | 28.8363 KOps/s | |
test_setitem_dim[int] | 0.1355ms | 19.8589μs | 50.3552 KOps/s | 49.3857 KOps/s | |
test_setitem_dim[slice_int] | 67.7510μs | 38.7665μs | 25.7954 KOps/s | 24.9769 KOps/s | |
test_setitem_dim[range] | 77.7210μs | 53.7133μs | 18.6174 KOps/s | 18.4228 KOps/s | |
test_setitem_dim[tuple] | 55.4010μs | 32.5890μs | 30.6852 KOps/s | 29.9826 KOps/s | |
test_setitem | 0.1550ms | 16.0993μs | 62.1145 KOps/s | 65.2317 KOps/s | |
test_set | 47.1800μs | 15.5161μs | 64.4491 KOps/s | 65.8824 KOps/s | |
test_set_shared | 1.6131ms | 0.1479ms | 6.7634 KOps/s | 6.6641 KOps/s | |
test_update | 0.5647ms | 18.7469μs | 53.3423 KOps/s | 58.6842 KOps/s | |
test_update_nested | 1.3621ms | 24.5879μs | 40.6705 KOps/s | 44.2882 KOps/s | |
test_update__nested | 53.8900μs | 24.8802μs | 40.1926 KOps/s | 39.4366 KOps/s | |
test_set_nested | 0.1286ms | 16.6081μs | 60.2116 KOps/s | 60.9865 KOps/s | |
test_set_nested_new | 0.1244ms | 19.3718μs | 51.6214 KOps/s | 54.5783 KOps/s | |
test_select | 0.1424ms | 30.7956μs | 32.4722 KOps/s | 32.2470 KOps/s | |
test_select_nested | 70.5310μs | 41.6823μs | 23.9910 KOps/s | 24.0988 KOps/s | |
test_exclude_nested | 87.6600μs | 62.0264μs | 16.1222 KOps/s | 16.5421 KOps/s | |
test_empty[True] | 0.3141ms | 0.2802ms | 3.5690 KOps/s | 3.5553 KOps/s | |
test_empty[False] | 2.8780μs | 0.7457μs | 1.3410 MOps/s | 1.3508 MOps/s | |
test_to | 89.4310μs | 57.5264μs | 17.3833 KOps/s | 17.7925 KOps/s | |
test_to_nonblocking | 0.1966ms | 45.9295μs | 21.7725 KOps/s | 21.7264 KOps/s | |
test_unbind_speed | 0.2766ms | 0.2340ms | 4.2737 KOps/s | 4.2298 KOps/s | |
test_unbind_speed_stack0 | 0.2801ms | 0.2329ms | 4.2940 KOps/s | 4.3754 KOps/s | |
test_unbind_speed_stack1 | 0.1095s | 0.6611ms | 1.5127 KOps/s | 1.5692 KOps/s | |
test_split | 0.1081s | 1.6440ms | 608.2894 Ops/s | 558.4870 Ops/s | |
test_chunk | 0.1148s | 1.6592ms | 602.6844 Ops/s | 670.4188 Ops/s | |
test_consolidate[False-None] | 0.1175s | 2.9162ms | 342.9149 Ops/s | 345.5091 Ops/s | |
test_consolidate[default-None] | 1.8314ms | 1.7128ms | 583.8341 Ops/s | 551.9655 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8426ms | 1.7336ms | 576.8296 Ops/s | 542.5153 Ops/s | |
test_consolidate_njt[False-None] | 6.9746ms | 6.5213ms | 153.3446 Ops/s | 151.1992 Ops/s | |
test_to[False-False-None] | 1.8727ms | 1.7136ms | 583.5642 Ops/s | 591.2369 Ops/s | |
test_to[True-False-None] | 1.5969ms | 1.3412ms | 745.6276 Ops/s | 748.7579 Ops/s | |
test_to[within-False-None] | 4.3798ms | 4.1165ms | 242.9255 Ops/s | 246.6199 Ops/s | |
test_to[True-default-None] | 5.4653ms | 5.2306ms | 191.1828 Ops/s | 193.0480 Ops/s | |
test_to_njt[False-False-None] | 7.4208ms | 6.9670ms | 143.5343 Ops/s | 140.3792 Ops/s | |
test_to_njt[True-False-None] | 5.8246ms | 5.4935ms | 182.0341 Ops/s | 181.0300 Ops/s | |
test_to_njt[within-False-None] | 12.4054ms | 12.0356ms | 83.0871 Ops/s | 81.5159 Ops/s | |
test_creation[device0] | 0.4550ms | 79.4487μs | 12.5867 KOps/s | 12.2868 KOps/s | |
test_creation_from_tensor | 0.5344ms | 82.7864μs | 12.0793 KOps/s | 11.8730 KOps/s | |
test_add_one[memmap_tensor0] | 0.4718ms | 7.3856μs | 135.3981 KOps/s | 133.7711 KOps/s | |
test_contiguous[memmap_tensor0] | 1.8240μs | 0.4117μs | 2.4287 MOps/s | 2.3572 MOps/s | |
test_stack[memmap_tensor0] | 36.1400μs | 4.7252μs | 211.6313 KOps/s | 197.6503 KOps/s | |
test_memmaptd_index | 1.7650ms | 0.2611ms | 3.8305 KOps/s | 3.6815 KOps/s | |
test_memmaptd_index_astensor | 0.8050ms | 0.3162ms | 3.1621 KOps/s | 3.0289 KOps/s | |
test_memmaptd_index_op | 1.0664ms | 0.6210ms | 1.6103 KOps/s | 1.6523 KOps/s | |
test_serialize_model | 0.1317s | 0.1309s | 7.6368 Ops/s | 7.6255 Ops/s | |
test_serialize_model_pickle | 1.3486s | 1.2149s | 0.8231 Ops/s | 0.8194 Ops/s | |
test_serialize_weights | 0.1331s | 0.1306s | 7.6595 Ops/s | 7.6195 Ops/s | |
test_serialize_weights_returnearly | 50.7625ms | 42.9437ms | 23.2863 Ops/s | 13.9262 Ops/s | |
test_serialize_weights_pickle | 1.3537s | 1.2126s | 0.8247 Ops/s | 0.8202 Ops/s | |
test_reshape_pytree | 0.1632ms | 21.9927μs | 45.4696 KOps/s | 43.7694 KOps/s | |
test_reshape_td | 63.8910μs | 27.0365μs | 36.9870 KOps/s | 36.4963 KOps/s | |
test_view_pytree | 0.1675ms | 22.0562μs | 45.3387 KOps/s | 45.0365 KOps/s | |
test_view_td | 0.1504ms | 28.6776μs | 34.8705 KOps/s | 32.8029 KOps/s | |
test_unbind_pytree | 0.1278ms | 28.3565μs | 35.2653 KOps/s | 35.4084 KOps/s | |
test_unbind_td | 0.8024ms | 36.4330μs | 27.4476 KOps/s | 27.3036 KOps/s | |
test_split_pytree | 0.1585ms | 30.0045μs | 33.3283 KOps/s | 32.7103 KOps/s | |
test_split_td | 0.1750ms | 39.1917μs | 25.5156 KOps/s | 25.0789 KOps/s | |
test_add_pytree | 95.1110μs | 35.6447μs | 28.0546 KOps/s | 27.5569 KOps/s | |
test_add_td | 0.1628ms | 50.7146μs | 19.7182 KOps/s | 20.8387 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1851ms | 0.1212ms | 8.2497 KOps/s | 7.9286 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2587ms | 0.1255ms | 7.9684 KOps/s | 8.0787 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2337ms | 96.1806μs | 10.3971 KOps/s | 10.2172 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.7515ms | 0.1520ms | 6.5793 KOps/s | 6.6340 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1076ms | 21.9203μs | 45.6198 KOps/s | 45.2888 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1444ms | 26.7616μs | 37.3670 KOps/s | 36.7996 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.4659ms | 64.7143μs | 15.4525 KOps/s | 15.1891 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1969ms | 49.0141μs | 20.4023 KOps/s | 19.8826 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2801ms | 0.1446ms | 6.9169 KOps/s | 6.9453 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3544ms | 0.2069ms | 4.8323 KOps/s | 4.9040 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2519ms | 98.9603μs | 10.1051 KOps/s | 9.7524 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2098ms | 51.8961μs | 19.2693 KOps/s | 19.5962 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2901ms | 0.1365ms | 7.3245 KOps/s | 7.1149 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6489ms | 0.4922ms | 2.0315 KOps/s | 2.0514 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3922ms | 0.2478ms | 4.0351 KOps/s | 4.0715 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2919ms | 0.1445ms | 6.9209 KOps/s | 6.7377 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2315ms | 61.9239μs | 16.1489 KOps/s | 16.1623 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2490ms | 99.6184μs | 10.0383 KOps/s | 9.6209 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5779ms | 0.4182ms | 2.3914 KOps/s | 2.4924 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3002ms | 0.1376ms | 7.2696 KOps/s | 7.4188 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1340ms | 17.9738μs | 55.6365 KOps/s | 52.6101 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 79.8810μs | 26.9575μs | 37.0954 KOps/s | 37.8855 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1227ms | 69.9691μs | 14.2920 KOps/s | 14.1942 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.2121ms | 51.4904μs | 19.4211 KOps/s | 19.2922 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6771ms | 0.4556ms | 2.1950 KOps/s | 2.2050 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.8877ms | 2.6417ms | 378.5423 Ops/s | 375.7824 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6330ms | 0.4474ms | 2.2351 KOps/s | 2.2131 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8476ms | 2.6975ms | 370.7130 Ops/s | 369.4097 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.6608ms | 0.1132ms | 8.8349 KOps/s | 8.7405 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5650ms | 79.0472μs | 12.6507 KOps/s | 12.6622 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.5171ms | 0.1063ms | 9.4115 KOps/s | 9.5184 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2194ms | 67.5268μs | 14.8089 KOps/s | 14.4405 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2626ms | 0.1064ms | 9.3999 KOps/s | 9.1098 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2208ms | 67.2957μs | 14.8598 KOps/s | 14.4794 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2252ms | 0.1019ms | 9.8179 KOps/s | 9.7470 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.2006ms | 21.1633μs | 47.2516 KOps/s | 53.9243 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2124ms | 97.1171μs | 10.2968 KOps/s | 10.2453 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1610ms | 16.0310μs | 62.3792 KOps/s | 61.0685 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2792ms | 97.1823μs | 10.2899 KOps/s | 10.1957 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1514ms | 16.1368μs | 61.9703 KOps/s | 60.9305 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2483ms | 0.1025ms | 9.7573 KOps/s | 9.7720 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.7268ms | 18.1271μs | 55.1660 KOps/s | 54.7762 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2789ms | 97.6248μs | 10.2433 KOps/s | 10.2030 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 52.6310μs | 16.1691μs | 61.8464 KOps/s | 61.2270 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2484ms | 97.4935μs | 10.2571 KOps/s | 10.1206 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1445ms | 16.1777μs | 61.8134 KOps/s | 60.8025 KOps/s | |
test_mod_add[eager] | 0.1994ms | 37.7821μs | 26.4676 KOps/s | 27.1051 KOps/s | |
test_mod_add[compile] | 0.4857ms | 81.4218μs | 12.2817 KOps/s | 12.1685 KOps/s | |
test_mod_add[compile-overhead] | 0.3381ms | 0.1728ms | 5.7885 KOps/s | 5.5621 KOps/s | |
test_mod_wrap[eager] | 0.4094ms | 0.2533ms | 3.9484 KOps/s | 3.9717 KOps/s | |
test_mod_wrap[compile] | 0.4969ms | 0.2859ms | 3.4975 KOps/s | 3.4684 KOps/s | |
test_mod_wrap[compile-overhead] | 7.5014ms | 3.8078ms | 262.6155 Ops/s | 263.4250 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6099ms | 1.3845ms | 722.2594 Ops/s | 686.3277 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4735ms | 1.2837ms | 778.9910 Ops/s | 722.7631 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.4377ms | 0.9578ms | 1.0440 KOps/s | 947.8438 Ops/s | |
test_seq_add[eager] | 0.2764ms | 0.1157ms | 8.6415 KOps/s | 8.9073 KOps/s | |
test_seq_add[compile] | 0.2269ms | 89.7736μs | 11.1391 KOps/s | 11.1310 KOps/s | |
test_seq_add[compile-overhead] | 0.2683ms | 0.1298ms | 7.7065 KOps/s | 7.6218 KOps/s | |
test_seq_wrap[eager] | 0.6157ms | 0.4175ms | 2.3955 KOps/s | 2.3752 KOps/s | |
test_seq_wrap[compile] | 0.4414ms | 0.3021ms | 3.3104 KOps/s | 3.2742 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3910ms | 0.2283ms | 4.3801 KOps/s | 4.3575 KOps/s | |
test_func_call_runtime[False-eager] | 0.9172ms | 0.7510ms | 1.3316 KOps/s | 1.3327 KOps/s | |
test_func_call_runtime[False-compile] | 0.9734ms | 0.7536ms | 1.3270 KOps/s | 1.3296 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5057ms | 0.3663ms | 2.7297 KOps/s | 2.7400 KOps/s | |
test_func_call_runtime[True-eager] | 1.0684ms | 0.9016ms | 1.1091 KOps/s | 1.1093 KOps/s | |
test_func_call_runtime[True-compile] | 1.1795ms | 0.7751ms | 1.2902 KOps/s | 1.2954 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8201ms | 0.3874ms | 2.5812 KOps/s | 2.5920 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.1370ms | 0.7641ms | 1.3088 KOps/s | 1.3365 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.1659ms | 0.7721ms | 1.2951 KOps/s | 1.3242 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5098ms | 0.3681ms | 2.7163 KOps/s | 2.7118 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4443ms | 1.0080ms | 992.0225 Ops/s | 985.3693 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9989ms | 0.8083ms | 1.2371 KOps/s | 1.2462 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5595ms | 0.4115ms | 2.4304 KOps/s | 2.4173 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5926ms | 2.0837ms | 479.9163 Ops/s | 478.0493 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9812ms | 0.8108ms | 1.2333 KOps/s | 1.2203 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5878ms | 0.4190ms | 2.3864 KOps/s | 2.3851 KOps/s | |
test_distributed | 10.1024ms | 0.1895ms | 5.2774 KOps/s | 8.3380 KOps/s | |
test_tdmodule | 30.0600μs | 19.2066μs | 52.0653 KOps/s | 53.0592 KOps/s | |
test_tdmodule_dispatch | 0.1535ms | 35.4424μs | 28.2148 KOps/s | 30.3537 KOps/s | |
test_tdseq | 41.1900μs | 19.4461μs | 51.4241 KOps/s | 55.5802 KOps/s | |
test_tdseq_dispatch | 0.1359ms | 37.0717μs | 26.9748 KOps/s | 29.2454 KOps/s | |
test_instantiation_functorch | 1.8008ms | 1.5510ms | 644.7369 Ops/s | 644.5727 Ops/s | |
test_exec_functorch | 0.3259ms | 0.1483ms | 6.7423 KOps/s | 6.7011 KOps/s | |
test_exec_functional_call | 0.2655ms | 0.1412ms | 7.0803 KOps/s | 6.8930 KOps/s | |
test_exec_td_decorator | 0.3884ms | 0.1875ms | 5.3320 KOps/s | 5.3003 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8473ms | 0.6829ms | 1.4644 KOps/s | 1.4485 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8506ms | 0.6933ms | 1.4423 KOps/s | 1.4731 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8375ms | 0.6129ms | 1.6317 KOps/s | 1.6349 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7864ms | 0.5985ms | 1.6708 KOps/s | 1.6823 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.3721ms | 19.1903ms | 52.1097 Ops/s | 52.1071 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.9784ms | 19.2575ms | 51.9279 Ops/s | 52.2138 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.3690ms | 19.1012ms | 52.3527 Ops/s | 52.7614 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.2786ms | 19.1068ms | 52.3373 Ops/s | 52.5830 Ops/s | |
test_to_module_speed[True] | 1.0433ms | 0.9242ms | 1.0820 KOps/s | 1.0745 KOps/s | |
test_to_module_speed[False] | 1.2642ms | 0.8973ms | 1.1144 KOps/s | 1.0954 KOps/s | |
test_tc_init | 60.7600μs | 34.6365μs | 28.8713 KOps/s | 31.2086 KOps/s | |
test_tc_init_nested | 0.1699ms | 70.7452μs | 14.1352 KOps/s | 15.1731 KOps/s | |
test_tc_first_layer_tensor | 3.8829μs | 0.7031μs | 1.4224 MOps/s | 1.4665 MOps/s | |
test_tc_first_layer_nontensor | 24.5300μs | 2.3421μs | 426.9742 KOps/s | 433.9179 KOps/s | |
test_tc_second_layer_tensor | 11.7803μs | 1.4465μs | 691.3214 KOps/s | 710.6662 KOps/s | |
test_tc_second_layer_nontensor | 31.6400μs | 3.0580μs | 327.0125 KOps/s | 325.2082 KOps/s | |
test_unbind | 0.2376s | 10.5737ms | 94.5747 Ops/s | 152.0598 Ops/s | |
test_full_like | 13.3555ms | 10.6983ms | 93.4731 Ops/s | 96.4927 Ops/s | |
test_zeros_like | 5.9593ms | 4.6778ms | 213.7777 Ops/s | 133.7812 Ops/s | |
test_ones_like | 6.1252ms | 4.6734ms | 213.9785 Ops/s | 212.6321 Ops/s | |
test_clone | 9.1718ms | 7.5823ms | 131.8867 Ops/s | 133.0232 Ops/s | |
test_squeeze | 0.1423ms | 9.6053μs | 104.1090 KOps/s | 105.2535 KOps/s | |
test_unsqueeze | 0.2105ms | 70.4188μs | 14.2008 KOps/s | 14.0742 KOps/s | |
test_split | 0.3395ms | 0.1651ms | 6.0568 KOps/s | 6.3211 KOps/s | |
test_permute | 0.2754ms | 0.1781ms | 5.6162 KOps/s | 5.8023 KOps/s | |
test_stack | 54.1348ms | 52.7653ms | 18.9518 Ops/s | 18.8126 Ops/s | |
test_cat | 55.2989ms | 52.4453ms | 19.0675 Ops/s | 18.7652 Ops/s |
vmoens
added a commit
that referenced
this pull request
Dec 9, 2024
ghstack-source-id: 86065377cc1cd7c7283ed0a468f5d5602d60526d Pull Request resolved: #1132
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):