-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Refactor] Refactor context managers #1098
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Nov 20, 2024
ghstack-source-id: 9666ab7653889907c139ba14eb2cb4e6151c3f5d Pull Request resolved: #1098
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Nov 20, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 41.3470μs | 17.6624μs | 56.6174 KOps/s | 56.7636 KOps/s | |
test_plain_set_stack_nested | 41.3370μs | 17.9069μs | 55.8444 KOps/s | 55.8640 KOps/s | |
test_plain_set_nested_inplace | 73.1170μs | 19.4251μs | 51.4799 KOps/s | 51.1795 KOps/s | |
test_plain_set_stack_nested_inplace | 51.0550μs | 19.6623μs | 50.8587 KOps/s | 51.0534 KOps/s | |
test_items | 32.1600μs | 4.1826μs | 239.0841 KOps/s | 232.1014 KOps/s | |
test_items_nested | 0.5594ms | 0.3424ms | 2.9206 KOps/s | 2.9346 KOps/s | |
test_items_nested_locked | 0.4949ms | 0.3422ms | 2.9222 KOps/s | 2.9058 KOps/s | |
test_items_nested_leaf | 0.1339ms | 72.5926μs | 13.7755 KOps/s | 13.7323 KOps/s | |
test_items_stack_nested | 0.4782ms | 0.3465ms | 2.8863 KOps/s | 2.8425 KOps/s | |
test_items_stack_nested_leaf | 0.1358ms | 75.7729μs | 13.1973 KOps/s | 13.5517 KOps/s | |
test_items_stack_nested_locked | 0.4344ms | 0.3478ms | 2.8752 KOps/s | 2.8641 KOps/s | |
test_keys | 41.6180μs | 3.4839μs | 287.0377 KOps/s | 262.5652 KOps/s | |
test_keys_nested | 0.2305ms | 0.1348ms | 7.4209 KOps/s | 7.3786 KOps/s | |
test_keys_nested_locked | 1.8704ms | 0.1396ms | 7.1634 KOps/s | 7.0522 KOps/s | |
test_keys_nested_leaf | 0.2083ms | 0.1160ms | 8.6199 KOps/s | 8.5777 KOps/s | |
test_keys_stack_nested | 0.2225ms | 0.1355ms | 7.3801 KOps/s | 7.4044 KOps/s | |
test_keys_stack_nested_leaf | 0.1986ms | 0.1163ms | 8.6006 KOps/s | 8.7592 KOps/s | |
test_keys_stack_nested_locked | 0.2427ms | 0.1399ms | 7.1464 KOps/s | 7.1982 KOps/s | |
test_values | 7.0752μs | 1.0378μs | 963.6228 KOps/s | 983.2852 KOps/s | |
test_values_nested | 0.1021ms | 55.5676μs | 17.9961 KOps/s | 18.0888 KOps/s | |
test_values_nested_locked | 0.1020ms | 55.3077μs | 18.0807 KOps/s | 18.1247 KOps/s | |
test_values_nested_leaf | 0.1070ms | 60.1119μs | 16.6356 KOps/s | 16.4418 KOps/s | |
test_values_stack_nested | 0.1073ms | 56.5165μs | 17.6940 KOps/s | 17.4677 KOps/s | |
test_values_stack_nested_leaf | 0.1263ms | 60.3562μs | 16.5683 KOps/s | 16.5024 KOps/s | |
test_values_stack_nested_locked | 0.1187ms | 56.8134μs | 17.6015 KOps/s | 17.7180 KOps/s | |
test_membership | 16.1210μs | 0.8972μs | 1.1145 MOps/s | 1.1659 MOps/s | |
test_membership_nested | 23.9750μs | 2.7375μs | 365.3033 KOps/s | 368.3903 KOps/s | |
test_membership_nested_leaf | 29.6050μs | 2.7703μs | 360.9690 KOps/s | 365.1590 KOps/s | |
test_membership_stacked_nested | 22.5420μs | 2.7218μs | 367.4016 KOps/s | 360.6474 KOps/s | |
test_membership_stacked_nested_leaf | 30.2470μs | 2.7456μs | 364.2236 KOps/s | 368.7912 KOps/s | |
test_membership_nested_last | 26.3890μs | 4.1082μs | 243.4155 KOps/s | 245.6866 KOps/s | |
test_membership_nested_leaf_last | 33.7330μs | 4.0536μs | 246.6952 KOps/s | 245.2453 KOps/s | |
test_membership_stacked_nested_last | 29.8650μs | 6.1769μs | 161.8947 KOps/s | 77.6187 KOps/s | |
test_membership_stacked_nested_leaf_last | 30.3470μs | 6.0524μs | 165.2245 KOps/s | 77.3379 KOps/s | |
test_nested_getleaf | 46.5070μs | 10.5216μs | 95.0426 KOps/s | 94.5150 KOps/s | |
test_nested_get | 49.3420μs | 10.0916μs | 99.0923 KOps/s | 99.5563 KOps/s | |
test_stacked_getleaf | 45.7190μs | 10.4543μs | 95.6540 KOps/s | 95.3065 KOps/s | |
test_stacked_get | 33.6530μs | 10.6972μs | 93.4827 KOps/s | 98.7096 KOps/s | |
test_nested_getitemleaf | 53.1190μs | 10.9957μs | 90.9443 KOps/s | 90.4221 KOps/s | |
test_nested_getitem | 54.2120μs | 10.2243μs | 97.8064 KOps/s | 95.3629 KOps/s | |
test_stacked_getitemleaf | 38.2410μs | 11.0913μs | 90.1607 KOps/s | 92.2064 KOps/s | |
test_stacked_getitem | 46.9380μs | 10.2852μs | 97.2269 KOps/s | 97.7581 KOps/s | |
test_lock_nested | 4.4064ms | 0.4410ms | 2.2677 KOps/s | 1.8491 KOps/s | |
test_lock_stack_nested | 0.6473ms | 0.4111ms | 2.4326 KOps/s | 2.5126 KOps/s | |
test_unlock_nested | 0.7465ms | 0.3559ms | 2.8096 KOps/s | 2.7658 KOps/s | |
test_unlock_stack_nested | 0.6550ms | 0.3308ms | 3.0225 KOps/s | 3.1630 KOps/s | |
test_flatten_speed | 0.1700ms | 92.5366μs | 10.8065 KOps/s | 10.8869 KOps/s | |
test_unflatten_speed | 0.8051ms | 0.4800ms | 2.0835 KOps/s | 2.1300 KOps/s | |
test_common_ops | 5.9916ms | 0.7563ms | 1.3222 KOps/s | 1.3226 KOps/s | |
test_creation | 72.6960μs | 2.1255μs | 470.4715 KOps/s | 479.5434 KOps/s | |
test_creation_empty | 42.2590μs | 10.6214μs | 94.1499 KOps/s | 96.9390 KOps/s | |
test_creation_nested_1 | 41.2370μs | 13.3425μs | 74.9487 KOps/s | 76.7621 KOps/s | |
test_creation_nested_2 | 50.5440μs | 17.5625μs | 56.9396 KOps/s | 58.3215 KOps/s | |
test_clone | 63.7690μs | 12.9650μs | 77.1305 KOps/s | 77.6310 KOps/s | |
test_getitem[int] | 0.8954ms | 12.4510μs | 80.3151 KOps/s | 79.8338 KOps/s | |
test_getitem[slice_int] | 0.1418ms | 23.5893μs | 42.3922 KOps/s | 43.2876 KOps/s | |
test_getitem[range] | 0.2345ms | 46.9362μs | 21.3055 KOps/s | 21.7915 KOps/s | |
test_getitem[tuple] | 0.1317ms | 19.6531μs | 50.8825 KOps/s | 52.5643 KOps/s | |
test_getitem[list] | 0.2396ms | 42.6182μs | 23.4641 KOps/s | 24.0290 KOps/s | |
test_setitem_dim[int] | 42.8000μs | 24.6545μs | 40.5605 KOps/s | 41.5562 KOps/s | |
test_setitem_dim[slice_int] | 85.8410μs | 50.1648μs | 19.9343 KOps/s | 19.9411 KOps/s | |
test_setitem_dim[range] | 0.1514ms | 75.1223μs | 13.3116 KOps/s | 13.8293 KOps/s | |
test_setitem_dim[tuple] | 64.6310μs | 39.6102μs | 25.2461 KOps/s | 25.2968 KOps/s | |
test_setitem | 91.2410μs | 19.5651μs | 51.1115 KOps/s | 52.4296 KOps/s | |
test_set | 97.1720μs | 19.0811μs | 52.4079 KOps/s | 53.6435 KOps/s | |
test_set_shared | 4.3407ms | 0.1674ms | 5.9739 KOps/s | 6.0585 KOps/s | |
test_update | 0.1784ms | 21.5802μs | 46.3388 KOps/s | 47.7035 KOps/s | |
test_update_nested | 0.1473ms | 30.7722μs | 32.4969 KOps/s | 32.9577 KOps/s | |
test_update__nested | 0.4277ms | 32.0378μs | 31.2131 KOps/s | 32.9269 KOps/s | |
test_set_nested | 0.1237ms | 20.8514μs | 47.9584 KOps/s | 49.5198 KOps/s | |
test_set_nested_new | 0.1300ms | 25.4356μs | 39.3150 KOps/s | 40.2267 KOps/s | |
test_select | 0.2174ms | 41.7443μs | 23.9554 KOps/s | 24.6740 KOps/s | |
test_select_nested | 0.1240ms | 60.7423μs | 16.4630 KOps/s | 16.9252 KOps/s | |
test_exclude_nested | 0.1762ms | 76.1823μs | 13.1264 KOps/s | 13.4922 KOps/s | |
test_empty[True] | 0.6499ms | 0.3490ms | 2.8651 KOps/s | 2.8627 KOps/s | |
test_empty[False] | 15.4613μs | 1.2337μs | 810.5962 KOps/s | 791.2869 KOps/s | |
test_unbind_speed | 0.3530ms | 0.2600ms | 3.8461 KOps/s | 3.8488 KOps/s | |
test_unbind_speed_stack0 | 0.5594ms | 0.2585ms | 3.8690 KOps/s | 3.9919 KOps/s | |
test_unbind_speed_stack1 | 0.1064s | 0.7018ms | 1.4249 KOps/s | 1.4493 KOps/s | |
test_split | 99.1645ms | 1.7286ms | 578.5172 Ops/s | 580.6146 Ops/s | |
test_chunk | 0.1031s | 1.7166ms | 582.5351 Ops/s | 577.3398 Ops/s | |
test_consolidate_njt[False-None] | 0.1115s | 9.0389ms | 110.6335 Ops/s | 121.2632 Ops/s | |
test_creation[device0] | 0.2287ms | 89.0368μs | 11.2313 KOps/s | 10.9725 KOps/s | |
test_creation_from_tensor | 3.3852ms | 92.4340μs | 10.8185 KOps/s | 10.4939 KOps/s | |
test_add_one[memmap_tensor0] | 0.3162ms | 4.8127μs | 207.7850 KOps/s | 212.0505 KOps/s | |
test_contiguous[memmap_tensor0] | 10.9310μs | 0.5015μs | 1.9940 MOps/s | 1.9726 MOps/s | |
test_stack[memmap_tensor0] | 40.8560μs | 3.2939μs | 303.5891 KOps/s | 300.7255 KOps/s | |
test_memmaptd_index | 1.1231ms | 0.2339ms | 4.2753 KOps/s | 4.3147 KOps/s | |
test_memmaptd_index_astensor | 0.5767ms | 0.3089ms | 3.2375 KOps/s | 3.2503 KOps/s | |
test_memmaptd_index_op | 1.0152ms | 0.5624ms | 1.7780 KOps/s | 1.7785 KOps/s | |
test_serialize_model | 0.1237s | 0.1155s | 8.6578 Ops/s | 7.5609 Ops/s | |
test_serialize_model_pickle | 0.4425s | 0.3889s | 2.5716 Ops/s | 2.4947 Ops/s | |
test_serialize_weights | 0.2174s | 0.1281s | 7.8074 Ops/s | 8.5053 Ops/s | |
test_serialize_weights_returnearly | 0.1685s | 0.1558s | 6.4203 Ops/s | 6.3006 Ops/s | |
test_serialize_weights_pickle | 0.4829s | 0.4332s | 2.3086 Ops/s | 2.4973 Ops/s | |
test_serialize_weights_filesystem | 0.1540s | 0.1433s | 6.9779 Ops/s | 6.9060 Ops/s | |
test_serialize_model_filesystem | 0.2520s | 0.1655s | 6.0426 Ops/s | 6.6536 Ops/s | |
test_reshape_pytree | 65.9230μs | 26.5034μs | 37.7310 KOps/s | 35.7981 KOps/s | |
test_reshape_td | 69.4700μs | 32.6564μs | 30.6219 KOps/s | 30.3447 KOps/s | |
test_view_pytree | 76.8540μs | 27.1380μs | 36.8487 KOps/s | 37.7258 KOps/s | |
test_view_td | 99.7160μs | 37.0768μs | 26.9711 KOps/s | 26.9687 KOps/s | |
test_unbind_pytree | 63.4190μs | 29.7006μs | 33.6694 KOps/s | 33.4410 KOps/s | |
test_unbind_td | 0.3104ms | 37.6894μs | 26.5327 KOps/s | 26.1285 KOps/s | |
test_split_pytree | 69.0690μs | 29.3075μs | 34.1209 KOps/s | 34.4503 KOps/s | |
test_split_td | 0.5012ms | 43.5336μs | 22.9708 KOps/s | 23.2882 KOps/s | |
test_add_pytree | 0.1200ms | 35.6907μs | 28.0185 KOps/s | 28.6158 KOps/s | |
test_add_td | 0.1326ms | 53.0812μs | 18.8391 KOps/s | 18.2367 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1486ms | 61.9162μs | 16.1509 KOps/s | 16.1148 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3685ms | 0.1601ms | 6.2442 KOps/s | 6.3673 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1109ms | 44.1145μs | 22.6683 KOps/s | 22.0807 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2262ms | 0.1185ms | 8.4409 KOps/s | 8.5051 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 87.1670μs | 26.6743μs | 37.4893 KOps/s | 38.6247 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1085ms | 53.3582μs | 18.7412 KOps/s | 18.4736 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1468ms | 78.2186μs | 12.7847 KOps/s | 12.7749 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1316ms | 68.1005μs | 14.6842 KOps/s | 14.7767 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1634ms | 0.1042ms | 9.5985 KOps/s | 9.4652 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2770ms | 0.1989ms | 5.0286 KOps/s | 5.0286 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1242ms | 44.4186μs | 22.5131 KOps/s | 21.7060 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4600ms | 59.8997μs | 16.6946 KOps/s | 16.6343 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1606ms | 0.1019ms | 9.8132 KOps/s | 9.9503 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3754ms | 0.2043ms | 4.8953 KOps/s | 5.0000 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3088ms | 0.2105ms | 4.7510 KOps/s | 4.8133 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2421ms | 0.1052ms | 9.5086 KOps/s | 9.6548 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1840ms | 54.7347μs | 18.2700 KOps/s | 19.0334 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 94.2760μs | 45.6476μs | 21.9070 KOps/s | 22.0266 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6233ms | 0.1606ms | 6.2279 KOps/s | 6.4152 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1883ms | 0.1029ms | 9.7192 KOps/s | 9.7409 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 86.4070μs | 22.1924μs | 45.0604 KOps/s | 46.3956 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1230ms | 59.0104μs | 16.9462 KOps/s | 16.7845 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1608ms | 81.5886μs | 12.2566 KOps/s | 12.5578 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1465ms | 70.6854μs | 14.1472 KOps/s | 14.8474 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.4186ms | 0.2129ms | 4.6979 KOps/s | 4.8696 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.5416ms | 1.2664ms | 789.6654 Ops/s | 794.4088 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3772ms | 0.2061ms | 4.8520 KOps/s | 4.9671 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3810ms | 0.7854ms | 1.2733 KOps/s | 1.3004 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.5901ms | 0.4708ms | 2.1241 KOps/s | 2.2203 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.9344ms | 2.5674ms | 389.4997 Ops/s | 393.3065 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 95.1980μs | 35.8714μs | 27.8773 KOps/s | 28.6317 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.6943ms | 31.5340μs | 31.7118 KOps/s | 32.3085 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1091ms | 28.8126μs | 34.7071 KOps/s | 35.5802 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 83.2550μs | 22.7275μs | 43.9996 KOps/s | 44.4863 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 89.9980μs | 29.5121μs | 33.8844 KOps/s | 33.9752 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 84.5780μs | 22.9006μs | 43.6670 KOps/s | 44.5172 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1219ms | 51.3783μs | 19.4635 KOps/s | 19.6617 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5457ms | 19.2141μs | 52.0452 KOps/s | 52.9059 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1002ms | 43.6058μs | 22.9328 KOps/s | 22.9951 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 57.0470μs | 18.9612μs | 52.7392 KOps/s | 54.4758 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1411ms | 44.6619μs | 22.3904 KOps/s | 22.5197 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 64.8210μs | 18.8073μs | 53.1707 KOps/s | 54.5906 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1318ms | 52.4848μs | 19.0531 KOps/s | 19.1540 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8613ms | 19.1538μs | 52.2090 KOps/s | 53.1633 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1243ms | 45.8399μs | 21.8150 KOps/s | 22.6748 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 66.6550μs | 18.6447μs | 53.6346 KOps/s | 54.5286 KOps/s | |
test_compile_indexing[int-pytree-compile] | 93.3450μs | 44.1838μs | 22.6327 KOps/s | 22.6266 KOps/s | |
test_compile_indexing[int-pytree-eager] | 60.6130μs | 18.6453μs | 53.6328 KOps/s | 54.5548 KOps/s | |
test_mod_add[eager] | 94.0750μs | 29.7088μs | 33.6601 KOps/s | 37.8884 KOps/s | |
test_mod_add[compile] | 89.6280μs | 44.3149μs | 22.5658 KOps/s | 22.9864 KOps/s | |
test_mod_add[compile-overhead] | 0.1091ms | 45.3179μs | 22.0663 KOps/s | 22.2194 KOps/s | |
test_mod_wrap[eager] | 0.3533ms | 0.2185ms | 4.5776 KOps/s | 4.7970 KOps/s | |
test_mod_wrap[compile] | 1.7766ms | 0.2052ms | 4.8743 KOps/s | 4.9371 KOps/s | |
test_mod_wrap[compile-overhead] | 2.1775ms | 0.2019ms | 4.9533 KOps/s | 5.0552 KOps/s | |
test_mod_wrap_and_backward[eager] | 15.2559ms | 12.0860ms | 82.7401 Ops/s | 84.1716 Ops/s | |
test_mod_wrap_and_backward[compile] | 12.7849ms | 11.0617ms | 90.4019 Ops/s | 79.7100 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 16.6570ms | 13.7658ms | 72.6438 Ops/s | 80.1235 Ops/s | |
test_seq_add[eager] | 0.2060ms | 0.1022ms | 9.7830 KOps/s | 10.9897 KOps/s | |
test_seq_add[compile] | 0.1301ms | 59.7009μs | 16.7502 KOps/s | 17.2070 KOps/s | |
test_seq_add[compile-overhead] | 0.1303ms | 58.0787μs | 17.2180 KOps/s | 17.6799 KOps/s | |
test_seq_wrap[eager] | 0.7266ms | 0.4059ms | 2.4638 KOps/s | 2.6247 KOps/s | |
test_seq_wrap[compile] | 0.3608ms | 0.2271ms | 4.4034 KOps/s | 4.5584 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4508ms | 0.2261ms | 4.4231 KOps/s | 4.5013 KOps/s | |
test_func_call_runtime[False-eager] | 1.2951ms | 0.5542ms | 1.8043 KOps/s | 1.8847 KOps/s | |
test_func_call_runtime[False-compile] | 0.8075ms | 0.4288ms | 2.3320 KOps/s | 2.4159 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.9341ms | 0.4257ms | 2.3489 KOps/s | 2.4010 KOps/s | |
test_func_call_runtime[True-eager] | 1.2962ms | 0.7649ms | 1.3073 KOps/s | 1.3360 KOps/s | |
test_func_call_runtime[True-compile] | 0.6129ms | 0.4680ms | 2.1369 KOps/s | 2.2017 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.7240ms | 0.4644ms | 2.1533 KOps/s | 2.1759 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7844ms | 0.5485ms | 1.8230 KOps/s | 1.8909 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5357ms | 0.4248ms | 2.3541 KOps/s | 2.4140 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5335ms | 0.4255ms | 2.3504 KOps/s | 2.4011 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0895ms | 0.8946ms | 1.1178 KOps/s | 1.1463 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.8773ms | 0.4946ms | 2.0220 KOps/s | 2.0781 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.8579ms | 0.4910ms | 2.0367 KOps/s | 2.0752 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5576ms | 1.8885ms | 529.5116 Ops/s | 535.9459 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9208ms | 0.5171ms | 1.9340 KOps/s | 1.9433 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.9938ms | 0.5199ms | 1.9235 KOps/s | 1.9582 KOps/s | |
test_distributed | 0.2587ms | 0.1261ms | 7.9281 KOps/s | 7.7970 KOps/s | |
test_tdmodule | 83.1160μs | 21.6752μs | 46.1357 KOps/s | 54.0843 KOps/s | |
test_tdmodule_dispatch | 60.7940μs | 39.3361μs | 25.4220 KOps/s | 27.7407 KOps/s | |
test_tdseq | 49.6220μs | 24.1157μs | 41.4668 KOps/s | 47.2016 KOps/s | |
test_tdseq_dispatch | 72.8160μs | 44.4486μs | 22.4979 KOps/s | 24.2681 KOps/s | |
test_instantiation_functorch | 1.6723ms | 1.5112ms | 661.7476 Ops/s | 661.0114 Ops/s | |
test_exec_functorch | 0.3285ms | 0.1804ms | 5.5437 KOps/s | 5.6221 KOps/s | |
test_exec_functional_call | 0.3686ms | 0.1722ms | 5.8060 KOps/s | 5.8988 KOps/s | |
test_exec_td_decorator | 0.5969ms | 0.2309ms | 4.3308 KOps/s | 4.4104 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9645ms | 0.6286ms | 1.5909 KOps/s | 1.5992 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9793ms | 0.6298ms | 1.5879 KOps/s | 1.5353 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8715ms | 0.5170ms | 1.9344 KOps/s | 1.9354 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8454ms | 0.5153ms | 1.9405 KOps/s | 1.9084 KOps/s | |
test_to_module_speed[True] | 1.6739ms | 1.3017ms | 768.2122 Ops/s | 779.6275 Ops/s | |
test_to_module_speed[False] | 1.8726ms | 1.2617ms | 792.5538 Ops/s | 800.6814 Ops/s | |
test_tc_init | 82.7850μs | 43.2287μs | 23.1328 KOps/s | 22.5988 KOps/s | |
test_tc_init_nested | 0.1413ms | 86.6660μs | 11.5385 KOps/s | 11.5950 KOps/s | |
test_tc_first_layer_tensor | 40.1750μs | 1.5249μs | 655.7632 KOps/s | 662.9040 KOps/s | |
test_tc_first_layer_nontensor | 33.9740μs | 4.8412μs | 206.5604 KOps/s | 212.7714 KOps/s | |
test_tc_second_layer_tensor | 48.0900μs | 2.8094μs | 355.9426 KOps/s | 349.0668 KOps/s | |
test_tc_second_layer_nontensor | 32.8710μs | 6.0532μs | 165.2020 KOps/s | 165.6716 KOps/s | |
test_unbind | 0.2313s | 12.7459ms | 78.4566 Ops/s | 82.7458 Ops/s | |
test_full_like | 8.3245ms | 7.5707ms | 132.0883 Ops/s | 133.4384 Ops/s | |
test_zeros_like | 3.6451ms | 2.9382ms | 340.3415 Ops/s | 349.9355 Ops/s | |
test_ones_like | 3.8113ms | 3.3310ms | 300.2064 Ops/s | 306.6540 Ops/s | |
test_clone | 6.4951ms | 5.3274ms | 187.7085 Ops/s | 187.9645 Ops/s | |
test_squeeze | 60.8430μs | 11.7130μs | 85.3755 KOps/s | 81.9014 KOps/s | |
test_unsqueeze | 0.2498ms | 89.2921μs | 11.1992 KOps/s | 11.4916 KOps/s | |
test_split | 0.4780ms | 0.1919ms | 5.2105 KOps/s | 5.3186 KOps/s | |
test_permute | 0.3362ms | 0.2151ms | 4.6501 KOps/s | 4.6358 KOps/s | |
test_stack | 29.9206ms | 25.2599ms | 39.5885 Ops/s | 39.5196 Ops/s | |
test_cat | 28.2657ms | 25.3406ms | 39.4623 Ops/s | 39.8864 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 32.3510μs | 11.0208μs | 90.7371 KOps/s | 102.6233 KOps/s | |
test_plain_set_stack_nested | 0.1271ms | 11.0634μs | 90.3884 KOps/s | 102.5486 KOps/s | |
test_plain_set_nested_inplace | 68.6140μs | 11.9400μs | 83.7522 KOps/s | 94.2056 KOps/s | |
test_plain_set_stack_nested_inplace | 40.5920μs | 11.8576μs | 84.3339 KOps/s | 93.7387 KOps/s | |
test_items | 28.6510μs | 2.8913μs | 345.8629 KOps/s | 340.2915 KOps/s | |
test_items_nested | 0.4478ms | 0.3573ms | 2.7986 KOps/s | 3.0722 KOps/s | |
test_items_nested_locked | 0.3838ms | 0.3582ms | 2.7918 KOps/s | 3.0634 KOps/s | |
test_items_nested_leaf | 83.3440μs | 57.8820μs | 17.2765 KOps/s | 17.1515 KOps/s | |
test_items_stack_nested | 0.3812ms | 0.3584ms | 2.7903 KOps/s | 3.0607 KOps/s | |
test_items_stack_nested_leaf | 91.6740μs | 59.0671μs | 16.9299 KOps/s | 16.8616 KOps/s | |
test_items_stack_nested_locked | 0.5298ms | 0.3556ms | 2.8123 KOps/s | 3.0201 KOps/s | |
test_keys | 72.9930μs | 3.5137μs | 284.5985 KOps/s | 285.7738 KOps/s | |
test_keys_nested | 0.1710ms | 69.9501μs | 14.2959 KOps/s | 14.2231 KOps/s | |
test_keys_nested_locked | 0.7832ms | 76.8277μs | 13.0161 KOps/s | 13.2482 KOps/s | |
test_keys_nested_leaf | 0.2255ms | 61.2667μs | 16.3221 KOps/s | 16.3226 KOps/s | |
test_keys_stack_nested | 0.2308ms | 70.2784μs | 14.2291 KOps/s | 14.0253 KOps/s | |
test_keys_stack_nested_leaf | 0.2455ms | 60.9476μs | 16.4076 KOps/s | 16.0040 KOps/s | |
test_keys_stack_nested_locked | 0.1212ms | 75.5471μs | 13.2368 KOps/s | 13.0935 KOps/s | |
test_values | 5.3202μs | 0.8391μs | 1.1918 MOps/s | 1.1493 MOps/s | |
test_values_nested | 63.2030μs | 31.2955μs | 31.9535 KOps/s | 31.9626 KOps/s | |
test_values_nested_locked | 74.9440μs | 33.0014μs | 30.3018 KOps/s | 30.4346 KOps/s | |
test_values_nested_leaf | 60.7230μs | 33.7843μs | 29.5995 KOps/s | 29.6392 KOps/s | |
test_values_stack_nested | 0.1536ms | 31.7883μs | 31.4581 KOps/s | 31.6173 KOps/s | |
test_values_stack_nested_leaf | 60.3030μs | 34.2071μs | 29.2337 KOps/s | 29.1413 KOps/s | |
test_values_stack_nested_locked | 61.6130μs | 33.1456μs | 30.1699 KOps/s | 29.9858 KOps/s | |
test_membership | 2.1921μs | 0.5076μs | 1.9699 MOps/s | 1.9741 MOps/s | |
test_membership_nested | 17.8860μs | 2.0133μs | 496.6973 KOps/s | 516.8007 KOps/s | |
test_membership_nested_leaf | 15.4905μs | 2.0045μs | 498.8892 KOps/s | 515.8343 KOps/s | |
test_membership_stacked_nested | 27.3910μs | 2.1221μs | 471.2354 KOps/s | 494.9491 KOps/s | |
test_membership_stacked_nested_leaf | 32.5310μs | 2.0890μs | 478.7032 KOps/s | 499.0756 KOps/s | |
test_membership_nested_last | 43.7220μs | 2.9564μs | 338.2494 KOps/s | 357.5477 KOps/s | |
test_membership_nested_leaf_last | 29.1510μs | 3.0137μs | 331.8203 KOps/s | 356.9817 KOps/s | |
test_membership_stacked_nested_last | 32.6220μs | 7.9285μs | 126.1272 KOps/s | 349.2016 KOps/s | |
test_membership_stacked_nested_leaf_last | 0.1859ms | 7.9129μs | 126.3751 KOps/s | 350.1396 KOps/s | |
test_nested_getleaf | 0.1751ms | 6.1286μs | 163.1685 KOps/s | 165.8826 KOps/s | |
test_nested_get | 28.4810μs | 5.8110μs | 172.0866 KOps/s | 174.3789 KOps/s | |
test_stacked_getleaf | 26.9610μs | 6.1527μs | 162.5290 KOps/s | 166.3464 KOps/s | |
test_stacked_get | 0.1700ms | 5.8403μs | 171.2240 KOps/s | 177.3222 KOps/s | |
test_nested_getitemleaf | 27.1910μs | 6.2457μs | 160.1110 KOps/s | 164.1864 KOps/s | |
test_nested_getitem | 0.2004ms | 5.9281μs | 168.6893 KOps/s | 173.0667 KOps/s | |
test_stacked_getitemleaf | 35.1520μs | 6.2730μs | 159.4146 KOps/s | 164.0452 KOps/s | |
test_stacked_getitem | 36.2710μs | 5.9144μs | 169.0790 KOps/s | 174.6795 KOps/s | |
test_lock_nested | 0.7289ms | 0.3688ms | 2.7117 KOps/s | 2.7244 KOps/s | |
test_lock_stack_nested | 0.4610ms | 0.3323ms | 3.0089 KOps/s | 3.0111 KOps/s | |
test_unlock_nested | 0.5943ms | 0.3060ms | 3.2682 KOps/s | 3.3003 KOps/s | |
test_unlock_stack_nested | 0.3951ms | 0.2710ms | 3.6907 KOps/s | 3.7013 KOps/s | |
test_flatten_speed | 0.1098ms | 74.0892μs | 13.4972 KOps/s | 13.7955 KOps/s | |
test_unflatten_speed | 0.3991ms | 0.3052ms | 3.2761 KOps/s | 3.4313 KOps/s | |
test_common_ops | 1.5938ms | 0.6274ms | 1.5939 KOps/s | 1.8176 KOps/s | |
test_creation | 0.1822ms | 1.4954μs | 668.7046 KOps/s | 673.2649 KOps/s | |
test_creation_empty | 40.9820μs | 8.2006μs | 121.9429 KOps/s | 172.4207 KOps/s | |
test_creation_nested_1 | 30.1110μs | 9.7786μs | 102.2645 KOps/s | 136.5522 KOps/s | |
test_creation_nested_2 | 35.0620μs | 12.4100μs | 80.5804 KOps/s | 102.3451 KOps/s | |
test_clone | 0.1635ms | 11.2751μs | 88.6913 KOps/s | 95.5740 KOps/s | |
test_getitem[int] | 1.9469ms | 11.0981μs | 90.1058 KOps/s | 95.4690 KOps/s | |
test_getitem[slice_int] | 0.2232ms | 21.8159μs | 45.8380 KOps/s | 50.1014 KOps/s | |
test_getitem[range] | 0.1663ms | 39.7811μs | 25.1376 KOps/s | 26.1824 KOps/s | |
test_getitem[tuple] | 0.1902ms | 18.5145μs | 54.0116 KOps/s | 56.8233 KOps/s | |
test_getitem[list] | 0.3095ms | 34.8243μs | 28.7156 KOps/s | 29.9197 KOps/s | |
test_setitem_dim[int] | 39.3220μs | 20.0882μs | 49.7804 KOps/s | 53.6439 KOps/s | |
test_setitem_dim[slice_int] | 78.3140μs | 39.5848μs | 25.2622 KOps/s | 26.6444 KOps/s | |
test_setitem_dim[range] | 0.1833ms | 55.8060μs | 17.9192 KOps/s | 18.8774 KOps/s | |
test_setitem_dim[tuple] | 53.8020μs | 33.7860μs | 29.5981 KOps/s | 31.4656 KOps/s | |
test_setitem | 0.1222ms | 15.9524μs | 62.6865 KOps/s | 72.6510 KOps/s | |
test_set | 0.1910ms | 15.3697μs | 65.0632 KOps/s | 76.9461 KOps/s | |
test_set_shared | 1.6234ms | 0.1490ms | 6.7131 KOps/s | 6.8546 KOps/s | |
test_update | 0.3624ms | 18.5618μs | 53.8740 KOps/s | 67.1948 KOps/s | |
test_update_nested | 0.1520ms | 23.5323μs | 42.4947 KOps/s | 51.4879 KOps/s | |
test_update__nested | 0.7078ms | 25.3117μs | 39.5074 KOps/s | 39.8918 KOps/s | |
test_set_nested | 0.1517ms | 16.6798μs | 59.9529 KOps/s | 70.3483 KOps/s | |
test_set_nested_new | 0.1681ms | 18.7669μs | 53.2853 KOps/s | 61.4704 KOps/s | |
test_select | 0.1646ms | 31.0731μs | 32.1822 KOps/s | 35.1610 KOps/s | |
test_select_nested | 73.1640μs | 41.7137μs | 23.9729 KOps/s | 24.0101 KOps/s | |
test_exclude_nested | 0.1620ms | 62.1346μs | 16.0941 KOps/s | 16.9019 KOps/s | |
test_empty[True] | 0.3274ms | 0.2759ms | 3.6248 KOps/s | 3.9263 KOps/s | |
test_empty[False] | 3.1981μs | 0.7509μs | 1.3317 MOps/s | 1.3607 MOps/s | |
test_to | 92.0240μs | 56.4114μs | 17.7269 KOps/s | 18.1960 KOps/s | |
test_to_nonblocking | 0.1929ms | 46.9142μs | 21.3155 KOps/s | 22.2775 KOps/s | |
test_unbind_speed | 0.2673ms | 0.2347ms | 4.2608 KOps/s | 4.3954 KOps/s | |
test_unbind_speed_stack0 | 0.2757ms | 0.2290ms | 4.3668 KOps/s | 4.3492 KOps/s | |
test_unbind_speed_stack1 | 96.4313ms | 0.6442ms | 1.5523 KOps/s | 1.5446 KOps/s | |
test_split | 97.1829ms | 1.6410ms | 609.3884 Ops/s | 643.5567 Ops/s | |
test_chunk | 0.1059s | 1.6439ms | 608.2934 Ops/s | 582.2561 Ops/s | |
test_consolidate[False-None] | 0.1001s | 2.8531ms | 350.4924 Ops/s | 383.4670 Ops/s | |
test_consolidate[default-None] | 2.1777ms | 1.8406ms | 543.2865 Ops/s | 596.8840 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.9075ms | 1.7753ms | 563.2969 Ops/s | 579.6660 Ops/s | |
test_consolidate_njt[False-None] | 7.0175ms | 6.5847ms | 151.8683 Ops/s | 151.0358 Ops/s | |
test_to[False-False-None] | 1.9468ms | 1.7026ms | 587.3289 Ops/s | 597.0286 Ops/s | |
test_to[True-False-None] | 1.6018ms | 1.3465ms | 742.6737 Ops/s | 763.7735 Ops/s | |
test_to[within-False-None] | 4.2642ms | 4.0541ms | 246.6619 Ops/s | 250.5716 Ops/s | |
test_to[True-default-None] | 5.6504ms | 5.2596ms | 190.1284 Ops/s | 196.6031 Ops/s | |
test_to_njt[False-False-None] | 7.3854ms | 7.0404ms | 142.0371 Ops/s | 143.4402 Ops/s | |
test_to_njt[True-False-None] | 5.8243ms | 5.5613ms | 179.8150 Ops/s | 182.0736 Ops/s | |
test_to_njt[within-False-None] | 12.6459ms | 12.4266ms | 80.4724 Ops/s | 81.1961 Ops/s | |
test_creation[device0] | 0.6413ms | 80.6248μs | 12.4031 KOps/s | 12.4965 KOps/s | |
test_creation_from_tensor | 0.7169ms | 83.4966μs | 11.9765 KOps/s | 12.0180 KOps/s | |
test_add_one[memmap_tensor0] | 0.3135ms | 7.3387μs | 136.2637 KOps/s | 143.9797 KOps/s | |
test_contiguous[memmap_tensor0] | 1.7886μs | 0.4091μs | 2.4444 MOps/s | 2.4409 MOps/s | |
test_stack[memmap_tensor0] | 44.4230μs | 4.7195μs | 211.8874 KOps/s | 227.7469 KOps/s | |
test_memmaptd_index | 1.6763ms | 0.2553ms | 3.9167 KOps/s | 4.0542 KOps/s | |
test_memmaptd_index_astensor | 0.5845ms | 0.3154ms | 3.1707 KOps/s | 3.2787 KOps/s | |
test_memmaptd_index_op | 1.0549ms | 0.6092ms | 1.6414 KOps/s | 1.7783 KOps/s | |
test_serialize_model | 0.1333s | 0.1316s | 7.5972 Ops/s | 7.6228 Ops/s | |
test_serialize_model_pickle | 1.3476s | 1.2190s | 0.8204 Ops/s | 0.8440 Ops/s | |
test_serialize_weights | 0.4438s | 0.1752s | 5.7068 Ops/s | 7.7231 Ops/s | |
test_serialize_weights_returnearly | 0.3608s | 55.3662ms | 18.0615 Ops/s | 13.9720 Ops/s | |
test_serialize_weights_pickle | 1.3659s | 1.2213s | 0.8188 Ops/s | 0.8374 Ops/s | |
test_reshape_pytree | 43.6520μs | 22.3801μs | 44.6825 KOps/s | 44.1551 KOps/s | |
test_reshape_td | 0.1383ms | 28.0113μs | 35.6999 KOps/s | 36.9193 KOps/s | |
test_view_pytree | 0.2149ms | 22.1816μs | 45.0823 KOps/s | 44.7668 KOps/s | |
test_view_td | 0.1242ms | 30.5297μs | 32.7550 KOps/s | 32.7994 KOps/s | |
test_unbind_pytree | 0.1375ms | 28.1553μs | 35.5173 KOps/s | 35.6106 KOps/s | |
test_unbind_td | 0.7492ms | 37.8533μs | 26.4178 KOps/s | 27.6365 KOps/s | |
test_split_pytree | 0.1611ms | 30.1535μs | 33.1636 KOps/s | 32.0652 KOps/s | |
test_split_td | 1.0475ms | 39.2681μs | 25.4659 KOps/s | 26.4282 KOps/s | |
test_add_pytree | 0.1835ms | 37.0414μs | 26.9968 KOps/s | 28.9662 KOps/s | |
test_add_td | 0.1956ms | 51.9011μs | 19.2674 KOps/s | 23.1996 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.2742ms | 0.1240ms | 8.0677 KOps/s | 7.9956 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3425ms | 0.1254ms | 7.9766 KOps/s | 7.9573 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2425ms | 97.2560μs | 10.2821 KOps/s | 10.1826 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3076ms | 0.1544ms | 6.4779 KOps/s | 6.5578 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1663ms | 22.3462μs | 44.7503 KOps/s | 43.8703 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1513ms | 26.7986μs | 37.3154 KOps/s | 36.8164 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.4521ms | 65.2094μs | 15.3352 KOps/s | 15.2593 KOps/s | |
test_compile_copy_nested[pytree-eager] | 79.4440μs | 49.6151μs | 20.1552 KOps/s | 19.9495 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2959ms | 0.1441ms | 6.9414 KOps/s | 6.8805 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3555ms | 0.2070ms | 4.8313 KOps/s | 4.8420 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2551ms | 0.1006ms | 9.9355 KOps/s | 10.0703 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2542ms | 53.4296μs | 18.7162 KOps/s | 19.1145 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3335ms | 0.1392ms | 7.1813 KOps/s | 7.2231 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.7150ms | 0.5328ms | 1.8768 KOps/s | 1.9796 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4305ms | 0.2467ms | 4.0530 KOps/s | 4.0909 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2899ms | 0.1461ms | 6.8454 KOps/s | 6.9127 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2250ms | 64.2807μs | 15.5568 KOps/s | 16.0065 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2477ms | 0.1022ms | 9.7874 KOps/s | 10.0197 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5784ms | 0.4263ms | 2.3460 KOps/s | 2.3750 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2787ms | 0.1395ms | 7.1666 KOps/s | 7.2781 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1695ms | 19.8052μs | 50.4918 KOps/s | 55.2374 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1384ms | 26.2679μs | 38.0693 KOps/s | 37.6753 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1410ms | 70.0008μs | 14.2856 KOps/s | 14.1028 KOps/s | |
test_compile_copy_flat[pytree-eager] | 81.5440μs | 52.4496μs | 19.0659 KOps/s | 19.2475 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6980ms | 0.4595ms | 2.1764 KOps/s | 2.2117 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.1529ms | 2.7487ms | 363.8079 Ops/s | 389.4260 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.5907ms | 0.4324ms | 2.3124 KOps/s | 2.2273 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.4803ms | 2.8905ms | 345.9572 Ops/s | 373.2112 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.5057ms | 0.1229ms | 8.1397 KOps/s | 8.6449 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5817ms | 85.3586μs | 11.7153 KOps/s | 11.9090 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.3198ms | 0.1167ms | 8.5677 KOps/s | 8.9240 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2897ms | 77.2752μs | 12.9408 KOps/s | 14.0026 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.3490ms | 0.1150ms | 8.6981 KOps/s | 8.8835 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2868ms | 75.5755μs | 13.2318 KOps/s | 14.2393 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.3112ms | 0.1101ms | 9.0821 KOps/s | 9.2675 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1798ms | 18.4063μs | 54.3292 KOps/s | 58.5195 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2951ms | 0.1039ms | 9.6224 KOps/s | 10.3028 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.2049ms | 16.5530μs | 60.4122 KOps/s | 64.4570 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2957ms | 0.1059ms | 9.4464 KOps/s | 10.1587 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1266ms | 16.5904μs | 60.2757 KOps/s | 62.0362 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.3031ms | 0.1123ms | 8.9067 KOps/s | 9.6922 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6428ms | 17.9412μs | 55.7375 KOps/s | 60.4074 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2822ms | 0.1050ms | 9.5270 KOps/s | 10.1207 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1801ms | 16.4586μs | 60.7585 KOps/s | 63.9174 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2506ms | 98.9899μs | 10.1020 KOps/s | 9.6624 KOps/s | |
test_compile_indexing[int-pytree-eager] | 89.7740μs | 16.3562μs | 61.1388 KOps/s | 64.4265 KOps/s | |
test_mod_add[eager] | 0.1886ms | 38.0288μs | 26.2959 KOps/s | 29.1649 KOps/s | |
test_mod_add[compile] | 0.3000ms | 84.2434μs | 11.8704 KOps/s | 12.3034 KOps/s | |
test_mod_add[compile-overhead] | 0.3297ms | 0.1706ms | 5.8626 KOps/s | 5.5799 KOps/s | |
test_mod_wrap[eager] | 0.4087ms | 0.2542ms | 3.9343 KOps/s | 3.9421 KOps/s | |
test_mod_wrap[compile] | 0.4957ms | 0.3039ms | 3.2903 KOps/s | 3.3132 KOps/s | |
test_mod_wrap[compile-overhead] | 7.3821ms | 3.7932ms | 263.6262 Ops/s | 263.3723 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6639ms | 1.3813ms | 723.9353 Ops/s | 689.4583 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.6468ms | 1.3056ms | 765.9194 Ops/s | 720.6396 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3759ms | 0.9417ms | 1.0619 KOps/s | 948.4177 Ops/s | |
test_seq_add[eager] | 0.2902ms | 0.1135ms | 8.8110 KOps/s | 10.2908 KOps/s | |
test_seq_add[compile] | 0.2700ms | 90.0464μs | 11.1054 KOps/s | 10.6914 KOps/s | |
test_seq_add[compile-overhead] | 0.2796ms | 0.1317ms | 7.5922 KOps/s | 7.4414 KOps/s | |
test_seq_wrap[eager] | 0.6132ms | 0.4324ms | 2.3129 KOps/s | 2.5486 KOps/s | |
test_seq_wrap[compile] | 0.5022ms | 0.3192ms | 3.1325 KOps/s | 3.0991 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4072ms | 0.2261ms | 4.4226 KOps/s | 4.3835 KOps/s | |
test_func_call_runtime[False-eager] | 0.9685ms | 0.7764ms | 1.2881 KOps/s | 1.3058 KOps/s | |
test_func_call_runtime[False-compile] | 0.9469ms | 0.7650ms | 1.3071 KOps/s | 1.3148 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5149ms | 0.3679ms | 2.7184 KOps/s | 2.7238 KOps/s | |
test_func_call_runtime[True-eager] | 1.0742ms | 0.9205ms | 1.0864 KOps/s | 1.0834 KOps/s | |
test_func_call_runtime[True-compile] | 0.9608ms | 0.7860ms | 1.2723 KOps/s | 1.2935 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5306ms | 0.3888ms | 2.5723 KOps/s | 2.5724 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8997ms | 0.7514ms | 1.3309 KOps/s | 1.3331 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9295ms | 0.7695ms | 1.2995 KOps/s | 1.2558 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5075ms | 0.3690ms | 2.7100 KOps/s | 2.7069 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1619ms | 1.0077ms | 992.3572 Ops/s | 992.0077 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9876ms | 0.8163ms | 1.2250 KOps/s | 1.2333 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5563ms | 0.4160ms | 2.4037 KOps/s | 2.4161 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5642ms | 2.0965ms | 476.9956 Ops/s | 475.1750 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9825ms | 0.8266ms | 1.2098 KOps/s | 1.2174 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5557ms | 0.4146ms | 2.4118 KOps/s | 2.3938 KOps/s | |
test_distributed | 2.6069ms | 0.2101ms | 4.7587 KOps/s | 8.7162 KOps/s | |
test_tdmodule | 88.6540μs | 19.0874μs | 52.3905 KOps/s | 70.3082 KOps/s | |
test_tdmodule_dispatch | 0.1352ms | 35.6627μs | 28.0405 KOps/s | 36.5878 KOps/s | |
test_tdseq | 55.5430μs | 19.5942μs | 51.0354 KOps/s | 71.7430 KOps/s | |
test_tdseq_dispatch | 56.7920μs | 36.8622μs | 27.1281 KOps/s | 34.0130 KOps/s | |
test_instantiation_functorch | 1.7842ms | 1.5800ms | 632.8971 Ops/s | 633.6815 Ops/s | |
test_exec_functorch | 0.1914ms | 0.1499ms | 6.6733 KOps/s | 6.8751 KOps/s | |
test_exec_functional_call | 0.2568ms | 0.1493ms | 6.6982 KOps/s | 7.1441 KOps/s | |
test_exec_td_decorator | 0.4089ms | 0.1987ms | 5.0337 KOps/s | 5.4635 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9322ms | 0.7265ms | 1.3764 KOps/s | 1.4767 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 5.9392ms | 0.6969ms | 1.4350 KOps/s | 1.4758 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8532ms | 0.6238ms | 1.6031 KOps/s | 1.6760 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8163ms | 0.6319ms | 1.5825 KOps/s | 1.6714 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.5156ms | 19.5019ms | 51.2772 Ops/s | 52.0163 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.8522ms | 19.4626ms | 51.3806 Ops/s | 52.0076 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.5639ms | 19.2838ms | 51.8571 Ops/s | 52.5710 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.5646ms | 19.2974ms | 51.8204 Ops/s | 52.3909 Ops/s | |
test_to_module_speed[True] | 2.2049ms | 0.9396ms | 1.0643 KOps/s | 1.0741 KOps/s | |
test_to_module_speed[False] | 1.0227ms | 0.9205ms | 1.0864 KOps/s | 1.0933 KOps/s | |
test_tc_init | 0.1704ms | 36.7765μs | 27.1913 KOps/s | 31.1243 KOps/s | |
test_tc_init_nested | 0.1982ms | 74.9199μs | 13.3476 KOps/s | 15.4809 KOps/s | |
test_tc_first_layer_tensor | 5.7474μs | 0.6963μs | 1.4363 MOps/s | 1.4214 MOps/s | |
test_tc_first_layer_nontensor | 22.5520μs | 2.3363μs | 428.0364 KOps/s | 433.6499 KOps/s | |
test_tc_second_layer_tensor | 7.9855μs | 1.4339μs | 697.3783 KOps/s | 698.0164 KOps/s | |
test_tc_second_layer_nontensor | 27.2510μs | 3.0419μs | 328.7457 KOps/s | 324.3498 KOps/s | |
test_unbind | 6.8538ms | 6.6274ms | 150.8894 Ops/s | 149.4326 Ops/s | |
test_full_like | 12.3394ms | 10.0736ms | 99.2694 Ops/s | 95.5097 Ops/s | |
test_zeros_like | 5.4689ms | 4.4794ms | 223.2417 Ops/s | 223.9305 Ops/s | |
test_ones_like | 5.4531ms | 4.5296ms | 220.7699 Ops/s | 219.5411 Ops/s | |
test_clone | 8.5756ms | 7.1977ms | 138.9334 Ops/s | 139.1438 Ops/s | |
test_squeeze | 81.5040μs | 9.0951μs | 109.9490 KOps/s | 106.0246 KOps/s | |
test_unsqueeze | 0.1800ms | 70.2594μs | 14.2330 KOps/s | 14.0553 KOps/s | |
test_split | 0.2905ms | 0.1580ms | 6.3281 KOps/s | 6.2797 KOps/s | |
test_permute | 0.3722ms | 0.1824ms | 5.4838 KOps/s | 5.5752 KOps/s | |
test_stack | 53.2498ms | 52.1037ms | 19.1925 Ops/s | 19.2064 Ops/s | |
test_cat | 53.7446ms | 51.8667ms | 19.2802 Ops/s | 19.1905 Ops/s |
vmoens
added a commit
that referenced
this pull request
Nov 21, 2024
ghstack-source-id: 23f30be0b5b36329dc6dfb2d7eb303aaf87b8474 Pull Request resolved: #1098
vmoens
added a commit
that referenced
this pull request
Nov 25, 2024
ghstack-source-id: c16baa83f6e41c4afd6637f3b3739d4e5cf25f1e Pull Request resolved: #1098
vmoens
added a commit
that referenced
this pull request
Nov 25, 2024
ghstack-source-id: c16baa83f6e41c4afd6637f3b3739d4e5cf25f1e Pull Request resolved: #1098
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Refactor
Refactoring code - not a new feature
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):