-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Fix update_ KeyError when a key from source is missing in dest #1150
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Dec 19, 2024
ghstack-source-id: 63013752ba61f05079cb6a60bb06312968b79ae9 Pull Request resolved: #1150
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Dec 19, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 48.4310μs | 20.9272μs | 47.7847 KOps/s | 49.7646 KOps/s | |
test_plain_set_stack_nested | 50.2540μs | 21.1283μs | 47.3298 KOps/s | 49.1133 KOps/s | |
test_plain_set_nested_inplace | 68.4580μs | 22.8945μs | 43.6786 KOps/s | 45.5770 KOps/s | |
test_plain_set_stack_nested_inplace | 79.6990μs | 22.9083μs | 43.6522 KOps/s | 45.6875 KOps/s | |
test_items | 21.9210μs | 4.1469μs | 241.1416 KOps/s | 238.1587 KOps/s | |
test_items_nested | 0.5754ms | 0.4045ms | 2.4722 KOps/s | 2.5002 KOps/s | |
test_items_nested_locked | 0.9251ms | 0.4082ms | 2.4498 KOps/s | 2.4831 KOps/s | |
test_items_nested_leaf | 0.1477ms | 76.7797μs | 13.0243 KOps/s | 12.9588 KOps/s | |
test_items_stack_nested | 0.6173ms | 0.4058ms | 2.4641 KOps/s | 2.4512 KOps/s | |
test_items_stack_nested_leaf | 0.1364ms | 76.7312μs | 13.0325 KOps/s | 12.8750 KOps/s | |
test_items_stack_nested_locked | 0.6032ms | 0.4042ms | 2.4738 KOps/s | 2.4832 KOps/s | |
test_keys | 20.8590μs | 3.9637μs | 252.2904 KOps/s | 287.4870 KOps/s | |
test_keys_nested | 0.3072ms | 0.1645ms | 6.0792 KOps/s | 6.0392 KOps/s | |
test_keys_nested_locked | 0.7032ms | 0.1700ms | 5.8824 KOps/s | 5.8111 KOps/s | |
test_keys_nested_leaf | 0.2593ms | 0.1427ms | 7.0070 KOps/s | 6.9354 KOps/s | |
test_keys_stack_nested | 0.3088ms | 0.1636ms | 6.1125 KOps/s | 6.0501 KOps/s | |
test_keys_stack_nested_leaf | 0.2356ms | 0.1417ms | 7.0561 KOps/s | 6.9116 KOps/s | |
test_keys_stack_nested_locked | 0.2662ms | 0.1688ms | 5.9237 KOps/s | 5.8090 KOps/s | |
test_values | 8.4056μs | 1.0309μs | 970.0094 KOps/s | 937.2334 KOps/s | |
test_values_nested | 0.1138ms | 62.2819μs | 16.0560 KOps/s | 15.8655 KOps/s | |
test_values_nested_locked | 0.1141ms | 62.4646μs | 16.0091 KOps/s | 14.9510 KOps/s | |
test_values_nested_leaf | 0.1226ms | 71.5943μs | 13.9676 KOps/s | 13.8431 KOps/s | |
test_values_stack_nested | 0.1195ms | 62.5943μs | 15.9759 KOps/s | 15.8162 KOps/s | |
test_values_stack_nested_leaf | 0.1323ms | 71.3668μs | 14.0121 KOps/s | 13.7596 KOps/s | |
test_values_stack_nested_locked | 0.1302ms | 62.5306μs | 15.9922 KOps/s | 15.7774 KOps/s | |
test_membership | 14.4370μs | 0.9043μs | 1.1058 MOps/s | 1.1110 MOps/s | |
test_membership_nested | 30.0960μs | 2.9638μs | 337.3996 KOps/s | 343.9988 KOps/s | |
test_membership_nested_leaf | 25.8580μs | 2.9569μs | 338.1873 KOps/s | 340.2090 KOps/s | |
test_membership_stacked_nested | 35.2560μs | 2.9296μs | 341.3414 KOps/s | 342.0619 KOps/s | |
test_membership_stacked_nested_leaf | 33.3420μs | 2.9063μs | 344.0785 KOps/s | 342.1405 KOps/s | |
test_membership_nested_last | 33.2710μs | 4.3620μs | 229.2523 KOps/s | 231.2246 KOps/s | |
test_membership_nested_leaf_last | 28.2930μs | 4.4337μs | 225.5439 KOps/s | 225.7354 KOps/s | |
test_membership_stacked_nested_last | 52.5080μs | 4.3424μs | 230.2848 KOps/s | 227.8468 KOps/s | |
test_membership_stacked_nested_leaf_last | 24.6760μs | 4.3792μs | 228.3508 KOps/s | 227.9146 KOps/s | |
test_nested_getleaf | 33.9430μs | 11.0557μs | 90.4512 KOps/s | 94.0494 KOps/s | |
test_nested_get | 46.6970μs | 10.5714μs | 94.5953 KOps/s | 99.0378 KOps/s | |
test_stacked_getleaf | 41.1760μs | 10.9389μs | 91.4168 KOps/s | 92.0304 KOps/s | |
test_stacked_get | 43.6920μs | 10.3549μs | 96.5726 KOps/s | 97.9919 KOps/s | |
test_nested_getitemleaf | 52.4590μs | 11.4828μs | 87.0869 KOps/s | 90.1002 KOps/s | |
test_nested_getitem | 38.7230μs | 10.7898μs | 92.6797 KOps/s | 95.1008 KOps/s | |
test_stacked_getitemleaf | 45.3340μs | 11.3052μs | 88.4552 KOps/s | 89.6784 KOps/s | |
test_stacked_getitem | 37.5110μs | 10.6390μs | 93.9940 KOps/s | 93.7442 KOps/s | |
test_lock_nested | 0.9675ms | 0.4571ms | 2.1877 KOps/s | 2.1758 KOps/s | |
test_lock_stack_nested | 0.8585ms | 0.4295ms | 2.3284 KOps/s | 2.3263 KOps/s | |
test_unlock_nested | 0.9443ms | 0.3732ms | 2.6795 KOps/s | 2.6101 KOps/s | |
test_unlock_stack_nested | 0.6932ms | 0.3486ms | 2.8685 KOps/s | 2.8583 KOps/s | |
test_flatten_speed | 0.1866ms | 0.1014ms | 9.8596 KOps/s | 10.1195 KOps/s | |
test_unflatten_speed | 0.7564ms | 0.5382ms | 1.8579 KOps/s | 1.8907 KOps/s | |
test_common_ops | 3.7566ms | 0.8411ms | 1.1889 KOps/s | 1.3143 KOps/s | |
test_creation | 27.9220μs | 2.5282μs | 395.5385 KOps/s | 399.7091 KOps/s | |
test_creation_empty | 37.8910μs | 12.0735μs | 82.8260 KOps/s | 98.6878 KOps/s | |
test_creation_nested_1 | 45.8460μs | 15.0740μs | 66.3393 KOps/s | 76.9227 KOps/s | |
test_creation_nested_2 | 53.6000μs | 19.5805μs | 51.0713 KOps/s | 56.8470 KOps/s | |
test_clone | 0.1182ms | 13.6908μs | 73.0420 KOps/s | 74.2290 KOps/s | |
test_getitem[int] | 1.4131ms | 12.9495μs | 77.2228 KOps/s | 76.0797 KOps/s | |
test_getitem[slice_int] | 0.1434ms | 24.5116μs | 40.7971 KOps/s | 39.5322 KOps/s | |
test_getitem[range] | 0.1771ms | 47.6233μs | 20.9981 KOps/s | 20.8434 KOps/s | |
test_getitem[tuple] | 0.1628ms | 20.3551μs | 49.1277 KOps/s | 49.3542 KOps/s | |
test_getitem[list] | 0.1649ms | 42.8678μs | 23.3276 KOps/s | 23.3083 KOps/s | |
test_setitem_dim[int] | 41.1970μs | 24.1844μs | 41.3490 KOps/s | 40.6556 KOps/s | |
test_setitem_dim[slice_int] | 95.2980μs | 51.5413μs | 19.4019 KOps/s | 19.6632 KOps/s | |
test_setitem_dim[range] | 0.1162ms | 72.4813μs | 13.7967 KOps/s | 13.5316 KOps/s | |
test_setitem_dim[tuple] | 60.3530μs | 40.1940μs | 24.8793 KOps/s | 24.6735 KOps/s | |
test_setitem | 70.3110μs | 21.0492μs | 47.5078 KOps/s | 51.0069 KOps/s | |
test_set | 0.2588ms | 20.4029μs | 49.0127 KOps/s | 52.4551 KOps/s | |
test_set_shared | 1.1640ms | 0.1701ms | 5.8795 KOps/s | 5.7168 KOps/s | |
test_update | 0.2606ms | 23.5850μs | 42.3999 KOps/s | 48.1234 KOps/s | |
test_update_nested | 0.2668ms | 33.8190μs | 29.5692 KOps/s | 32.8172 KOps/s | |
test_update__nested | 1.0981ms | 34.3070μs | 29.1486 KOps/s | 30.3114 KOps/s | |
test_set_nested | 0.3272ms | 22.7337μs | 43.9875 KOps/s | 47.1801 KOps/s | |
test_set_nested_new | 0.2720ms | 27.8588μs | 35.8953 KOps/s | 38.6736 KOps/s | |
test_select | 0.3075ms | 45.4244μs | 22.0146 KOps/s | 23.5096 KOps/s | |
test_select_nested | 0.1205ms | 63.1158μs | 15.8439 KOps/s | 15.6205 KOps/s | |
test_exclude_nested | 0.3524ms | 82.9522μs | 12.0551 KOps/s | 12.1901 KOps/s | |
test_empty[True] | 0.6015ms | 0.4121ms | 2.4267 KOps/s | 2.4278 KOps/s | |
test_empty[False] | 8.0725μs | 1.3878μs | 720.5868 KOps/s | 699.6292 KOps/s | |
test_unbind_speed | 0.3801ms | 0.2682ms | 3.7287 KOps/s | 3.6915 KOps/s | |
test_unbind_speed_stack0 | 0.4646ms | 0.2681ms | 3.7306 KOps/s | 3.7052 KOps/s | |
test_unbind_speed_stack1 | 0.1111s | 0.8233ms | 1.2146 KOps/s | 1.3488 KOps/s | |
test_split | 0.1134s | 1.7876ms | 559.4131 Ops/s | 553.8499 Ops/s | |
test_chunk | 0.1136s | 1.8021ms | 554.8938 Ops/s | 555.0341 Ops/s | |
test_consolidate_njt[False-None] | 8.9112ms | 8.1635ms | 122.4965 Ops/s | 118.7560 Ops/s | |
test_creation[device0] | 0.3197ms | 91.9915μs | 10.8706 KOps/s | 10.9239 KOps/s | |
test_creation_from_tensor | 0.2959ms | 95.3561μs | 10.4870 KOps/s | 10.3457 KOps/s | |
test_add_one[memmap_tensor0] | 0.1421ms | 4.7095μs | 212.3356 KOps/s | 212.5005 KOps/s | |
test_contiguous[memmap_tensor0] | 23.9740μs | 0.5323μs | 1.8786 MOps/s | 1.9362 MOps/s | |
test_stack[memmap_tensor0] | 0.1030ms | 3.3430μs | 299.1343 KOps/s | 295.2226 KOps/s | |
test_memmaptd_index | 1.0510ms | 0.2375ms | 4.2099 KOps/s | 4.1378 KOps/s | |
test_memmaptd_index_astensor | 0.6487ms | 0.3253ms | 3.0739 KOps/s | 3.0390 KOps/s | |
test_memmaptd_index_op | 1.0216ms | 0.5874ms | 1.7023 KOps/s | 1.8057 KOps/s | |
test_serialize_model | 0.1300s | 0.1170s | 8.5496 Ops/s | 7.1991 Ops/s | |
test_serialize_model_pickle | 0.5040s | 0.4001s | 2.4997 Ops/s | 2.5060 Ops/s | |
test_serialize_weights | 0.2514s | 0.1403s | 7.1273 Ops/s | 8.5499 Ops/s | |
test_serialize_weights_returnearly | 0.1837s | 0.1631s | 6.1325 Ops/s | 6.1970 Ops/s | |
test_serialize_weights_pickle | 1.3468s | 0.7405s | 1.3505 Ops/s | 2.5348 Ops/s | |
test_serialize_weights_filesystem | 0.1516s | 0.1441s | 6.9384 Ops/s | 6.8537 Ops/s | |
test_serialize_model_filesystem | 0.1525s | 0.1436s | 6.9649 Ops/s | 6.4401 Ops/s | |
test_reshape_pytree | 68.8790μs | 27.0051μs | 37.0301 KOps/s | 37.4539 KOps/s | |
test_reshape_td | 74.7500μs | 33.5292μs | 29.8248 KOps/s | 30.2022 KOps/s | |
test_view_pytree | 81.4320μs | 27.1970μs | 36.7687 KOps/s | 37.6621 KOps/s | |
test_view_td | 73.9980μs | 38.2424μs | 26.1490 KOps/s | 26.2793 KOps/s | |
test_unbind_pytree | 62.2670μs | 29.8474μs | 33.5038 KOps/s | 33.8897 KOps/s | |
test_unbind_td | 0.3436ms | 39.8394μs | 25.1008 KOps/s | 25.0914 KOps/s | |
test_split_pytree | 69.0390μs | 29.7276μs | 33.6388 KOps/s | 33.4278 KOps/s | |
test_split_td | 0.1126s | 59.6246μs | 16.7716 KOps/s | 22.0156 KOps/s | |
test_add_pytree | 0.1345ms | 35.9035μs | 27.8525 KOps/s | 28.6260 KOps/s | |
test_add_td | 0.1016ms | 55.1671μs | 18.1267 KOps/s | 17.8906 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1390ms | 63.0086μs | 15.8709 KOps/s | 16.0557 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.4060ms | 0.1685ms | 5.9351 KOps/s | 5.9202 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1286ms | 46.9862μs | 21.2829 KOps/s | 21.7781 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.1992ms | 0.1195ms | 8.3683 KOps/s | 8.5072 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 94.0050μs | 27.0241μs | 37.0040 KOps/s | 38.8383 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1182ms | 58.8675μs | 16.9873 KOps/s | 17.1652 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1521ms | 79.4455μs | 12.5872 KOps/s | 12.9002 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1295ms | 68.0341μs | 14.6985 KOps/s | 14.7960 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1881ms | 0.1053ms | 9.4975 KOps/s | 9.5270 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4137ms | 0.2120ms | 4.7172 KOps/s | 4.6867 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1080ms | 45.5908μs | 21.9343 KOps/s | 22.3300 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4870ms | 63.7836μs | 15.6780 KOps/s | 15.5076 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2116ms | 0.1039ms | 9.6264 KOps/s | 9.7539 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3562ms | 0.2023ms | 4.9425 KOps/s | 4.9540 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4554ms | 0.2309ms | 4.3317 KOps/s | 4.2796 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2225ms | 0.1112ms | 8.9940 KOps/s | 9.5439 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2485ms | 57.9800μs | 17.2473 KOps/s | 16.7457 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1132ms | 47.2201μs | 21.1774 KOps/s | 22.1542 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6926ms | 0.1599ms | 6.2551 KOps/s | 6.2663 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1896ms | 0.1061ms | 9.4258 KOps/s | 9.3941 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 73.0470μs | 22.1060μs | 45.2365 KOps/s | 47.8726 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1793ms | 66.0165μs | 15.1477 KOps/s | 15.3421 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1600ms | 83.2329μs | 12.0145 KOps/s | 12.3004 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1293ms | 70.6030μs | 14.1637 KOps/s | 14.3144 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.4317ms | 0.2118ms | 4.7220 KOps/s | 4.7984 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.5223ms | 1.3036ms | 767.1353 Ops/s | 741.0378 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3016ms | 0.2056ms | 4.8629 KOps/s | 4.8094 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 0.9608ms | 0.7677ms | 1.3026 KOps/s | 1.2833 KOps/s | |
test_compile_assign_and_add_stack[compile] | 1.0069ms | 0.4601ms | 2.1737 KOps/s | 2.1766 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.9936ms | 2.7489ms | 363.7844 Ops/s | 385.1345 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1085ms | 37.3215μs | 26.7942 KOps/s | 28.1077 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.7943ms | 32.8645μs | 30.4279 KOps/s | 30.2193 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 83.9570μs | 29.8237μs | 33.5304 KOps/s | 33.6702 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 63.1380μs | 23.4219μs | 42.6951 KOps/s | 43.9636 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1035ms | 30.9098μs | 32.3522 KOps/s | 32.7069 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 66.6450μs | 23.3548μs | 42.8177 KOps/s | 44.3556 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1079ms | 52.7944μs | 18.9414 KOps/s | 19.4408 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.6322ms | 20.3344μs | 49.1778 KOps/s | 48.6080 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1091ms | 45.5308μs | 21.9632 KOps/s | 22.5390 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 58.3490μs | 19.0750μs | 52.4248 KOps/s | 53.1075 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1295ms | 45.5132μs | 21.9717 KOps/s | 22.3126 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 55.5140μs | 19.1121μs | 52.3229 KOps/s | 53.1901 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1317ms | 54.0525μs | 18.5005 KOps/s | 19.0704 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.1444ms | 20.1737μs | 49.5696 KOps/s | 49.0113 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2088ms | 45.8006μs | 21.8338 KOps/s | 22.3785 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1087ms | 18.7700μs | 53.2765 KOps/s | 53.1362 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1359ms | 45.5172μs | 21.9697 KOps/s | 22.5205 KOps/s | |
test_compile_indexing[int-pytree-eager] | 56.8760μs | 19.0418μs | 52.5161 KOps/s | 52.6291 KOps/s | |
test_mod_add[eager] | 95.8090μs | 34.8679μs | 28.6797 KOps/s | 29.5124 KOps/s | |
test_mod_add[compile] | 0.1275ms | 48.2863μs | 20.7098 KOps/s | 19.6606 KOps/s | |
test_mod_add[compile-overhead] | 0.1068ms | 47.8485μs | 20.8993 KOps/s | 20.2374 KOps/s | |
test_mod_wrap[eager] | 0.4319ms | 0.2291ms | 4.3646 KOps/s | 4.4048 KOps/s | |
test_mod_wrap[compile] | 0.3069ms | 0.2089ms | 4.7875 KOps/s | 4.7148 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4091ms | 0.2061ms | 4.8511 KOps/s | 4.8312 KOps/s | |
test_mod_wrap_and_backward[eager] | 12.4814ms | 11.0578ms | 90.4342 Ops/s | 83.4922 Ops/s | |
test_mod_wrap_and_backward[compile] | 12.5256ms | 11.0239ms | 90.7120 Ops/s | 75.3529 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.4975ms | 10.9688ms | 91.1678 Ops/s | 73.4791 Ops/s | |
test_seq_add[eager] | 0.2016ms | 0.1137ms | 8.7957 KOps/s | 8.5493 KOps/s | |
test_seq_add[compile] | 0.1582ms | 63.1604μs | 15.8327 KOps/s | 15.8158 KOps/s | |
test_seq_add[compile-overhead] | 0.1454ms | 63.4172μs | 15.7686 KOps/s | 15.9624 KOps/s | |
test_seq_wrap[eager] | 0.7599ms | 0.4589ms | 2.1790 KOps/s | 2.2284 KOps/s | |
test_seq_wrap[compile] | 0.4224ms | 0.2333ms | 4.2859 KOps/s | 4.2895 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4017ms | 0.2321ms | 4.3077 KOps/s | 4.2897 KOps/s | |
test_func_call_runtime[False-eager] | 0.9028ms | 0.5480ms | 1.8248 KOps/s | 1.8284 KOps/s | |
test_func_call_runtime[False-compile] | 0.8165ms | 0.4320ms | 2.3147 KOps/s | 2.3748 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5850ms | 0.4264ms | 2.3451 KOps/s | 2.3579 KOps/s | |
test_func_call_runtime[True-eager] | 1.0026ms | 0.7692ms | 1.3000 KOps/s | 1.3201 KOps/s | |
test_func_call_runtime[True-compile] | 0.7263ms | 0.4686ms | 2.1339 KOps/s | 2.1496 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8287ms | 0.4682ms | 2.1357 KOps/s | 2.1463 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7410ms | 0.5441ms | 1.8379 KOps/s | 1.8547 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6656ms | 0.4267ms | 2.3434 KOps/s | 2.3498 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5398ms | 0.4257ms | 2.3490 KOps/s | 2.3604 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1362ms | 0.9103ms | 1.0986 KOps/s | 1.0936 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.8880ms | 0.4976ms | 2.0095 KOps/s | 2.0333 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.6253ms | 0.4957ms | 2.0175 KOps/s | 2.0480 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5601ms | 1.9137ms | 522.5509 Ops/s | 513.2470 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9609ms | 0.5262ms | 1.9006 KOps/s | 1.9149 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7500ms | 0.5256ms | 1.9027 KOps/s | 1.9338 KOps/s | |
test_distributed | 0.3386ms | 0.1263ms | 7.9146 KOps/s | 7.8007 KOps/s | |
test_tdmodule | 85.5800μs | 26.4302μs | 37.8355 KOps/s | 38.2026 KOps/s | |
test_tdmodule_dispatch | 67.8270μs | 47.9876μs | 20.8387 KOps/s | 21.3724 KOps/s | |
test_tdseq | 48.5410μs | 29.2665μs | 34.1688 KOps/s | 35.5751 KOps/s | |
test_tdseq_dispatch | 81.1610μs | 54.5949μs | 18.3167 KOps/s | 19.0295 KOps/s | |
test_instantiation_functorch | 2.2463ms | 1.5445ms | 647.4766 Ops/s | 647.1322 Ops/s | |
test_exec_functorch | 0.4060ms | 0.1823ms | 5.4863 KOps/s | 5.6643 KOps/s | |
test_exec_functional_call | 0.3246ms | 0.1739ms | 5.7489 KOps/s | 5.9216 KOps/s | |
test_exec_td_decorator | 0.4945ms | 0.2359ms | 4.2387 KOps/s | 4.1965 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.0131ms | 0.6682ms | 1.4965 KOps/s | 1.4964 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9682ms | 0.6699ms | 1.4927 KOps/s | 1.5003 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.9573ms | 0.5487ms | 1.8225 KOps/s | 1.8480 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 1.1514ms | 0.5519ms | 1.8119 KOps/s | 1.8566 KOps/s | |
test_to_module_speed[True] | 2.2162ms | 1.3619ms | 734.2706 Ops/s | 737.2073 Ops/s | |
test_to_module_speed[False] | 1.7896ms | 1.3163ms | 759.6971 Ops/s | 755.5638 Ops/s | |
test_tc_init | 85.7100μs | 48.6871μs | 20.5393 KOps/s | 20.6667 KOps/s | |
test_tc_init_nested | 0.2582ms | 96.4770μs | 10.3652 KOps/s | 10.2852 KOps/s | |
test_tc_first_layer_tensor | 29.1850μs | 1.5055μs | 664.2414 KOps/s | 644.5369 KOps/s | |
test_tc_first_layer_nontensor | 57.8850μs | 4.6275μs | 216.0992 KOps/s | 210.2918 KOps/s | |
test_tc_second_layer_tensor | 25.7680μs | 2.8244μs | 354.0604 KOps/s | 349.0051 KOps/s | |
test_tc_second_layer_nontensor | 33.4630μs | 6.0225μs | 166.0439 KOps/s | 163.2802 KOps/s | |
test_unbind | 0.2256s | 14.3516ms | 69.6785 Ops/s | 76.5862 Ops/s | |
test_full_like | 10.1407ms | 8.3595ms | 119.6241 Ops/s | 138.5599 Ops/s | |
test_zeros_like | 3.8951ms | 2.9655ms | 337.2077 Ops/s | 345.1981 Ops/s | |
test_ones_like | 4.1348ms | 3.3978ms | 294.3115 Ops/s | 288.2015 Ops/s | |
test_clone | 6.2949ms | 5.1030ms | 195.9628 Ops/s | 182.9463 Ops/s | |
test_squeeze | 86.0710μs | 12.3447μs | 81.0061 KOps/s | 79.8689 KOps/s | |
test_unsqueeze | 0.1709ms | 92.2111μs | 10.8447 KOps/s | 10.6510 KOps/s | |
test_split | 0.3615ms | 0.2021ms | 4.9481 KOps/s | 5.0020 KOps/s | |
test_permute | 0.3005ms | 0.2048ms | 4.8826 KOps/s | 4.7976 KOps/s | |
test_stack | 28.0541ms | 24.3841ms | 41.0103 Ops/s | 40.3021 Ops/s | |
test_cat | 28.7607ms | 24.5967ms | 40.6559 Ops/s | 40.4931 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 32.6910μs | 12.8856μs | 77.6063 KOps/s | 76.0318 KOps/s | |
test_plain_set_stack_nested | 40.0100μs | 13.0196μs | 76.8070 KOps/s | 74.6865 KOps/s | |
test_plain_set_nested_inplace | 54.0610μs | 14.1093μs | 70.8751 KOps/s | 70.2829 KOps/s | |
test_plain_set_stack_nested_inplace | 50.6210μs | 14.0040μs | 71.4081 KOps/s | 71.2742 KOps/s | |
test_items | 35.1300μs | 2.8635μs | 349.2225 KOps/s | 342.5167 KOps/s | |
test_items_nested | 0.4203ms | 0.3566ms | 2.8041 KOps/s | 2.8043 KOps/s | |
test_items_nested_locked | 0.4232ms | 0.3573ms | 2.7988 KOps/s | 2.7785 KOps/s | |
test_items_nested_leaf | 89.2520μs | 58.5798μs | 17.0707 KOps/s | 17.1214 KOps/s | |
test_items_stack_nested | 0.4299ms | 0.3597ms | 2.7803 KOps/s | 2.7659 KOps/s | |
test_items_stack_nested_leaf | 87.6820μs | 60.2014μs | 16.6109 KOps/s | 17.1346 KOps/s | |
test_items_stack_nested_locked | 0.4679ms | 0.3574ms | 2.7983 KOps/s | 2.7882 KOps/s | |
test_keys | 30.7910μs | 3.4656μs | 288.5537 KOps/s | 270.1796 KOps/s | |
test_keys_nested | 0.1217ms | 81.0563μs | 12.3371 KOps/s | 12.3618 KOps/s | |
test_keys_nested_locked | 0.7810ms | 86.3647μs | 11.5788 KOps/s | 11.5156 KOps/s | |
test_keys_nested_leaf | 0.1070ms | 71.4106μs | 14.0035 KOps/s | 13.9427 KOps/s | |
test_keys_stack_nested | 0.1295ms | 82.1291μs | 12.1760 KOps/s | 12.4628 KOps/s | |
test_keys_stack_nested_leaf | 0.1326ms | 73.3370μs | 13.6357 KOps/s | 13.9142 KOps/s | |
test_keys_stack_nested_locked | 0.1233ms | 87.6400μs | 11.4103 KOps/s | 11.5040 KOps/s | |
test_values | 5.3967μs | 0.8503μs | 1.1760 MOps/s | 1.1807 MOps/s | |
test_values_nested | 85.8620μs | 34.6611μs | 28.8507 KOps/s | 29.4519 KOps/s | |
test_values_nested_locked | 78.6120μs | 36.0016μs | 27.7765 KOps/s | 27.9670 KOps/s | |
test_values_nested_leaf | 76.0710μs | 38.7169μs | 25.8285 KOps/s | 25.8738 KOps/s | |
test_values_stack_nested | 84.9110μs | 35.0494μs | 28.5312 KOps/s | 29.2866 KOps/s | |
test_values_stack_nested_leaf | 64.2410μs | 39.2835μs | 25.4560 KOps/s | 25.6782 KOps/s | |
test_values_stack_nested_locked | 81.6420μs | 36.3632μs | 27.5004 KOps/s | 27.7917 KOps/s | |
test_membership | 2.0755μs | 0.5142μs | 1.9449 MOps/s | 1.9559 MOps/s | |
test_membership_nested | 30.5610μs | 2.0783μs | 481.1695 KOps/s | 494.3043 KOps/s | |
test_membership_nested_leaf | 18.9605μs | 1.9937μs | 501.5807 KOps/s | 494.3853 KOps/s | |
test_membership_stacked_nested | 25.4700μs | 2.0928μs | 477.8373 KOps/s | 476.6855 KOps/s | |
test_membership_stacked_nested_leaf | 28.5710μs | 2.0562μs | 486.3258 KOps/s | 482.7436 KOps/s | |
test_membership_nested_last | 30.9900μs | 3.0375μs | 329.2147 KOps/s | 320.3901 KOps/s | |
test_membership_nested_leaf_last | 28.6100μs | 3.0204μs | 331.0783 KOps/s | 324.3136 KOps/s | |
test_membership_stacked_nested_last | 26.5410μs | 3.5849μs | 278.9455 KOps/s | 320.0903 KOps/s | |
test_membership_stacked_nested_leaf_last | 0.1100ms | 3.5783μs | 279.4644 KOps/s | 320.8859 KOps/s | |
test_nested_getleaf | 30.0700μs | 6.0715μs | 164.7031 KOps/s | 160.8461 KOps/s | |
test_nested_get | 48.8100μs | 5.8271μs | 171.6106 KOps/s | 171.2119 KOps/s | |
test_stacked_getleaf | 37.7910μs | 6.2353μs | 160.3760 KOps/s | 162.4153 KOps/s | |
test_stacked_get | 33.2000μs | 5.8312μs | 171.4911 KOps/s | 170.7976 KOps/s | |
test_nested_getitemleaf | 0.7335ms | 6.1840μs | 161.7083 KOps/s | 158.8203 KOps/s | |
test_nested_getitem | 30.7500μs | 5.8547μs | 170.8036 KOps/s | 166.2618 KOps/s | |
test_stacked_getitemleaf | 34.9300μs | 6.3087μs | 158.5111 KOps/s | 159.7898 KOps/s | |
test_stacked_getitem | 29.9410μs | 5.9244μs | 168.7924 KOps/s | 168.0495 KOps/s | |
test_lock_nested | 1.0319ms | 0.3734ms | 2.6781 KOps/s | 2.5387 KOps/s | |
test_lock_stack_nested | 0.4164ms | 0.3436ms | 2.9103 KOps/s | 2.7966 KOps/s | |
test_unlock_nested | 0.6414ms | 0.3169ms | 3.1558 KOps/s | 3.0608 KOps/s | |
test_unlock_stack_nested | 0.3337ms | 0.2827ms | 3.5379 KOps/s | 3.3638 KOps/s | |
test_flatten_speed | 0.1161ms | 75.4816μs | 13.2483 KOps/s | 13.2651 KOps/s | |
test_unflatten_speed | 0.3914ms | 0.3216ms | 3.1090 KOps/s | 3.1905 KOps/s | |
test_common_ops | 1.5728ms | 0.6372ms | 1.5694 KOps/s | 1.5211 KOps/s | |
test_creation | 0.1176ms | 1.7672μs | 565.8684 KOps/s | 561.5147 KOps/s | |
test_creation_empty | 30.2910μs | 9.6758μs | 103.3509 KOps/s | 98.9972 KOps/s | |
test_creation_nested_1 | 46.0010μs | 11.2601μs | 88.8089 KOps/s | 85.1835 KOps/s | |
test_creation_nested_2 | 43.4410μs | 13.9608μs | 71.6290 KOps/s | 69.4288 KOps/s | |
test_clone | 0.1121ms | 11.1969μs | 89.3106 KOps/s | 85.5929 KOps/s | |
test_getitem[int] | 1.4080ms | 10.6344μs | 94.0346 KOps/s | 88.4853 KOps/s | |
test_getitem[slice_int] | 0.1122ms | 20.8806μs | 47.8913 KOps/s | 45.3637 KOps/s | |
test_getitem[range] | 0.1315ms | 38.7675μs | 25.7948 KOps/s | 25.3587 KOps/s | |
test_getitem[tuple] | 0.1041ms | 18.3028μs | 54.6364 KOps/s | 52.5856 KOps/s | |
test_getitem[list] | 0.1297ms | 33.7857μs | 29.5983 KOps/s | 28.4675 KOps/s | |
test_setitem_dim[int] | 41.4710μs | 19.2854μs | 51.8526 KOps/s | 48.8879 KOps/s | |
test_setitem_dim[slice_int] | 61.1610μs | 39.3255μs | 25.4288 KOps/s | 24.8822 KOps/s | |
test_setitem_dim[range] | 78.3920μs | 54.6153μs | 18.3099 KOps/s | 18.1021 KOps/s | |
test_setitem_dim[tuple] | 64.3310μs | 33.2625μs | 30.0639 KOps/s | 29.8930 KOps/s | |
test_setitem | 48.2510μs | 16.3623μs | 61.1162 KOps/s | 57.8380 KOps/s | |
test_set | 41.7610μs | 15.9700μs | 62.6173 KOps/s | 59.1529 KOps/s | |
test_set_shared | 1.5316ms | 0.1510ms | 6.6216 KOps/s | 6.5601 KOps/s | |
test_update | 0.3481ms | 19.3415μs | 51.7024 KOps/s | 48.4434 KOps/s | |
test_update_nested | 75.5810μs | 24.8110μs | 40.3047 KOps/s | 38.5607 KOps/s | |
test_update__nested | 0.5222ms | 26.0655μs | 38.3649 KOps/s | 36.9151 KOps/s | |
test_set_nested | 0.1186ms | 17.4789μs | 57.2118 KOps/s | 55.2037 KOps/s | |
test_set_nested_new | 0.1200ms | 19.5805μs | 51.0712 KOps/s | 49.1904 KOps/s | |
test_select | 62.0210μs | 30.6576μs | 32.6183 KOps/s | 31.3142 KOps/s | |
test_select_nested | 69.3310μs | 43.6214μs | 22.9245 KOps/s | 22.9630 KOps/s | |
test_exclude_nested | 92.8720μs | 62.6179μs | 15.9699 KOps/s | 15.8566 KOps/s | |
test_empty[True] | 0.3349ms | 0.2851ms | 3.5079 KOps/s | 3.5105 KOps/s | |
test_empty[False] | 3.1190μs | 0.8432μs | 1.1860 MOps/s | 1.2099 MOps/s | |
test_to | 86.4210μs | 57.1080μs | 17.5107 KOps/s | 17.6137 KOps/s | |
test_to_nonblocking | 86.4520μs | 49.0194μs | 20.4001 KOps/s | 20.7306 KOps/s | |
test_unbind_speed | 1.4404ms | 0.2399ms | 4.1680 KOps/s | 4.0448 KOps/s | |
test_unbind_speed_stack0 | 0.3215ms | 0.2361ms | 4.2361 KOps/s | 4.0070 KOps/s | |
test_unbind_speed_stack1 | 92.3967ms | 0.6664ms | 1.5005 KOps/s | 1.5747 KOps/s | |
test_split | 93.3719ms | 1.5996ms | 625.1454 Ops/s | 612.1593 Ops/s | |
test_chunk | 93.5363ms | 1.5971ms | 626.1354 Ops/s | 615.0765 Ops/s | |
test_consolidate[False-None] | 96.0181ms | 2.9518ms | 338.7792 Ops/s | 334.5206 Ops/s | |
test_consolidate[default-None] | 1.8352ms | 1.7386ms | 575.1758 Ops/s | 574.5203 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8715ms | 1.7840ms | 560.5535 Ops/s | 561.5731 Ops/s | |
test_consolidate_njt[False-None] | 7.1226ms | 6.8482ms | 146.0229 Ops/s | 147.6372 Ops/s | |
test_to[False-False-None] | 1.8831ms | 1.7689ms | 565.3390 Ops/s | 572.7462 Ops/s | |
test_to[True-False-None] | 1.6188ms | 1.3465ms | 742.6754 Ops/s | 723.0302 Ops/s | |
test_to[within-False-None] | 4.4645ms | 4.1829ms | 239.0697 Ops/s | 236.3845 Ops/s | |
test_to[True-default-None] | 5.6859ms | 5.4208ms | 184.4738 Ops/s | 182.0049 Ops/s | |
test_to_njt[False-False-None] | 7.2827ms | 7.1096ms | 140.6557 Ops/s | 141.6759 Ops/s | |
test_to_njt[True-False-None] | 6.1666ms | 5.7454ms | 174.0520 Ops/s | 176.2351 Ops/s | |
test_to_njt[within-False-None] | 12.7954ms | 12.6358ms | 79.1403 Ops/s | 79.3883 Ops/s | |
test_creation[device0] | 0.3876ms | 79.6406μs | 12.5564 KOps/s | 12.3589 KOps/s | |
test_creation_from_tensor | 0.8761ms | 84.1111μs | 11.8890 KOps/s | 11.7161 KOps/s | |
test_add_one[memmap_tensor0] | 0.4159ms | 7.2505μs | 137.9213 KOps/s | 135.7426 KOps/s | |
test_contiguous[memmap_tensor0] | 1.8260μs | 0.4502μs | 2.2213 MOps/s | 2.2189 MOps/s | |
test_stack[memmap_tensor0] | 40.3410μs | 4.4545μs | 224.4908 KOps/s | 217.8628 KOps/s | |
test_memmaptd_index | 1.5395ms | 0.2571ms | 3.8896 KOps/s | 3.6992 KOps/s | |
test_memmaptd_index_astensor | 0.6071ms | 0.3173ms | 3.1520 KOps/s | 2.9891 KOps/s | |
test_memmaptd_index_op | 1.0983ms | 0.6271ms | 1.5946 KOps/s | 1.5342 KOps/s | |
test_serialize_model | 0.1326s | 0.1309s | 7.6398 Ops/s | 7.6206 Ops/s | |
test_serialize_model_pickle | 1.3907s | 1.2212s | 0.8189 Ops/s | 0.8255 Ops/s | |
test_serialize_weights | 0.1330s | 0.1308s | 7.6439 Ops/s | 7.6734 Ops/s | |
test_serialize_weights_returnearly | 0.4391s | 71.0460ms | 14.0754 Ops/s | 22.6659 Ops/s | |
test_serialize_weights_pickle | 1.3792s | 1.1954s | 0.8366 Ops/s | 0.8407 Ops/s | |
test_reshape_pytree | 57.4710μs | 22.1407μs | 45.1657 KOps/s | 43.4166 KOps/s | |
test_reshape_td | 66.4210μs | 27.3618μs | 36.5473 KOps/s | 32.5330 KOps/s | |
test_view_pytree | 54.3510μs | 22.1557μs | 45.1351 KOps/s | 44.6806 KOps/s | |
test_view_td | 72.8810μs | 31.4069μs | 31.8401 KOps/s | 30.2525 KOps/s | |
test_unbind_pytree | 62.5410μs | 27.7729μs | 36.0063 KOps/s | 35.0576 KOps/s | |
test_unbind_td | 0.7502ms | 36.3538μs | 27.5074 KOps/s | 25.5968 KOps/s | |
test_split_pytree | 64.4020μs | 29.7939μs | 33.5639 KOps/s | 32.1934 KOps/s | |
test_split_td | 0.9107ms | 38.9325μs | 25.6855 KOps/s | 25.2203 KOps/s | |
test_add_pytree | 77.7620μs | 36.7586μs | 27.2046 KOps/s | 27.3976 KOps/s | |
test_add_td | 0.2141ms | 52.5324μs | 19.0359 KOps/s | 18.2884 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1755ms | 0.1212ms | 8.2535 KOps/s | 7.9497 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2192ms | 0.1298ms | 7.7016 KOps/s | 7.6450 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1995ms | 94.5794μs | 10.5731 KOps/s | 9.9414 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.5013ms | 0.1555ms | 6.4300 KOps/s | 6.5189 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1052ms | 23.4253μs | 42.6888 KOps/s | 42.5635 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 56.1510μs | 30.1035μs | 33.2187 KOps/s | 32.8129 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.3376ms | 64.7860μs | 15.4354 KOps/s | 15.1000 KOps/s | |
test_compile_copy_nested[pytree-eager] | 77.3910μs | 49.7133μs | 20.1153 KOps/s | 19.8763 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1823ms | 0.1462ms | 6.8381 KOps/s | 6.8613 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3119ms | 0.2157ms | 4.6362 KOps/s | 4.6685 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1352ms | 98.0172μs | 10.2023 KOps/s | 9.2357 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1092ms | 54.7546μs | 18.2633 KOps/s | 19.0186 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2251ms | 0.1348ms | 7.4168 KOps/s | 7.2288 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5517ms | 0.5040ms | 1.9840 KOps/s | 2.0169 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3649ms | 0.2581ms | 3.8742 KOps/s | 3.8544 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1849ms | 0.1433ms | 6.9778 KOps/s | 6.8822 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1476ms | 64.0829μs | 15.6048 KOps/s | 15.3532 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1564ms | 97.9896μs | 10.2052 KOps/s | 9.9396 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5018ms | 0.4221ms | 2.3693 KOps/s | 2.4338 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 1.5782ms | 0.1382ms | 7.2344 KOps/s | 7.3253 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 59.2810μs | 19.7316μs | 50.6801 KOps/s | 55.0404 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 67.2410μs | 31.4424μs | 31.8041 KOps/s | 31.6738 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.2408ms | 70.7730μs | 14.1297 KOps/s | 14.2741 KOps/s | |
test_compile_copy_flat[pytree-eager] | 94.7220μs | 51.6299μs | 19.3686 KOps/s | 19.4373 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6699ms | 0.4051ms | 2.4686 KOps/s | 2.2089 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.9459ms | 2.7201ms | 367.6282 Ops/s | 373.1937 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.5998ms | 0.3806ms | 2.6276 KOps/s | 2.2282 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8896ms | 2.7447ms | 364.3332 Ops/s | 353.8397 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.2365ms | 0.1178ms | 8.4893 KOps/s | 8.1204 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5718ms | 84.1162μs | 11.8883 KOps/s | 11.7885 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1737ms | 0.1125ms | 8.8875 KOps/s | 8.8019 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1307ms | 70.2830μs | 14.2282 KOps/s | 14.4112 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1768ms | 0.1158ms | 8.6364 KOps/s | 8.9834 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2086ms | 70.1565μs | 14.2538 KOps/s | 13.9633 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1549ms | 0.1027ms | 9.7338 KOps/s | 9.4689 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1445ms | 17.2215μs | 58.0668 KOps/s | 55.0152 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1418ms | 95.9037μs | 10.4271 KOps/s | 9.7580 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 52.0910μs | 15.7401μs | 63.5321 KOps/s | 61.9711 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1511ms | 96.9084μs | 10.3190 KOps/s | 9.8690 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 54.6300μs | 15.8151μs | 63.2308 KOps/s | 61.4268 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1655ms | 0.1026ms | 9.7434 KOps/s | 9.6302 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6010ms | 17.2680μs | 57.9105 KOps/s | 55.4305 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1462ms | 97.1754μs | 10.2907 KOps/s | 10.0493 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 60.3210μs | 15.7664μs | 63.4260 KOps/s | 61.6969 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1937ms | 97.8095μs | 10.2240 KOps/s | 10.0678 KOps/s | |
test_compile_indexing[int-pytree-eager] | 43.4200μs | 15.8206μs | 63.2085 KOps/s | 61.6397 KOps/s | |
test_mod_add[eager] | 89.9210μs | 39.7850μs | 25.1351 KOps/s | 24.6303 KOps/s | |
test_mod_add[compile] | 0.1394ms | 81.1664μs | 12.3204 KOps/s | 12.2616 KOps/s | |
test_mod_add[compile-overhead] | 0.3267ms | 0.1674ms | 5.9736 KOps/s | 5.5667 KOps/s | |
test_mod_wrap[eager] | 0.3391ms | 0.2542ms | 3.9335 KOps/s | 3.9274 KOps/s | |
test_mod_wrap[compile] | 0.3891ms | 0.2913ms | 3.4327 KOps/s | 3.4401 KOps/s | |
test_mod_wrap[compile-overhead] | 7.2913ms | 3.7605ms | 265.9244 Ops/s | 272.2437 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.7976ms | 1.3741ms | 727.7638 Ops/s | 682.3845 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3993ms | 1.2826ms | 779.6945 Ops/s | 712.6672 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.4041ms | 0.9469ms | 1.0561 KOps/s | 948.6836 Ops/s | |
test_seq_add[eager] | 0.1669ms | 0.1178ms | 8.4906 KOps/s | 8.1850 KOps/s | |
test_seq_add[compile] | 0.1353ms | 89.1401μs | 11.2183 KOps/s | 11.2440 KOps/s | |
test_seq_add[compile-overhead] | 0.1731ms | 0.1289ms | 7.7575 KOps/s | 7.6193 KOps/s | |
test_seq_wrap[eager] | 0.5358ms | 0.4243ms | 2.3567 KOps/s | 2.2830 KOps/s | |
test_seq_wrap[compile] | 0.4008ms | 0.3041ms | 3.2886 KOps/s | 3.2896 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2851ms | 0.2279ms | 4.3884 KOps/s | 4.3583 KOps/s | |
test_func_call_runtime[False-eager] | 0.8426ms | 0.7428ms | 1.3463 KOps/s | 1.3254 KOps/s | |
test_func_call_runtime[False-compile] | 0.8663ms | 0.7522ms | 1.3294 KOps/s | 1.3082 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4232ms | 0.3754ms | 2.6641 KOps/s | 2.6561 KOps/s | |
test_func_call_runtime[True-eager] | 1.0488ms | 0.9424ms | 1.0611 KOps/s | 1.0811 KOps/s | |
test_func_call_runtime[True-compile] | 0.8460ms | 0.7778ms | 1.2856 KOps/s | 1.2835 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4670ms | 0.3947ms | 2.5333 KOps/s | 2.5241 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8710ms | 0.7434ms | 1.3451 KOps/s | 1.3351 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8321ms | 0.7586ms | 1.3182 KOps/s | 1.3111 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4494ms | 0.3779ms | 2.6465 KOps/s | 2.6316 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.7112ms | 1.0232ms | 977.3596 Ops/s | 976.4847 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.8784ms | 0.8074ms | 1.2385 KOps/s | 1.2342 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4969ms | 0.4202ms | 2.3796 KOps/s | 2.3518 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5328ms | 2.1110ms | 473.7064 Ops/s | 470.1542 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9189ms | 0.8292ms | 1.2059 KOps/s | 1.2040 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5024ms | 0.4231ms | 2.3633 KOps/s | 2.3478 KOps/s | |
test_distributed | 6.1174ms | 0.1683ms | 5.9432 KOps/s | 8.7768 KOps/s | |
test_tdmodule | 82.1010μs | 20.2718μs | 49.3297 KOps/s | 45.5214 KOps/s | |
test_tdmodule_dispatch | 56.9310μs | 37.1816μs | 26.8950 KOps/s | 26.3462 KOps/s | |
test_tdseq | 41.1700μs | 21.5465μs | 46.4113 KOps/s | 44.9875 KOps/s | |
test_tdseq_dispatch | 62.3820μs | 40.1423μs | 24.9114 KOps/s | 23.9652 KOps/s | |
test_instantiation_functorch | 1.9474ms | 1.5767ms | 634.2372 Ops/s | 619.9258 Ops/s | |
test_exec_functorch | 0.1864ms | 0.1475ms | 6.7804 KOps/s | 6.5549 KOps/s | |
test_exec_functional_call | 0.5301ms | 0.1411ms | 7.0873 KOps/s | 6.8340 KOps/s | |
test_exec_td_decorator | 0.5885ms | 0.1876ms | 5.3295 KOps/s | 5.1378 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.0921ms | 0.7061ms | 1.4161 KOps/s | 1.4293 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.1149ms | 0.7055ms | 1.4175 KOps/s | 1.4348 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 1.0162ms | 0.6267ms | 1.5957 KOps/s | 1.6561 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 1.0272ms | 0.6242ms | 1.6020 KOps/s | 1.6559 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.6500ms | 19.5944ms | 51.0350 Ops/s | 51.6319 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.3280ms | 19.4774ms | 51.3416 Ops/s | 51.5592 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.8987ms | 19.4065ms | 51.5290 Ops/s | 52.0233 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.9493ms | 19.4021ms | 51.5409 Ops/s | 51.9634 Ops/s | |
test_to_module_speed[True] | 1.3871ms | 0.9799ms | 1.0205 KOps/s | 1.0224 KOps/s | |
test_to_module_speed[False] | 1.5083ms | 0.9437ms | 1.0596 KOps/s | 1.0437 KOps/s | |
test_tc_init | 63.9910μs | 38.9515μs | 25.6730 KOps/s | 25.6586 KOps/s | |
test_tc_init_nested | 0.4769ms | 75.9600μs | 13.1648 KOps/s | 12.8658 KOps/s | |
test_tc_first_layer_tensor | 56.1280μs | 0.6969μs | 1.4350 MOps/s | 1.4340 MOps/s | |
test_tc_first_layer_nontensor | 31.9000μs | 2.3457μs | 426.3113 KOps/s | 423.2274 KOps/s | |
test_tc_second_layer_tensor | 98.0242μs | 1.4102μs | 709.1175 KOps/s | 710.4230 KOps/s | |
test_tc_second_layer_nontensor | 24.6800μs | 3.0477μs | 328.1133 KOps/s | 327.0047 KOps/s | |
test_unbind | 0.2380s | 10.4666ms | 95.5420 Ops/s | 144.3081 Ops/s | |
test_full_like | 11.5390ms | 9.1486ms | 109.3061 Ops/s | 107.3914 Ops/s | |
test_zeros_like | 11.6290ms | 8.6977ms | 114.9728 Ops/s | 230.7705 Ops/s | |
test_ones_like | 4.8169ms | 4.2875ms | 233.2383 Ops/s | 231.3990 Ops/s | |
test_clone | 11.6004ms | 9.1467ms | 109.3291 Ops/s | 157.1471 Ops/s | |
test_squeeze | 59.0410μs | 9.9513μs | 100.4899 KOps/s | 100.6927 KOps/s | |
test_unsqueeze | 0.1187ms | 72.7062μs | 13.7540 KOps/s | 13.5185 KOps/s | |
test_split | 0.3135ms | 0.1696ms | 5.8958 KOps/s | 6.0613 KOps/s | |
test_permute | 0.6260ms | 0.1894ms | 5.2787 KOps/s | 5.4047 KOps/s | |
test_stack | 51.6825ms | 50.6503ms | 19.7432 Ops/s | 19.6597 Ops/s | |
test_cat | 53.1239ms | 50.7573ms | 19.7016 Ops/s | 19.7186 Ops/s |
vmoens
added a commit
that referenced
this pull request
Dec 19, 2024
ghstack-source-id: 63013752ba61f05079cb6a60bb06312968b79ae9 Pull Request resolved: #1150
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):