-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Force log_prob to return a tensordict when kwargs are passed to ProbabilisticTensorDictSequential.log_prob #1146
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Dec 18, 2024
…d to ProbabilisticTensorDictSequential.log_prob ghstack-source-id: a7482a31fbb6b65562589cc3771e73a61ce0ed17 Pull Request resolved: #1146
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Dec 18, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 53.5790μs | 22.1892μs | 45.0670 KOps/s | 50.3731 KOps/s | |
test_plain_set_stack_nested | 56.9260μs | 21.9783μs | 45.4993 KOps/s | 49.8115 KOps/s | |
test_plain_set_nested_inplace | 69.4780μs | 23.8154μs | 41.9896 KOps/s | 45.8989 KOps/s | |
test_plain_set_stack_nested_inplace | 50.9550μs | 23.9101μs | 41.8233 KOps/s | 44.0462 KOps/s | |
test_items | 28.1420μs | 4.2233μs | 236.7797 KOps/s | 235.6818 KOps/s | |
test_items_nested | 0.7959ms | 0.4033ms | 2.4795 KOps/s | 2.4768 KOps/s | |
test_items_nested_locked | 0.8255ms | 0.4039ms | 2.4756 KOps/s | 2.4808 KOps/s | |
test_items_nested_leaf | 0.1455ms | 76.6681μs | 13.0432 KOps/s | 12.8960 KOps/s | |
test_items_stack_nested | 0.6081ms | 0.4075ms | 2.4542 KOps/s | 2.4667 KOps/s | |
test_items_stack_nested_leaf | 0.1392ms | 80.3848μs | 12.4402 KOps/s | 12.5665 KOps/s | |
test_items_stack_nested_locked | 0.8010ms | 0.4130ms | 2.4215 KOps/s | 2.4511 KOps/s | |
test_keys | 92.4720μs | 3.6045μs | 277.4323 KOps/s | 284.5814 KOps/s | |
test_keys_nested | 0.2803ms | 0.1669ms | 5.9923 KOps/s | 6.0938 KOps/s | |
test_keys_nested_locked | 1.7822ms | 0.1744ms | 5.7345 KOps/s | 5.8432 KOps/s | |
test_keys_nested_leaf | 0.2849ms | 0.1455ms | 6.8734 KOps/s | 7.0085 KOps/s | |
test_keys_stack_nested | 0.3387ms | 0.1653ms | 6.0498 KOps/s | 6.1227 KOps/s | |
test_keys_stack_nested_leaf | 0.3534ms | 0.1440ms | 6.9433 KOps/s | 7.1013 KOps/s | |
test_keys_stack_nested_locked | 0.2663ms | 0.1718ms | 5.8194 KOps/s | 5.9346 KOps/s | |
test_values | 6.1314μs | 1.0974μs | 911.2512 KOps/s | 976.4345 KOps/s | |
test_values_nested | 0.1641ms | 63.4527μs | 15.7598 KOps/s | 16.2551 KOps/s | |
test_values_nested_locked | 0.1137ms | 64.0611μs | 15.6101 KOps/s | 16.1125 KOps/s | |
test_values_nested_leaf | 0.1667ms | 72.3944μs | 13.8132 KOps/s | 13.9459 KOps/s | |
test_values_stack_nested | 0.1451ms | 64.8499μs | 15.4202 KOps/s | 15.6547 KOps/s | |
test_values_stack_nested_leaf | 0.1650ms | 72.8356μs | 13.7296 KOps/s | 14.0241 KOps/s | |
test_values_stack_nested_locked | 0.1160ms | 64.5711μs | 15.4868 KOps/s | 15.6869 KOps/s | |
test_membership | 18.1230μs | 0.8827μs | 1.1329 MOps/s | 1.1547 MOps/s | |
test_membership_nested | 31.0580μs | 2.9801μs | 335.5648 KOps/s | 346.9498 KOps/s | |
test_membership_nested_leaf | 30.8170μs | 2.9666μs | 337.0919 KOps/s | 342.5238 KOps/s | |
test_membership_stacked_nested | 22.9530μs | 2.9323μs | 341.0270 KOps/s | 344.6614 KOps/s | |
test_membership_stacked_nested_leaf | 27.5520μs | 2.9701μs | 336.6852 KOps/s | 343.9579 KOps/s | |
test_membership_nested_last | 32.7110μs | 4.5136μs | 221.5507 KOps/s | 228.0562 KOps/s | |
test_membership_nested_leaf_last | 29.9150μs | 4.4467μs | 224.8882 KOps/s | 227.5054 KOps/s | |
test_membership_stacked_nested_last | 34.7740μs | 5.7341μs | 174.3965 KOps/s | 193.2964 KOps/s | |
test_membership_stacked_nested_leaf_last | 38.6510μs | 5.7149μs | 174.9797 KOps/s | 191.1723 KOps/s | |
test_nested_getleaf | 58.2980μs | 10.9272μs | 91.5148 KOps/s | 91.9235 KOps/s | |
test_nested_get | 36.5280μs | 10.3528μs | 96.5924 KOps/s | 96.9270 KOps/s | |
test_stacked_getleaf | 35.6760μs | 10.8423μs | 92.2311 KOps/s | 92.2345 KOps/s | |
test_stacked_get | 40.8360μs | 10.3083μs | 97.0088 KOps/s | 97.5777 KOps/s | |
test_nested_getitemleaf | 37.8910μs | 11.3436μs | 88.1556 KOps/s | 89.1961 KOps/s | |
test_nested_getitem | 48.9210μs | 10.6309μs | 94.0650 KOps/s | 96.0844 KOps/s | |
test_stacked_getitemleaf | 35.0050μs | 11.3974μs | 87.7391 KOps/s | 90.4701 KOps/s | |
test_stacked_getitem | 37.8100μs | 10.7173μs | 93.3067 KOps/s | 97.0981 KOps/s | |
test_lock_nested | 4.8723ms | 0.4643ms | 2.1536 KOps/s | 2.1880 KOps/s | |
test_lock_stack_nested | 0.6696ms | 0.4234ms | 2.3618 KOps/s | 2.3392 KOps/s | |
test_unlock_nested | 0.7379ms | 0.3737ms | 2.6757 KOps/s | 2.6576 KOps/s | |
test_unlock_stack_nested | 0.6343ms | 0.3421ms | 2.9229 KOps/s | 2.8809 KOps/s | |
test_flatten_speed | 0.1996ms | 0.1011ms | 9.8919 KOps/s | 9.9795 KOps/s | |
test_unflatten_speed | 0.7243ms | 0.5394ms | 1.8540 KOps/s | 1.8743 KOps/s | |
test_common_ops | 1.6214ms | 0.8149ms | 1.2272 KOps/s | 1.3162 KOps/s | |
test_creation | 17.9430μs | 2.5142μs | 397.7381 KOps/s | 403.0386 KOps/s | |
test_creation_empty | 35.6260μs | 13.5132μs | 74.0015 KOps/s | 104.9215 KOps/s | |
test_creation_nested_1 | 37.6400μs | 16.5944μs | 60.2611 KOps/s | 81.7470 KOps/s | |
test_creation_nested_2 | 1.4016ms | 21.4065μs | 46.7148 KOps/s | 59.5620 KOps/s | |
test_clone | 66.4930μs | 13.5458μs | 73.8234 KOps/s | 72.7205 KOps/s | |
test_getitem[int] | 0.1941ms | 12.8137μs | 78.0416 KOps/s | 76.6752 KOps/s | |
test_getitem[slice_int] | 0.1373ms | 24.7030μs | 40.4809 KOps/s | 39.8926 KOps/s | |
test_getitem[range] | 0.2168ms | 50.3405μs | 19.8647 KOps/s | 19.9516 KOps/s | |
test_getitem[tuple] | 0.1390ms | 20.7670μs | 48.1532 KOps/s | 48.6160 KOps/s | |
test_getitem[list] | 0.1702ms | 44.2044μs | 22.6222 KOps/s | 21.8473 KOps/s | |
test_setitem_dim[int] | 52.6280μs | 24.9079μs | 40.1480 KOps/s | 38.6012 KOps/s | |
test_setitem_dim[slice_int] | 83.4650μs | 51.7423μs | 19.3265 KOps/s | 18.8539 KOps/s | |
test_setitem_dim[range] | 0.1048ms | 72.8798μs | 13.7212 KOps/s | 13.4436 KOps/s | |
test_setitem_dim[tuple] | 76.7620μs | 39.3745μs | 25.3971 KOps/s | 23.8396 KOps/s | |
test_setitem | 84.9180μs | 21.6394μs | 46.2120 KOps/s | 50.0552 KOps/s | |
test_set | 67.9260μs | 21.0303μs | 47.5504 KOps/s | 52.7402 KOps/s | |
test_set_shared | 1.1676ms | 0.1688ms | 5.9234 KOps/s | 5.7979 KOps/s | |
test_update | 0.2213ms | 24.9709μs | 40.0466 KOps/s | 48.9808 KOps/s | |
test_update_nested | 0.1272ms | 36.2706μs | 27.5706 KOps/s | 31.4965 KOps/s | |
test_update__nested | 0.3800ms | 34.4004μs | 29.0694 KOps/s | 28.5284 KOps/s | |
test_set_nested | 92.6920μs | 23.5093μs | 42.5364 KOps/s | 47.1224 KOps/s | |
test_set_nested_new | 0.1259ms | 28.5105μs | 35.0748 KOps/s | 37.9869 KOps/s | |
test_select | 0.2178ms | 44.7826μs | 22.3301 KOps/s | 23.4915 KOps/s | |
test_select_nested | 0.1278ms | 63.5101μs | 15.7455 KOps/s | 15.3319 KOps/s | |
test_exclude_nested | 0.1867ms | 82.9881μs | 12.0499 KOps/s | 11.9404 KOps/s | |
test_empty[True] | 0.8267ms | 0.4146ms | 2.4121 KOps/s | 2.4377 KOps/s | |
test_empty[False] | 6.4220μs | 1.4632μs | 683.4543 KOps/s | 717.4568 KOps/s | |
test_unbind_speed | 0.3912ms | 0.2700ms | 3.7035 KOps/s | 3.6323 KOps/s | |
test_unbind_speed_stack0 | 0.4074ms | 0.2650ms | 3.7735 KOps/s | 3.7147 KOps/s | |
test_unbind_speed_stack1 | 0.1020s | 0.7860ms | 1.2722 KOps/s | 1.5353 KOps/s | |
test_split | 1.7436ms | 1.5837ms | 631.4275 Ops/s | 561.5414 Ops/s | |
test_chunk | 99.0406ms | 1.7467ms | 572.4983 Ops/s | 561.2980 Ops/s | |
test_consolidate_njt[False-None] | 0.1024s | 8.7481ms | 114.3106 Ops/s | 124.0720 Ops/s | |
test_creation[device0] | 0.2286ms | 90.3925μs | 11.0629 KOps/s | 10.6664 KOps/s | |
test_creation_from_tensor | 3.6381ms | 94.0737μs | 10.6300 KOps/s | 10.5769 KOps/s | |
test_add_one[memmap_tensor0] | 0.1650ms | 4.7927μs | 208.6526 KOps/s | 193.5440 KOps/s | |
test_contiguous[memmap_tensor0] | 9.6880μs | 0.5324μs | 1.8783 MOps/s | 1.9028 MOps/s | |
test_stack[memmap_tensor0] | 47.2380μs | 3.4542μs | 289.5018 KOps/s | 287.3478 KOps/s | |
test_memmaptd_index | 1.1154ms | 0.2396ms | 4.1733 KOps/s | 4.0747 KOps/s | |
test_memmaptd_index_astensor | 0.7698ms | 0.3291ms | 3.0383 KOps/s | 3.0140 KOps/s | |
test_memmaptd_index_op | 1.1067ms | 0.6215ms | 1.6091 KOps/s | 1.7372 KOps/s | |
test_serialize_model | 0.1204s | 0.1128s | 8.8632 Ops/s | 7.7955 Ops/s | |
test_serialize_model_pickle | 0.4586s | 0.3946s | 2.5343 Ops/s | 2.5420 Ops/s | |
test_serialize_weights | 0.2180s | 0.1286s | 7.7733 Ops/s | 8.8836 Ops/s | |
test_serialize_weights_returnearly | 0.2626s | 0.1723s | 5.8024 Ops/s | 6.5266 Ops/s | |
test_serialize_weights_pickle | 0.5574s | 0.4288s | 2.3321 Ops/s | 2.5272 Ops/s | |
test_serialize_weights_filesystem | 0.1462s | 0.1388s | 7.2035 Ops/s | 7.1528 Ops/s | |
test_serialize_model_filesystem | 0.2394s | 0.1575s | 6.3493 Ops/s | 6.2160 Ops/s | |
test_reshape_pytree | 61.3040μs | 26.4612μs | 37.7911 KOps/s | 37.5257 KOps/s | |
test_reshape_td | 91.4290μs | 33.2640μs | 30.0625 KOps/s | 29.3349 KOps/s | |
test_view_pytree | 85.5390μs | 26.9645μs | 37.0858 KOps/s | 37.3013 KOps/s | |
test_view_td | 84.8480μs | 40.3569μs | 24.7789 KOps/s | 25.1460 KOps/s | |
test_unbind_pytree | 86.5100μs | 29.4340μs | 33.9744 KOps/s | 33.2930 KOps/s | |
test_unbind_td | 0.3369ms | 39.9440μs | 25.0350 KOps/s | 24.9680 KOps/s | |
test_split_pytree | 85.2380μs | 28.9991μs | 34.4838 KOps/s | 33.7221 KOps/s | |
test_split_td | 0.5407ms | 45.0307μs | 22.2071 KOps/s | 21.5652 KOps/s | |
test_add_pytree | 82.7540μs | 35.3708μs | 28.2719 KOps/s | 27.6480 KOps/s | |
test_add_td | 0.1630ms | 63.2383μs | 15.8132 KOps/s | 17.4011 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1416ms | 60.7840μs | 16.4517 KOps/s | 16.4657 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3741ms | 0.1716ms | 5.8277 KOps/s | 5.8335 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1186ms | 44.8314μs | 22.3058 KOps/s | 22.1681 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2598ms | 0.1191ms | 8.3990 KOps/s | 8.3839 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 77.4530μs | 25.7000μs | 38.9105 KOps/s | 39.2563 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1118ms | 58.9514μs | 16.9631 KOps/s | 17.0650 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1498ms | 78.9290μs | 12.6696 KOps/s | 12.8630 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1589ms | 67.1619μs | 14.8894 KOps/s | 14.9117 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1914ms | 0.1027ms | 9.7405 KOps/s | 9.5781 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3917ms | 0.2127ms | 4.7020 KOps/s | 4.6768 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1127ms | 44.1429μs | 22.6537 KOps/s | 22.4187 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4833ms | 65.1921μs | 15.3393 KOps/s | 15.2770 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1815ms | 0.1028ms | 9.7259 KOps/s | 9.8347 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4761ms | 0.2023ms | 4.9436 KOps/s | 4.9634 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3783ms | 0.2300ms | 4.3484 KOps/s | 4.2830 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1725ms | 0.1031ms | 9.7001 KOps/s | 9.5644 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1557ms | 59.3033μs | 16.8625 KOps/s | 16.7337 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1002ms | 45.3123μs | 22.0690 KOps/s | 21.8943 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.2442ms | 0.1590ms | 6.2895 KOps/s | 6.2370 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2416ms | 0.1025ms | 9.7600 KOps/s | 9.8729 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 51.6760μs | 20.6102μs | 48.5196 KOps/s | 46.6758 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1308ms | 66.1000μs | 15.1286 KOps/s | 15.1500 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1550ms | 81.4622μs | 12.2756 KOps/s | 12.5756 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1372ms | 70.0715μs | 14.2711 KOps/s | 14.9347 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.4251ms | 0.2042ms | 4.8974 KOps/s | 4.9011 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.4694ms | 1.3489ms | 741.3338 Ops/s | 748.6825 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2915ms | 0.2026ms | 4.9364 KOps/s | 5.0108 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3441ms | 0.7909ms | 1.2644 KOps/s | 1.2743 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.9141ms | 0.4620ms | 2.1644 KOps/s | 2.1868 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.0547ms | 2.7784ms | 359.9247 Ops/s | 396.3730 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 81.7120μs | 35.3043μs | 28.3251 KOps/s | 27.9934 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5408ms | 32.8440μs | 30.4470 KOps/s | 29.1746 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1239ms | 29.2398μs | 34.2000 KOps/s | 34.7864 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1903ms | 23.7083μs | 42.1793 KOps/s | 42.5778 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 80.4800μs | 29.3333μs | 34.0910 KOps/s | 33.1407 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 93.8240μs | 23.3010μs | 42.9166 KOps/s | 43.0716 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1336ms | 49.9739μs | 20.0105 KOps/s | 19.7373 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5768ms | 20.4563μs | 48.8848 KOps/s | 47.9735 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 95.7270μs | 42.6726μs | 23.4343 KOps/s | 22.9221 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 59.1600μs | 18.6938μs | 53.4936 KOps/s | 52.0176 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 96.5790μs | 43.5194μs | 22.9782 KOps/s | 22.3887 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 55.8940μs | 18.4667μs | 54.1515 KOps/s | 53.5740 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1125ms | 51.0208μs | 19.5998 KOps/s | 19.0586 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9950ms | 20.3466μs | 49.1482 KOps/s | 48.8358 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1101ms | 43.9120μs | 22.7728 KOps/s | 22.5193 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 88.8350μs | 18.3903μs | 54.3764 KOps/s | 54.3314 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1087ms | 43.9595μs | 22.7482 KOps/s | 22.5471 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.3421ms | 19.0059μs | 52.6153 KOps/s | 54.2406 KOps/s | |
test_mod_add[eager] | 97.6910μs | 34.9725μs | 28.5939 KOps/s | 30.7839 KOps/s | |
test_mod_add[compile] | 0.1563ms | 47.4592μs | 21.0707 KOps/s | 21.1748 KOps/s | |
test_mod_add[compile-overhead] | 0.1144ms | 46.5416μs | 21.4862 KOps/s | 21.3407 KOps/s | |
test_mod_wrap[eager] | 0.4281ms | 0.2178ms | 4.5912 KOps/s | 4.5606 KOps/s | |
test_mod_wrap[compile] | 0.4518ms | 0.2027ms | 4.9342 KOps/s | 4.8766 KOps/s | |
test_mod_wrap[compile-overhead] | 0.2926ms | 0.1982ms | 5.0446 KOps/s | 4.8834 KOps/s | |
test_mod_wrap_and_backward[eager] | 16.2745ms | 11.6671ms | 85.7115 Ops/s | 85.5771 Ops/s | |
test_mod_wrap_and_backward[compile] | 13.4136ms | 11.0625ms | 90.3957 Ops/s | 78.5474 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 14.4505ms | 12.6298ms | 79.1777 Ops/s | 79.6752 Ops/s | |
test_seq_add[eager] | 0.2889ms | 0.1156ms | 8.6522 KOps/s | 8.8085 KOps/s | |
test_seq_add[compile] | 0.1309ms | 60.8213μs | 16.4416 KOps/s | 16.3906 KOps/s | |
test_seq_add[compile-overhead] | 0.1535ms | 58.9335μs | 16.9683 KOps/s | 16.7135 KOps/s | |
test_seq_wrap[eager] | 0.6802ms | 0.4350ms | 2.2989 KOps/s | 2.2856 KOps/s | |
test_seq_wrap[compile] | 0.4803ms | 0.2221ms | 4.5024 KOps/s | 4.4079 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4126ms | 0.2217ms | 4.5107 KOps/s | 4.4272 KOps/s | |
test_func_call_runtime[False-eager] | 0.8113ms | 0.5299ms | 1.8871 KOps/s | 1.8078 KOps/s | |
test_func_call_runtime[False-compile] | 0.5130ms | 0.4134ms | 2.4190 KOps/s | 2.3202 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.8291ms | 0.4153ms | 2.4079 KOps/s | 2.3358 KOps/s | |
test_func_call_runtime[True-eager] | 1.2692ms | 0.7438ms | 1.3445 KOps/s | 1.3142 KOps/s | |
test_func_call_runtime[True-compile] | 0.5919ms | 0.4529ms | 2.2077 KOps/s | 2.1421 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6101ms | 0.4566ms | 2.1901 KOps/s | 2.1378 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.6577ms | 0.5346ms | 1.8707 KOps/s | 1.8162 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5286ms | 0.4116ms | 2.4297 KOps/s | 2.3435 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5776ms | 0.4125ms | 2.4242 KOps/s | 2.3461 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0729ms | 0.8842ms | 1.1310 KOps/s | 1.0930 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.5704ms | 0.4779ms | 2.0926 KOps/s | 2.0206 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.8964ms | 0.4853ms | 2.0604 KOps/s | 2.0340 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6340ms | 1.8830ms | 531.0697 Ops/s | 516.9322 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.7121ms | 0.5102ms | 1.9601 KOps/s | 1.9193 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7262ms | 0.5121ms | 1.9529 KOps/s | 1.9498 KOps/s | |
test_distributed | 0.2828ms | 0.1233ms | 8.1072 KOps/s | 7.7863 KOps/s | |
test_tdmodule | 44.3120μs | 26.8256μs | 37.2779 KOps/s | 40.3613 KOps/s | |
test_tdmodule_dispatch | 74.9590μs | 50.0609μs | 19.9757 KOps/s | 22.0440 KOps/s | |
test_tdseq | 65.1910μs | 29.8380μs | 33.5143 KOps/s | 36.2948 KOps/s | |
test_tdseq_dispatch | 89.3050μs | 55.8641μs | 17.9006 KOps/s | 19.8994 KOps/s | |
test_instantiation_functorch | 2.2074ms | 1.5044ms | 664.7150 Ops/s | 649.4958 Ops/s | |
test_exec_functorch | 0.4233ms | 0.1754ms | 5.7028 KOps/s | 5.5649 KOps/s | |
test_exec_functional_call | 0.3082ms | 0.1662ms | 6.0156 KOps/s | 5.7698 KOps/s | |
test_exec_td_decorator | 0.4375ms | 0.2274ms | 4.3983 KOps/s | 4.2667 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8168ms | 0.6502ms | 1.5380 KOps/s | 1.5305 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8748ms | 0.6491ms | 1.5407 KOps/s | 1.5594 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8399ms | 0.5240ms | 1.9084 KOps/s | 1.8852 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.6912ms | 0.5225ms | 1.9140 KOps/s | 1.9087 KOps/s | |
test_to_module_speed[True] | 2.0889ms | 1.3528ms | 739.1966 Ops/s | 740.4467 Ops/s | |
test_to_module_speed[False] | 2.5115ms | 1.3245ms | 755.0289 Ops/s | 762.9415 Ops/s | |
test_tc_init | 96.7600μs | 51.5645μs | 19.3932 KOps/s | 22.2833 KOps/s | |
test_tc_init_nested | 0.2189ms | 0.1005ms | 9.9485 KOps/s | 10.9302 KOps/s | |
test_tc_first_layer_tensor | 27.6010μs | 1.5259μs | 655.3562 KOps/s | 652.6953 KOps/s | |
test_tc_first_layer_nontensor | 33.6320μs | 4.6685μs | 214.2011 KOps/s | 214.6502 KOps/s | |
test_tc_second_layer_tensor | 43.1400μs | 2.8213μs | 354.4511 KOps/s | 352.4612 KOps/s | |
test_tc_second_layer_nontensor | 46.1460μs | 6.0070μs | 166.4711 KOps/s | 167.5294 KOps/s | |
test_unbind | 0.2065s | 13.4079ms | 74.5830 Ops/s | 77.5016 Ops/s | |
test_full_like | 7.7284ms | 6.9854ms | 143.1549 Ops/s | 85.5219 Ops/s | |
test_zeros_like | 3.0553ms | 2.7003ms | 370.3350 Ops/s | 143.3408 Ops/s | |
test_ones_like | 3.5846ms | 3.1643ms | 316.0252 Ops/s | 130.8095 Ops/s | |
test_clone | 5.3504ms | 4.8614ms | 205.7027 Ops/s | 108.6201 Ops/s | |
test_squeeze | 72.4540μs | 12.6574μs | 79.0052 KOps/s | 80.3386 KOps/s | |
test_unsqueeze | 0.1917ms | 92.5474μs | 10.8053 KOps/s | 10.8697 KOps/s | |
test_split | 0.3573ms | 0.1962ms | 5.0957 KOps/s | 5.0913 KOps/s | |
test_permute | 0.3356ms | 0.2116ms | 4.7252 KOps/s | 4.7095 KOps/s | |
test_stack | 27.6562ms | 23.4369ms | 42.6677 Ops/s | 42.2642 Ops/s | |
test_cat | 27.9690ms | 23.3047ms | 42.9098 Ops/s | 42.3350 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 34.4910μs | 11.4828μs | 87.0871 KOps/s | 78.2187 KOps/s | |
test_plain_set_stack_nested | 65.6930μs | 11.6132μs | 86.1088 KOps/s | 77.4860 KOps/s | |
test_plain_set_nested_inplace | 43.0130μs | 12.5535μs | 79.6590 KOps/s | 72.3825 KOps/s | |
test_plain_set_stack_nested_inplace | 0.1822ms | 12.5061μs | 79.9612 KOps/s | 72.5211 KOps/s | |
test_items | 0.1934ms | 2.9012μs | 344.6792 KOps/s | 342.7524 KOps/s | |
test_items_nested | 0.3916ms | 0.3600ms | 2.7780 KOps/s | 2.7915 KOps/s | |
test_items_nested_locked | 0.4079ms | 0.3596ms | 2.7812 KOps/s | 2.7836 KOps/s | |
test_items_nested_leaf | 0.1597ms | 58.2562μs | 17.1655 KOps/s | 17.1443 KOps/s | |
test_items_stack_nested | 0.4058ms | 0.3647ms | 2.7419 KOps/s | 2.7519 KOps/s | |
test_items_stack_nested_leaf | 0.1623ms | 59.7399μs | 16.7392 KOps/s | 16.4610 KOps/s | |
test_items_stack_nested_locked | 0.4129ms | 0.3635ms | 2.7511 KOps/s | 2.7640 KOps/s | |
test_keys | 33.0120μs | 3.4613μs | 288.9114 KOps/s | 288.9970 KOps/s | |
test_keys_nested | 0.1345ms | 81.5372μs | 12.2643 KOps/s | 12.3771 KOps/s | |
test_keys_nested_locked | 0.7659ms | 87.3105μs | 11.4534 KOps/s | 11.5944 KOps/s | |
test_keys_nested_leaf | 0.1352ms | 71.9691μs | 13.8949 KOps/s | 14.0044 KOps/s | |
test_keys_stack_nested | 0.1244ms | 82.4112μs | 12.1343 KOps/s | 12.1501 KOps/s | |
test_keys_stack_nested_leaf | 0.1195ms | 73.4164μs | 13.6209 KOps/s | 13.5119 KOps/s | |
test_keys_stack_nested_locked | 0.1351ms | 88.9390μs | 11.2437 KOps/s | 11.3278 KOps/s | |
test_values | 5.0473μs | 0.8497μs | 1.1769 MOps/s | 1.1728 MOps/s | |
test_values_nested | 81.7950μs | 34.5737μs | 28.9238 KOps/s | 29.1560 KOps/s | |
test_values_nested_locked | 99.6450μs | 36.4386μs | 27.4434 KOps/s | 27.9994 KOps/s | |
test_values_nested_leaf | 0.1826ms | 39.1558μs | 25.5390 KOps/s | 25.8149 KOps/s | |
test_values_stack_nested | 0.2090ms | 34.8440μs | 28.6993 KOps/s | 28.5270 KOps/s | |
test_values_stack_nested_leaf | 0.2334ms | 39.6782μs | 25.2028 KOps/s | 25.4110 KOps/s | |
test_values_stack_nested_locked | 70.9930μs | 36.6822μs | 27.2612 KOps/s | 27.3074 KOps/s | |
test_membership | 2.3201μs | 0.5067μs | 1.9734 MOps/s | 1.9798 MOps/s | |
test_membership_nested | 15.5010μs | 2.0182μs | 495.4920 KOps/s | 495.1415 KOps/s | |
test_membership_nested_leaf | 20.0660μs | 2.0167μs | 495.8671 KOps/s | 499.7909 KOps/s | |
test_membership_stacked_nested | 56.5030μs | 2.1150μs | 472.8241 KOps/s | 479.7154 KOps/s | |
test_membership_stacked_nested_leaf | 29.1010μs | 2.1021μs | 475.7169 KOps/s | 483.2556 KOps/s | |
test_membership_nested_last | 33.2520μs | 3.0776μs | 324.9260 KOps/s | 321.2040 KOps/s | |
test_membership_nested_leaf_last | 29.5820μs | 3.1438μs | 318.0867 KOps/s | 317.5767 KOps/s | |
test_membership_stacked_nested_last | 31.7310μs | 3.6610μs | 273.1500 KOps/s | 212.7867 KOps/s | |
test_membership_stacked_nested_leaf_last | 32.2020μs | 3.6441μs | 274.4189 KOps/s | 214.7199 KOps/s | |
test_nested_getleaf | 39.1120μs | 6.1093μs | 163.6855 KOps/s | 164.1866 KOps/s | |
test_nested_get | 32.5820μs | 5.8040μs | 172.2964 KOps/s | 176.5626 KOps/s | |
test_stacked_getleaf | 62.0730μs | 6.1494μs | 162.6169 KOps/s | 164.3402 KOps/s | |
test_stacked_get | 34.2420μs | 5.8078μs | 172.1810 KOps/s | 173.1613 KOps/s | |
test_nested_getitemleaf | 31.3120μs | 6.2693μs | 159.5082 KOps/s | 158.5850 KOps/s | |
test_nested_getitem | 31.8720μs | 5.9759μs | 167.3394 KOps/s | 167.1278 KOps/s | |
test_stacked_getitemleaf | 31.2020μs | 6.2624μs | 159.6843 KOps/s | 160.4982 KOps/s | |
test_stacked_getitem | 33.6720μs | 5.9442μs | 168.2306 KOps/s | 168.5256 KOps/s | |
test_lock_nested | 9.7327ms | 0.3895ms | 2.5677 KOps/s | 2.5754 KOps/s | |
test_lock_stack_nested | 0.4425ms | 0.3491ms | 2.8642 KOps/s | 2.8853 KOps/s | |
test_unlock_nested | 0.7180ms | 0.3170ms | 3.1543 KOps/s | 3.1300 KOps/s | |
test_unlock_stack_nested | 0.3995ms | 0.2883ms | 3.4688 KOps/s | 3.4992 KOps/s | |
test_flatten_speed | 0.1092ms | 75.1480μs | 13.3071 KOps/s | 13.3478 KOps/s | |
test_unflatten_speed | 0.3859ms | 0.3204ms | 3.1208 KOps/s | 3.0932 KOps/s | |
test_common_ops | 1.6579ms | 0.5771ms | 1.7329 KOps/s | 1.6062 KOps/s | |
test_creation | 0.1057ms | 1.7545μs | 569.9631 KOps/s | 561.6564 KOps/s | |
test_creation_empty | 48.5620μs | 6.8486μs | 146.0157 KOps/s | 105.9181 KOps/s | |
test_creation_nested_1 | 37.3020μs | 8.5745μs | 116.6254 KOps/s | 89.8712 KOps/s | |
test_creation_nested_2 | 63.2530μs | 11.3301μs | 88.2608 KOps/s | 71.3109 KOps/s | |
test_clone | 76.3940μs | 10.1796μs | 98.2357 KOps/s | 95.9733 KOps/s | |
test_getitem[int] | 1.7805ms | 11.0506μs | 90.4927 KOps/s | 90.4705 KOps/s | |
test_getitem[slice_int] | 0.1113ms | 21.1793μs | 47.2159 KOps/s | 47.0141 KOps/s | |
test_getitem[range] | 0.1457ms | 36.9335μs | 27.0757 KOps/s | 26.9479 KOps/s | |
test_getitem[tuple] | 0.1109ms | 18.4538μs | 54.1893 KOps/s | 52.5800 KOps/s | |
test_getitem[list] | 0.2230ms | 32.4635μs | 30.8038 KOps/s | 30.5951 KOps/s | |
test_setitem_dim[int] | 50.8330μs | 18.2308μs | 54.8522 KOps/s | 56.0910 KOps/s | |
test_setitem_dim[slice_int] | 70.4940μs | 37.6286μs | 26.5755 KOps/s | 26.7904 KOps/s | |
test_setitem_dim[range] | 94.6550μs | 51.0595μs | 19.5850 KOps/s | 19.1238 KOps/s | |
test_setitem_dim[tuple] | 60.0930μs | 30.8371μs | 32.4285 KOps/s | 30.9900 KOps/s | |
test_setitem | 84.8940μs | 13.9279μs | 71.7985 KOps/s | 65.0391 KOps/s | |
test_set | 94.8660μs | 13.1618μs | 75.9776 KOps/s | 67.5034 KOps/s | |
test_set_shared | 1.4938ms | 0.1479ms | 6.7626 KOps/s | 6.7893 KOps/s | |
test_update | 0.3804ms | 15.6940μs | 63.7185 KOps/s | 52.1748 KOps/s | |
test_update_nested | 93.1350μs | 21.1190μs | 47.3507 KOps/s | 41.0110 KOps/s | |
test_update__nested | 0.4824ms | 24.8186μs | 40.2923 KOps/s | 39.6074 KOps/s | |
test_set_nested | 80.4940μs | 14.6613μs | 68.2067 KOps/s | 60.9255 KOps/s | |
test_set_nested_new | 91.5950μs | 16.8707μs | 59.2743 KOps/s | 53.3406 KOps/s | |
test_select | 0.2293ms | 29.6433μs | 33.7344 KOps/s | 32.2647 KOps/s | |
test_select_nested | 0.2413ms | 43.8745μs | 22.7923 KOps/s | 22.5435 KOps/s | |
test_exclude_nested | 94.3750μs | 62.8253μs | 15.9172 KOps/s | 15.7001 KOps/s | |
test_empty[True] | 0.3842ms | 0.2892ms | 3.4572 KOps/s | 3.4725 KOps/s | |
test_empty[False] | 4.6892μs | 0.8312μs | 1.2031 MOps/s | 1.2014 MOps/s | |
test_to | 93.2250μs | 57.5474μs | 17.3770 KOps/s | 17.7084 KOps/s | |
test_to_nonblocking | 0.1981ms | 47.4605μs | 21.0702 KOps/s | 21.0740 KOps/s | |
test_unbind_speed | 0.2955ms | 0.2372ms | 4.2159 KOps/s | 4.2092 KOps/s | |
test_unbind_speed_stack0 | 0.3768ms | 0.2369ms | 4.2213 KOps/s | 4.1764 KOps/s | |
test_unbind_speed_stack1 | 0.7158ms | 0.6140ms | 1.6287 KOps/s | 1.6522 KOps/s | |
test_split | 95.7698ms | 1.6193ms | 617.5530 Ops/s | 615.2225 Ops/s | |
test_chunk | 99.0344ms | 1.6204ms | 617.1414 Ops/s | 612.1799 Ops/s | |
test_consolidate[False-None] | 0.1009s | 2.9823ms | 335.3148 Ops/s | 332.0126 Ops/s | |
test_consolidate[default-None] | 1.9052ms | 1.6998ms | 588.2889 Ops/s | 589.7314 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8726ms | 1.7400ms | 574.6985 Ops/s | 572.8154 Ops/s | |
test_consolidate_njt[False-None] | 6.7363ms | 6.5956ms | 151.6159 Ops/s | 151.5442 Ops/s | |
test_to[False-False-None] | 1.8494ms | 1.6914ms | 591.2099 Ops/s | 600.6610 Ops/s | |
test_to[True-False-None] | 1.5566ms | 1.3413ms | 745.5382 Ops/s | 741.1838 Ops/s | |
test_to[within-False-None] | 4.4675ms | 4.2294ms | 236.4375 Ops/s | 237.6263 Ops/s | |
test_to[True-default-None] | 5.5598ms | 5.2889ms | 189.0760 Ops/s | 188.7127 Ops/s | |
test_to_njt[False-False-None] | 7.0210ms | 6.8296ms | 146.4214 Ops/s | 145.3480 Ops/s | |
test_to_njt[True-False-None] | 5.9306ms | 5.5650ms | 179.6948 Ops/s | 179.7828 Ops/s | |
test_to_njt[within-False-None] | 12.3824ms | 12.2270ms | 81.7860 Ops/s | 80.8158 Ops/s | |
test_creation[device0] | 0.5361ms | 78.8507μs | 12.6822 KOps/s | 12.7532 KOps/s | |
test_creation_from_tensor | 0.5296ms | 82.0582μs | 12.1865 KOps/s | 12.0993 KOps/s | |
test_add_one[memmap_tensor0] | 0.2403ms | 6.2697μs | 159.4971 KOps/s | 156.8847 KOps/s | |
test_contiguous[memmap_tensor0] | 3.6537μs | 0.4125μs | 2.4244 MOps/s | 2.4163 MOps/s | |
test_stack[memmap_tensor0] | 0.2128ms | 4.6225μs | 216.3321 KOps/s | 209.5802 KOps/s | |
test_memmaptd_index | 1.9841ms | 0.2538ms | 3.9397 KOps/s | 3.8633 KOps/s | |
test_memmaptd_index_astensor | 0.9807ms | 0.3171ms | 3.1539 KOps/s | 3.1231 KOps/s | |
test_memmaptd_index_op | 0.9487ms | 0.5510ms | 1.8150 KOps/s | 1.6733 KOps/s | |
test_serialize_model | 0.1319s | 0.1305s | 7.6623 Ops/s | 7.6294 Ops/s | |
test_serialize_model_pickle | 1.3483s | 1.2134s | 0.8241 Ops/s | 0.8209 Ops/s | |
test_serialize_weights | 0.1311s | 0.1302s | 7.6783 Ops/s | 7.6882 Ops/s | |
test_serialize_weights_returnearly | 0.3337s | 54.9439ms | 18.2004 Ops/s | 15.5861 Ops/s | |
test_serialize_weights_pickle | 1.4002s | 1.2264s | 0.8154 Ops/s | 0.8211 Ops/s | |
test_reshape_pytree | 0.1592ms | 22.2274μs | 44.9895 KOps/s | 43.7700 KOps/s | |
test_reshape_td | 61.4540μs | 26.8891μs | 37.1898 KOps/s | 36.4837 KOps/s | |
test_view_pytree | 51.0830μs | 22.1740μs | 45.0979 KOps/s | 44.5274 KOps/s | |
test_view_td | 87.6250μs | 30.1251μs | 33.1949 KOps/s | 30.5854 KOps/s | |
test_unbind_pytree | 0.1487ms | 27.8709μs | 35.8797 KOps/s | 34.8655 KOps/s | |
test_unbind_td | 0.8464ms | 36.7153μs | 27.2366 KOps/s | 26.4494 KOps/s | |
test_split_pytree | 0.1480ms | 30.1305μs | 33.1890 KOps/s | 32.1021 KOps/s | |
test_split_td | 0.2009ms | 39.3501μs | 25.4129 KOps/s | 24.6636 KOps/s | |
test_add_pytree | 0.1822ms | 34.2780μs | 29.1732 KOps/s | 30.0866 KOps/s | |
test_add_td | 0.2101ms | 48.2735μs | 20.7153 KOps/s | 20.4069 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.2658ms | 0.1197ms | 8.3568 KOps/s | 8.1339 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2790ms | 0.1305ms | 7.6649 KOps/s | 7.6042 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2230ms | 95.9686μs | 10.4201 KOps/s | 10.2060 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.3300ms | 0.1469ms | 6.8070 KOps/s | 6.7421 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1609ms | 23.3179μs | 42.8855 KOps/s | 43.4001 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1776ms | 29.3493μs | 34.0724 KOps/s | 33.1579 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.3840ms | 65.1327μs | 15.3533 KOps/s | 15.1766 KOps/s | |
test_compile_copy_nested[pytree-eager] | 79.4540μs | 49.0630μs | 20.3820 KOps/s | 19.9642 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2675ms | 0.1414ms | 7.0735 KOps/s | 7.0063 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3995ms | 0.2160ms | 4.6302 KOps/s | 4.5935 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2260ms | 97.4457μs | 10.2621 KOps/s | 10.1864 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2357ms | 54.3602μs | 18.3958 KOps/s | 18.6018 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1757ms | 0.1344ms | 7.4420 KOps/s | 7.3611 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6233ms | 0.4657ms | 2.1474 KOps/s | 2.1364 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3868ms | 0.2577ms | 3.8808 KOps/s | 3.8467 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2848ms | 0.1416ms | 7.0613 KOps/s | 7.0529 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2219ms | 65.6058μs | 15.2425 KOps/s | 15.4580 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2403ms | 97.2314μs | 10.2847 KOps/s | 10.1804 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5451ms | 0.3948ms | 2.5331 KOps/s | 2.5372 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2454ms | 0.1341ms | 7.4563 KOps/s | 7.4456 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2092ms | 20.8551μs | 47.9500 KOps/s | 53.9267 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 88.2750μs | 31.7113μs | 31.5345 KOps/s | 32.0948 KOps/s | |
test_compile_copy_flat[pytree-compile] | 99.6360μs | 70.1006μs | 14.2652 KOps/s | 13.9199 KOps/s | |
test_compile_copy_flat[pytree-eager] | 80.2040μs | 51.6386μs | 19.3653 KOps/s | 18.8013 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6758ms | 0.3999ms | 2.5009 KOps/s | 2.1940 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.8745ms | 2.6242ms | 381.0739 Ops/s | 388.5012 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6514ms | 0.4501ms | 2.2218 KOps/s | 2.2321 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.7745ms | 2.5771ms | 388.0342 Ops/s | 388.8494 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.2812ms | 0.1159ms | 8.6251 KOps/s | 8.9853 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5661ms | 80.7637μs | 12.3818 KOps/s | 12.9052 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.2856ms | 0.1080ms | 9.2591 KOps/s | 9.7751 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2516ms | 69.3905μs | 14.4112 KOps/s | 15.1051 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2937ms | 0.1106ms | 9.0419 KOps/s | 9.6942 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2602ms | 69.4142μs | 14.4063 KOps/s | 15.1129 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.3127ms | 0.1021ms | 9.7947 KOps/s | 9.9744 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1423ms | 17.2539μs | 57.9578 KOps/s | 53.7358 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2720ms | 99.6251μs | 10.0376 KOps/s | 10.3490 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1497ms | 15.9427μs | 62.7246 KOps/s | 61.7833 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2827ms | 0.1002ms | 9.9779 KOps/s | 10.2616 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1244ms | 15.6586μs | 63.8627 KOps/s | 61.9275 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2728ms | 0.1052ms | 9.5063 KOps/s | 9.8691 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6960ms | 17.8945μs | 55.8832 KOps/s | 56.7415 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2546ms | 99.3711μs | 10.0633 KOps/s | 10.2974 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1369ms | 15.8150μs | 63.2311 KOps/s | 61.6860 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2222ms | 95.9184μs | 10.4255 KOps/s | 10.3139 KOps/s | |
test_compile_indexing[int-pytree-eager] | 48.6030μs | 15.7376μs | 63.5421 KOps/s | 61.2154 KOps/s | |
test_mod_add[eager] | 0.2108ms | 39.1981μs | 25.5115 KOps/s | 26.2216 KOps/s | |
test_mod_add[compile] | 0.1463ms | 83.8191μs | 11.9305 KOps/s | 12.2819 KOps/s | |
test_mod_add[compile-overhead] | 0.3240ms | 0.1665ms | 6.0074 KOps/s | 5.4864 KOps/s | |
test_mod_wrap[eager] | 0.4398ms | 0.2561ms | 3.9041 KOps/s | 4.0503 KOps/s | |
test_mod_wrap[compile] | 0.4863ms | 0.2784ms | 3.5919 KOps/s | 3.4915 KOps/s | |
test_mod_wrap[compile-overhead] | 7.1078ms | 3.7244ms | 268.4968 Ops/s | 275.3967 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.7093ms | 1.3499ms | 740.7772 Ops/s | 697.2666 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4771ms | 1.2562ms | 796.0803 Ops/s | 728.7811 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3664ms | 0.9221ms | 1.0845 KOps/s | 970.2342 Ops/s | |
test_seq_add[eager] | 0.3003ms | 0.1179ms | 8.4842 KOps/s | 8.5146 KOps/s | |
test_seq_add[compile] | 0.5979ms | 90.8828μs | 11.0032 KOps/s | 11.1287 KOps/s | |
test_seq_add[compile-overhead] | 0.2640ms | 0.1294ms | 7.7280 KOps/s | 7.7116 KOps/s | |
test_seq_wrap[eager] | 0.5484ms | 0.4044ms | 2.4725 KOps/s | 2.3489 KOps/s | |
test_seq_wrap[compile] | 0.5015ms | 0.3064ms | 3.2640 KOps/s | 3.3043 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4083ms | 0.2278ms | 4.3897 KOps/s | 4.4163 KOps/s | |
test_func_call_runtime[False-eager] | 0.9415ms | 0.7373ms | 1.3564 KOps/s | 1.3728 KOps/s | |
test_func_call_runtime[False-compile] | 0.8894ms | 0.7366ms | 1.3576 KOps/s | 1.3351 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5057ms | 0.3617ms | 2.7648 KOps/s | 2.7486 KOps/s | |
test_func_call_runtime[True-eager] | 1.0984ms | 0.8774ms | 1.1397 KOps/s | 1.1225 KOps/s | |
test_func_call_runtime[True-compile] | 0.9063ms | 0.7526ms | 1.3287 KOps/s | 1.3039 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4973ms | 0.3857ms | 2.5925 KOps/s | 2.5945 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9611ms | 0.7165ms | 1.3957 KOps/s | 1.3830 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8883ms | 0.7385ms | 1.3541 KOps/s | 1.3346 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5015ms | 0.3650ms | 2.7399 KOps/s | 2.7210 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1142ms | 0.9695ms | 1.0315 KOps/s | 1.0025 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.9205ms | 0.7818ms | 1.2791 KOps/s | 1.2587 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.6325ms | 0.4105ms | 2.4362 KOps/s | 2.4282 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5136ms | 2.0144ms | 496.4296 Ops/s | 490.3833 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.0242ms | 0.8223ms | 1.2161 KOps/s | 1.2296 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5551ms | 0.4117ms | 2.4287 KOps/s | 2.4121 KOps/s | |
test_distributed | 5.7234ms | 0.2998ms | 3.3359 KOps/s | 7.4103 KOps/s | |
test_tdmodule | 51.8830μs | 19.7034μs | 50.7526 KOps/s | 47.7680 KOps/s | |
test_tdmodule_dispatch | 0.2023ms | 35.1029μs | 28.4877 KOps/s | 27.3016 KOps/s | |
test_tdseq | 75.1940μs | 20.9274μs | 47.7842 KOps/s | 46.7695 KOps/s | |
test_tdseq_dispatch | 0.1582ms | 38.7413μs | 25.8122 KOps/s | 25.2797 KOps/s | |
test_instantiation_functorch | 1.7020ms | 1.5733ms | 635.5967 Ops/s | 640.6480 Ops/s | |
test_exec_functorch | 0.2365ms | 0.1420ms | 7.0416 KOps/s | 7.0422 KOps/s | |
test_exec_functional_call | 0.3232ms | 0.1315ms | 7.6019 KOps/s | 7.4933 KOps/s | |
test_exec_td_decorator | 0.4062ms | 0.1902ms | 5.2581 KOps/s | 5.4487 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8253ms | 0.6637ms | 1.5067 KOps/s | 1.4895 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8405ms | 0.6666ms | 1.5002 KOps/s | 1.4777 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8019ms | 0.5917ms | 1.6900 KOps/s | 1.7144 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8111ms | 0.5986ms | 1.6706 KOps/s | 1.7158 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.3622ms | 18.6550ms | 53.6050 Ops/s | 53.3398 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.4611ms | 18.6961ms | 53.4872 Ops/s | 53.2849 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 18.8643ms | 18.4753ms | 54.1264 Ops/s | 53.7772 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.2365ms | 18.5605ms | 53.8779 Ops/s | 53.7467 Ops/s | |
test_to_module_speed[True] | 1.0968ms | 0.9606ms | 1.0411 KOps/s | 1.0201 KOps/s | |
test_to_module_speed[False] | 1.3516ms | 0.9541ms | 1.0481 KOps/s | 1.0468 KOps/s | |
test_tc_init | 71.0240μs | 36.8684μs | 27.1235 KOps/s | 26.0737 KOps/s | |
test_tc_init_nested | 0.1390ms | 73.2858μs | 13.6452 KOps/s | 13.1872 KOps/s | |
test_tc_first_layer_tensor | 7.6376μs | 0.6903μs | 1.4487 MOps/s | 1.4123 MOps/s | |
test_tc_first_layer_nontensor | 29.2610μs | 2.3363μs | 428.0361 KOps/s | 425.1189 KOps/s | |
test_tc_second_layer_tensor | 10.5607μs | 1.4212μs | 703.6193 KOps/s | 700.4956 KOps/s | |
test_tc_second_layer_nontensor | 38.6720μs | 3.1060μs | 321.9536 KOps/s | 325.7942 KOps/s | |
test_unbind | 0.2443s | 10.6312ms | 94.0624 Ops/s | 143.9015 Ops/s | |
test_full_like | 11.9940ms | 10.1034ms | 98.9770 Ops/s | 99.0367 Ops/s | |
test_zeros_like | 9.4859ms | 7.2940ms | 137.0986 Ops/s | 112.8620 Ops/s | |
test_ones_like | 5.0588ms | 4.3642ms | 229.1362 Ops/s | 221.9504 Ops/s | |
test_clone | 7.5834ms | 7.0421ms | 142.0027 Ops/s | 139.6824 Ops/s | |
test_squeeze | 61.1940μs | 9.5538μs | 104.6709 KOps/s | 104.1985 KOps/s | |
test_unsqueeze | 0.2318ms | 73.8532μs | 13.5404 KOps/s | 14.0328 KOps/s | |
test_split | 0.3245ms | 0.1626ms | 6.1519 KOps/s | 6.1592 KOps/s | |
test_permute | 0.3507ms | 0.1809ms | 5.5279 KOps/s | 5.4709 KOps/s | |
test_stack | 52.7764ms | 52.1104ms | 19.1900 Ops/s | 19.2250 Ops/s | |
test_cat | 52.7344ms | 52.1113ms | 19.1897 Ops/s | 19.2543 Ops/s |
vmoens
added a commit
that referenced
this pull request
Dec 18, 2024
…d to ProbabilisticTensorDictSequential.log_prob ghstack-source-id: 326d0763c9bbb13b51daac91edca4f0e821adf62 Pull Request resolved: #1146
vmoens
added a commit
that referenced
this pull request
Dec 19, 2024
…d to ProbabilisticTensorDictSequential.log_prob ghstack-source-id: 326d0763c9bbb13b51daac91edca4f0e821adf62 Pull Request resolved: #1146
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):