Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Fix windows upload wheels #2507

Open
wants to merge 1 commit into
base: gh/vmoens/34/base
Choose a base branch
from
Open

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Oct 21, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Oct 21, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2507

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 6 Unrelated Failures

As of commit 6fa1826 with merge base 9f6c21f (image):

NEW FAILURE - The following job has failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens added a commit that referenced this pull request Oct 21, 2024
ghstack-source-id: 3d8eff7ccdf32050aac82dd9656d7db9ee08093d
Pull Request resolved: #2507
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 21, 2024
@vmoens vmoens added the CI Has to do with CI setup (e.g. wheels & builds, tests...) label Oct 21, 2024
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 143. Improved: $\large\color{#35bf28}6$. Worsened: $\large\color{#d91a1a}8$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.4086s 0.4068s 2.4584 Ops/s 2.3473 Ops/s $\color{#35bf28}+4.74\%$
test_transformed 0.6734s 0.6002s 1.6661 Ops/s 1.7287 Ops/s $\color{#d91a1a}-3.62\%$
test_serial 1.4099s 1.3349s 0.7491 Ops/s 0.7546 Ops/s $\color{#d91a1a}-0.73\%$
test_parallel 1.3791s 1.3079s 0.7646 Ops/s 0.7580 Ops/s $\color{#35bf28}+0.87\%$
test_step_mdp_speed[True-True-True-True-True] 0.1962ms 28.3987μs 35.2128 KOps/s 35.1922 KOps/s $\color{#35bf28}+0.06\%$
test_step_mdp_speed[True-True-True-True-False] 50.7960μs 16.9676μs 58.9358 KOps/s 58.6693 KOps/s $\color{#35bf28}+0.45\%$
test_step_mdp_speed[True-True-True-False-True] 57.0670μs 15.8911μs 62.9283 KOps/s 60.9121 KOps/s $\color{#35bf28}+3.31\%$
test_step_mdp_speed[True-True-True-False-False] 46.4770μs 9.3976μs 106.4096 KOps/s 106.4997 KOps/s $\color{#d91a1a}-0.08\%$
test_step_mdp_speed[True-True-False-True-True] 77.3850μs 30.8896μs 32.3734 KOps/s 32.5838 KOps/s $\color{#d91a1a}-0.65\%$
test_step_mdp_speed[True-True-False-True-False] 0.6587ms 19.3915μs 51.5689 KOps/s 52.9332 KOps/s $\color{#d91a1a}-2.58\%$
test_step_mdp_speed[True-True-False-False-True] 71.2030μs 17.9561μs 55.6913 KOps/s 55.2094 KOps/s $\color{#35bf28}+0.87\%$
test_step_mdp_speed[True-True-False-False-False] 40.4660μs 11.5277μs 86.7476 KOps/s 87.7461 KOps/s $\color{#d91a1a}-1.14\%$
test_step_mdp_speed[True-False-True-True-True] 73.6390μs 33.1411μs 30.1740 KOps/s 29.9278 KOps/s $\color{#35bf28}+0.82\%$
test_step_mdp_speed[True-False-True-True-False] 70.7930μs 21.4536μs 46.6122 KOps/s 47.6495 KOps/s $\color{#d91a1a}-2.18\%$
test_step_mdp_speed[True-False-True-False-True] 46.8880μs 17.9834μs 55.6067 KOps/s 55.6254 KOps/s $\color{#d91a1a}-0.03\%$
test_step_mdp_speed[True-False-True-False-False] 59.8630μs 11.5369μs 86.6788 KOps/s 88.1797 KOps/s $\color{#d91a1a}-1.70\%$
test_step_mdp_speed[True-False-False-True-True] 93.6560μs 35.1049μs 28.4860 KOps/s 28.6397 KOps/s $\color{#d91a1a}-0.54\%$
test_step_mdp_speed[True-False-False-True-False] 70.0610μs 23.2992μs 42.9199 KOps/s 43.5498 KOps/s $\color{#d91a1a}-1.45\%$
test_step_mdp_speed[True-False-False-False-True] 50.5250μs 19.9637μs 50.0909 KOps/s 50.5568 KOps/s $\color{#d91a1a}-0.92\%$
test_step_mdp_speed[True-False-False-False-False] 73.9890μs 13.4612μs 74.2875 KOps/s 75.5684 KOps/s $\color{#d91a1a}-1.70\%$
test_step_mdp_speed[False-True-True-True-True] 85.5210μs 32.7550μs 30.5297 KOps/s 30.3331 KOps/s $\color{#35bf28}+0.65\%$
test_step_mdp_speed[False-True-True-True-False] 56.3550μs 21.3561μs 46.8251 KOps/s 47.4937 KOps/s $\color{#d91a1a}-1.41\%$
test_step_mdp_speed[False-True-True-False-True] 57.6580μs 21.0652μs 47.4717 KOps/s 47.3522 KOps/s $\color{#35bf28}+0.25\%$
test_step_mdp_speed[False-True-True-False-False] 38.0410μs 13.0883μs 76.4039 KOps/s 76.5521 KOps/s $\color{#d91a1a}-0.19\%$
test_step_mdp_speed[False-True-False-True-True] 72.7060μs 34.5226μs 28.9666 KOps/s 28.6696 KOps/s $\color{#35bf28}+1.04\%$
test_step_mdp_speed[False-True-False-True-False] 74.9010μs 23.0308μs 43.4200 KOps/s 43.4091 KOps/s $\color{#35bf28}+0.03\%$
test_step_mdp_speed[False-True-False-False-True] 2.7200ms 23.1278μs 43.2380 KOps/s 42.4525 KOps/s $\color{#35bf28}+1.85\%$
test_step_mdp_speed[False-True-False-False-False] 0.6252ms 15.1840μs 65.8588 KOps/s 66.2725 KOps/s $\color{#d91a1a}-0.62\%$
test_step_mdp_speed[False-False-True-True-True] 71.4040μs 36.7892μs 27.1819 KOps/s 27.3464 KOps/s $\color{#d91a1a}-0.60\%$
test_step_mdp_speed[False-False-True-True-False] 79.3490μs 25.4048μs 39.3627 KOps/s 40.2899 KOps/s $\color{#d91a1a}-2.30\%$
test_step_mdp_speed[False-False-True-False-True] 92.6910μs 23.3117μs 42.8970 KOps/s 43.4924 KOps/s $\color{#d91a1a}-1.37\%$
test_step_mdp_speed[False-False-True-False-False] 58.8000μs 14.9754μs 66.7760 KOps/s 66.6057 KOps/s $\color{#35bf28}+0.26\%$
test_step_mdp_speed[False-False-False-True-True] 90.7080μs 38.5561μs 25.9363 KOps/s 25.7638 KOps/s $\color{#35bf28}+0.67\%$
test_step_mdp_speed[False-False-False-True-False] 70.0540μs 27.0580μs 36.9577 KOps/s 37.2359 KOps/s $\color{#d91a1a}-0.75\%$
test_step_mdp_speed[False-False-False-False-True] 69.0300μs 25.0643μs 39.8973 KOps/s 40.0891 KOps/s $\color{#d91a1a}-0.48\%$
test_step_mdp_speed[False-False-False-False-False] 65.2930μs 17.0576μs 58.6250 KOps/s 58.4799 KOps/s $\color{#35bf28}+0.25\%$
test_values[generalized_advantage_estimate-True-True] 16.0097ms 10.1695ms 98.3329 Ops/s 105.1625 Ops/s $\textbf{\color{#d91a1a}-6.49\%}$
test_values[vec_generalized_advantage_estimate-True-True] 37.4947ms 35.3665ms 28.2754 Ops/s 29.9743 Ops/s $\textbf{\color{#d91a1a}-5.67\%}$
test_values[td0_return_estimate-False-False] 0.2257ms 0.1692ms 5.9088 KOps/s 5.9074 KOps/s $\color{#35bf28}+0.02\%$
test_values[td1_return_estimate-False-False] 24.5747ms 24.0445ms 41.5896 Ops/s 41.4998 Ops/s $\color{#35bf28}+0.22\%$
test_values[vec_td1_return_estimate-False-False] 37.7296ms 35.5746ms 28.1099 Ops/s 29.7777 Ops/s $\textbf{\color{#d91a1a}-5.60\%}$
test_values[td_lambda_return_estimate-True-False] 37.8334ms 34.8682ms 28.6795 Ops/s 28.6731 Ops/s $\color{#35bf28}+0.02\%$
test_values[vec_td_lambda_return_estimate-True-False] 37.4116ms 35.5488ms 28.1304 Ops/s 30.0475 Ops/s $\textbf{\color{#d91a1a}-6.38\%}$
test_gae_speed[generalized_advantage_estimate-False-1-512] 12.2946ms 8.4631ms 118.1597 Ops/s 120.1198 Ops/s $\color{#d91a1a}-1.63\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.7129ms 2.1719ms 460.4340 Ops/s 460.3513 Ops/s $\color{#35bf28}+0.02\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.5975ms 0.3563ms 2.8066 KOps/s 2.8288 KOps/s $\color{#d91a1a}-0.78\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 45.5381ms 44.5985ms 22.4223 Ops/s 22.7818 Ops/s $\color{#d91a1a}-1.58\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 3.8702ms 3.0352ms 329.4684 Ops/s 329.8689 Ops/s $\color{#d91a1a}-0.12\%$
test_dqn_speed[False-None] 5.8767ms 1.3428ms 744.7258 Ops/s 752.5098 Ops/s $\color{#d91a1a}-1.03\%$
test_dqn_speed[False-backward] 1.8798ms 1.8022ms 554.8816 Ops/s 554.6386 Ops/s $\color{#35bf28}+0.04\%$
test_dqn_speed[True-None] 0.5725ms 0.4564ms 2.1909 KOps/s 2.1541 KOps/s $\color{#35bf28}+1.71\%$
test_dqn_speed[True-backward] 0.9375ms 0.8776ms 1.1394 KOps/s 778.1648 Ops/s $\textbf{\color{#35bf28}+46.43\%}$
test_dqn_speed[reduce-overhead-None] 0.7925ms 0.4607ms 2.1704 KOps/s 2.1219 KOps/s $\color{#35bf28}+2.29\%$
test_dqn_speed[reduce-overhead-backward] 0.9591ms 0.8761ms 1.1414 KOps/s 1.1405 KOps/s $\color{#35bf28}+0.08\%$
test_ddpg_speed[False-None] 3.4748ms 2.7979ms 357.4160 Ops/s 357.2395 Ops/s $\color{#35bf28}+0.05\%$
test_ddpg_speed[False-backward] 4.0745ms 3.9201ms 255.0967 Ops/s 254.8861 Ops/s $\color{#35bf28}+0.08\%$
test_ddpg_speed[True-None] 1.1814ms 1.0022ms 997.7644 Ops/s 989.2806 Ops/s $\color{#35bf28}+0.86\%$
test_ddpg_speed[True-backward] 1.9520ms 1.8771ms 532.7275 Ops/s 530.1423 Ops/s $\color{#35bf28}+0.49\%$
test_ddpg_speed[reduce-overhead-None] 1.5098ms 1.0021ms 997.8753 Ops/s 1.0003 KOps/s $\color{#d91a1a}-0.24\%$
test_ddpg_speed[reduce-overhead-backward] 2.0832ms 1.8955ms 527.5742 Ops/s 522.9666 Ops/s $\color{#35bf28}+0.88\%$
test_sac_speed[False-None] 9.0939ms 7.9098ms 126.4256 Ops/s 126.8898 Ops/s $\color{#d91a1a}-0.37\%$
test_sac_speed[False-backward] 10.8971ms 10.5838ms 94.4842 Ops/s 94.6243 Ops/s $\color{#d91a1a}-0.15\%$
test_sac_speed[True-None] 2.4482ms 1.8572ms 538.4420 Ops/s 540.8475 Ops/s $\color{#d91a1a}-0.44\%$
test_sac_speed[True-backward] 4.3634ms 3.5730ms 279.8743 Ops/s 283.5629 Ops/s $\color{#d91a1a}-1.30\%$
test_sac_speed[reduce-overhead-None] 3.3892ms 1.8516ms 540.0670 Ops/s 539.3981 Ops/s $\color{#35bf28}+0.12\%$
test_sac_speed[reduce-overhead-backward] 4.3826ms 3.5567ms 281.1593 Ops/s 277.8429 Ops/s $\color{#35bf28}+1.19\%$
test_redq_speed[False-None] 19.6479ms 13.4875ms 74.1426 Ops/s 80.9101 Ops/s $\textbf{\color{#d91a1a}-8.36\%}$
test_redq_speed[False-backward] 25.7401ms 22.1161ms 45.2159 Ops/s 46.1665 Ops/s $\color{#d91a1a}-2.06\%$
test_redq_speed[True-None] 6.3622ms 4.5580ms 219.3940 Ops/s 223.2604 Ops/s $\color{#d91a1a}-1.73\%$
test_redq_speed[True-backward] 13.0342ms 11.9037ms 84.0076 Ops/s 82.7807 Ops/s $\color{#35bf28}+1.48\%$
test_redq_speed[reduce-overhead-None] 5.3758ms 4.5792ms 218.3797 Ops/s 215.7432 Ops/s $\color{#35bf28}+1.22\%$
test_redq_speed[reduce-overhead-backward] 13.0371ms 12.1263ms 82.4656 Ops/s 83.5908 Ops/s $\color{#d91a1a}-1.35\%$
test_redq_deprec_speed[False-None] 14.5298ms 12.5726ms 79.5379 Ops/s 78.9328 Ops/s $\color{#35bf28}+0.77\%$
test_redq_deprec_speed[False-backward] 19.4491ms 18.1850ms 54.9904 Ops/s 54.5958 Ops/s $\color{#35bf28}+0.72\%$
test_redq_deprec_speed[True-None] 4.6479ms 3.5691ms 280.1803 Ops/s 279.8502 Ops/s $\color{#35bf28}+0.12\%$
test_redq_deprec_speed[True-backward] 8.6290ms 7.9837ms 125.2555 Ops/s 124.3933 Ops/s $\color{#35bf28}+0.69\%$
test_redq_deprec_speed[reduce-overhead-None] 4.0697ms 3.5762ms 279.6291 Ops/s 280.7713 Ops/s $\color{#d91a1a}-0.41\%$
test_redq_deprec_speed[reduce-overhead-backward] 9.5726ms 8.0994ms 123.4656 Ops/s 124.2293 Ops/s $\color{#d91a1a}-0.61\%$
test_td3_speed[False-None] 8.0121ms 7.7984ms 128.2313 Ops/s 128.6548 Ops/s $\color{#d91a1a}-0.33\%$
test_td3_speed[False-backward] 12.3870ms 10.2561ms 97.5027 Ops/s 97.7586 Ops/s $\color{#d91a1a}-0.26\%$
test_td3_speed[True-None] 1.8925ms 1.7515ms 570.9379 Ops/s 565.9981 Ops/s $\color{#35bf28}+0.87\%$
test_td3_speed[True-backward] 3.4535ms 3.3638ms 297.2826 Ops/s 279.7976 Ops/s $\textbf{\color{#35bf28}+6.25\%}$
test_td3_speed[reduce-overhead-None] 1.9308ms 1.7499ms 571.4690 Ops/s 563.7591 Ops/s $\color{#35bf28}+1.37\%$
test_td3_speed[reduce-overhead-backward] 4.3795ms 3.4013ms 294.0061 Ops/s 297.8467 Ops/s $\color{#d91a1a}-1.29\%$
test_cql_speed[False-None] 37.2475ms 35.3941ms 28.2533 Ops/s 27.8802 Ops/s $\color{#35bf28}+1.34\%$
test_cql_speed[False-backward] 48.3532ms 45.8453ms 21.8125 Ops/s 22.0298 Ops/s $\color{#d91a1a}-0.99\%$
test_cql_speed[True-None] 16.6773ms 15.5855ms 64.1621 Ops/s 64.2063 Ops/s $\color{#d91a1a}-0.07\%$
test_cql_speed[True-backward] 23.4270ms 22.1499ms 45.1470 Ops/s 44.3410 Ops/s $\color{#35bf28}+1.82\%$
test_cql_speed[reduce-overhead-None] 16.5244ms 15.5545ms 64.2901 Ops/s 63.4469 Ops/s $\color{#35bf28}+1.33\%$
test_cql_speed[reduce-overhead-backward] 23.6545ms 22.3495ms 44.7437 Ops/s 46.0217 Ops/s $\color{#d91a1a}-2.78\%$
test_a2c_speed[False-None] 7.9681ms 7.1051ms 140.7445 Ops/s 140.0349 Ops/s $\color{#35bf28}+0.51\%$
test_a2c_speed[False-backward] 16.0000ms 14.1197ms 70.8229 Ops/s 70.8427 Ops/s $\color{#d91a1a}-0.03\%$
test_a2c_speed[True-None] 3.9489ms 3.3301ms 300.2883 Ops/s 300.5295 Ops/s $\color{#d91a1a}-0.08\%$
test_a2c_speed[True-backward] 10.1238ms 9.7954ms 102.0883 Ops/s 103.4568 Ops/s $\color{#d91a1a}-1.32\%$
test_a2c_speed[reduce-overhead-None] 3.7528ms 3.3428ms 299.1465 Ops/s 299.8286 Ops/s $\color{#d91a1a}-0.23\%$
test_a2c_speed[reduce-overhead-backward] 10.2009ms 9.7223ms 102.8561 Ops/s 103.6082 Ops/s $\color{#d91a1a}-0.73\%$
test_ppo_speed[False-None] 9.2189ms 7.3568ms 135.9290 Ops/s 136.0712 Ops/s $\color{#d91a1a}-0.10\%$
test_ppo_speed[False-backward] 15.9364ms 14.4165ms 69.3650 Ops/s 70.0189 Ops/s $\color{#d91a1a}-0.93\%$
test_ppo_speed[True-None] 4.4913ms 3.7471ms 266.8726 Ops/s 269.9324 Ops/s $\color{#d91a1a}-1.13\%$
test_ppo_speed[True-backward] 10.7637ms 9.6115ms 104.0425 Ops/s 104.7276 Ops/s $\color{#d91a1a}-0.65\%$
test_ppo_speed[reduce-overhead-None] 4.0400ms 3.7203ms 268.7930 Ops/s 268.9217 Ops/s $\color{#d91a1a}-0.05\%$
test_ppo_speed[reduce-overhead-backward] 10.4685ms 9.5993ms 104.1744 Ops/s 104.6026 Ops/s $\color{#d91a1a}-0.41\%$
test_reinforce_speed[False-None] 8.6289ms 6.5082ms 153.6533 Ops/s 155.1621 Ops/s $\color{#d91a1a}-0.97\%$
test_reinforce_speed[False-backward] 10.5220ms 9.6801ms 103.3044 Ops/s 103.5567 Ops/s $\color{#d91a1a}-0.24\%$
test_reinforce_speed[True-None] 3.6903ms 2.6678ms 374.8442 Ops/s 378.3473 Ops/s $\color{#d91a1a}-0.93\%$
test_reinforce_speed[True-backward] 9.8522ms 8.5533ms 116.9140 Ops/s 117.4864 Ops/s $\color{#d91a1a}-0.49\%$
test_reinforce_speed[reduce-overhead-None] 3.2443ms 2.6499ms 377.3680 Ops/s 379.9329 Ops/s $\color{#d91a1a}-0.68\%$
test_reinforce_speed[reduce-overhead-backward] 9.2752ms 8.5615ms 116.8020 Ops/s 117.3000 Ops/s $\color{#d91a1a}-0.42\%$
test_iql_speed[False-None] 33.2904ms 31.8415ms 31.4056 Ops/s 31.0827 Ops/s $\color{#35bf28}+1.04\%$
test_iql_speed[False-backward] 46.4789ms 44.7947ms 22.3241 Ops/s 22.0272 Ops/s $\color{#35bf28}+1.35\%$
test_iql_speed[True-None] 12.6952ms 10.6741ms 93.6847 Ops/s 95.3040 Ops/s $\color{#d91a1a}-1.70\%$
test_iql_speed[True-backward] 22.8179ms 21.6590ms 46.1701 Ops/s 46.7935 Ops/s $\color{#d91a1a}-1.33\%$
test_iql_speed[reduce-overhead-None] 11.9125ms 10.6126ms 94.2277 Ops/s 96.0751 Ops/s $\color{#d91a1a}-1.92\%$
test_iql_speed[reduce-overhead-backward] 22.6923ms 21.6502ms 46.1889 Ops/s 45.7583 Ops/s $\color{#35bf28}+0.94\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.2540ms 4.7664ms 209.8030 Ops/s 211.8363 Ops/s $\color{#d91a1a}-0.96\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.7707ms 0.4779ms 2.0924 KOps/s 2.1005 KOps/s $\color{#d91a1a}-0.38\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6896ms 0.4539ms 2.2030 KOps/s 2.2086 KOps/s $\color{#d91a1a}-0.25\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.0441ms 4.6518ms 214.9713 Ops/s 217.8090 Ops/s $\color{#d91a1a}-1.30\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.7696ms 0.4709ms 2.1237 KOps/s 2.1430 KOps/s $\color{#d91a1a}-0.90\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6836ms 0.4488ms 2.2284 KOps/s 2.2348 KOps/s $\color{#d91a1a}-0.29\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.2958ms 1.5863ms 630.3905 Ops/s 635.0875 Ops/s $\color{#d91a1a}-0.74\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.1736ms 1.5356ms 651.1988 Ops/s 653.9704 Ops/s $\color{#d91a1a}-0.42\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.9988ms 4.7992ms 208.3681 Ops/s 207.0425 Ops/s $\color{#35bf28}+0.64\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.1636ms 0.6146ms 1.6269 KOps/s 1.6207 KOps/s $\color{#35bf28}+0.39\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.9420ms 0.5881ms 1.7003 KOps/s 1.6986 KOps/s $\color{#35bf28}+0.10\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 4.9042ms 4.6897ms 213.2328 Ops/s 209.7563 Ops/s $\color{#35bf28}+1.66\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.6558ms 0.4761ms 2.1005 KOps/s 2.0537 KOps/s $\color{#35bf28}+2.28\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 7.3810ms 0.4655ms 2.1482 KOps/s 2.1591 KOps/s $\color{#d91a1a}-0.51\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.0367ms 4.6395ms 215.5412 Ops/s 213.5465 Ops/s $\color{#35bf28}+0.93\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.1719ms 0.4757ms 2.1021 KOps/s 2.1348 KOps/s $\color{#d91a1a}-1.53\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6585ms 0.4522ms 2.2114 KOps/s 2.1767 KOps/s $\color{#35bf28}+1.59\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.1134ms 4.8084ms 207.9706 Ops/s 208.5532 Ops/s $\color{#d91a1a}-0.28\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.5836ms 0.6142ms 1.6281 KOps/s 1.6482 KOps/s $\color{#d91a1a}-1.22\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7979ms 0.5886ms 1.6990 KOps/s 1.7048 KOps/s $\color{#d91a1a}-0.34\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 8.4490ms 4.2677ms 234.3190 Ops/s 250.4588 Ops/s $\textbf{\color{#d91a1a}-6.44\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 8.0693ms 2.2876ms 437.1366 Ops/s 461.8849 Ops/s $\textbf{\color{#d91a1a}-5.36\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 3.8540ms 1.2292ms 813.5612 Ops/s 752.3575 Ops/s $\textbf{\color{#35bf28}+8.13\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.3638s 11.4093ms 87.6474 Ops/s 37.6370 Ops/s $\textbf{\color{#35bf28}+132.88\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 4.7984ms 2.2526ms 443.9331 Ops/s 423.8300 Ops/s $\color{#35bf28}+4.74\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 7.3742ms 1.3047ms 766.4444 Ops/s 713.0531 Ops/s $\textbf{\color{#35bf28}+7.49\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 5.9296ms 4.3854ms 228.0310 Ops/s 212.0353 Ops/s $\textbf{\color{#35bf28}+7.54\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 6.0253ms 2.4186ms 413.4634 Ops/s 416.5488 Ops/s $\color{#d91a1a}-0.74\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 6.6227ms 1.4794ms 675.9696 Ops/s 715.9084 Ops/s $\textbf{\color{#d91a1a}-5.58\%}$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 143. Improved: $\large\color{#35bf28}16$. Worsened: $\large\color{#d91a1a}7$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.7581s 0.7491s 1.3350 Ops/s 1.3560 Ops/s $\color{#d91a1a}-1.55\%$
test_transformed 1.0771s 0.9994s 1.0006 Ops/s 1.0222 Ops/s $\color{#d91a1a}-2.11\%$
test_serial 2.2420s 2.1618s 0.4626 Ops/s 0.4649 Ops/s $\color{#d91a1a}-0.50\%$
test_parallel 2.0658s 1.9880s 0.5030 Ops/s 0.4999 Ops/s $\color{#35bf28}+0.63\%$
test_step_mdp_speed[True-True-True-True-True] 0.1816ms 38.6488μs 25.8740 KOps/s 25.8450 KOps/s $\color{#35bf28}+0.11\%$
test_step_mdp_speed[True-True-True-True-False] 51.3410μs 23.1125μs 43.2666 KOps/s 43.6861 KOps/s $\color{#d91a1a}-0.96\%$
test_step_mdp_speed[True-True-True-False-True] 61.5010μs 20.5985μs 48.5473 KOps/s 48.8137 KOps/s $\color{#d91a1a}-0.55\%$
test_step_mdp_speed[True-True-True-False-False] 53.6910μs 12.2641μs 81.5390 KOps/s 81.7009 KOps/s $\color{#d91a1a}-0.20\%$
test_step_mdp_speed[True-True-False-True-True] 74.1020μs 41.2427μs 24.2467 KOps/s 24.1384 KOps/s $\color{#35bf28}+0.45\%$
test_step_mdp_speed[True-True-False-True-False] 55.5710μs 25.2523μs 39.6004 KOps/s 39.3410 KOps/s $\color{#35bf28}+0.66\%$
test_step_mdp_speed[True-True-False-False-True] 66.2920μs 24.0214μs 41.6296 KOps/s 42.7470 KOps/s $\color{#d91a1a}-2.61\%$
test_step_mdp_speed[True-True-False-False-False] 39.4110μs 15.0623μs 66.3910 KOps/s 66.8289 KOps/s $\color{#d91a1a}-0.66\%$
test_step_mdp_speed[True-False-True-True-True] 79.9510μs 44.5240μs 22.4598 KOps/s 22.2990 KOps/s $\color{#35bf28}+0.72\%$
test_step_mdp_speed[True-False-True-True-False] 60.6510μs 28.2193μs 35.4368 KOps/s 35.3574 KOps/s $\color{#35bf28}+0.22\%$
test_step_mdp_speed[True-False-True-False-True] 52.9010μs 23.6491μs 42.2849 KOps/s 42.7734 KOps/s $\color{#d91a1a}-1.14\%$
test_step_mdp_speed[True-False-True-False-False] 38.9510μs 15.0504μs 66.4433 KOps/s 66.6902 KOps/s $\color{#d91a1a}-0.37\%$
test_step_mdp_speed[True-False-False-True-True] 78.6010μs 47.0955μs 21.2334 KOps/s 21.1599 KOps/s $\color{#35bf28}+0.35\%$
test_step_mdp_speed[True-False-False-True-False] 56.6910μs 30.8443μs 32.4210 KOps/s 32.4929 KOps/s $\color{#d91a1a}-0.22\%$
test_step_mdp_speed[True-False-False-False-True] 54.4710μs 25.9583μs 38.5233 KOps/s 37.7694 KOps/s $\color{#35bf28}+2.00\%$
test_step_mdp_speed[True-False-False-False-False] 41.7810μs 17.6685μs 56.5979 KOps/s 55.5332 KOps/s $\color{#35bf28}+1.92\%$
test_step_mdp_speed[False-True-True-True-True] 74.1620μs 44.8922μs 22.2756 KOps/s 22.1920 KOps/s $\color{#35bf28}+0.38\%$
test_step_mdp_speed[False-True-True-True-False] 55.4010μs 28.3619μs 35.2585 KOps/s 34.9611 KOps/s $\color{#35bf28}+0.85\%$
test_step_mdp_speed[False-True-True-False-True] 73.1310μs 28.5129μs 35.0718 KOps/s 34.5574 KOps/s $\color{#35bf28}+1.49\%$
test_step_mdp_speed[False-True-True-False-False] 44.3000μs 17.3965μs 57.4829 KOps/s 56.3942 KOps/s $\color{#35bf28}+1.93\%$
test_step_mdp_speed[False-True-False-True-True] 84.9320μs 46.7347μs 21.3974 KOps/s 21.1213 KOps/s $\color{#35bf28}+1.31\%$
test_step_mdp_speed[False-True-False-True-False] 61.6210μs 31.3208μs 31.9277 KOps/s 32.9057 KOps/s $\color{#d91a1a}-2.97\%$
test_step_mdp_speed[False-True-False-False-True] 3.1534ms 31.9554μs 31.2936 KOps/s 31.3926 KOps/s $\color{#d91a1a}-0.32\%$
test_step_mdp_speed[False-True-False-False-False] 52.2710μs 20.6018μs 48.5393 KOps/s 48.8994 KOps/s $\color{#d91a1a}-0.74\%$
test_step_mdp_speed[False-False-True-True-True] 78.4010μs 50.1058μs 19.9578 KOps/s 20.0903 KOps/s $\color{#d91a1a}-0.66\%$
test_step_mdp_speed[False-False-True-True-False] 63.9810μs 34.0419μs 29.3756 KOps/s 30.0306 KOps/s $\color{#d91a1a}-2.18\%$
test_step_mdp_speed[False-False-True-False-True] 65.6020μs 30.5990μs 32.6808 KOps/s 32.1186 KOps/s $\color{#35bf28}+1.75\%$
test_step_mdp_speed[False-False-True-False-False] 46.4410μs 20.1711μs 49.5759 KOps/s 50.2719 KOps/s $\color{#d91a1a}-1.38\%$
test_step_mdp_speed[False-False-False-True-True] 82.9120μs 51.8316μs 19.2932 KOps/s 19.2553 KOps/s $\color{#35bf28}+0.20\%$
test_step_mdp_speed[False-False-False-True-False] 64.8310μs 36.4992μs 27.3979 KOps/s 28.0060 KOps/s $\color{#d91a1a}-2.17\%$
test_step_mdp_speed[False-False-False-False-True] 62.6620μs 34.1750μs 29.2611 KOps/s 29.7389 KOps/s $\color{#d91a1a}-1.61\%$
test_step_mdp_speed[False-False-False-False-False] 56.3910μs 23.1392μs 43.2166 KOps/s 44.8311 KOps/s $\color{#d91a1a}-3.60\%$
test_values[generalized_advantage_estimate-True-True] 25.3762ms 25.0450ms 39.9281 Ops/s 40.6874 Ops/s $\color{#d91a1a}-1.87\%$
test_values[vec_generalized_advantage_estimate-True-True] 99.0969ms 2.8763ms 347.6733 Ops/s 313.1596 Ops/s $\textbf{\color{#35bf28}+11.02\%}$
test_values[td0_return_estimate-False-False] 86.7210μs 66.4906μs 15.0397 KOps/s 15.2072 KOps/s $\color{#d91a1a}-1.10\%$
test_values[td1_return_estimate-False-False] 56.0590ms 55.7568ms 17.9350 Ops/s 18.2293 Ops/s $\color{#d91a1a}-1.61\%$
test_values[vec_td1_return_estimate-False-False] 1.2939ms 1.0787ms 927.0087 Ops/s 932.9208 Ops/s $\color{#d91a1a}-0.63\%$
test_values[td_lambda_return_estimate-True-False] 90.2283ms 88.7109ms 11.2726 Ops/s 11.5208 Ops/s $\color{#d91a1a}-2.15\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.2681ms 1.0758ms 929.5457 Ops/s 935.5877 Ops/s $\color{#d91a1a}-0.65\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 24.8769ms 24.6868ms 40.5075 Ops/s 40.9577 Ops/s $\color{#d91a1a}-1.10\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0345ms 0.7616ms 1.3130 KOps/s 1.3441 KOps/s $\color{#d91a1a}-2.31\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7828ms 0.6672ms 1.4987 KOps/s 1.5092 KOps/s $\color{#d91a1a}-0.69\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5217ms 1.4785ms 676.3387 Ops/s 680.4539 Ops/s $\color{#d91a1a}-0.60\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.7197ms 0.6823ms 1.4656 KOps/s 1.4743 KOps/s $\color{#d91a1a}-0.59\%$
test_dqn_speed[False-None] 6.5919ms 1.3304ms 751.6428 Ops/s 735.7617 Ops/s $\color{#35bf28}+2.16\%$
test_dqn_speed[False-backward] 1.9413ms 1.8620ms 537.0554 Ops/s 533.7860 Ops/s $\color{#35bf28}+0.61\%$
test_dqn_speed[True-None] 0.9453ms 0.5602ms 1.7851 KOps/s 1.7259 KOps/s $\color{#35bf28}+3.43\%$
test_dqn_speed[True-backward] 1.0413ms 1.0063ms 993.7570 Ops/s 967.4076 Ops/s $\color{#35bf28}+2.72\%$
test_dqn_speed[reduce-overhead-None] 0.6853ms 0.5757ms 1.7369 KOps/s 1.7390 KOps/s $\color{#d91a1a}-0.12\%$
test_dqn_speed[reduce-overhead-backward] 1.0468ms 1.0092ms 990.9143 Ops/s 981.5254 Ops/s $\color{#35bf28}+0.96\%$
test_ddpg_speed[False-None] 3.0524ms 2.6849ms 372.4566 Ops/s 365.0872 Ops/s $\color{#35bf28}+2.02\%$
test_ddpg_speed[False-backward] 4.2864ms 3.9538ms 252.9217 Ops/s 253.0487 Ops/s $\color{#d91a1a}-0.05\%$
test_ddpg_speed[True-None] 1.4910ms 1.2444ms 803.5821 Ops/s 745.3896 Ops/s $\textbf{\color{#35bf28}+7.81\%}$
test_ddpg_speed[True-backward] 2.2972ms 2.2302ms 448.3934 Ops/s 366.4170 Ops/s $\textbf{\color{#35bf28}+22.37\%}$
test_ddpg_speed[reduce-overhead-None] 1.5070ms 1.2547ms 797.0331 Ops/s 777.8032 Ops/s $\color{#35bf28}+2.47\%$
test_ddpg_speed[reduce-overhead-backward] 2.3240ms 2.2585ms 442.7622 Ops/s 451.4353 Ops/s $\color{#d91a1a}-1.92\%$
test_sac_speed[False-None] 7.9557ms 7.6271ms 131.1112 Ops/s 128.1762 Ops/s $\color{#35bf28}+2.29\%$
test_sac_speed[False-backward] 11.2461ms 10.8244ms 92.3837 Ops/s 91.1922 Ops/s $\color{#35bf28}+1.31\%$
test_sac_speed[True-None] 2.4064ms 2.0309ms 492.3974 Ops/s 488.8467 Ops/s $\color{#35bf28}+0.73\%$
test_sac_speed[True-backward] 4.0609ms 3.9580ms 252.6529 Ops/s 231.6162 Ops/s $\textbf{\color{#35bf28}+9.08\%}$
test_sac_speed[reduce-overhead-None] 2.6054ms 2.0647ms 484.3313 Ops/s 490.3121 Ops/s $\color{#d91a1a}-1.22\%$
test_sac_speed[reduce-overhead-backward] 4.2222ms 3.9584ms 252.6262 Ops/s 253.1817 Ops/s $\color{#d91a1a}-0.22\%$
test_redq_speed[False-None] 11.5962ms 9.9603ms 100.3987 Ops/s 95.8881 Ops/s $\color{#35bf28}+4.70\%$
test_redq_speed[False-backward] 18.3133ms 17.2029ms 58.1296 Ops/s 55.9483 Ops/s $\color{#35bf28}+3.90\%$
test_redq_speed[True-None] 3.8186ms 3.5593ms 280.9568 Ops/s 276.5976 Ops/s $\color{#35bf28}+1.58\%$
test_redq_speed[True-backward] 8.9828ms 8.5258ms 117.2908 Ops/s 107.8923 Ops/s $\textbf{\color{#35bf28}+8.71\%}$
test_redq_speed[reduce-overhead-None] 3.7241ms 3.4963ms 286.0180 Ops/s 279.3345 Ops/s $\color{#35bf28}+2.39\%$
test_redq_speed[reduce-overhead-backward] 8.8998ms 8.4554ms 118.2676 Ops/s 116.5038 Ops/s $\color{#35bf28}+1.51\%$
test_redq_deprec_speed[False-None] 10.9746ms 10.5430ms 94.8492 Ops/s 93.2207 Ops/s $\color{#35bf28}+1.75\%$
test_redq_deprec_speed[False-backward] 15.6918ms 15.2886ms 65.4082 Ops/s 64.7408 Ops/s $\color{#35bf28}+1.03\%$
test_redq_deprec_speed[True-None] 3.3768ms 3.1840ms 314.0697 Ops/s 305.6809 Ops/s $\color{#35bf28}+2.74\%$
test_redq_deprec_speed[True-backward] 7.2117ms 7.0286ms 142.2758 Ops/s 140.0384 Ops/s $\color{#35bf28}+1.60\%$
test_redq_deprec_speed[reduce-overhead-None] 3.3360ms 3.1733ms 315.1301 Ops/s 304.3459 Ops/s $\color{#35bf28}+3.54\%$
test_redq_deprec_speed[reduce-overhead-backward] 7.2602ms 7.0589ms 141.6644 Ops/s 139.1514 Ops/s $\color{#35bf28}+1.81\%$
test_td3_speed[False-None] 7.9238ms 7.5721ms 132.0637 Ops/s 130.0925 Ops/s $\color{#35bf28}+1.52\%$
test_td3_speed[False-backward] 10.6332ms 10.3538ms 96.5826 Ops/s 94.9284 Ops/s $\color{#35bf28}+1.74\%$
test_td3_speed[True-None] 1.9350ms 1.9078ms 524.1770 Ops/s 523.1381 Ops/s $\color{#35bf28}+0.20\%$
test_td3_speed[True-backward] 3.8169ms 3.6927ms 270.8064 Ops/s 263.8615 Ops/s $\color{#35bf28}+2.63\%$
test_td3_speed[reduce-overhead-None] 1.9339ms 1.8975ms 527.0005 Ops/s 521.4170 Ops/s $\color{#35bf28}+1.07\%$
test_td3_speed[reduce-overhead-backward] 3.7798ms 3.6735ms 272.2172 Ops/s 269.0432 Ops/s $\color{#35bf28}+1.18\%$
test_cql_speed[False-None] 29.3625ms 25.3645ms 39.4251 Ops/s 39.7153 Ops/s $\color{#d91a1a}-0.73\%$
test_cql_speed[False-backward] 39.0133ms 34.8534ms 28.6916 Ops/s 29.2528 Ops/s $\color{#d91a1a}-1.92\%$
test_cql_speed[True-None] 11.2065ms 10.8208ms 92.4147 Ops/s 92.5663 Ops/s $\color{#d91a1a}-0.16\%$
test_cql_speed[True-backward] 16.8731ms 16.6028ms 60.2309 Ops/s 60.7399 Ops/s $\color{#d91a1a}-0.84\%$
test_cql_speed[reduce-overhead-None] 11.1487ms 10.8843ms 91.8753 Ops/s 92.2892 Ops/s $\color{#d91a1a}-0.45\%$
test_cql_speed[reduce-overhead-backward] 16.9206ms 16.5642ms 60.3712 Ops/s 61.0470 Ops/s $\color{#d91a1a}-1.11\%$
test_a2c_speed[False-None] 5.6113ms 5.3561ms 186.7033 Ops/s 185.5709 Ops/s $\color{#35bf28}+0.61\%$
test_a2c_speed[False-backward] 11.9419ms 11.6439ms 85.8821 Ops/s 84.5695 Ops/s $\color{#35bf28}+1.55\%$
test_a2c_speed[True-None] 3.3196ms 3.0191ms 331.2294 Ops/s 331.3734 Ops/s $\color{#d91a1a}-0.04\%$
test_a2c_speed[True-backward] 8.8132ms 8.5316ms 117.2119 Ops/s 108.1328 Ops/s $\textbf{\color{#35bf28}+8.40\%}$
test_a2c_speed[reduce-overhead-None] 3.1806ms 3.0286ms 330.1853 Ops/s 327.1372 Ops/s $\color{#35bf28}+0.93\%$
test_a2c_speed[reduce-overhead-backward] 8.7657ms 8.4991ms 117.6601 Ops/s 118.7880 Ops/s $\color{#d91a1a}-0.95\%$
test_ppo_speed[False-None] 5.8196ms 5.5395ms 180.5221 Ops/s 174.2414 Ops/s $\color{#35bf28}+3.60\%$
test_ppo_speed[False-backward] 12.3878ms 12.0459ms 83.0157 Ops/s 81.2422 Ops/s $\color{#35bf28}+2.18\%$
test_ppo_speed[True-None] 3.6209ms 3.4550ms 289.4321 Ops/s 285.7002 Ops/s $\color{#35bf28}+1.31\%$
test_ppo_speed[True-backward] 8.3636ms 8.1667ms 122.4484 Ops/s 122.2084 Ops/s $\color{#35bf28}+0.20\%$
test_ppo_speed[reduce-overhead-None] 3.6146ms 3.4481ms 290.0151 Ops/s 286.1752 Ops/s $\color{#35bf28}+1.34\%$
test_ppo_speed[reduce-overhead-backward] 8.4948ms 8.2426ms 121.3213 Ops/s 121.1171 Ops/s $\color{#35bf28}+0.17\%$
test_reinforce_speed[False-None] 4.9882ms 4.4380ms 225.3288 Ops/s 220.3451 Ops/s $\color{#35bf28}+2.26\%$
test_reinforce_speed[False-backward] 7.8178ms 7.2839ms 137.2891 Ops/s 138.1718 Ops/s $\color{#d91a1a}-0.64\%$
test_reinforce_speed[True-None] 2.3515ms 2.2153ms 451.4091 Ops/s 448.9679 Ops/s $\color{#35bf28}+0.54\%$
test_reinforce_speed[True-backward] 7.4281ms 7.0727ms 141.3883 Ops/s 118.1327 Ops/s $\textbf{\color{#35bf28}+19.69\%}$
test_reinforce_speed[reduce-overhead-None] 2.4498ms 2.2278ms 448.8765 Ops/s 445.3594 Ops/s $\color{#35bf28}+0.79\%$
test_reinforce_speed[reduce-overhead-backward] 7.4984ms 7.1333ms 140.1884 Ops/s 141.7125 Ops/s $\color{#d91a1a}-1.08\%$
test_iql_speed[False-None] 20.8861ms 19.4845ms 51.3228 Ops/s 50.4789 Ops/s $\color{#35bf28}+1.67\%$
test_iql_speed[False-backward] 30.5193ms 29.7255ms 33.6411 Ops/s 33.3283 Ops/s $\color{#35bf28}+0.94\%$
test_iql_speed[True-None] 7.1689ms 6.7265ms 148.6649 Ops/s 148.4977 Ops/s $\color{#35bf28}+0.11\%$
test_iql_speed[True-backward] 15.9689ms 15.3784ms 65.0262 Ops/s 63.7760 Ops/s $\color{#35bf28}+1.96\%$
test_iql_speed[reduce-overhead-None] 7.0541ms 6.7490ms 148.1710 Ops/s 149.5718 Ops/s $\color{#d91a1a}-0.94\%$
test_iql_speed[reduce-overhead-backward] 15.7876ms 15.2892ms 65.4057 Ops/s 64.9228 Ops/s $\color{#35bf28}+0.74\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.4348ms 6.2151ms 160.8983 Ops/s 159.6188 Ops/s $\color{#35bf28}+0.80\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.2952s 0.4484ms 2.2300 KOps/s 2.8197 KOps/s $\textbf{\color{#d91a1a}-20.91\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.4203ms 0.2197ms 4.5507 KOps/s 3.4479 KOps/s $\textbf{\color{#35bf28}+31.98\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.4889ms 6.2318ms 160.4674 Ops/s 163.5400 Ops/s $\color{#d91a1a}-1.88\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.0946ms 0.2775ms 3.6040 KOps/s 3.0663 KOps/s $\textbf{\color{#35bf28}+17.53\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5091ms 0.2525ms 3.9610 KOps/s 3.2433 KOps/s $\textbf{\color{#35bf28}+22.13\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.4525ms 1.2372ms 808.3069 Ops/s 716.7932 Ops/s $\textbf{\color{#35bf28}+12.77\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.5577ms 1.2535ms 797.7779 Ops/s 738.5569 Ops/s $\textbf{\color{#35bf28}+8.02\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.5060ms 6.4005ms 156.2373 Ops/s 159.6125 Ops/s $\color{#d91a1a}-2.11\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.9189ms 0.4260ms 2.3472 KOps/s 2.1338 KOps/s $\textbf{\color{#35bf28}+10.00\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6481ms 0.4002ms 2.4990 KOps/s 2.2122 KOps/s $\textbf{\color{#35bf28}+12.97\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.3648ms 6.2639ms 159.6449 Ops/s 163.5341 Ops/s $\color{#d91a1a}-2.38\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.9291ms 0.3409ms 2.9336 KOps/s 4.1246 KOps/s $\textbf{\color{#d91a1a}-28.88\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5639ms 0.2934ms 3.4077 KOps/s 4.5859 KOps/s $\textbf{\color{#d91a1a}-25.69\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 9.8467ms 6.2093ms 161.0479 Ops/s 162.5700 Ops/s $\color{#d91a1a}-0.94\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.8068ms 0.2716ms 3.6813 KOps/s 4.2029 KOps/s $\textbf{\color{#d91a1a}-12.41\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4573ms 0.2156ms 4.6390 KOps/s 4.6432 KOps/s $\color{#d91a1a}-0.09\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.8102ms 6.4341ms 155.4227 Ops/s 154.2962 Ops/s $\color{#35bf28}+0.73\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.4025ms 0.4144ms 2.4129 KOps/s 2.5751 KOps/s $\textbf{\color{#d91a1a}-6.30\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7857ms 0.3954ms 2.5289 KOps/s 2.7201 KOps/s $\textbf{\color{#d91a1a}-7.03\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 6.9799ms 5.3088ms 188.3671 Ops/s 184.2846 Ops/s $\color{#35bf28}+2.22\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 7.2499ms 1.9913ms 502.1869 Ops/s 453.1663 Ops/s $\textbf{\color{#35bf28}+10.82\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 8.4390ms 1.2620ms 792.4026 Ops/s 799.8761 Ops/s $\color{#d91a1a}-0.93\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4078s 13.4513ms 74.3422 Ops/s 180.2077 Ops/s $\textbf{\color{#d91a1a}-58.75\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 9.5270ms 2.0706ms 482.9571 Ops/s 479.7881 Ops/s $\color{#35bf28}+0.66\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 7.0976ms 1.1989ms 834.0707 Ops/s 816.7118 Ops/s $\color{#35bf28}+2.13\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 7.2180ms 5.5333ms 180.7235 Ops/s 176.9021 Ops/s $\color{#35bf28}+2.16\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 9.8762ms 2.1755ms 459.6738 Ops/s 404.5056 Ops/s $\textbf{\color{#35bf28}+13.64\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 6.9109ms 1.3736ms 727.9931 Ops/s 730.5017 Ops/s $\color{#d91a1a}-0.34\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants