Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Fix windows compile tests #2511

Merged
merged 1 commit into from
Oct 22, 2024
Merged

[CI] Fix windows compile tests #2511

merged 1 commit into from
Oct 22, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Oct 22, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Oct 22, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2511

Note: Links to docs will display an error until the docs builds have been completed.

❌ 9 New Failures, 4 Unrelated Failures

As of commit 6c71b70 with merge base baba52b (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens added a commit that referenced this pull request Oct 22, 2024
ghstack-source-id: 2ab8ae3907a9b0c2acd4d383d929074c5a4a022e
Pull Request resolved: #2511
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 22, 2024
@vmoens vmoens added Tests Incomplete or broken unit tests CI Has to do with CI setup (e.g. wheels & builds, tests...) labels Oct 22, 2024
@vmoens vmoens merged commit 6c71b70 into gh/vmoens/35/base Oct 22, 2024
50 of 59 checks passed
vmoens added a commit that referenced this pull request Oct 22, 2024
ghstack-source-id: 2ab8ae3907a9b0c2acd4d383d929074c5a4a022e
Pull Request resolved: #2511
@vmoens vmoens deleted the gh/vmoens/35/head branch October 22, 2024 06:53
@vmoens vmoens changed the title [CI] Fix winndows compile tests [CI] Fix windows compile tests Oct 22, 2024
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 143. Improved: $\large\color{#35bf28}9$. Worsened: $\large\color{#d91a1a}9$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.4246s 0.4222s 2.3685 Ops/s 2.2784 Ops/s $\color{#35bf28}+3.95\%$
test_transformed 0.7209s 0.6251s 1.5996 Ops/s 1.6664 Ops/s $\color{#d91a1a}-4.01\%$
test_serial 1.4551s 1.3700s 0.7299 Ops/s 0.7356 Ops/s $\color{#d91a1a}-0.77\%$
test_parallel 1.4432s 1.3396s 0.7465 Ops/s 0.7456 Ops/s $\color{#35bf28}+0.11\%$
test_step_mdp_speed[True-True-True-True-True] 95.2050μs 28.3006μs 35.3349 KOps/s 34.4044 KOps/s $\color{#35bf28}+2.70\%$
test_step_mdp_speed[True-True-True-True-False] 46.4560μs 17.0533μs 58.6396 KOps/s 56.8620 KOps/s $\color{#35bf28}+3.13\%$
test_step_mdp_speed[True-True-True-False-True] 77.3840μs 15.8769μs 62.9844 KOps/s 60.5817 KOps/s $\color{#35bf28}+3.97\%$
test_step_mdp_speed[True-True-True-False-False] 30.4270μs 9.3882μs 106.5164 KOps/s 103.0498 KOps/s $\color{#35bf28}+3.36\%$
test_step_mdp_speed[True-True-False-True-True] 76.7930μs 30.8581μs 32.4064 KOps/s 31.6212 KOps/s $\color{#35bf28}+2.48\%$
test_step_mdp_speed[True-True-False-True-False] 63.4480μs 19.2926μs 51.8334 KOps/s 51.0917 KOps/s $\color{#35bf28}+1.45\%$
test_step_mdp_speed[True-True-False-False-True] 79.0070μs 18.0420μs 55.4263 KOps/s 53.8478 KOps/s $\color{#35bf28}+2.93\%$
test_step_mdp_speed[True-True-False-False-False] 62.8570μs 11.5656μs 86.4636 KOps/s 83.5473 KOps/s $\color{#35bf28}+3.49\%$
test_step_mdp_speed[True-False-True-True-True] 92.4830μs 32.8747μs 30.4185 KOps/s 29.9680 KOps/s $\color{#35bf28}+1.50\%$
test_step_mdp_speed[True-False-True-True-False] 82.4530μs 21.5310μs 46.4446 KOps/s 46.3764 KOps/s $\color{#35bf28}+0.15\%$
test_step_mdp_speed[True-False-True-False-True] 55.6330μs 18.2001μs 54.9448 KOps/s 53.9032 KOps/s $\color{#35bf28}+1.93\%$
test_step_mdp_speed[True-False-True-False-False] 59.0000μs 11.5814μs 86.3456 KOps/s 84.1387 KOps/s $\color{#35bf28}+2.62\%$
test_step_mdp_speed[True-False-False-True-True] 75.0190μs 35.1159μs 28.4771 KOps/s 27.9065 KOps/s $\color{#35bf28}+2.04\%$
test_step_mdp_speed[True-False-False-True-False] 73.6570μs 23.3752μs 42.7803 KOps/s 42.2845 KOps/s $\color{#35bf28}+1.17\%$
test_step_mdp_speed[True-False-False-False-True] 50.2840μs 20.1122μs 49.7210 KOps/s 48.6376 KOps/s $\color{#35bf28}+2.23\%$
test_step_mdp_speed[True-False-False-False-False] 67.8270μs 13.4275μs 74.4738 KOps/s 71.0634 KOps/s $\color{#35bf28}+4.80\%$
test_step_mdp_speed[False-True-True-True-True] 94.0850μs 33.0516μs 30.2558 KOps/s 29.7160 KOps/s $\color{#35bf28}+1.82\%$
test_step_mdp_speed[False-True-True-True-False] 51.6370μs 21.2420μs 47.0766 KOps/s 45.4075 KOps/s $\color{#35bf28}+3.68\%$
test_step_mdp_speed[False-True-True-False-True] 78.7170μs 21.0079μs 47.6012 KOps/s 45.4025 KOps/s $\color{#35bf28}+4.84\%$
test_step_mdp_speed[False-True-True-False-False] 41.8580μs 13.1509μs 76.0402 KOps/s 72.3874 KOps/s $\textbf{\color{#35bf28}+5.05\%}$
test_step_mdp_speed[False-True-False-True-True] 91.6410μs 34.7922μs 28.7421 KOps/s 29.2415 KOps/s $\color{#d91a1a}-1.71\%$
test_step_mdp_speed[False-True-False-True-False] 58.9000μs 23.5587μs 42.4472 KOps/s 42.6515 KOps/s $\color{#d91a1a}-0.48\%$
test_step_mdp_speed[False-True-False-False-True] 2.7171ms 22.8500μs 43.7636 KOps/s 42.4341 KOps/s $\color{#35bf28}+3.13\%$
test_step_mdp_speed[False-True-False-False-False] 61.2440μs 15.2148μs 65.7256 KOps/s 64.1269 KOps/s $\color{#35bf28}+2.49\%$
test_step_mdp_speed[False-False-True-True-True] 89.4660μs 36.9887μs 27.0352 KOps/s 27.0415 KOps/s $\color{#d91a1a}-0.02\%$
test_step_mdp_speed[False-False-True-True-False] 84.4070μs 25.3446μs 39.4562 KOps/s 38.8092 KOps/s $\color{#35bf28}+1.67\%$
test_step_mdp_speed[False-False-True-False-True] 50.8250μs 23.1831μs 43.1349 KOps/s 42.6734 KOps/s $\color{#35bf28}+1.08\%$
test_step_mdp_speed[False-False-True-False-False] 67.2650μs 15.2271μs 65.6725 KOps/s 64.0670 KOps/s $\color{#35bf28}+2.51\%$
test_step_mdp_speed[False-False-False-True-True] 0.1000ms 38.7230μs 25.8244 KOps/s 25.2402 KOps/s $\color{#35bf28}+2.31\%$
test_step_mdp_speed[False-False-False-True-False] 82.6240μs 27.2564μs 36.6887 KOps/s 36.2994 KOps/s $\color{#35bf28}+1.07\%$
test_step_mdp_speed[False-False-False-False-True] 84.4100μs 24.4162μs 40.9564 KOps/s 39.3922 KOps/s $\color{#35bf28}+3.97\%$
test_step_mdp_speed[False-False-False-False-False] 56.3350μs 16.7817μs 59.5889 KOps/s 56.8032 KOps/s $\color{#35bf28}+4.90\%$
test_values[generalized_advantage_estimate-True-True] 10.9896ms 9.7392ms 102.6783 Ops/s 102.7688 Ops/s $\color{#d91a1a}-0.09\%$
test_values[vec_generalized_advantage_estimate-True-True] 53.4865ms 41.8559ms 23.8915 Ops/s 29.8092 Ops/s $\textbf{\color{#d91a1a}-19.85\%}$
test_values[td0_return_estimate-False-False] 0.2460ms 0.1897ms 5.2717 KOps/s 5.5734 KOps/s $\textbf{\color{#d91a1a}-5.41\%}$
test_values[td1_return_estimate-False-False] 27.5666ms 24.4549ms 40.8915 Ops/s 41.3230 Ops/s $\color{#d91a1a}-1.04\%$
test_values[vec_td1_return_estimate-False-False] 41.4247ms 36.4217ms 27.4561 Ops/s 29.6940 Ops/s $\textbf{\color{#d91a1a}-7.54\%}$
test_values[td_lambda_return_estimate-True-False] 36.3798ms 34.9750ms 28.5918 Ops/s 28.5887 Ops/s $\color{#35bf28}+0.01\%$
test_values[vec_td_lambda_return_estimate-True-False] 38.4052ms 36.5475ms 27.3617 Ops/s 29.7484 Ops/s $\textbf{\color{#d91a1a}-8.02\%}$
test_gae_speed[generalized_advantage_estimate-False-1-512] 11.3986ms 8.2815ms 120.7517 Ops/s 119.3308 Ops/s $\color{#35bf28}+1.19\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.5555ms 1.9843ms 503.9579 Ops/s 519.1866 Ops/s $\color{#d91a1a}-2.93\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.5667ms 0.3629ms 2.7556 KOps/s 2.8255 KOps/s $\color{#d91a1a}-2.47\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 50.5121ms 48.0377ms 20.8170 Ops/s 23.1236 Ops/s $\textbf{\color{#d91a1a}-9.98\%}$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 3.9996ms 3.0740ms 325.3061 Ops/s 326.0520 Ops/s $\color{#d91a1a}-0.23\%$
test_dqn_speed[False-None] 6.0551ms 1.3859ms 721.5711 Ops/s 725.1467 Ops/s $\color{#d91a1a}-0.49\%$
test_dqn_speed[False-backward] 1.9506ms 1.8425ms 542.7309 Ops/s 535.4416 Ops/s $\color{#35bf28}+1.36\%$
test_dqn_speed[True-None] 0.7368ms 0.4676ms 2.1385 KOps/s 2.1349 KOps/s $\color{#35bf28}+0.17\%$
test_dqn_speed[True-backward] 0.9910ms 0.9040ms 1.1062 KOps/s 1.1010 KOps/s $\color{#35bf28}+0.48\%$
test_dqn_speed[reduce-overhead-None] 0.7182ms 0.4739ms 2.1101 KOps/s 2.1138 KOps/s $\color{#d91a1a}-0.18\%$
test_dqn_speed[reduce-overhead-backward] 0.9512ms 0.8894ms 1.1243 KOps/s 1.1297 KOps/s $\color{#d91a1a}-0.47\%$
test_ddpg_speed[False-None] 3.5097ms 2.8302ms 353.3317 Ops/s 349.8132 Ops/s $\color{#35bf28}+1.01\%$
test_ddpg_speed[False-backward] 4.1909ms 3.9291ms 254.5143 Ops/s 251.3689 Ops/s $\color{#35bf28}+1.25\%$
test_ddpg_speed[True-None] 1.7382ms 1.0165ms 983.7685 Ops/s 985.7464 Ops/s $\color{#d91a1a}-0.20\%$
test_ddpg_speed[True-backward] 2.2048ms 1.9465ms 513.7457 Ops/s 523.6933 Ops/s $\color{#d91a1a}-1.90\%$
test_ddpg_speed[reduce-overhead-None] 1.3396ms 1.0187ms 981.6806 Ops/s 993.6318 Ops/s $\color{#d91a1a}-1.20\%$
test_ddpg_speed[reduce-overhead-backward] 1.9985ms 1.9100ms 523.5594 Ops/s 523.2638 Ops/s $\color{#35bf28}+0.06\%$
test_sac_speed[False-None] 9.1001ms 8.1035ms 123.4041 Ops/s 122.3497 Ops/s $\color{#35bf28}+0.86\%$
test_sac_speed[False-backward] 11.3428ms 10.8449ms 92.2095 Ops/s 91.9784 Ops/s $\color{#35bf28}+0.25\%$
test_sac_speed[True-None] 2.1520ms 1.8622ms 537.0135 Ops/s 532.8788 Ops/s $\color{#35bf28}+0.78\%$
test_sac_speed[True-backward] 3.7204ms 3.5755ms 279.6781 Ops/s 247.2042 Ops/s $\textbf{\color{#35bf28}+13.14\%}$
test_sac_speed[reduce-overhead-None] 2.1515ms 1.8775ms 532.6355 Ops/s 525.0863 Ops/s $\color{#35bf28}+1.44\%$
test_sac_speed[reduce-overhead-backward] 3.6964ms 3.5780ms 279.4862 Ops/s 271.3250 Ops/s $\color{#35bf28}+3.01\%$
test_redq_speed[False-None] 20.4029ms 13.4693ms 74.2429 Ops/s 74.0183 Ops/s $\color{#35bf28}+0.30\%$
test_redq_speed[False-backward] 32.4186ms 23.0016ms 43.4752 Ops/s 43.7563 Ops/s $\color{#d91a1a}-0.64\%$
test_redq_speed[True-None] 6.7176ms 5.0104ms 199.5850 Ops/s 179.0234 Ops/s $\textbf{\color{#35bf28}+11.49\%}$
test_redq_speed[True-backward] 14.8277ms 12.7146ms 78.6497 Ops/s 78.2232 Ops/s $\color{#35bf28}+0.55\%$
test_redq_speed[reduce-overhead-None] 6.3103ms 5.0359ms 198.5726 Ops/s 193.2499 Ops/s $\color{#35bf28}+2.75\%$
test_redq_speed[reduce-overhead-backward] 13.1226ms 12.6592ms 78.9940 Ops/s 79.2001 Ops/s $\color{#d91a1a}-0.26\%$
test_redq_deprec_speed[False-None] 14.9391ms 13.3485ms 74.9150 Ops/s 72.2538 Ops/s $\color{#35bf28}+3.68\%$
test_redq_deprec_speed[False-backward] 19.5807ms 18.7613ms 53.3012 Ops/s 50.1774 Ops/s $\textbf{\color{#35bf28}+6.23\%}$
test_redq_deprec_speed[True-None] 4.3543ms 3.7410ms 267.3097 Ops/s 258.0248 Ops/s $\color{#35bf28}+3.60\%$
test_redq_deprec_speed[True-backward] 8.6867ms 8.2886ms 120.6471 Ops/s 112.5734 Ops/s $\textbf{\color{#35bf28}+7.17\%}$
test_redq_deprec_speed[reduce-overhead-None] 4.5059ms 3.7052ms 269.8940 Ops/s 255.8072 Ops/s $\textbf{\color{#35bf28}+5.51\%}$
test_redq_deprec_speed[reduce-overhead-backward] 8.8054ms 8.3158ms 120.2532 Ops/s 115.0049 Ops/s $\color{#35bf28}+4.56\%$
test_td3_speed[False-None] 8.7915ms 8.1127ms 123.2630 Ops/s 123.9217 Ops/s $\color{#d91a1a}-0.53\%$
test_td3_speed[False-backward] 18.0497ms 10.8290ms 92.3442 Ops/s 94.7501 Ops/s $\color{#d91a1a}-2.54\%$
test_td3_speed[True-None] 2.0655ms 1.7853ms 560.1208 Ops/s 559.9670 Ops/s $\color{#35bf28}+0.03\%$
test_td3_speed[True-backward] 4.1686ms 3.5966ms 278.0442 Ops/s 269.8287 Ops/s $\color{#35bf28}+3.04\%$
test_td3_speed[reduce-overhead-None] 1.8830ms 1.7821ms 561.1390 Ops/s 560.8252 Ops/s $\color{#35bf28}+0.06\%$
test_td3_speed[reduce-overhead-backward] 3.5091ms 3.4006ms 294.0625 Ops/s 288.9318 Ops/s $\color{#35bf28}+1.78\%$
test_cql_speed[False-None] 38.7540ms 36.2566ms 27.5812 Ops/s 27.0159 Ops/s $\color{#35bf28}+2.09\%$
test_cql_speed[False-backward] 0.3322s 53.9254ms 18.5441 Ops/s 21.0372 Ops/s $\textbf{\color{#d91a1a}-11.85\%}$
test_cql_speed[True-None] 17.9486ms 15.9942ms 62.5225 Ops/s 62.1482 Ops/s $\color{#35bf28}+0.60\%$
test_cql_speed[True-backward] 29.4957ms 23.1563ms 43.1849 Ops/s 43.6782 Ops/s $\color{#d91a1a}-1.13\%$
test_cql_speed[reduce-overhead-None] 17.3710ms 15.9921ms 62.5307 Ops/s 62.1911 Ops/s $\color{#35bf28}+0.55\%$
test_cql_speed[reduce-overhead-backward] 24.8246ms 22.7755ms 43.9069 Ops/s 43.4766 Ops/s $\color{#35bf28}+0.99\%$
test_a2c_speed[False-None] 8.7175ms 7.3100ms 136.7994 Ops/s 135.1420 Ops/s $\color{#35bf28}+1.23\%$
test_a2c_speed[False-backward] 18.0238ms 15.1209ms 66.1335 Ops/s 67.7573 Ops/s $\color{#d91a1a}-2.40\%$
test_a2c_speed[True-None] 4.0197ms 3.3447ms 298.9768 Ops/s 293.9224 Ops/s $\color{#35bf28}+1.72\%$
test_a2c_speed[True-backward] 11.4523ms 10.1948ms 98.0895 Ops/s 98.3384 Ops/s $\color{#d91a1a}-0.25\%$
test_a2c_speed[reduce-overhead-None] 3.9377ms 3.3737ms 296.4077 Ops/s 298.3618 Ops/s $\color{#d91a1a}-0.65\%$
test_a2c_speed[reduce-overhead-backward] 10.4754ms 10.0241ms 99.7593 Ops/s 101.1126 Ops/s $\color{#d91a1a}-1.34\%$
test_ppo_speed[False-None] 8.8961ms 7.5848ms 131.8423 Ops/s 131.6502 Ops/s $\color{#35bf28}+0.15\%$
test_ppo_speed[False-backward] 15.6048ms 15.2172ms 65.7152 Ops/s 67.0598 Ops/s $\color{#d91a1a}-2.00\%$
test_ppo_speed[True-None] 4.1120ms 3.7419ms 267.2406 Ops/s 255.4394 Ops/s $\color{#35bf28}+4.62\%$
test_ppo_speed[True-backward] 10.5415ms 9.8432ms 101.5934 Ops/s 98.9611 Ops/s $\color{#35bf28}+2.66\%$
test_ppo_speed[reduce-overhead-None] 4.5249ms 3.7373ms 267.5710 Ops/s 264.5685 Ops/s $\color{#35bf28}+1.13\%$
test_ppo_speed[reduce-overhead-backward] 10.4080ms 9.7363ms 102.7084 Ops/s 96.7938 Ops/s $\textbf{\color{#35bf28}+6.11\%}$
test_reinforce_speed[False-None] 7.7948ms 6.5309ms 153.1182 Ops/s 150.9175 Ops/s $\color{#35bf28}+1.46\%$
test_reinforce_speed[False-backward] 11.3584ms 9.9209ms 100.7973 Ops/s 99.5172 Ops/s $\color{#35bf28}+1.29\%$
test_reinforce_speed[True-None] 3.4823ms 2.6900ms 371.7456 Ops/s 367.5662 Ops/s $\color{#35bf28}+1.14\%$
test_reinforce_speed[True-backward] 11.8006ms 9.0237ms 110.8193 Ops/s 113.1031 Ops/s $\color{#d91a1a}-2.02\%$
test_reinforce_speed[reduce-overhead-None] 3.3343ms 2.6868ms 372.1900 Ops/s 366.5562 Ops/s $\color{#35bf28}+1.54\%$
test_reinforce_speed[reduce-overhead-backward] 10.2065ms 8.8845ms 112.5551 Ops/s 111.9488 Ops/s $\color{#35bf28}+0.54\%$
test_iql_speed[False-None] 33.7518ms 32.3441ms 30.9176 Ops/s 30.0946 Ops/s $\color{#35bf28}+2.73\%$
test_iql_speed[False-backward] 47.3147ms 45.3832ms 22.0346 Ops/s 21.6866 Ops/s $\color{#35bf28}+1.60\%$
test_iql_speed[True-None] 12.2503ms 10.9417ms 91.3933 Ops/s 88.7721 Ops/s $\color{#35bf28}+2.95\%$
test_iql_speed[True-backward] 23.0589ms 22.1063ms 45.2360 Ops/s 43.7377 Ops/s $\color{#35bf28}+3.43\%$
test_iql_speed[reduce-overhead-None] 11.6379ms 10.8853ms 91.8666 Ops/s 88.8799 Ops/s $\color{#35bf28}+3.36\%$
test_iql_speed[reduce-overhead-backward] 23.3194ms 22.4661ms 44.5114 Ops/s 44.2656 Ops/s $\color{#35bf28}+0.56\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.5089ms 5.0587ms 197.6797 Ops/s 192.1538 Ops/s $\color{#35bf28}+2.88\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.7291ms 0.4925ms 2.0304 KOps/s 2.0432 KOps/s $\color{#d91a1a}-0.62\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7671ms 0.4616ms 2.1664 KOps/s 2.1582 KOps/s $\color{#35bf28}+0.38\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.3172ms 4.9147ms 203.4722 Ops/s 200.3691 Ops/s $\color{#35bf28}+1.55\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 3.1763ms 0.4857ms 2.0587 KOps/s 2.0592 KOps/s $\color{#d91a1a}-0.03\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 1.2923ms 0.4762ms 2.0999 KOps/s 2.1666 KOps/s $\color{#d91a1a}-3.08\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 3.0417ms 1.6073ms 622.1620 Ops/s 621.4570 Ops/s $\color{#35bf28}+0.11\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.8472ms 1.5445ms 647.4662 Ops/s 645.3689 Ops/s $\color{#35bf28}+0.32\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 7.8039ms 4.9699ms 201.2127 Ops/s 196.2712 Ops/s $\color{#35bf28}+2.52\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.9830ms 0.6225ms 1.6063 KOps/s 1.5785 KOps/s $\color{#35bf28}+1.76\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.1285ms 0.6077ms 1.6456 KOps/s 1.6725 KOps/s $\color{#d91a1a}-1.61\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.7594ms 4.9382ms 202.5010 Ops/s 200.4922 Ops/s $\color{#35bf28}+1.00\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.2037ms 0.5156ms 1.9393 KOps/s 2.0538 KOps/s $\textbf{\color{#d91a1a}-5.57\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6000ms 0.4635ms 2.1573 KOps/s 2.0782 KOps/s $\color{#35bf28}+3.81\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.4763ms 4.8252ms 207.2461 Ops/s 206.2956 Ops/s $\color{#35bf28}+0.46\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 3.0495ms 0.4916ms 2.0341 KOps/s 2.0226 KOps/s $\color{#35bf28}+0.57\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7069ms 0.4667ms 2.1428 KOps/s 2.1488 KOps/s $\color{#d91a1a}-0.28\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 7.6685ms 4.9882ms 200.4719 Ops/s 193.2841 Ops/s $\color{#35bf28}+3.72\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.8053ms 0.6344ms 1.5763 KOps/s 1.5669 KOps/s $\color{#35bf28}+0.60\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8180ms 0.5959ms 1.6782 KOps/s 1.6136 KOps/s $\color{#35bf28}+4.00\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 5.5784ms 4.2585ms 234.8272 Ops/s 222.8423 Ops/s $\textbf{\color{#35bf28}+5.38\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 7.7760ms 2.3531ms 424.9670 Ops/s 437.4877 Ops/s $\color{#d91a1a}-2.86\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 4.2766ms 1.2839ms 778.8871 Ops/s 749.4606 Ops/s $\color{#35bf28}+3.93\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4281s 12.7130ms 78.6598 Ops/s 224.6571 Ops/s $\textbf{\color{#d91a1a}-64.99\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 8.3250ms 2.3594ms 423.8361 Ops/s 435.4836 Ops/s $\color{#d91a1a}-2.67\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 1.9430ms 1.2595ms 793.9550 Ops/s 727.9041 Ops/s $\textbf{\color{#35bf28}+9.07\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 6.0930ms 4.4680ms 223.8155 Ops/s 214.3720 Ops/s $\color{#35bf28}+4.41\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 6.3674ms 2.4432ms 409.3043 Ops/s 421.6996 Ops/s $\color{#d91a1a}-2.94\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 6.5006ms 1.5094ms 662.5174 Ops/s 700.5298 Ops/s $\textbf{\color{#d91a1a}-5.43\%}$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 143. Improved: $\large\color{#35bf28}15$. Worsened: $\large\color{#d91a1a}4$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.7196s 0.7192s 1.3904 Ops/s 1.3891 Ops/s $\color{#35bf28}+0.09\%$
test_transformed 1.0509s 0.9766s 1.0240 Ops/s 1.0463 Ops/s $\color{#d91a1a}-2.13\%$
test_serial 2.1894s 2.1106s 0.4738 Ops/s 0.4789 Ops/s $\color{#d91a1a}-1.07\%$
test_parallel 2.1404s 2.0482s 0.4882 Ops/s 0.4971 Ops/s $\color{#d91a1a}-1.78\%$
test_step_mdp_speed[True-True-True-True-True] 0.1816ms 39.2884μs 25.4528 KOps/s 25.7202 KOps/s $\color{#d91a1a}-1.04\%$
test_step_mdp_speed[True-True-True-True-False] 47.2220μs 23.2446μs 43.0207 KOps/s 43.0495 KOps/s $\color{#d91a1a}-0.07\%$
test_step_mdp_speed[True-True-True-False-True] 63.8930μs 21.4235μs 46.6777 KOps/s 46.9296 KOps/s $\color{#d91a1a}-0.54\%$
test_step_mdp_speed[True-True-True-False-False] 39.9920μs 12.4508μs 80.3158 KOps/s 79.7943 KOps/s $\color{#35bf28}+0.65\%$
test_step_mdp_speed[True-True-False-True-True] 71.9540μs 42.2521μs 23.6674 KOps/s 23.7665 KOps/s $\color{#d91a1a}-0.42\%$
test_step_mdp_speed[True-True-False-True-False] 61.1130μs 25.8020μs 38.7567 KOps/s 38.9059 KOps/s $\color{#d91a1a}-0.38\%$
test_step_mdp_speed[True-True-False-False-True] 52.5930μs 24.3004μs 41.1516 KOps/s 41.7896 KOps/s $\color{#d91a1a}-1.53\%$
test_step_mdp_speed[True-True-False-False-False] 48.8730μs 15.2058μs 65.7645 KOps/s 65.0887 KOps/s $\color{#35bf28}+1.04\%$
test_step_mdp_speed[True-False-True-True-True] 68.5730μs 44.9040μs 22.2697 KOps/s 22.7092 KOps/s $\color{#d91a1a}-1.94\%$
test_step_mdp_speed[True-False-True-True-False] 57.1620μs 28.6201μs 34.9405 KOps/s 35.0079 KOps/s $\color{#d91a1a}-0.19\%$
test_step_mdp_speed[True-False-True-False-True] 53.2220μs 24.2038μs 41.3159 KOps/s 41.6258 KOps/s $\color{#d91a1a}-0.74\%$
test_step_mdp_speed[True-False-True-False-False] 39.7020μs 15.0077μs 66.6326 KOps/s 66.5727 KOps/s $\color{#35bf28}+0.09\%$
test_step_mdp_speed[True-False-False-True-True] 76.6740μs 47.1375μs 21.2145 KOps/s 21.4758 KOps/s $\color{#d91a1a}-1.22\%$
test_step_mdp_speed[True-False-False-True-False] 96.5750μs 30.4657μs 32.8238 KOps/s 32.4194 KOps/s $\color{#35bf28}+1.25\%$
test_step_mdp_speed[True-False-False-False-True] 58.6830μs 26.6631μs 37.5051 KOps/s 37.8619 KOps/s $\color{#d91a1a}-0.94\%$
test_step_mdp_speed[True-False-False-False-False] 48.0620μs 17.8385μs 56.0584 KOps/s 55.9527 KOps/s $\color{#35bf28}+0.19\%$
test_step_mdp_speed[False-True-True-True-True] 0.1008ms 44.0843μs 22.6838 KOps/s 22.5634 KOps/s $\color{#35bf28}+0.53\%$
test_step_mdp_speed[False-True-True-True-False] 60.2330μs 28.6476μs 34.9069 KOps/s 35.2075 KOps/s $\color{#d91a1a}-0.85\%$
test_step_mdp_speed[False-True-True-False-True] 57.6330μs 28.5741μs 34.9967 KOps/s 33.9738 KOps/s $\color{#35bf28}+3.01\%$
test_step_mdp_speed[False-True-True-False-False] 44.2720μs 17.4442μs 57.3257 KOps/s 55.4850 KOps/s $\color{#35bf28}+3.32\%$
test_step_mdp_speed[False-True-False-True-True] 82.3940μs 47.8612μs 20.8937 KOps/s 20.9161 KOps/s $\color{#d91a1a}-0.11\%$
test_step_mdp_speed[False-True-False-True-False] 67.6330μs 31.2973μs 31.9517 KOps/s 31.9100 KOps/s $\color{#35bf28}+0.13\%$
test_step_mdp_speed[False-True-False-False-True] 3.2909ms 31.5506μs 31.6952 KOps/s 31.9576 KOps/s $\color{#d91a1a}-0.82\%$
test_step_mdp_speed[False-True-False-False-False] 44.8020μs 20.2593μs 49.3600 KOps/s 49.3209 KOps/s $\color{#35bf28}+0.08\%$
test_step_mdp_speed[False-False-True-True-True] 82.9650μs 49.9542μs 20.0183 KOps/s 19.8589 KOps/s $\color{#35bf28}+0.80\%$
test_step_mdp_speed[False-False-True-True-False] 61.1630μs 33.6265μs 29.7385 KOps/s 29.5648 KOps/s $\color{#35bf28}+0.59\%$
test_step_mdp_speed[False-False-True-False-True] 62.6030μs 32.2011μs 31.0548 KOps/s 32.2027 KOps/s $\color{#d91a1a}-3.56\%$
test_step_mdp_speed[False-False-True-False-False] 49.0830μs 20.4487μs 48.9030 KOps/s 49.0135 KOps/s $\color{#d91a1a}-0.23\%$
test_step_mdp_speed[False-False-False-True-True] 85.5540μs 51.6912μs 19.3457 KOps/s 19.2967 KOps/s $\color{#35bf28}+0.25\%$
test_step_mdp_speed[False-False-False-True-False] 69.0830μs 36.1451μs 27.6662 KOps/s 27.8798 KOps/s $\color{#d91a1a}-0.77\%$
test_step_mdp_speed[False-False-False-False-True] 55.9820μs 33.3494μs 29.9855 KOps/s 29.5716 KOps/s $\color{#35bf28}+1.40\%$
test_step_mdp_speed[False-False-False-False-False] 54.1730μs 22.8833μs 43.7000 KOps/s 44.4720 KOps/s $\color{#d91a1a}-1.74\%$
test_values[generalized_advantage_estimate-True-True] 24.3824ms 23.6007ms 42.3715 Ops/s 41.9631 Ops/s $\color{#35bf28}+0.97\%$
test_values[vec_generalized_advantage_estimate-True-True] 93.6725ms 2.7429ms 364.5782 Ops/s 357.2061 Ops/s $\color{#35bf28}+2.06\%$
test_values[td0_return_estimate-False-False] 82.7440μs 62.6851μs 15.9527 KOps/s 15.3617 KOps/s $\color{#35bf28}+3.85\%$
test_values[td1_return_estimate-False-False] 53.0789ms 52.5244ms 19.0388 Ops/s 18.8373 Ops/s $\color{#35bf28}+1.07\%$
test_values[vec_td1_return_estimate-False-False] 1.3090ms 1.0497ms 952.6092 Ops/s 952.1151 Ops/s $\color{#35bf28}+0.05\%$
test_values[td_lambda_return_estimate-True-False] 83.8790ms 83.1147ms 12.0316 Ops/s 11.5132 Ops/s $\color{#35bf28}+4.50\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.3275ms 1.0468ms 955.2675 Ops/s 949.9064 Ops/s $\color{#35bf28}+0.56\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 24.8056ms 24.0223ms 41.6280 Ops/s 39.1028 Ops/s $\textbf{\color{#35bf28}+6.46\%}$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 0.9969ms 0.7147ms 1.3992 KOps/s 1.3968 KOps/s $\color{#35bf28}+0.17\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7171ms 0.6360ms 1.5723 KOps/s 1.5570 KOps/s $\color{#35bf28}+0.98\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.4809ms 1.4491ms 690.0809 Ops/s 688.5035 Ops/s $\color{#35bf28}+0.23\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.6919ms 0.6502ms 1.5379 KOps/s 1.5271 KOps/s $\color{#35bf28}+0.71\%$
test_dqn_speed[False-None] 6.6490ms 1.2990ms 769.8208 Ops/s 758.0189 Ops/s $\color{#35bf28}+1.56\%$
test_dqn_speed[False-backward] 1.9382ms 1.8248ms 548.0104 Ops/s 542.2139 Ops/s $\color{#35bf28}+1.07\%$
test_dqn_speed[True-None] 1.1706ms 0.5773ms 1.7322 KOps/s 1.8093 KOps/s $\color{#d91a1a}-4.26\%$
test_dqn_speed[True-backward] 1.5185ms 1.1928ms 838.3689 Ops/s 911.0420 Ops/s $\textbf{\color{#d91a1a}-7.98\%}$
test_dqn_speed[reduce-overhead-None] 0.9327ms 0.5644ms 1.7717 KOps/s 1.7358 KOps/s $\color{#35bf28}+2.07\%$
test_dqn_speed[reduce-overhead-backward] 1.0718ms 0.9947ms 1.0053 KOps/s 977.8660 Ops/s $\color{#35bf28}+2.80\%$
test_ddpg_speed[False-None] 2.9223ms 2.6864ms 372.2391 Ops/s 368.6385 Ops/s $\color{#35bf28}+0.98\%$
test_ddpg_speed[False-backward] 3.9313ms 3.8465ms 259.9763 Ops/s 257.7526 Ops/s $\color{#35bf28}+0.86\%$
test_ddpg_speed[True-None] 1.3016ms 1.2324ms 811.4198 Ops/s 794.9210 Ops/s $\color{#35bf28}+2.08\%$
test_ddpg_speed[True-backward] 2.2261ms 2.1771ms 459.3230 Ops/s 412.6902 Ops/s $\textbf{\color{#35bf28}+11.30\%}$
test_ddpg_speed[reduce-overhead-None] 1.5798ms 1.2523ms 798.5302 Ops/s 798.8116 Ops/s $\color{#d91a1a}-0.04\%$
test_ddpg_speed[reduce-overhead-backward] 2.2435ms 2.1937ms 455.8542 Ops/s 454.0432 Ops/s $\color{#35bf28}+0.40\%$
test_sac_speed[False-None] 8.0433ms 7.4460ms 134.3001 Ops/s 132.7131 Ops/s $\color{#35bf28}+1.20\%$
test_sac_speed[False-backward] 11.0157ms 10.5147ms 95.1046 Ops/s 94.4015 Ops/s $\color{#35bf28}+0.74\%$
test_sac_speed[True-None] 2.3910ms 2.0174ms 495.6923 Ops/s 490.1966 Ops/s $\color{#35bf28}+1.12\%$
test_sac_speed[True-backward] 4.2852ms 3.9162ms 255.3508 Ops/s 222.4287 Ops/s $\textbf{\color{#35bf28}+14.80\%}$
test_sac_speed[reduce-overhead-None] 2.3693ms 2.0195ms 495.1649 Ops/s 484.8210 Ops/s $\color{#35bf28}+2.13\%$
test_sac_speed[reduce-overhead-backward] 3.9928ms 3.9101ms 255.7452 Ops/s 253.1944 Ops/s $\color{#35bf28}+1.01\%$
test_redq_speed[False-None] 14.6828ms 10.0530ms 99.4725 Ops/s 99.5768 Ops/s $\color{#d91a1a}-0.10\%$
test_redq_speed[False-backward] 17.3583ms 16.6604ms 60.0226 Ops/s 60.0770 Ops/s $\color{#d91a1a}-0.09\%$
test_redq_speed[True-None] 3.7614ms 3.5281ms 283.4414 Ops/s 284.8301 Ops/s $\color{#d91a1a}-0.49\%$
test_redq_speed[True-backward] 8.8259ms 8.4617ms 118.1793 Ops/s 118.5804 Ops/s $\color{#d91a1a}-0.34\%$
test_redq_speed[reduce-overhead-None] 3.8717ms 3.4642ms 288.6711 Ops/s 284.0154 Ops/s $\color{#35bf28}+1.64\%$
test_redq_speed[reduce-overhead-backward] 8.8283ms 8.4824ms 117.8907 Ops/s 119.3478 Ops/s $\color{#d91a1a}-1.22\%$
test_redq_deprec_speed[False-None] 10.8292ms 10.4418ms 95.7687 Ops/s 96.3872 Ops/s $\color{#d91a1a}-0.64\%$
test_redq_deprec_speed[False-backward] 15.5903ms 15.1234ms 66.1225 Ops/s 67.3092 Ops/s $\color{#d91a1a}-1.76\%$
test_redq_deprec_speed[True-None] 3.5603ms 3.2130ms 311.2372 Ops/s 300.7175 Ops/s $\color{#35bf28}+3.50\%$
test_redq_deprec_speed[True-backward] 7.8270ms 7.0198ms 142.4547 Ops/s 140.7557 Ops/s $\color{#35bf28}+1.21\%$
test_redq_deprec_speed[reduce-overhead-None] 3.3668ms 3.1984ms 312.6592 Ops/s 303.8319 Ops/s $\color{#35bf28}+2.91\%$
test_redq_deprec_speed[reduce-overhead-backward] 7.2035ms 7.0083ms 142.6880 Ops/s 140.1459 Ops/s $\color{#35bf28}+1.81\%$
test_td3_speed[False-None] 7.5925ms 7.3908ms 135.3029 Ops/s 132.4589 Ops/s $\color{#35bf28}+2.15\%$
test_td3_speed[False-backward] 10.3543ms 10.1553ms 98.4710 Ops/s 96.6295 Ops/s $\color{#35bf28}+1.91\%$
test_td3_speed[True-None] 1.9899ms 1.8913ms 528.7376 Ops/s 515.8884 Ops/s $\color{#35bf28}+2.49\%$
test_td3_speed[True-backward] 3.7923ms 3.6654ms 272.8223 Ops/s 220.4612 Ops/s $\textbf{\color{#35bf28}+23.75\%}$
test_td3_speed[reduce-overhead-None] 1.9054ms 1.8817ms 531.4474 Ops/s 527.7487 Ops/s $\color{#35bf28}+0.70\%$
test_td3_speed[reduce-overhead-backward] 3.7821ms 3.6842ms 271.4302 Ops/s 271.3239 Ops/s $\color{#35bf28}+0.04\%$
test_cql_speed[False-None] 27.4775ms 24.5423ms 40.7459 Ops/s 40.9170 Ops/s $\color{#d91a1a}-0.42\%$
test_cql_speed[False-backward] 38.7951ms 34.3154ms 29.1414 Ops/s 29.6919 Ops/s $\color{#d91a1a}-1.85\%$
test_cql_speed[True-None] 11.1403ms 10.8296ms 92.3396 Ops/s 93.5935 Ops/s $\color{#d91a1a}-1.34\%$
test_cql_speed[True-backward] 16.8875ms 16.6322ms 60.1243 Ops/s 61.0109 Ops/s $\color{#d91a1a}-1.45\%$
test_cql_speed[reduce-overhead-None] 11.4994ms 10.8906ms 91.8220 Ops/s 94.0530 Ops/s $\color{#d91a1a}-2.37\%$
test_cql_speed[reduce-overhead-backward] 17.1649ms 16.6273ms 60.1420 Ops/s 61.2755 Ops/s $\color{#d91a1a}-1.85\%$
test_a2c_speed[False-None] 5.8459ms 5.2501ms 190.4719 Ops/s 192.0875 Ops/s $\color{#d91a1a}-0.84\%$
test_a2c_speed[False-backward] 12.7993ms 11.5684ms 86.4423 Ops/s 86.8246 Ops/s $\color{#d91a1a}-0.44\%$
test_a2c_speed[True-None] 3.3461ms 3.0101ms 332.2126 Ops/s 328.5256 Ops/s $\color{#35bf28}+1.12\%$
test_a2c_speed[True-backward] 8.7282ms 8.5135ms 117.4606 Ops/s 116.3671 Ops/s $\color{#35bf28}+0.94\%$
test_a2c_speed[reduce-overhead-None] 3.1848ms 3.0409ms 328.8544 Ops/s 326.7876 Ops/s $\color{#35bf28}+0.63\%$
test_a2c_speed[reduce-overhead-backward] 8.8854ms 8.3935ms 119.1393 Ops/s 104.5063 Ops/s $\textbf{\color{#35bf28}+14.00\%}$
test_ppo_speed[False-None] 7.3343ms 5.4657ms 182.9588 Ops/s 179.8795 Ops/s $\color{#35bf28}+1.71\%$
test_ppo_speed[False-backward] 12.2637ms 11.8968ms 84.0560 Ops/s 83.9504 Ops/s $\color{#35bf28}+0.13\%$
test_ppo_speed[True-None] 3.7317ms 3.4584ms 289.1511 Ops/s 285.9591 Ops/s $\color{#35bf28}+1.12\%$
test_ppo_speed[True-backward] 8.3677ms 8.2315ms 121.4847 Ops/s 123.0402 Ops/s $\color{#d91a1a}-1.26\%$
test_ppo_speed[reduce-overhead-None] 3.7845ms 3.4082ms 293.4130 Ops/s 288.7676 Ops/s $\color{#35bf28}+1.61\%$
test_ppo_speed[reduce-overhead-backward] 8.3326ms 8.1288ms 123.0194 Ops/s 121.3992 Ops/s $\color{#35bf28}+1.33\%$
test_reinforce_speed[False-None] 4.6977ms 4.3236ms 231.2911 Ops/s 225.4900 Ops/s $\color{#35bf28}+2.57\%$
test_reinforce_speed[False-backward] 7.4003ms 7.1167ms 140.5149 Ops/s 140.4518 Ops/s $\color{#35bf28}+0.04\%$
test_reinforce_speed[True-None] 2.5703ms 2.1994ms 454.6689 Ops/s 433.7722 Ops/s $\color{#35bf28}+4.82\%$
test_reinforce_speed[True-backward] 7.4930ms 7.0014ms 142.8285 Ops/s 142.7222 Ops/s $\color{#35bf28}+0.07\%$
test_reinforce_speed[reduce-overhead-None] 2.6151ms 2.2029ms 453.9422 Ops/s 448.0480 Ops/s $\color{#35bf28}+1.32\%$
test_reinforce_speed[reduce-overhead-backward] 7.1894ms 7.0370ms 142.1057 Ops/s 141.5807 Ops/s $\color{#35bf28}+0.37\%$
test_iql_speed[False-None] 21.4572ms 19.1562ms 52.2024 Ops/s 50.9575 Ops/s $\color{#35bf28}+2.44\%$
test_iql_speed[False-backward] 30.1352ms 29.5884ms 33.7970 Ops/s 33.3650 Ops/s $\color{#35bf28}+1.29\%$
test_iql_speed[True-None] 8.8154ms 6.8396ms 146.2084 Ops/s 147.5920 Ops/s $\color{#d91a1a}-0.94\%$
test_iql_speed[True-backward] 15.6726ms 15.2702ms 65.4870 Ops/s 62.3277 Ops/s $\textbf{\color{#35bf28}+5.07\%}$
test_iql_speed[reduce-overhead-None] 7.1529ms 6.7414ms 148.3364 Ops/s 147.7770 Ops/s $\color{#35bf28}+0.38\%$
test_iql_speed[reduce-overhead-backward] 15.6751ms 15.3005ms 65.3575 Ops/s 63.5346 Ops/s $\color{#35bf28}+2.87\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.5288ms 6.3502ms 157.4765 Ops/s 157.8092 Ops/s $\color{#d91a1a}-0.21\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.2992s 0.4446ms 2.2494 KOps/s 3.6578 KOps/s $\textbf{\color{#d91a1a}-38.50\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5007ms 0.2568ms 3.8943 KOps/s 3.6933 KOps/s $\textbf{\color{#35bf28}+5.44\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.3984ms 6.1815ms 161.7721 Ops/s 166.3042 Ops/s $\color{#d91a1a}-2.73\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.8007ms 0.2319ms 4.3125 KOps/s 2.9084 KOps/s $\textbf{\color{#35bf28}+48.28\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4944ms 0.2658ms 3.7619 KOps/s 3.5361 KOps/s $\textbf{\color{#35bf28}+6.39\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.4410ms 1.2144ms 823.4186 Ops/s 744.9448 Ops/s $\textbf{\color{#35bf28}+10.53\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.3726ms 1.1571ms 864.2004 Ops/s 765.1860 Ops/s $\textbf{\color{#35bf28}+12.94\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.4565ms 6.3384ms 157.7681 Ops/s 160.1609 Ops/s $\color{#d91a1a}-1.49\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.2307ms 0.3756ms 2.6625 KOps/s 2.2587 KOps/s $\textbf{\color{#35bf28}+17.88\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6719ms 0.4028ms 2.4825 KOps/s 2.2040 KOps/s $\textbf{\color{#35bf28}+12.64\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.4195ms 6.2375ms 160.3216 Ops/s 165.7774 Ops/s $\color{#d91a1a}-3.29\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.8059ms 0.3166ms 3.1588 KOps/s 2.9821 KOps/s $\textbf{\color{#35bf28}+5.93\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6296ms 0.3449ms 2.8992 KOps/s 4.7013 KOps/s $\textbf{\color{#d91a1a}-38.33\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 9.7031ms 6.1832ms 161.7273 Ops/s 167.9606 Ops/s $\color{#d91a1a}-3.71\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.9269ms 0.2605ms 3.8394 KOps/s 3.5033 KOps/s $\textbf{\color{#35bf28}+9.59\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4358ms 0.2499ms 4.0023 KOps/s 3.9242 KOps/s $\color{#35bf28}+1.99\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.5309ms 6.3592ms 157.2520 Ops/s 157.8292 Ops/s $\color{#d91a1a}-0.37\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.6012ms 0.4155ms 2.4067 KOps/s 2.2972 KOps/s $\color{#35bf28}+4.77\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7381ms 0.4119ms 2.4277 KOps/s 2.4451 KOps/s $\color{#d91a1a}-0.71\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 6.7700ms 5.2044ms 192.1467 Ops/s 188.4277 Ops/s $\color{#35bf28}+1.97\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 6.9708ms 2.0110ms 497.2712 Ops/s 493.6257 Ops/s $\color{#35bf28}+0.74\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 8.6045ms 1.2260ms 815.6763 Ops/s 825.7148 Ops/s $\color{#d91a1a}-1.22\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4105s 13.3837ms 74.7178 Ops/s 186.2069 Ops/s $\textbf{\color{#d91a1a}-59.87\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 8.8279ms 2.0048ms 498.8113 Ops/s 490.5046 Ops/s $\color{#35bf28}+1.69\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 7.3592ms 1.2180ms 821.0289 Ops/s 815.7058 Ops/s $\color{#35bf28}+0.65\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 7.1000ms 5.4628ms 183.0574 Ops/s 180.9981 Ops/s $\color{#35bf28}+1.14\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 10.9667ms 2.0840ms 479.8489 Ops/s 477.9396 Ops/s $\color{#35bf28}+0.40\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 7.0758ms 1.3551ms 737.9682 Ops/s 724.2975 Ops/s $\color{#35bf28}+1.89\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Tests Incomplete or broken unit tests
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants