-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Super slow on Mac MPS #16
Comments
I'm looking into this. There aren't any more graph break but even collecting data is slower on MPS.
and there are several interesting things in the profile:
I raised an issue about this: pytorch/pytorch#140706 I'll keep you posted, but working with an MPS backend may not be a suitable option for the time being! |
Thanks! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi, a follow up on #15: I compared cpu vs mps and compile vs no compile on halfcheetah for 100k steps using SAC. It shows that mps is significantly slower than cpu, and
aot_eager
backend makes compile slower and much more so for cpu, tho the defaultinductor
backend makes compile quite a bit faster for cpu but doesn't work for mps.Code change is the following:
The text was updated successfully, but these errors were encountered: