-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Expert vs Ours? #2
Comments
It's been a while since I've seen it but I don't think the videos show the expert, instead they only show the policy trained through DAgger. I think one is a training rollout while another is an evaluation rollout. You can see what they log in the code here, and it looks like one is labeled It's a good idea to look through other parts of the code and understand it because if you end up ever applying what you learn here in another area, you will likely need to use similar infrastructure to what they have provided for you here (e.g. replay buffers, logging, etc.) in addition to implementing the algorithms themselves (I usually find RL algorithms themselves to be quite short in terms of lines of code; infrastructure/data pipelines end up needing many more lines). In addition, they sometimes contain decent starter hyperparameters that you can use on your own projects. Hope this helps. Vincent |
I've taken a look at the tensorboard and I see what you are talking about now. Unfortunately, I'm not sure but I'll take a look at it and I'll let you know. |
Hi Vincent,
In tensorboard, under the "Images" tab, we are able to visualize our results on the Ant or the Humanoid. We see a split screen (e.g. 2 ants), one on the left and one on the right.
How do we know which is the expert and which is ours (DAgger)?
(PS: Where were you able to find this info? I tried googling for hours and could not find it.)
Thanks!
The text was updated successfully, but these errors were encountered: