Strong diverging behaviors regarding realtime_mode being true or false upon inference #83

MarcoMeter · 2019-04-18T05:54:53Z

I trained a model which can get a mean reward of 7 for solving seed 34 if realtime_mode is disabled.
If I set realtime_mode to true to observe the agent playing, the mean reward for multiple episodes is 1.7.

Did anybody else observe such a huge difference on version 1.3?

KarolisRam · 2019-04-18T08:10:01Z

I noticed that a deterministic policy can give different results on same seed with different realtime_mode settings back in 1.2, so different scores are not surprising.
If i remember correctly, doing 1 or 2 steps forward with realtime_mode=True would often snap the agent back to the original position repeatedly, while realtime_mode=False worked fine. I made one of my agents stutter step to combat this :)

MarcoMeter · 2019-04-18T08:28:49Z

I guess I'll implement a video recorder to observe the agent's performance during inference.
I'm using a stochastic policy.

awjuliani · 2019-04-18T17:30:04Z

Hi all,

Thanks for bringing this to our attention. There should be no differences between the two modes, but clearly that is not the case. We will look into this.

kwea123 · 2019-04-19T03:39:33Z

I noticed that sometimes when the communication between python and unity becomes slow (for example when the policy network runs slowly), it results in unexpected behavior, for example warping (suddenly appears in a totally different position).

BerkayDemirel · 2019-07-01T14:09:21Z

Anyone managed to find a solution / work around for this bug? Tried stutter stepping but it seems to make it even worse, showing walking animation but snapping back to the same place.

nmichlo · 2019-11-16T16:00:41Z

I have just noticed that in realtime mode doors open instantly, while when realtime mode is not enabled doors take multiple steps to actually open, often up to a second of in-game time.

Additionally, as mentioned by others when realtime mode is not enabled, animations do not play properly.

EDIT: workaround at the moment is totally ignoring realtime mode and rather using a custom window to handle interaction/visualisation.

awjuliani self-assigned this Apr 18, 2019

awjuliani added the bug Something isn't working label Apr 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strong diverging behaviors regarding realtime_mode being true or false upon inference #83

Strong diverging behaviors regarding realtime_mode being true or false upon inference #83

MarcoMeter commented Apr 18, 2019

KarolisRam commented Apr 18, 2019

MarcoMeter commented Apr 18, 2019

awjuliani commented Apr 18, 2019

kwea123 commented Apr 19, 2019

BerkayDemirel commented Jul 1, 2019

nmichlo commented Nov 16, 2019 •

edited

Loading

Strong diverging behaviors regarding realtime_mode being true or false upon inference #83

Strong diverging behaviors regarding realtime_mode being true or false upon inference #83

Comments

MarcoMeter commented Apr 18, 2019

KarolisRam commented Apr 18, 2019

MarcoMeter commented Apr 18, 2019

awjuliani commented Apr 18, 2019

kwea123 commented Apr 19, 2019

BerkayDemirel commented Jul 1, 2019

nmichlo commented Nov 16, 2019 • edited Loading

nmichlo commented Nov 16, 2019 •

edited

Loading