recalculation of reward after action update #1

soldierofhell · 2021-11-20T22:04:58Z

Hi @homangab,
Thanks for the effort to boost the performance of CEM optimization in the right (gradient) direction.
However reading the code I don't see (probably should be somwhere here) an update of rewards after the actions are updated by optimizer.

gradcem/mpc/gradcem.py

Line 44 in 02a8b36

returns = self.env.rollout(actions)

should be calculated once again. Am I wrong?

Regards,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

recalculation of reward after action update #1

recalculation of reward after action update #1

soldierofhell commented Nov 20, 2021

recalculation of reward after action update #1

recalculation of reward after action update #1

Comments

soldierofhell commented Nov 20, 2021