Skip to content

Latest commit

 

History

History
19 lines (16 loc) · 831 Bytes

README.md

File metadata and controls

19 lines (16 loc) · 831 Bytes

CircleCI

Environments + agents for Open AI Gym

  • Solving partial state/multi-agent control problems with RL
  • Implementations of continuous control PAAC and A3C
  • Finance/trading environments

Swarm environment

Learning to stop a swarm of locusts from "rolling"

Random After 24 hours
Random Learned

To train

python scripts/train_paac_conv.py -d /gpu:0 --height=84 --clip_norm=1

Acknowledgements

Bibs and bobs borrowed from https://github.com/Alfredvc/paac and https://github.com/dennybritz/reinforcement-learning/