Momentum-Based Policy Gradient Methods

Authors: Feihu Huang, Shangqian Gao, Pei Jian and Huang Heng

PyTorch Implementation of Momentum-Based Policy Gradient Methods (ICML 2020).

Code uploaded.

Requirements

pytorch 1.1.0
garage 2019.10.1
mujuco
gym
If you do not install mujuco, then only CartPole environment is available.

Usage

To run IS-MBPG

python MBPG_test.py --env CartPole

To run IS-MBPG*

python MBPG_test.py --env CartPole --IS_MBPG_star True

To run HA-MBPG

python MBPG_HA_test.py --env CartPole

To run different environments change --env to one of the followings: "CartPole", "Walker", "Hopper" or "HalfCheetah". If you want to use our algorithms on different enviroment, you need to implement it by yourself, but it should be pretty straightforward.

Citation

@InProceedings{huang2020accelerated,
  author    = {Huang, Feihu and Gao, Shangqian and Pei, Jian and Huang, Heng},
  title     = {Momentum-Based Policy Gradient Methods},
  booktitle = {Proceedings of the 37th International Conference on Machine Learning},
  year      = {2020},}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Algorithms		Algorithms
Policy		Policy
init		init
log		log
MBPG_HA_test.py		MBPG_HA_test.py
MBPG_test.py		MBPG_test.py
README.md		README.md
__init__.py		__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Momentum-Based Policy Gradient Methods

Requirements

Usage

Citation

About

Releases

Packages

Languages

gaosh/MBPG

Folders and files

Latest commit

History

Repository files navigation

Momentum-Based Policy Gradient Methods

Requirements

Usage

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages