Overview

The purpose of this repository is to provide meaninful baselines for a variety reinforcement learning approaches. The RL approaches are evaluated with an emphasis on the average performance accross multiple agents.

Q-learning algorithms

Recipies

OpenAI cartpole v1
Description: See https://gym.openai.com/envs/CartPole-v1/ for more information.
Agent description	Representative parameters	Mean performance accross thirty agents
SGD with feedforward ANN
Colorado State Univ cartpole swing-up and balance task
An inverted pendulum on a cart initially developed by Chuck Anderson ([email protected]). An evaluation episode begins with the pole pointing down, the cart in the center of the track, with both the cart and pole with zero velocity.
Agent description	Representative parameters	Mean performance accross thirty agents
SGD with feedforward ANN	Adam with feedforward ANN

Carpole

Other stuff:

How to create package

python3 setup.py sdist bdist_wheel

python3 -m twine upload dist/*

To create a private release


   $ python3 setup.py sdist bdist_wheel

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Qlearners		Qlearners
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Q-learning algorithms

Recipies

OpenAI cartpole v1

Agent description

Representative parameters

Mean performance accross thirty agents

Colorado State Univ cartpole swing-up and balance task

Agent description

Representative parameters

Mean performance accross thirty agents

Carpole

Other stuff:

How to create package

To create a private release

About

Releases

Packages

Languages

License

danelliottster/Qlearners

Folders and files

Latest commit

History

Repository files navigation

Overview

Q-learning algorithms

Recipies

OpenAI cartpole v1

Agent description

Representative parameters

Mean performance accross thirty agents

Colorado State Univ cartpole swing-up and balance task

Agent description

Representative parameters

Mean performance accross thirty agents

Carpole

Other stuff:

How to create package

To create a private release

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages