GitHub - nmruedap/technical_assessment_smarthop: Smarthop machine learning position (technical assessment)

Smarthop machine learning position (technical assessment)

Below steps for technical assessment solution, are described:

Training and test agent in lunar lander environment with DQN stable baselines algorithm

Stable baselines module was installed using anaconda to support Tensorflow 1.8.
DQN agent was trained using DQN stable baselines algorithm with learning_rate = 0.003 and time_steps = 500000
After learning stage, agent was tested loading l_lander_dqn.zip file. Below is a gif with recording result

Python file lunar_lander_dqn.py check if previous training model exists (l_lander_dqn.zip). If exists, run test environment with the learning file. If not exists, agent is trained with DQN argorithm (stable baselines).

Customize environment

Environment lunar lander was customized in python file lunar_launcher_env.py, now is named: LunarLauncherEnv. Lunar launcher environment goal is reach the top center of screen environment (between flags). It make random terrains, simulating random launch started angles. If set_random_x_pos=False always will start in ground center (see image below).

If set_random_x_pos=True agent will start in random ground positions along x axis (see image below).

To allow the performance of an agent in this environment, it was necessary to make some changes, which are described below:

Friction was increased to allow a fixed started position
Collision ground detector was removed, when agent reach to top, game over
Main engine power was decreased to promote use of right and left engines, it force agent to fly to reach correct launch angle and position
Observation space was reduced to 6, observations related with contact ground detection were removed
Rewards system was changed, to promote movement and reach the center top

Environment lunar launcher was tested in python file using_custom_env_lunarlauncher.py through DQN algorithm. Below is a gif with recording result:

Folder gym-lunarlauncher was added to install environment using pip3 install -e gym-lunarlauncher, however, I haven't tested yet because I using anaconda virtual python, but I hope it works.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
gym-lunarlauncher		gym-lunarlauncher
images		images
.gitignore		.gitignore
README.md		README.md
l_lander_dqn.zip		l_lander_dqn.zip
l_launcher_dqn.zip		l_launcher_dqn.zip
lunar_lander_dqn.py		lunar_lander_dqn.py
lunarlauncher_env.py		lunarlauncher_env.py
using_custom_env_lunarlauncher.py		using_custom_env_lunarlauncher.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Smarthop machine learning position (technical assessment)

Training and test agent in lunar lander environment with DQN stable baselines algorithm

Customize environment

About

Releases

Packages

Languages

nmruedap/technical_assessment_smarthop

Folders and files

Latest commit

History

Repository files navigation

Smarthop machine learning position (technical assessment)

Training and test agent in lunar lander environment with DQN stable baselines algorithm

Customize environment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages