feature(cxy): add averaged-dqn policy #683

Mossforest · 2023-07-08T13:12:48Z

Description

Adding new policy: averaged-dqn and part of ensemble-dqn.

Related Issue

TODO

Ensemble-dqn: specified q-learning net needed
Experiments debugging needed

Check List

merge the latest version source branch/repo, and resolve all the conflicts
pass style check
pass all the tests

PaParaZz1 · 2023-07-10T05:37:51Z

ding/policy/__init__.py

@@ -53,3 +53,5 @@

 # new-type policy
 from .ppof import PPOFPolicy
+
+from .averaged_dqn import AveragedDQNPolicy, EnsembleDQNPolicy


move it to the old-type policy part

PaParaZz1 · 2023-07-10T05:38:17Z

ding/policy/averaged_dqn.py

+class AveragedDQNPolicy(DQNPolicy):
+    """
+    Overview:
+        Policy class of Averaged_DQN algorithm.


don't add underline here

PaParaZz1 · 2023-07-10T05:40:56Z

ding/policy/averaged_dqn.py

+
+        # use model_wrapper for specialized demands of different modes
+        self._target_model_list = copy.deepcopy(self._prime_model_list)
+        if 'target_update_freq' in self._cfg.learn: 


only use one type target update here

PaParaZz1 · 2023-07-10T05:55:31Z

ding/policy/averaged_dqn.py

+
+
+    def _state_dict_learn(self) -> Dict[str, Any]:
+            """


polish indent

feature(cxy): add averaged-dqn policy

7968925

PaParaZz1 added the algo Add new algorithm or improve old one label Jul 10, 2023

PaParaZz1 requested changes Jul 10, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature(cxy): add averaged-dqn policy #683

feature(cxy): add averaged-dqn policy #683

Mossforest commented Jul 8, 2023

PaParaZz1 Jul 10, 2023

PaParaZz1 Jul 10, 2023

PaParaZz1 Jul 10, 2023

PaParaZz1 Jul 10, 2023

feature(cxy): add averaged-dqn policy #683

Are you sure you want to change the base?

feature(cxy): add averaged-dqn policy #683

Conversation

Mossforest commented Jul 8, 2023

Description

Related Issue

TODO

Check List

PaParaZz1 Jul 10, 2023

Choose a reason for hiding this comment

PaParaZz1 Jul 10, 2023

Choose a reason for hiding this comment

PaParaZz1 Jul 10, 2023

Choose a reason for hiding this comment

PaParaZz1 Jul 10, 2023

Choose a reason for hiding this comment