[BugFix] Vectorized priority update in replay buffers #1598

matteobettini · 2023-10-03T16:22:54Z

This is a patch for #1574

It fixes the core problem highlighted in that issue but there are still points that will need attention in a future refactoring of this class as I am not sure it is compatible with all the cases it aims to support. I am happy to elucidate more about this if needed.

Signed-off-by: Matteo Bettini <[email protected]>

vmoens

LGTM, see my few minor comments

torchrl/data/replay_buffers/replay_buffers.py

vmoens · 2023-10-03T19:50:02Z

torchrl/data/replay_buffers/replay_buffers.py

+                priority = torch.tensor(
+                    [self._get_priority_item(td) for td in data],
+                    dtype=torch.float,
+                    device=data.device,
+                )


I guess we assume that the stack dim is 0 but it could not be (?)
I think we can consider the priority can be stacked, no? At the end of the day it's supposed to be one priority per item. Maybe I'm missing something

this was the previous treatment so I am not super sure what was going on or why things were this way.

my guess is that it is assuming 0 as stack dim because in expand it stacks on 0 and also because that is the dim of the indeces and priority

are you suggesting to completely remove this and always go vectorized? I am down to try.

maybe it was this way because the priority can have different shapes along the stack dim?

That is the only explanation I can guess

torchrl/data/replay_buffers/replay_buffers.py

Co-authored-by: Vincent Moens <[email protected]>

Signed-off-by: Matteo Bettini <[email protected]>

…rioritised_buffer

Signed-off-by: Matteo Bettini <[email protected]>

vmoens

LGTM thanks!

Signed-off-by: Matteo Bettini <[email protected]> Co-authored-by: Vincent Moens <[email protected]>

update

5725282

Signed-off-by: Matteo Bettini <[email protected]>

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 3, 2023

vmoens added bug Something isn't working performance Performance issue or suggestion for improvement labels Oct 3, 2023

vmoens approved these changes Oct 3, 2023

View reviewed changes

matteobettini and others added 5 commits October 3, 2023 21:05

Update torchrl/data/replay_buffers/replay_buffers.py

81dcee7

Co-authored-by: Vincent Moens <[email protected]>

Update torchrl/data/replay_buffers/replay_buffers.py

bb74238

Co-authored-by: Vincent Moens <[email protected]>

update

25eab5a

Signed-off-by: Matteo Bettini <[email protected]>

Merge remote-tracking branch 'fork/fix_prioritised_buffer' into fix_p…

c8b1fa8

…rioritised_buffer

update

98514c0

Signed-off-by: Matteo Bettini <[email protected]>

matteobettini mentioned this pull request Oct 3, 2023

[Performance] Prioritised TensorDict replay buffers use for loops over the batch dimension #1574

Closed

vmoens approved these changes Oct 4, 2023

View reviewed changes

vmoens merged commit 3d2c161 into pytorch:main Oct 4, 2023
55 of 59 checks passed

matteobettini deleted the fix_prioritised_buffer branch October 4, 2023 07:38

vmoens added a commit to hyerra/rl that referenced this pull request Oct 10, 2023

[BugFix] Vectorized priority update in replay buffers (pytorch#1598)

059763f

Signed-off-by: Matteo Bettini <[email protected]> Co-authored-by: Vincent Moens <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] Vectorized priority update in replay buffers #1598

[BugFix] Vectorized priority update in replay buffers #1598

matteobettini commented Oct 3, 2023

vmoens left a comment

vmoens Oct 3, 2023

matteobettini Oct 3, 2023 •

edited

Loading

matteobettini Oct 3, 2023

vmoens left a comment

[BugFix] Vectorized priority update in replay buffers #1598

[BugFix] Vectorized priority update in replay buffers #1598

Conversation

matteobettini commented Oct 3, 2023

vmoens left a comment

Choose a reason for hiding this comment

vmoens Oct 3, 2023

Choose a reason for hiding this comment

matteobettini Oct 3, 2023 • edited Loading

Choose a reason for hiding this comment

matteobettini Oct 3, 2023

Choose a reason for hiding this comment

vmoens left a comment

Choose a reason for hiding this comment

matteobettini Oct 3, 2023 •

edited

Loading