[BugFix] Avoid `reshape(-1)` for inputs to `DreamerActorLoss` #2496

kurtamohler · 2024-10-15T20:59:14Z

Description

Avoid reshaping inputs to DreamerActorLoss.

Motivation and Context

Follow-up to #2494

I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
Example (update in the folder of examples)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

pytorch-bot · 2024-10-15T20:59:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2496

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 6 Unrelated Failures

As of commit bca6b79 with merge base d894358 ():

NEW FAILURES - The following jobs have failed:

Continuous Benchmark (PR) / CPU Pytest benchmark (gh)
##[error]Workflow failed! Resource not accessible by integration
Continuous Benchmark (PR) / GPU Pytest benchmark (gh)
##[error]Workflow failed! Resource not accessible by integration
Habitat Tests on Linux / tests (3.9, 12.1) / linux-job (gh)
RuntimeError: Command docker exec -t dd9cb22336c000f7cace9eb23d4eef0da2640b0f3c219a8917b09b6fb92cec90 /exec failed with exit code 134

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cpu (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow
Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda11_8 (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow
Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda12_1 (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow
Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda12_4 (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow
Unit-tests on Linux / tests-olddeps (3.8, 11.6) / linux-job (gh) (trunk failure)
test/test_rb.py::TestEnsemble::test_rb[SamplerWithoutReplacement-48-None-None-Tensor-ListStorage]
Unit-tests on Windows / unittests-cpu / windows-job (gh) (trunk failure)
test/test_collector.py::TestCompile::test_compiled_policy[device0-compile_policy2-MultiSyncDataCollector]

This comment was automatically generated by Dr. CI and updates every 15 minutes.

kurtamohler · 2024-10-15T21:02:06Z

test/test_cost.py

-        loss_td, fake_data = loss_module(tensordict)
+        # NOTE: Input is reshaped because GRUCell (which is part of the
+        # RSSMPrior module in `mb_env`) expects input to be either 1D or 2D
+        loss_td, fake_data = loss_module(tensordict.reshape(-1))


I'm not sure if there is a better way to fix this test. I suppose it could be possible to just reshape the direct input to the GRUCell?

if we need to reshape we should reshape - but another option here would be to use vmap
like:

if tensordict.ndim > 1: loss_td, fake_data = vmap(loss_module, (0,))(tensordict)

(gru works with vmap as long as you are using the python only version in torchrl.modules)

If I try using VmapModule, I get this error:

File "/home/endoplasm/develop/torchrl-1/test/test_cost.py", line 10338, in test_dreamer_actor loss_td, fake_data = VmapModule(loss_module, (0,))(tensordict) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/endoplasm/develop/torchrl-1/torchrl/modules/tensordict_module/common.py", line 454, in __init__ self.in_keys = module.in_keys ^^^^^^^^^^^^^^ File "/home/endoplasm/develop/torchrl-1/torchrl/objectives/common.py", line 441, in __getattr__ return super().__getattr__(item) ^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/endoplasm/miniconda/envs/torchrl-1/lib/python3.11/site-packages/torch/nn/modules/module.py", line 1931, in __getattr__ raise AttributeError( AttributeError: 'DreamerActorLoss' object has no attribute 'in_keys'

I'll probably just leave the reshape for now, but I would like to understand this.

Indeed if I try to access loss_module.in_keys directly, I also get the above error. But I can access the in_keys of the actor model and world model within the loss module:

print(loss_module.actor_model.in_keys) print(loss_module.model_based_env.world_model.in_keys)

['state', 'belief'] ['state', 'belief', 'action']

So I'm wondering what would be the right way to make VmapModule and DreamerActorLoss compatible? Would we want to add an in_keys attribute to DreamerActorLoss that returns a combined list of the keys in the actor model and world model?

Oh yeah DreamerActorLoss should have in_keys!
All losses should. Dreamer hasn't received much love lately as you can see.
Let's take care of that in a separate PR then

vmoens · 2024-10-16T17:16:46Z

The Dreamer implementation (in examples workflow) is failing

kurtamohler · 2024-10-16T23:08:47Z

The Dreamer implementation (in examples workflow) is failing

Should be fixed now

vmoens

LGTM thanks!

vmoens · 2024-10-18T13:40:55Z

test/test_cost.py

-        loss_td, fake_data = loss_module(tensordict)
+        # NOTE: Input is reshaped because GRUCell (which is part of the
+        # RSSMPrior module in `mb_env`) expects input to be either 1D or 2D
+        loss_td, fake_data = loss_module(tensordict.reshape(-1))


Oh yeah DreamerActorLoss should have in_keys!
All losses should. Dreamer hasn't received much love lately as you can see.
Let's take care of that in a separate PR then

kurtamohler added the Objectives label Oct 15, 2024

kurtamohler requested a review from vmoens October 15, 2024 20:59

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 15, 2024

kurtamohler commented Oct 15, 2024

View reviewed changes

[BugFix] Avoid reshape(-1) for inputs to DreamerActorLoss

bca6b79

kurtamohler force-pushed the dreamer-avoid-reshape-0 branch from 6009810 to bca6b79 Compare October 16, 2024 23:08

vmoens approved these changes Oct 18, 2024

View reviewed changes

vmoens merged commit a27514c into pytorch:main Oct 18, 2024
71 of 80 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] Avoid `reshape(-1)` for inputs to `DreamerActorLoss` #2496

[BugFix] Avoid `reshape(-1)` for inputs to `DreamerActorLoss` #2496

kurtamohler commented Oct 15, 2024

pytorch-bot bot commented Oct 15, 2024 •

edited

Loading

kurtamohler Oct 15, 2024 •

edited

Loading

vmoens Oct 16, 2024

kurtamohler Oct 16, 2024 •

edited

Loading

vmoens Oct 18, 2024

vmoens commented Oct 16, 2024

kurtamohler commented Oct 16, 2024

vmoens left a comment

vmoens Oct 18, 2024

[BugFix] Avoid reshape(-1) for inputs to DreamerActorLoss #2496

[BugFix] Avoid reshape(-1) for inputs to DreamerActorLoss #2496

Conversation

kurtamohler commented Oct 15, 2024

Description

Motivation and Context

Types of changes

Checklist

pytorch-bot bot commented Oct 15, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2496

❌ 3 New Failures, 6 Unrelated Failures

kurtamohler Oct 15, 2024 • edited Loading

Choose a reason for hiding this comment

vmoens Oct 16, 2024

Choose a reason for hiding this comment

kurtamohler Oct 16, 2024 • edited Loading

Choose a reason for hiding this comment

vmoens Oct 18, 2024

Choose a reason for hiding this comment

vmoens commented Oct 16, 2024

kurtamohler commented Oct 16, 2024

vmoens left a comment

Choose a reason for hiding this comment

vmoens Oct 18, 2024

Choose a reason for hiding this comment

[BugFix] Avoid `reshape(-1)` for inputs to `DreamerActorLoss` #2496

[BugFix] Avoid `reshape(-1)` for inputs to `DreamerActorLoss` #2496

pytorch-bot bot commented Oct 15, 2024 •

edited

Loading

kurtamohler Oct 15, 2024 •

edited

Loading

kurtamohler Oct 16, 2024 •

edited

Loading