Posterior computation #2

sai-prasanna · 2024-05-24T10:48:19Z

In your posterior, you use the stochastic state of the prior. But in RSSM they only use the deterministic state, and observation embedding. Since the prior's stochastic state is just a function of the deterministic state, it won't have extra information to condition upon. And using the stochastic state sample might hurt computing the posterior because of the sampling noise.

I am checking in case there is some other deeper reason to use it.

contrastive-aif/world_model.py

Line 129 in 980e386

x = torch.cat([prior_state['stoch'], prior_state['deter'], obs_embed], dim=-1)

mazpie · 2024-06-06T04:26:07Z

Hi @sai-prasanna,
you're right that there is a subtle difference with the original RSSM.

However, I would not expect any major differences as the information to condition upon is contained in the deterministic state, as you pointed out.

The stochastic state might either be helpful (it is a more noisy estimate of the state) or be ignored by the network, if it doesn't contain any useful information (e.g. if you just concatenate random noise to the inputs of a network, the network quickly learns to ignore it)

I hope this answers your question!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Posterior computation #2

Posterior computation #2

sai-prasanna commented May 24, 2024

mazpie commented Jun 6, 2024 •

edited

Loading

Posterior computation #2

Posterior computation #2

Comments

sai-prasanna commented May 24, 2024

mazpie commented Jun 6, 2024 • edited Loading

mazpie commented Jun 6, 2024 •

edited

Loading