Specify Module's PyTree-Representation for jit/grad seperately. I.e. How to freeze state.variables #22

simon-bachhuber · 2022-10-15T12:25:27Z

Disclaimer: I have not used oryx yet. Further, not an issue but rather just a question/discussion.

Suppose i want to define some recurrent network but its initial hidden state is not a parameter, i.e. it should be exposed to jax.jit but not to jax.grad. How can this be done?

E.g.

# syntax might be slightly wrong, think of it as pseudo-code
def network_def(x):
  s = state.variable(..., name="hidden-state")
  p = state.variable(..., name="parameters")
  s, y = f(s, p, x)
  state.assign(s, name="hidden-state")
  return y 

network = state.init(network_def)(x)

@jax.jit # <- this should "see" hidden-state
@jax.grad # <- this should not "see" hidden-state
def loss_fn(network, x, y):
  ...

Is there an elegant way of doing that?
Thank you!

Also, are all jax-transformations supported? Readme mentions jit, grad, vmap. What about pmap,scan (and all the others) ?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specify Module's PyTree-Representation for jit/grad seperately. I.e. How to freeze state.variables #22

Specify Module's PyTree-Representation for jit/grad seperately. I.e. How to freeze state.variables #22

simon-bachhuber commented Oct 15, 2022

Specify Module's PyTree-Representation for jit/grad seperately. I.e. How to freeze state.variables #22

Specify Module's PyTree-Representation for jit/grad seperately. I.e. How to freeze state.variables #22

Comments

simon-bachhuber commented Oct 15, 2022