You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I am trying to implement a VAE by following this example. I build a compute graph mimicking theirs (see below). You can see there are two loss functions, klloss and sigmoidcrossentropyloss (please ignore l2loss). How should I optimize using both loss functions? I can see two options:
add them together to create a final loss and optimize over it (this does not make much sense as klloss should not depend on decoders)
Turned out this is the right way to optimize (i.e., use the total loss).
optimize two functions independently. Will the following be correct steps?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hi, I am trying to implement a VAE by following this example. I build a compute graph mimicking theirs (see below). You can see there are two loss functions,
klloss
andsigmoidcrossentropyloss
(please ignorel2loss
). How should I optimize using both loss functions? I can see two options:add them together to create a final loss and optimize over it (
this does not make much sense as)klloss
should not depend on decodersTurned out this is the right way to optimize (i.e., use the total loss).
optimize two functions independently. Will the following be correct steps?
Any thoughts? Thanks.
Beta Was this translation helpful? Give feedback.
All reactions