You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I observed that during the training process, firstly, based on the Lora structure, we infer denoised_latents from randomly initialized latents,
Then, based on denoised_latents and the frozen SD structure, continue to predict noise? denoised_latents is already the denoised image, what is the principle of predicting noise again? Why not predict noise for randomly initialized latents?
The text was updated successfully, but these errors were encountered:
I observed that during the training process, firstly, based on the Lora structure, we infer denoised_latents from randomly initialized latents,
Then, based on denoised_latents and the frozen SD structure, continue to predict noise? denoised_latents is already the denoised image, what is the principle of predicting noise again? Why not predict noise for randomly initialized latents?
The text was updated successfully, but these errors were encountered: