Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mask guidance, inpaiting and outpaiting #49

Open
sahil02235 opened this issue Jul 25, 2024 · 7 comments
Open

Mask guidance, inpaiting and outpaiting #49

sahil02235 opened this issue Jul 25, 2024 · 7 comments

Comments

@sahil02235
Copy link

Thanks for the awesome paper. Even the codebase is very easy to use.
Can you please do some initial experiments on mask guidance image generation, inpainting and outpainting. It will really help the community.

@daiyixiang666
Copy link

The autoregreesive model acutally is not really good for mask guidance image generation in my opinion

@sahil02235
Copy link
Author

@daiyixiang666 theoretically i don't understand why it should perform badly, depends on how you are doing the conditioning.

@iFighting
Copy link
Collaborator

The autoregreesive model acutally is not really good for mask guidance image generation in my opinion

I think autoregreesive model perform better when align with language model

@daiyixiang666
Copy link

If the mask is just like casual mask I think it will be great, but I do not think we always has the casual mask in real life

@sahil02235
Copy link
Author

@iFighting While it's true that autoregressive models can generate high-quality images without relying on text-based or mask-based conditioning, it's important to acknowledge that diffusion-based models like DiT have also demonstrated impressive results. However, diffusion models do face challenges when it comes to text-based and mask-based conditioning.
Given that autoregressive models can generate high-quality images and handle text-based alignment effectively, a promising avenue for future research could be exploring different types of conditioning within these models. This would involve testing the accuracy of these new conditioning methods and addressing any limitations that arise.

@sahil02235
Copy link
Author

For control generation (reference): https://arxiv.org/pdf/2406.09750

@lxa9867
Copy link

lxa9867 commented Jul 27, 2024

We are releasing code recently. If you are interested in controllable AR generation, please keep an eye on https://github.com/lxa9867/ControlVAR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants