Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

generate images with arbitrary resolutions #84

Open
Leiii-Cao opened this issue Sep 1, 2024 · 2 comments
Open

generate images with arbitrary resolutions #84

Leiii-Cao opened this issue Sep 1, 2024 · 2 comments

Comments

@Leiii-Cao
Copy link

Is it true that a VAE model can only generate images with the same dimensions as the training data? For example, if the model was trained on 256x256 images, is there any way to use a checkpoint from that model to generate images with arbitrary resolutions, such as 352x275?

@iFighting
Copy link
Contributor

iFighting commented Nov 29, 2024

Is it true that a VAE model can only generate images with the same dimensions as the training data? For example, if the model was trained on 256x256 images, is there any way to use a checkpoint from that model to generate images with arbitrary resolutions, such as 352x275?

@Leiii-Cao
In fact, this is not the case, and we will soon release work on T2I based on VAR to support arbitrary resolution generation.

Also, VAE is a CNN structure, so it can be reconstructed at any resolution

@JeyesHan
Copy link

@Leiii-Cao
Powered by a CNN structure, VAE could encode and decode images with arbitrary resolution images. However, VAR only generates square images. Our recent work Infinity (text-to-image model for VAR) could generates images with various aspect ratios. Please check https://github.com/FoundationVision/Infinity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants