generate images with arbitrary resolutions #84

Leiii-Cao · 2024-09-01T12:09:24Z

Is it true that a VAE model can only generate images with the same dimensions as the training data? For example, if the model was trained on 256x256 images, is there any way to use a checkpoint from that model to generate images with arbitrary resolutions, such as 352x275?

iFighting · 2024-11-29T08:40:21Z

Is it true that a VAE model can only generate images with the same dimensions as the training data? For example, if the model was trained on 256x256 images, is there any way to use a checkpoint from that model to generate images with arbitrary resolutions, such as 352x275?

@Leiii-Cao
In fact, this is not the case, and we will soon release work on T2I based on VAR to support arbitrary resolution generation.

Also, VAE is a CNN structure, so it can be reconstructed at any resolution

JeyesHan · 2024-12-13T09:02:56Z

@Leiii-Cao
Powered by a CNN structure, VAE could encode and decode images with arbitrary resolution images. However, VAR only generates square images. Our recent work Infinity (text-to-image model for VAR) could generates images with various aspect ratios. Please check https://github.com/FoundationVision/Infinity

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

generate images with arbitrary resolutions #84

generate images with arbitrary resolutions #84

Leiii-Cao commented Sep 1, 2024

iFighting commented Nov 29, 2024 •

edited

Loading

JeyesHan commented Dec 13, 2024

generate images with arbitrary resolutions #84

generate images with arbitrary resolutions #84

Comments

Leiii-Cao commented Sep 1, 2024

iFighting commented Nov 29, 2024 • edited Loading

JeyesHan commented Dec 13, 2024

iFighting commented Nov 29, 2024 •

edited

Loading