Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DeepSVG for text-conditioned vector generation. #32

Open
nd7141 opened this issue Mar 24, 2023 · 1 comment
Open

DeepSVG for text-conditioned vector generation. #32

nd7141 opened this issue Mar 24, 2023 · 1 comment

Comments

@nd7141
Copy link

nd7141 commented Mar 24, 2023

I wonder if it's possible to adapt DeepSVG to replace the VAE block in stable diffusion to generate vector graphics?

I see a couple of problems.

  1. The latent embedding size in DeepSVG (256) does not match latent embedding size of SD (64).
  2. diffusers library expects bin file instead of pth. There is a script to convert it to diffusers but it seems to use AutoencoderKL, which I'm not sure the right architecture.

I wonder if you know an easy way to adopt DeepSVG for diffusers library?

@PranavSudersan
Copy link

PranavSudersan commented Mar 29, 2023

That's a great idea. Although I wonder if training a text based LM over the SVG source code dataset would be a better way to go about this, I don't know.
Edit: I managed to find a project called VectorFusion which generates SVG from text description using the diffusion model. The authors have a paper on arXiv but they have not published their code unfortunately. The main author has an old github repository which does something similar, but I haven't tried it yet.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants