DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models

This repository contains our official implementation of the NeurIPS 2023 paper: DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models, which can generate high-quality vector sketches based on text prompts.

Our project page can be found here.

DiffSketcher Rendering Process:


Prompt: Macaw full color, ultra detailed, realistic, insanely beautiful	Prompt: Very detailed masterpiece painting of baby yoda hoding a lightsaber	Prompt: Sailboat sailing in the sea on a clear day

🆕 Update

[01/2024] 🔥 We released the SVGDreamer. SVGDreamer is a novel text-guided vector graphics synthesis method. This method considers both the editing of vector graphics and the quality of the synthesis.
[12/2023] 🔥 We released the PyTorch-SVGRender. Pytorch-SVGRender is the go-to library for state-of-the-art differentiable rendering methods for image vectorization.
[11/2023] We thank @camenduru for implementing the DiffSketcher-colab.
[10/2023] We released the DiffSketcher code.
[10/2023] We released the VectorFusion code.

TODO

Add a webUI demo.
Add support for colorful results and oil painting.

🔧 Installation

Create a new conda environment:

conda create --name diffsketcher python=3.10
conda activate diffsketcher

Install pytorch and the following libraries:

conda install pytorch==1.13.1 torchvision==0.14.1 torchaudio==0.13.1 pytorch-cuda=11.6 -c pytorch -c nvidia
pip install omegaconf BeautifulSoup4
pip install opencv-python scikit-image matplotlib visdom wandb
pip install triton numba
pip install numpy scipy timm scikit-fmm einops
pip install accelerate transformers safetensors datasets

Install CLIP:

pip install ftfy regex tqdm
pip install git+https://github.com/openai/CLIP.git

Install diffusers:

pip install diffusers==0.20.2

Install xformers (require python=3.10):

conda install xformers -c xformers

Install diffvg:

git clone https://github.com/BachiLi/diffvg.git
cd diffvg
git submodule update --init --recursive
conda install -y -c anaconda cmake
conda install -y -c conda-forge ffmpeg
pip install svgwrite svgpathtools cssutils torch-tools
python setup.py install

🔥 Quickstart

Example:

Preview:

Script:

python run_painterly_render.py \ 
  -c diffsketcher.yaml \
  -eval_step 10 -save_step 10 \
  -update "token_ind=4 num_paths=96 sds.warmup=1000 num_iter=1500" \ 
  -pt "a photo of Sydney opera house" \ 
  -respath ./workdir/sydney_opera_house \ 
  -d 8019 \
  --download

-c a.k.a --config: configuration file, saving in DiffSketcher/config/.
-eval_step: the step size used to eval the method (too frequent calls will result in longer times).
-save_step: the step size used to save the result (too frequent calls will result in longer times).
-update: a tool for editing the hyper-params of the configuration file, so you don't need to create a new yaml.
-pt a.k.a --prompt: text prompt.
-respath a.k.a --results_path: the folder to save results.
-d a.k.a --seed: random seed.
--download: download models from huggingface automatically when you first run them.

crucial:

-update "token_ind=2" indicates the index of cross-attn maps to init strokes.
-update "num_paths=96" indicates the number of strokes.

optional:

-npt, a.k.a --negative_prompt: negative text prompt.
-mv, a.k.a --make_video: make a video of the rendering process (it will take much longer).
-frame_freq, a.k.a --video_frame_freq: control video frame.
Note: Download U2Net model and place in checkpoint/ dir if xdog_intersec=True
add enable_xformers=True in -update to enable xformers for speeding up.
add gradient_checkpoint=True in -update to use gradient checkpoint for low VRAM.

Another example

Preview:

Script:

python run_painterly_render.py \ 
  -c diffsketcher-width.yaml \
  -eval_step 10 -save_step 10 \
  -update "token_ind=4 num_paths=48 num_iter=500" \ 
  -pt "a photo of Sydney opera house" \ 
  -respath ./workdir/sydney_opera_house \ 
  -d 8019 \
  --download

More Sketch Examples

check the run.md for more scripts.

Oil Painting

Preview:

Script:

python run_painterly_render.py \
  -c diffsketcher-color.yaml \
  -eval_step 10 -save_step 10 \
  -update "token_ind=4 num_paths=2000 num_iter=1500" \
  -pt "portrait of latin woman having a spiritual awaking, eyes closed, slight smile, illuminating lights, oil painting, by Van Gogh" \
  -respath ./workdir/portrait_woman_oil_painting \
  -d 676955

Colorful Results

Preview:

Script:

python run_painterly_render.py \
  -c diffsketcher-color.yaml \
  -eval_step 10 -save_step 10 \
  -update "token_ind=2 num_paths=128 num_iter=1500" \
  -pt "A horse is drinking water by the lake" \
  -respath ./workdir/draw_horse_rgba \
  -d 998

📚 Acknowledgement

The project is built based on the following repository:

We gratefully thank the authors for their wonderful works.

📎 Citation

If you use this code for your research, please cite the following work:

@inproceedings{xing2023diffsketcher,
    title={DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models},
    author={XiMing Xing and Chuang Wang and Haitao Zhou and Jing Zhang and Qian Yu and Dong Xu},
    booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
    year={2023},
    url={https://openreview.net/forum?id=CY1xatvEQj}
}

©️ Licence

This work is licensed under a MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
checkpoint		checkpoint
config		config
img		img
libs		libs
methods		methods
pipelines		pipelines
.gitignore		.gitignore
CITATION.bib		CITATION.bib
LICENSE		LICENSE
README.md		README.md
run.md		run.md
run_painterly_render.py		run_painterly_render.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models

🆕 Update

TODO

🔧 Installation

🔥 Quickstart

Example:

Another example

More Sketch Examples

Oil Painting

Colorful Results

📚 Acknowledgement

📎 Citation

©️ Licence

About

Releases

Packages

Languages

License

Yhc-777/DiffSketcher

Folders and files

Latest commit

History

Repository files navigation

DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models

🆕 Update

TODO

🔧 Installation

🔥 Quickstart

Example:

Another example

More Sketch Examples

Oil Painting

Colorful Results

📚 Acknowledgement

📎 Citation

©️ Licence

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages