Official repo for Composer: Creative and Controllable Image Synthesis with Composable Conditions.
See Project Page for more examples.
Composer is a large (5 billion parameters) controllable diffusion model trained on billions of (text, image) pairs. It can exponentially expand the control space through composition, leading to an enormous number of ways to generate and manipulate images, i.e., making the infinite use of finite means.
- Release training and inference code.
- Release pretrained models.
- Release Gradio UI.
- A light-weighted Latent-Composer built upon Stable Diffusion 2.1.
@article{lhhuang2023composer,
title={Composer: Creative and Controllable Image Synthesis with Composable Conditions},
author={Huang, Lianghua and Chen, Di and Liu, Yu and Yujun, Shen and Zhao, Deli and Jingren, Zhou},
booktitle={arXiv preprint arxiv:2302.09778},
year={2023}
}