Pretraining is All You Need for Image-to-Image Translation
Paper
•
2205.12952
•
Published
svjack/Stable-Diffusion-FineTuned-zh-v0 is a Chinese-specific latent text-to-image diffusion model capable of generating images given any Chinese text input.
This model was trained by using a powerful text-to-image model, diffusers For more information about our training method, see train_zh_model.py. With the help of a good baseline model Taiyi-Stable-Diffusion-1B-Chinese-v0.1 from IDEA-CCNL
Firstly, install our package as follows. This package is modified 🤗's Diffusers library to run Chinese Stable Diffusion.
diffusers==0.6.0
transformers
torch
datasets
accelerate
sentencepiece
Run this command to log in with your HF Hub token if you haven't before:
huggingface-cli login
Running the pipeline with the LMSDiscreteScheduler scheduler:
from diffusers import StableDiffusionPipeline
pipeline = StableDiffusionPipeline.from_pretrained("svjack/Stable-Diffusion-FineTuned-zh-v2")
pipeline.safety_checker = lambda images, clip_input: (images, False)
pipeline = pipeline.to("cuda")
prompt = '女孩们打开了另一世界的大门'
image = pipeline(prompt, guidance_scale=7.5).images[0]