RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

MIT License

Stars

1.7K

Ecosystems: Jupyter Notebook

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

5 days

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to gene...

Let us democratise high-resolution generation! (CVPR 2024)

[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"