Latte

Latte: Latent Diffusion Transformer for Video Generation.

APACHE-2.0 License

Stars

1.7K

View Code on GitHub

Ecosystems: Python

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

3 months

Badges

Extracted from project README

Related Projects

CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

29 May 2022 7,818

PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

29 Feb 2024 1,624

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

12 Oct 2023 2,138

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deploy...

23 Feb 2024 1,061

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by ...

21 Aug 2023 4,880

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on di...

06 Nov 2023 2,650

TexForce

Official PyTorch codes for "Enhancing Diffusion Models with Text-Encoder Reinforcement Learning",...

27 Nov 2023 46

prismer

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

02 Mar 2023 1,295

IF

DeepFloyd-IF (Imagen Free)

20 Jan 2023 7,656

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vi...

19 Mar 2023 36,628

DiffBIR

Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

28 Aug 2023 3,289

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多...

22 Nov 2023 5,641

TimeSformer

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video ...

02 Apr 2021 1,518

LGM

[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

06 Feb 2024 1,590

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...

06 Jul 2023 1,448