High Resolution Depth Maps for Stable Diffusion WebUI
MIT License
Official code base of the BEVDet series .
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
A concise but complete implementation of CLIP with various experimental improvements from recent ...
DALL·E Mini - Generate images from a text prompt
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
[CVPR 2024] 4K4D: Real-Time 4D View Synthesis at 4K Resolution
Segment-Anything + 3D. Let's lift anything to 3D.
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular...
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable...
CVPR 2024 论文和开源项目合集
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
We estimate dense, flicker-free, geometrically consistent depth from monocular video, for example...
TRI-ML Monocular Depth Estimation Repository