🎯 ML-based positioning method from mmWave transmissions - with high accuracy and energy efficiency
MIT License
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
[CVPR 2024] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transfo...
The PyTorch Implementation based on YOLOv4 of the paper: "Complex-YOLO: Real-time 3D Object Detec...
Starter pack for NeurIPS LLM Efficiency Challenge 2023.
Implementation of Open-Set Likelihood Maximization for Few-Shot Learning
CVPR2023 - Activating More Pixels in Image Super-Resolution Transformer Arxiv - HAT: Hybrid Atten...
Deep Feature Flow for Video Recognition
The official repo for CVPR2021——ViPNAS: Efficient Video Pose Estimation via Neural Architecture S...
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregress...
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by ...
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".