Chain-of-Hindsight, A Scalable RLHF Method
[ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning fr...
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
Hierarchical Sketch Induction for Paraphrase Generation (Hosking et al., ACL 2022)
Generic image compressor for machine learning. Pytorch code for our paper "Lossy compression for ...
Embed arbitrary modalities (images, audio, documents, etc) into large language models.
Geom3D: Geometric Modeling on 3D Structures, NeurIPS 2023
Evaluating text-to-image/video/3D models with VQAScore
Code for EMNLP 2018 paper "Commonsense for Generative Multi-Hop Question Answering Tasks"
[ICCV 2021] Official implementation of "The Surprising Effectiveness of Visual Odometry Technique...
Shape and pose estimation using discretized signed distance fields
Bamboo: 4 times larger than ImageNet; 2 time larger than Object365; Built by active learning.
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
Train vision models using JAX and 🤗 transformers