Language Quantized AutoEncoders
Chain-of-Hindsight, A Scalable RLHF Method
Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding...
Instruction Following Agents with Multimodal Transforemrs
Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"
Fine-tuned LLaMa2 13B model designed for ReAct-style and Tree-Of-Thoughts style prompting.
本项目基于PaddleDetection目标检测开发套件,选取1.3M超轻量PPYOLO tiny进行项目开发,并部署于windows端。
Embed arbitrary modalities (images, audio, documents, etc) into large language models.
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation
Hybrid Discriminative-Generative Training via Contrastive Learning
We present NoticIA, a dataset consisting of 850 Spanish news articles featuring prominent clickba...
Geom3D: Geometric Modeling on 3D Structures, NeurIPS 2023