[Neurips 2023] Generating Mario Levels with GPT2. Code for the paper "MarioGPT: Open-Ended Text2Level Generation through Large Language Models" https://arxiv.org/abs/2302.05981
MIT License
Ongoing research training transformer models at scale
utilities for decoding deep representations (like sentence embeddings) back to text
SGLang is a structured generation language designed for large language models (LLMs). It makes yo...
GLM (General Language Model)
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
My python journey
Mamba state-space model
Using LLMs and pre-trained caption models for super-human performance on image captioning.
VILA - a multi-image visual language model with training, inference and evaluation recipe, deploy...
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...
🤖 Everything you need to create an LLM Agent—tools, prompts, frameworks, and models—all in one pl...
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"