RWKV-v2-RNN-Pile

RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.

APACHE-2.0 License

Stars

View Code on GitHub

Ecosystems: Python

Statistics for this project are still being loaded, please check back later.

Related Projects

pyllama

LLaMA: Open and Efficient Foundation Language Models

28 Feb 2023 2,805

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

02 Sep 2023 7,733

ToolkenGPT

ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 20...

27 May 2023 191

starcoder

Home of StarCoder: fine-tuning & inference!

24 Apr 2023 7,267

LongMamba

Some preliminary explorations of Mamba's context scaling.

03 Feb 2024 177

car-racing-attention-agent

Reproduce the results of "Neuroevolution of Self-Interpretable Agents" paper

26 Oct 2022 5

language-modeling

Pipeline for training Language Models using PyTorch.

02 Jan 2021 12

MAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

13 Nov 2021 2,591

two_hot_encoding

17 Jan 2022 6

bilm-tf

Tensorflow implementation of contextualized word representations from bi-directional language models

29 Sep 2017 1,620

starcoder2

Home of StarCoder2!

08 Dec 2023 1,732

minimal-gpt-neox-20b

09 Mar 2022 126

xlm-v-experiments

Experiments for XLM-V Transformers Integeration

05 Feb 2023 6

BertSum

Code for paper Fine-tune BERT for Extractive Summarization

25 Mar 2019 1,464

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

17 Aug 2020 19,926