rotobart

Pre-training BART in Flax on The Pile dataset

MIT License

Stars

View Code on GitHub

Ecosystems: Python

Statistics for this project are still being loaded, please check back later.

Related Projects

BertSum

Code for paper Fine-tune BERT for Extractive Summarization

25 Mar 2019 1,464

video-transformers

Easiest way of fine-tuning HuggingFace video classification models

12 Aug 2022 131

Complex-YOLOv4-Pytorch

The PyTorch Implementation based on YOLOv4 of the paper: "Complex-YOLO: Real-time 3D Object Detec...

03 Jul 2020 1,234

minGPT-flax

GPT implementation in Flax

30 Sep 2021 16

fairseq_mmt

17 Nov 2021 4

starcoder

Home of StarCoder: fine-tuning & inference!

24 Apr 2023 7,267

starcoder2

Home of StarCoder2!

08 Dec 2023 1,732

pytorch-fabric-demo

15 Mar 2023 24

JaraConverse-TransformersBased

This JaraConverse model is a cutting-edge Transformer-based supervised Language Model (LLM) speci...

02 Aug 2024 5

GPT2

An implementation of training for GPT2, supports TPUs

13 May 2019 1,419

RWKV-v2-RNN-Pile

RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.

07 Apr 2022 49

CogIE

CogIE: An Information Extraction Toolkit for Bridging Text and CogNet. ACL 2021

21 Mar 2021 67

bilm-tf

Tensorflow implementation of contextualized word representations from bi-directional language models

29 Sep 2017 1,620

jax-lm-training

generative language model training on top of the JAX and Huggingface 🤗

05 May 2022 8

magicoder

[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct

10 Nov 2023 1,966