Pre-training BART in Flax on The Pile dataset
MIT License
Statistics for this project are still being loaded, please check back later.
Code for paper Fine-tune BERT for Extractive Summarization
Easiest way of fine-tuning HuggingFace video classification models
The PyTorch Implementation based on YOLOv4 of the paper: "Complex-YOLO: Real-time 3D Object Detec...
GPT implementation in Flax
Home of StarCoder: fine-tuning & inference!
Home of StarCoder2!
This JaraConverse model is a cutting-edge Transformer-based supervised Language Model (LLM) speci...
An implementation of training for GPT2, supports TPUs
RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.
CogIE: An Information Extraction Toolkit for Bridging Text and CogNet. ACL 2021
Tensorflow implementation of contextualized word representations from bi-directional language models
generative language model training on top of the JAX and Huggingface 🤗
[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct