Token-free Language Modeling with ByGPT5 & Friends!
APACHE-2.0 License
A curated list of useful Python packages for data geeks
hmBench: Fine-Tuning, Evaluating & Benchmarking of Historic Language Models on NER Datasets
Pytorch Seq2Seq framework
We present NoticIA, a dataset consisting of 850 Spanish news articles featuring prominent clickba...
Transformer models implementation for training from scratch.
Data repository for pretrained NLP models and NLP corpora.
Find better generation parameters for your LLM
Asian language bart models (En, Ja, Ko, Zh, ECJK)
Hierarchical Sketch Induction for Paraphrase Generation (Hosking et al., ACL 2022)
LLM Inference benchmark
Fine tuning experiments for the GPT-2 model by OpenAI.
Geom3D: Geometric Modeling on 3D Structures, NeurIPS 2023