summarize_from_feedback_details

MIT License

Stars

105

View Code on GitHub View on X

Ecosystems: Python

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

N/A

Related Projects

llmsearch

Find better generation parameters for your LLM

30 Mar 2023 27

LightningConversation

Lightning implementation of seq2seq dialog model

23 Apr 2020 6

indic_eval

A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse ra...

26 Mar 2024 31

hrq-vae

Hierarchical Sketch Induction for Paraphrase Generation (Hosking et al., ACL 2022)

11 Oct 2021 51

chain-of-hindsight

Chain-of-Hindsight, A Scalable RLHF Method

20 Feb 2023 211

uniformers

Token-free Language Modeling with ByGPT5 & Friends!

17 Jun 2022 9

MT-SFT-ShareGPT

18 Aug 2024 3

ProteinDT

05 Feb 2023 41

lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase

12 Jun 2023 146

NoticIA

We present NoticIA, a dataset consisting of 850 Spanish news articles featuring prominent clickba...

02 Mar 2024 3

pretraining-with-human-feedback

Code accompanying the paper Pretraining Language Models with Human Preferences

20 Feb 2023 175

Twin-Merging

Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging

03 Jun 2024 18

RL4LMs

A modular RL library to fine-tune language models to human preferences

18 Aug 2022 2,183

ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

14 Jan 2022 626