self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

MIT License

Downloads

1.9K

Stars

1.3K

Committers

View Code on GitHub

Ecosystems: Python

Bot releases are hidden (Show)

self-rewarding-lm-pytorch - 0.2.12 Latest Release

Published by lucidrains 6 months ago

What's Changed

Fixed deep copy, shallow copy error and label mask error. by @Control-derek in https://github.com/lucidrains/self-rewarding-lm-pytorch/pull/29

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.11...0.2.12

self-rewarding-lm-pytorch - 0.2.11

Published by lucidrains 7 months ago

What's Changed

Solves the problem that some variables are not declared by @Control-derek in https://github.com/lucidrains/self-rewarding-lm-pytorch/pull/28

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.10...0.2.11

self-rewarding-lm-pytorch - 0.2.10

Published by lucidrains 7 months ago

What's Changed

Solves the problem that some variables are not declared by @Control-derek in https://github.com/lucidrains/self-rewarding-lm-pytorch/pull/27

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.9...0.2.10

self-rewarding-lm-pytorch - 0.2.9

Published by lucidrains 7 months ago

What's Changed

add self. by @Control-derek in https://github.com/lucidrains/self-rewarding-lm-pytorch/pull/26

New Contributors

@Control-derek made their first contribution in https://github.com/lucidrains/self-rewarding-lm-pytorch/pull/26

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.8...0.2.9

self-rewarding-lm-pytorch - 0.2.8

Published by lucidrains 8 months ago

What's Changed

Fix TypeError for is_valid_reward in SelfRewardDPOConfig by @ViswanathaReddyGajjala in https://github.com/lucidrains/self-rewarding-lm-pytorch/pull/19

New Contributors

@ViswanathaReddyGajjala made their first contribution in https://github.com/lucidrains/self-rewarding-lm-pytorch/pull/19

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.7...0.2.8

self-rewarding-lm-pytorch - 0.2.7

Published by lucidrains 8 months ago

What's Changed

Update self_rewarding_lm_pytorch.py by @unaidedelf8777 in https://github.com/lucidrains/self-rewarding-lm-pytorch/pull/17

New Contributors

@unaidedelf8777 made their first contribution in https://github.com/lucidrains/self-rewarding-lm-pytorch/pull/17

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.5...0.2.7

self-rewarding-lm-pytorch - 0.2.5

Published by lucidrains 9 months ago

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.4...0.2.5

self-rewarding-lm-pytorch - 0.2.4

Published by lucidrains 9 months ago

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.3...0.2.4

self-rewarding-lm-pytorch - 0.2.3

Published by lucidrains 9 months ago

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.2...0.2.3

self-rewarding-lm-pytorch - 0.2.2

Published by lucidrains 9 months ago

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.1...0.2.2

self-rewarding-lm-pytorch - 0.2.1

Published by lucidrains 9 months ago

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.0...0.2.1

self-rewarding-lm-pytorch - 0.2.0

Published by lucidrains 9 months ago

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.1.1...0.2.0

self-rewarding-lm-pytorch - 0.1.1

Published by lucidrains 9 months ago

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.1.0...0.1.1

self-rewarding-lm-pytorch - 0.1.0

Published by lucidrains 9 months ago

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.0.42...0.1.0

self-rewarding-lm-pytorch - 0.0.42

Published by lucidrains 9 months ago

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.0.41...0.0.42

self-rewarding-lm-pytorch - 0.0.41

Published by lucidrains 9 months ago

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.0.40...0.0.41

self-rewarding-lm-pytorch - 0.0.40

Published by lucidrains 9 months ago

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.0.39...0.0.40

self-rewarding-lm-pytorch - 0.0.39

Published by lucidrains 9 months ago

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.0.38...0.0.39

self-rewarding-lm-pytorch - 0.0.38

Published by lucidrains 9 months ago

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.0.37...0.0.38

self-rewarding-lm-pytorch - 0.0.37

Published by lucidrains 9 months ago

Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.0.36...0.0.37

Package Rankings

Top 21.42% on Pypi.org

Related Projects

nuwa-pytorch

Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch

28 Nov 2021 540

MOSS-RLHF

MOSS-RLHF

05 Jul 2023 1,274

MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

29 Jan 2024 5,590

ICLR2024-Papers-with-Code

ICLR 2024 论文和开源项目合集

14 May 2024 104

open-instruct

09 Jun 2023 1,214

x-clip

A concise but complete implementation of CLIP with various experimental improvements from recent ...

01 Dec 2021 681

GLM

GLM (General Language Model)

18 Mar 2021 3,170

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

19 Apr 2023 1,269

x-transformers

A simple but complete full-attention transformer with a set of promising experimental features fr...

24 Oct 2020 4,226

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vi...

19 Mar 2023 36,628

DPO-ST

[ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Re...

09 Sep 2022 2,399

DALLE-pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

05 Jan 2021 5,563

phenaki-pytorch

Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 min...

29 Sep 2022 746

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architectu...

09 Dec 2022 7,595