PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

MIT License

Downloads
1.4K
Stars
7.6K
Committers
5

Bot releases are hidden (Show)

PaLM-rlhf-pytorch - 0.2.1 Latest Release

Published by lucidrains over 1 year ago

PaLM-rlhf-pytorch - 0.2.0

Published by lucidrains over 1 year ago

PaLM-rlhf-pytorch - 0.1.4

Published by lucidrains over 1 year ago

PaLM-rlhf-pytorch - 0.1.2

Published by lucidrains over 1 year ago

PaLM-rlhf-pytorch - 0.1.1

Published by lucidrains over 1 year ago

PaLM-rlhf-pytorch - 0.1.0

Published by lucidrains over 1 year ago

PaLM-rlhf-pytorch - 0.0.68

Published by lucidrains over 1 year ago

PaLM-rlhf-pytorch - 0.0.67

Published by lucidrains over 1 year ago

PaLM-rlhf-pytorch - 0.0.66

Published by lucidrains over 1 year ago

PaLM-rlhf-pytorch - 0.0.65

Published by lucidrains over 1 year ago

PaLM-rlhf-pytorch - 0.0.64

Published by lucidrains over 1 year ago

PaLM-rlhf-pytorch - 0.0.63

Published by lucidrains over 1 year ago

PaLM-rlhf-pytorch - 0.0.62

Published by lucidrains over 1 year ago

PaLM-rlhf-pytorch - 0.0.61

Published by lucidrains over 1 year ago

PaLM-rlhf-pytorch - 0.0.60

Published by lucidrains over 1 year ago

PaLM-rlhf-pytorch - 0.0.59

Published by lucidrains almost 2 years ago

PaLM-rlhf-pytorch - 0.0.58

Published by lucidrains almost 2 years ago

PaLM-rlhf-pytorch - 0.0.57

Published by lucidrains almost 2 years ago

PaLM-rlhf-pytorch - 0.0.56

Published by lucidrains almost 2 years ago

PaLM-rlhf-pytorch - 0.0.55

Published by lucidrains almost 2 years ago

Package Rankings
Top 4.34% on Pypi.org
Related Projects