ReST-EM-pytorch

Implementations and explorations into the ReST𝐸𝑀 algorithm in the new deepmind paper "Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models"

MIT License

Stars
39

No README available, please check again later.

Related Projects