Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
MIT License
Bot releases are hidden (Show)
Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.11...0.2.12
Published by lucidrains 7 months ago
Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.10...0.2.11
Published by lucidrains 7 months ago
Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.9...0.2.10
Published by lucidrains 7 months ago
Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.8...0.2.9
Published by lucidrains 8 months ago
Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.7...0.2.8
Published by lucidrains 8 months ago
Full Changelog: https://github.com/lucidrains/self-rewarding-lm-pytorch/compare/0.2.5...0.2.7
Published by lucidrains 9 months ago
Published by lucidrains 9 months ago
Published by lucidrains 9 months ago
Published by lucidrains 9 months ago
Published by lucidrains 9 months ago
Published by lucidrains 9 months ago
Published by lucidrains 9 months ago
Published by lucidrains 9 months ago
Published by lucidrains 9 months ago
Published by lucidrains 9 months ago
Published by lucidrains 9 months ago
Published by lucidrains 9 months ago
Published by lucidrains 9 months ago
Published by lucidrains 9 months ago