HomeDatasetsModelsDocsBlogCommunityPricingLoginSign up
datasets/
Self-Rewarding-Language-Models
Search
DataBranchesEvaluationsNotebooksFine tune
Self-Rewarding-Language-Models/M0
/train
Self-Rewarding-Language-Models/M0
/train
Loading...
About

This repository is an effort to recreate the "Self-Rewarding Language Models" paper by the team at [Meta.ai](http://meta.ai/) but with using a smaller model that is able to be fine tuned by the community. https://arxiv.org/abs/2401.10020

25 commits
1 contributor
27 downloads
34.3 mb
0 stars
Folder contents
3 text files > 99%
Contributors
Ox Data Bot 🤖
@oxbot
Copyright © 2025 Oxen.ai, All Rights Reserved
CareersPrivacy PolicyTerms and Conditions