dpo.jsonl6.5 mb
adding dpo dataset based off of the judgements
2 years ago combined_responses_llama_405b.jsonl12.5 mb
Judging Llama 405B Responses
2 years ago generic_thoughts.jsonl6.3 mb
adding specific and generic thoughts
2 years ago specific_thoughts.jsonl8.3 mb
adding specific and generic thoughts
2 years ago combined_responses.jsonl13.7 mb
combining the better outputs
2 years ago