Collections/ox/llm-feedback

LLM-Feedback

Datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO.

This dataset contains 33K cleaned conversations with pairwise human preferences. It is collected from 13K unique IP addresses on the Chatbot Arena from April to June 2023. Each sample includes a question ID, two model names, their full conversation text in OpenAI API JSON format, the user vote, the anonymized user ID, the detected language tag, the OpenAI moderation API tag, the additional toxic tag, and the timestamp.

41.6 mb
12
Updated: 1 year ago

19.1 mb
12
Updated: 1 year ago

519.5 mb
12
Updated: 1 year ago

183.5 mb
22
Updated: 1 year ago

19.5 mb
22
Updated: 1 year ago

24.2 mb
22
Updated: 1 year ago

26 mb
22
Updated: 1 year ago

321.7 mb
12
Updated: 1 year ago