Featured Datasets
0
276.9 mb
21
test very large graph from an energy based social network simulation from https://synthasaizer.com/
618.3 mb
11
Public
0
4.5 gb
10K21
Public
0
59.5 mb
1
Public
0
Example dataset constructed from the steps and prompts in the "Thinking LLMs: General Instruction Following With Thought Generation" paper.
189.3 mb
11
View all featured repositories
Featured Collections
Some of the Oxen team's favorite collections.
Visual LLMs
This collection is datasets for understanding of images with large language models
a collection by datasets
LLM-Feedback
Datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO.
a collection by ox
Multimodal
List of datasets that cross modalities, combinations of text, image, audio, video etc.
a collection by ox
Browse all collections