
ox
User Account
Bessie
ox
User account
ox's Repositories
public
1
This is an aggregate dataset, comprised of Dolly HHRLHF (derived from the Databricks Dolly-15k and the Anthropic Helpful and Harmless (HH-RLHF) datasets), combined with Competition Math, Duorc, CoT GSM8k, Qasper, Quality, Summ Screen FD and Spider. The intention was to create a permissively-licensed instruction-following dataset with a large number of longform samples.
233.5 mb
2
public
19
239.9 mb
3312100K
88.9 mb
15K2
public
2
This is an automated nightly crawl of a dad joke dataset from r/dadjokes reddit
7.5 mb
11
public
13
5.9 gb
132
The PIQA dataset introduces the task of physical commonsense reasoning and a corresponding benchmark dataset Physical Interaction: Question Answering or PIQA.
public
40
Dataset of black and white 44x44 pixel images to detect emotion
59.9 mb
36K54