Bessie
ox
User account
ox's Repositories
Displaying Page 3 of 12 (120 total Repositories)
Public
0
Combining instruct data with SQUAD data to see if we can get a more generic model
179.7 mb
1
Public
1
This is an aggregate dataset, comprised of Dolly HHRLHF (derived from the Databricks Dolly-15k and the Anthropic Helpful and Harmless (HH-RLHF) datasets), combined with Competition Math, Duorc, CoT GSM8k, Qasper, Quality, Summ Screen FD and Spider. The intention was to create a permissively-licensed instruction-following dataset with a large number of longform samples.
233.5 mb
2
Public
19
239.9 mb
3312100K
88.9 mb
5K21
Public
2
This is an automated nightly crawl of a dad joke dataset from r/dadjokes reddit
7.5 mb
11
Public
13
5.9 gb
132
The PIQA dataset introduces the task of physical commonsense reasoning and a corresponding benchmark dataset Physical Interaction: Question Answering or PIQA.
Public
2
2.3 gb
14.9K2