![Bessie](https://oxen-hub-production.s3.amazonaws.com/media/users/588c4788-40d2-468c-b982-c24fa39407b7/profile_medium.png?v=63878001087)
Bessie
ox
User account
ox's Repositories
Displaying Page 2 of 12 (113 total Repositories)
Public
1
This is an aggregate dataset, comprised of Dolly HHRLHF (derived from the Databricks Dolly-15k and the Anthropic Helpful and Harmless (HH-RLHF) datasets), combined with Competition Math, Duorc, CoT GSM8k, Qasper, Quality, Summ Screen FD and Spider. The intention was to create a permissively-licensed instruction-following dataset with a large number of longform samples.
233.5 mb
2
Public
19
239.9 mb
2100K331
88.9 mb
5K12
Public
2
This is an automated nightly crawl of a dad joke dataset from r/dadjokes reddit
7.5 mb
11
Public
13
5.9 gb
132
The PIQA dataset introduces the task of physical commonsense reasoning and a corresponding benchmark dataset Physical Interaction: Question Answering or PIQA.
Public
1
2.3 gb
14.9K2