Bessie
Bessie
ox
User account
ox's Repositories
Displaying Page 6 of 15 (148 total Repositories)

🐱 Cats vs Dogs 🐶 which is better? Contribute your cat or dog to make the worlds largest cat and dog repository.

53.4 mb
119
Updated: 1 year ago

Combining instruct data with SQUAD data to see if we can get a more generic model

179.7 mb
1
Updated: 1 year ago

This is an aggregate dataset, comprised of Dolly HHRLHF (derived from the Databricks Dolly-15k and the Anthropic Helpful and Harmless (HH-RLHF) datasets), combined with Competition Math, Duorc, CoT GSM8k, Qasper, Quality, Summ Screen FD and Spider. The intention was to create a permissively-licensed instruction-following dataset with a large number of longform samples.

233.5 mb
2
Updated: 1 year ago

239.9 mb
100K2331
Updated: 2 years ago
public
16

88.9 mb
5K21
Updated: 2 years ago

This is a crawl of the top hacker news posts daily

7.4 mb
1
Updated: 2 years ago

This is an automated nightly crawl of a dad joke dataset from r/dadjokes reddit

7.5 mb
11
Updated: 2 years ago

5.9 gb
132
Updated: 2 years ago

Creating a dataset to fine-tune Mamba 🐍

302.8 mb
121
Updated: 2 years ago
public
0

The PIQA dataset introduces the task of physical commonsense reasoning and a corresponding benchmark dataset Physical Interaction: Question Answering or PIQA.