Bessie
Bessie
ox
User account
ox's Repositories
Displaying Page 1 of 11 (107 total Repositories)
Updated: 5 days ago

🐱 Cats vs Dogs 🐶 which is better? Contribute your cat or dog to make the worlds largest cat and dog repository.

13.3 mb
1
00
BitNet
Public
Updated: 2 weeks ago

20.6 gb
3298
10
Updated: 3 weeks ago

Combining instruct data with SQUAD data to see if we can get a more generic model

179.7 mb
1
00
Updated: 3 weeks ago

This is an aggregate dataset, comprised of Dolly HHRLHF (derived from the Databricks Dolly-15k and the Anthropic Helpful and Harmless (HH-RLHF) datasets), combined with Competition Math, Duorc, CoT GSM8k, Qasper, Quality, Summ Screen FD and Spider. The intention was to create a permissively-licensed instruction-following dataset with a large number of longform samples.

233.5 mb
2
10
SQuAD
Public
Updated: 3 weeks ago

Question answering dataset.

145.4 mb
29
61
Updated: 3 months ago

STL10
Public
Updated: 3 months ago

Updated: 3 months ago

This is a crawl of the top hacker news posts daily

7.4 mb
1
30
Updated: 3 months ago

This is an automated nightly crawl of a dad joke dataset from r/dadjokes reddit

7.5 mb
11
20
Updated: 3 months ago

5.9 gb
132
130