Bessie
ox
User account
ox's Repositories
Displaying Page 1 of 11 (108 total Repositories)
SEDD_baby_names
Public2.4 kB
1
90
CatsVsDogs
Public🐱 Cats vs Dogs 🐶 which is better? Contribute your cat or dog to make the worlds largest cat and dog repository.
48.3 mb
171
00
BitNet
Public20.6 gb
8329
10
InstructQA
PublicCombining instruct data with SQUAD data to see if we can get a more generic model
179.7 mb
1
00
mosaicml-instruct-v3
PublicThis is an aggregate dataset, comprised of Dolly HHRLHF (derived from the Databricks Dolly-15k and the Anthropic Helpful and Harmless (HH-RLHF) datasets), combined with Competition Math, Duorc, CoT GSM8k, Qasper, Quality, Summ Screen FD and Spider. The intention was to create a permissively-licensed instruction-following dataset with a large number of longform samples.
233.5 mb
2
10
IMDB-Movie-Reviews
Public239.9 mb
2331100K
STL10
Public88.9 mb
25K1
reddit_dad_jokes
PublicThis is an automated nightly crawl of a dad joke dataset from r/dadjokes reddit
7.5 mb
11
20