datasets
Organization Account
datasets's Repositories
Displaying Page 12 of 15 (150 total Repositories)
winogrande
Public790.3 kB
23
00
cifar100
Public134.4 mb
60K22
cifar10
Public136.1 mb
60K22
idl-wds
Public3.2 gb
7K1017K
20
pdfa-eng-words
Public3.9 gb
7.1K1110
00
arxiv_papers
PublicA dataset of Arxiv Papers to build on top of for fine tuning an LLM
35.7 gb
122K23K
20
Pexels
Public13.5 gb
1K19972
00
This repository is an effort to recreate the "Self-Rewarding Language Models" paper by the team at [Meta.ai](http://meta.ai/) but with using a smaller model that is able to be fine tuned by the community. https://arxiv.org/abs/2401.10020
34.3 mb
132
130
Embeddings for Wikipedia This dataset contains the wikimedia/wikipedia dataset dump from 2023-11-01 from Wikipedia in english
90.1 gb
4151
10