datasets
Organization Account
datasets's Repositories
public
0

578.3 mb
23K12
Updated: 2 years ago

This dataset contains the arrival and departure events for buses up to the most recent completed month of 2022. Due to data collection issues, data is not guaranteed to be complete for any stop or date.

3.8 gb
1
Updated: 2 years ago
21

BabyLM Challenge 2024 - Sample efficient pretraining on a developmentally plausible corpus.

418.7 mb
224
Updated: 2 years ago
public
2

The QA bAbI tasks are a set of proxy tasks that evaluate reading comprehension via question answering.

2.8 mb
1
Updated: 2 years ago

898.7 kB
32
Updated: 2 years ago
public
2

3.2 gb
7K1107K
Updated: 2 years ago

3.9 gb
11107.1K
Updated: 2 years ago

A dataset of Arxiv Papers to build on top of for fine tuning an LLM

35.7 gb
23K122K
Updated: 2 years ago