datasets Repositories

datasets

Organization Account

Repositories

datasets's Repositories

Pexels

public

13.5 gb

1K12997

Updated: 2 years ago

Self-Rewarding-Language-Models

public

This repository is an effort to recreate the "Self-Rewarding Language Models" paper by the team at [Meta.ai](http://meta.ai/) but with using a smaller model that is able to be fine tuned by the community. https://arxiv.org/abs/2401.10020

34.3 mb

132

Updated: 2 years ago

Cohere-wikipedia-2023-11-embed-multilingual-v3

public

Embeddings for Wikipedia This dataset contains the wikimedia/wikipedia dataset dump from 2023-11-01 from Wikipedia in english

90.1 gb

4151

Updated: 2 years ago

farm-animals

public

A dataset of images of farm animals and classified by type. Includes Bovine, Chicken, Duck, Goose, Pig, Mule, Owl, Turkey, and Sheep.

1.1 gb

143.3K

Updated: 2 years ago

sample

public

Sample datasets for download. Includes files in CSV, Parquet, Arrow, JSON, and TSV formats.

362.9 mb

136

Updated: 2 years ago

dolly-15k

public

12.9 mb

Updated: 2 years ago

MT-Bench

public

This dataset contains 3.3K expert-level pairwise human preferences for model responses generated by 6 models in response to 80 MT-bench questions. The annotators are mostly graduate students with expertise in the topic areas of each of the questions.

161.6 mb

Updated: 2 years ago

mmlu

public

Measuring Massive Multitask Language Understanding | ICLR 2021

10 mb

Updated: 2 years ago

diff-examples

public

This repository is a supplement to our diff documentation https://docs.oxen.ai/concepts/diffs to demonstrate how Oxen diffs work.

2.7 kB

Updated: 2 years ago

HRWSI

public

Dataset containing RGB images paired with depthmaps.

7.6 gb

1283K

Updated: 2 years ago