undefined (datasets) Repositories

Organization Account

Repositories

datasets's Repositories

Displaying Page 1 of 18 (179 total Repositories)

MarimoNotebooks

public

Here are example Marimo Notebooks to get started with Oxen.ai

1.1 mb

Updated: 1 month ago

The SMS Spam Collection is a set of SMS tagged messages that have been collected for SMS Spam research. It contains one set of SMS messages in English of 5,574 messages, tagged according being ham (legitimate) or spam. The original data can be found here: https://archive.ics.uci.edu/ml/datasets/SMS+Spam+Collection

13.9 mb

Updated: 3 months ago

GettingStarted

public

A starter repository that highlights some key features that you can get started with.

15.2 mb

1610

Updated: 3 months ago

SimpleQA

public

5.3 mb

Updated: 3 months ago

Synthetic-Persona-Customer-Support

public

This repository is an example of how to generate synthetic fine tuning data with random personas. The final output is "prompt", "response" pairs for customer support tickets.

1.3 mb

321

Updated: 6 months ago

Image-CoT-1m

public

This repository is 1 million images collected from different sources to run chain of thought reasoning on

147 gb

184K1.2M3

Updated: 7 months ago

ChartQA

public

A Benchmark for Question Answering about Charts with Visual and Logical Reasoning

976 mb

21K27

Updated: 7 months ago

HallusionBench

public

An Advanced Diagnostic Suite for Entangled Language Hallucination & Visual Illusion in Large Vision-Language Models

163.9 mb

524121

Updated: 7 months ago

MathVista

public

MathVista is a consolidated Mathematical reasoning benchmark within Visual contexts.

1.2 gb

6.1K2

Updated: 7 months ago

CLEVR

public

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

20 gb

8100K

Updated: 7 months ago