datasets
Organization Account
datasets's Repositories
Displaying Page 1 of 18 (180 total Repositories)

This is a cleaned version of the bitext/Bitext-customer-support-llm-chatbot-training-dataset dataset for customer support intent classification.

9.6 mb
22
Updated: 1 week ago

Here are example Marimo Notebooks to get started with Oxen.ai

1.1 mb
7
Updated: 2 months ago
public
3

The SMS Spam Collection is a set of SMS tagged messages that have been collected for SMS Spam research. It contains one set of SMS messages in English of 5,574 messages, tagged according being ham (legitimate) or spam. The original data can be found here: https://archive.ics.uci.edu/ml/datasets/SMS+Spam+Collection

13.9 mb
21
Updated: 3 months ago

A starter repository that highlights some key features that you can get started with.

15.2 mb
6110
Updated: 4 months ago
public
1

5.3 mb
32
Updated: 4 months ago

This repository is an example of how to generate synthetic fine tuning data with random personas. The final output is "prompt", "response" pairs for customer support tickets.

1.3 mb
312
Updated: 6 months ago

This repository is 1 million images collected from different sources to run chain of thought reasoning on

147 gb
31.2M184K
Updated: 7 months ago
public
2

A Benchmark for Question Answering about Charts with Visual and Logical Reasoning

976 mb
721K2
Updated: 7 months ago

An Advanced Diagnostic Suite for Entangled Language Hallucination & Visual Illusion in Large Vision-Language Models

163.9 mb
125241
Updated: 7 months ago
public
0

MathVista is a consolidated Mathematical reasoning benchmark within Visual contexts.

1.2 gb
6.1K2
Updated: 7 months ago