Bessie
Bessie
ox
User account
ox's Repositories
Displaying Page 7 of 13 (126 total Repositories)

This dataset contains 404,290 questions pairs from Quora, and if they are duplicates of eachother.

72.6 mb
Updated: 1 year ago
Public
2

TriviaQA is a reading comprehension dataset containing over 650K question-answer-evidence triples. TriviaQA includes 95K question-answer pairs authored by trivia enthusiasts and independently gathered evidence documents, six per question on average, that provide high quality distant supervision for answering the questions.

10.3 gb
Updated: 2 years ago
Public
0

AGIEval is a human-centric benchmark specifically designed to evaluate the general abilities of foundation models in tasks pertinent to human cognition and problem-solving. T

9.2 mb
122
Updated: 2 years ago
Public
0

MMLU (Massive Multitask Language Understanding) is a new benchmark designed to measure knowledge acquired during pre-training by evaluating models exclusively in zero-shot and few-shot settings. This makes the benchmark more challenging and more similar to how we evaluate humans. The benchmark covers 57 subjects across STEM, the humanities, the social sciences, and more. It ranges in difficulty from an elementary level to an advanced professional level, and it tests both world knowledge and problem solving ability. Subjects range from traditional areas, such as mathematics and history, to more specialized areas like law and ethics. The granularity and breadth of the subjects makes the benchmark ideal for identifying a model’s blind spots.

166 mb
2179
Updated: 2 years ago
70

Cats vs Dogs

100.2 mb
28.1K2
Updated: 2 years ago

Repository of images of cats and dogs for object detection.

141.3 mb
28.1K
Updated: 2 years ago

688.8 mb
2.7K1
Updated: 2 years ago

Empty
Empty Repository
Public
0