Bessie
Bessie
ox
User account
ox's Repositories
Displaying Page 6 of 12 (113 total Repositories)
Public
0

MMLU (Massive Multitask Language Understanding) is a new benchmark designed to measure knowledge acquired during pre-training by evaluating models exclusively in zero-shot and few-shot settings. This makes the benchmark more challenging and more similar to how we evaluate humans. The benchmark covers 57 subjects across STEM, the humanities, the social sciences, and more. It ranges in difficulty from an elementary level to an advanced professional level, and it tests both world knowledge and problem solving ability. Subjects range from traditional areas, such as mathematics and history, to more specialized areas like law and ethics. The granularity and breadth of the subjects makes the benchmark ideal for identifying a model’s blind spots.

166 mb
2179
Updated: 1 year ago
60

Cats vs Dogs

100.2 mb
8.1K22
Updated: 1 year ago

Repository of images of cats and dogs for object detection.

141.3 mb
8.1K2
Updated: 1 year ago

688.8 mb
2.7K1
Updated: 1 year ago

Empty
Empty Repository
Public
0

258.2 mb
7.5K1
Updated: 1 year ago
Public
0