main
train.jsonl
text → text
4o_mini_evals
You are an expert at evaluating coding question responses. You will receive a coding question and a response and your only job is to answer if the answer is "Bad", "Good", "Great". You MUST only answer with one word. Here is your prompt: {prompt} Here is your response: {response}
Jul 4, 2025, 7:57 PM UTC
Jul 4, 2025, 7:57 PM UTC
5 row sample
3191 tokens$ 0.0005
5 rows processed, 3191 tokens used ($0.0005)
Estimated cost for all 3000 rows: $0.2885Sample Results completed
4 columns, 1-5 of 3000 rows