Evaluations/6aec7d41-1454-4579-80f4-f4a40946bbb5
main
train.jsonl
text → text
QwenQwen/Qwen3 Coder 480B (A35B) Instruct
Fireworks AI Fireworks AI
prediction
You are an expert programmer tasked with evaluating a response to a programming question. Answer "good" if the answer is correct and "bad" if incorrect. Only answer with the one word
Aug 2, 2025, 1:59 AM UTC
Aug 2, 2025, 1:59 AM UTC
5 row sample
235 tokens$ 0.0001
5 rows processed, 235 tokens used ($0.0001)
Estimated cost for all 3000 rows: $0.0716
Sample Results completed
4 columns, 1-5 of 3000 rows