HomeDatasetsModelsDocsBlogCommunityPricingLoginSign up
ox/
Text2SQL
Search
DataBranchesEvaluationsNotebooksFine tune
Text2SQL/Commits
Text2SQL
/
20 branches
Judging GPT-4.1
Bessiecommitted 2 months ago
#f886d923e755bac16d1ed45ec788a7ce
Evaluate GPT-4.1 on the validation set
Bessiecommitted 2 months ago
#276e9e173b783e5eff970bf1f74730b
Extract accuracy working
oxcommitted 2 months ago
#eb43d812b4b59621a5f1b7eedb9bcdcb
Add extract-accuracy.py notebook
Bessiecommitted 2 months ago
#803b4cd396f82c4f6100fc5cc367b927
LLM as a judge on the Qwen3-0.6B results
Bessiecommitted 2 months ago
#54e6b2000dcc07ce5fa794944ddf94e
saving eval.py
oxcommitted 2 months ago
#cd98f803d4c3b247fb191f30864986b6
Adding 99 results for Qwen/Qwen3-0.6B
oxcommitted 2 months ago
#ab9f96505f0ed194cf859e54c3c7d4d6
Adding 79 results for Qwen/Qwen3-0.6B
oxcommitted 2 months ago
#edd896d135b9815e007cc08dccc8aefc
Adding 59 results for Qwen/Qwen3-0.6B
oxcommitted 2 months ago
#9c571dc0565dccd5906b08f19b0789f7
Adding 39 results for Qwen/Qwen3-0.6B
oxcommitted 2 months ago
#40afa6d56954b9592e9d049c64dfc50d
Adding 19 results for Qwen/Qwen3-0.6B
oxcommitted 2 months ago
#8c37404010751c0996b8251f4479e680
Update eval.py to save raw prompt with prediction
Bessiecommitted 2 months ago
#c46ab8b7b8f210fe9dd5f34e29caa940
Adding 99 results for Qwen/Qwen3-0.6B
Bessiecommitted 2 months ago
#90bf9712148c4b60778e664372ecc6bb
Adding 79 results for Qwen/Qwen3-0.6B
Bessiecommitted 2 months ago
#e21e36aff7324b0908cf6de3bc211d23
Adding 59 results for Qwen/Qwen3-0.6B
Bessiecommitted 2 months ago
#4a806255c804cee8da1705572f5e2f0e
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
Copyright © 2025 Oxen.ai, All Rights Reserved
CareersPrivacy PolicyTerms and Conditions