Models/Deepseek R1 Distill Llama 70B
DeepSeekDeepSeek / Deepseek R1 Distill Llama 70B
texttext
Input: $0.59 / Output: $0.79

Deepseek R1 Distill Llama 70B is an LLM that excels in math, code, and reasoning tasks, achieving performance comparable to frontier models like OpenAI-o1 through advanced distillation techniques based on Llama-3.3-70B-Instruct.

Some other noteworthy features of Deepseek R1 Distill Llama 70B include strong performance on mathematical benchmarks with 94.5% pass rate on MATH-500 and 70.0% on AIME 2024, as well as competitive coding abilities with a CodeForces Rating of 1633.

MetricValue
Parameter Count70 billion
Mixture of ExpertsYes
Active Parameter Count37 billion
Context Length128K-131K tokens
MultilingualYes
Quantized*Unknown

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

DeepSeek models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInference providerInputOutputInputOutput
Fireworks AI
texttext$3.00$8.00
DeepSeek
texttext$0.55$2.19
Together.ai
texttext$7.00$7.00
Groq
texttext$0.59$0.79
Fireworks AI
texttext$0.75$3.00
DeepSeek
texttext$0.27$1.10
Together.ai
texttext$1.25$1.25
See all models available on Oxen.ai