Models/Deepseek R1 Distill Llama 70B
DeepSeekDeepSeek / Deepseek R1 Distill Llama 70B
texttext
Input: $0.59 / Output: $0.79

Deepseek R1 Distill Llama 70B is an LLM that excels in math, code, and reasoning tasks, achieving performance comparable to frontier models like OpenAI-o1 through advanced distillation techniques based on Llama-3.3-70B-Instruct.

Some other noteworthy features of Deepseek R1 Distill Llama 70B include strong performance on mathematical benchmarks with 94.5% pass rate on MATH-500 and 70.0% on AIME 2024, as well as competitive coding abilities with a CodeForces Rating of 1633.

MetricValue
Parameter Count70 billion
Mixture of ExpertsYes
Active Parameter Count37 billion
Context Length128K-131K tokens
MultilingualYes
Quantized*Unknown

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

DeepSeek models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInputOutputInputOutput
texttext$3.00$8.00
texttext$0.55$2.19
texttext$0.59$0.79
texttext$0.27$1.10
texttext$0.75$3.00
See all models available on Oxen.ai