DeepSeek / Deepseek R1 Distill Llama 70B
texttext
Input: $0.59 / Output: $0.79
Deepseek R1 Distill Llama 70B is an LLM that excels in math, code, and reasoning tasks, achieving performance comparable to frontier models like OpenAI-o1 through advanced distillation techniques based on Llama-3.3-70B-Instruct.
Some other noteworthy features of Deepseek R1 Distill Llama 70B include strong performance on mathematical benchmarks with 94.5% pass rate on MATH-500 and 70.0% on AIME 2024, as well as competitive coding abilities with a CodeForces Rating of 1633.
Metric | Value |
---|---|
Parameter Count | 70 billion |
Mixture of Experts | Yes |
Active Parameter Count | 37 billion |
Context Length | 128K-131K tokens |
Multilingual | Yes |
Quantized* | Unknown |
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.
DeepSeek models available on Oxen.ai
Modality | Price (1M tokens) | ||||
---|---|---|---|---|---|
Model | Inference provider | Input | Output | Input | Output |
![]() | text | text | $3.00 | $8.00 | |
text | text | $0.55 | $2.19 | ||
text | text | $7.00 | $7.00 | ||
![]() | text | text | $0.59 | $0.79 | |
![]() | text | text | $0.75 | $3.00 | |
text | text | $0.27 | $1.10 | ||
text | text | $1.25 | $1.25 |