DeepSeek / Deepseek R1 (FP8)
Released: 1/20/2025texttext
Input: $7.00 / Output: $7.00
Deepseek R1 is an LLM that excels in step-by-step reasoning, code generation, and numerical problem solving, enabled by its reinforcement learning fine-tuning, chain-of-thought capabilities, and efficient Mixture of Experts architecture.
Some other noteworthy features of Deepseek R1 include its ability to produce structured and transparent responses, and its use of FP8 mixed precision and Multihead Latent Attention to optimize inference speed and resource use.
Metric | Value |
---|---|
Parameter Count | 671 billion |
Mixture of Experts | Yes |
Active Parameter Count | 37 billion |
Context Length | Unknown |
Multilingual | Unknown |
Quantized* | Yes |
Precision* | FP8 |
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.
DeepSeek models available on Oxen.ai
Modality | Price (1M tokens) | ||||
---|---|---|---|---|---|
Model | Inference provider | Input | Output | Input | Output |
![]() | text | text | $3.00 | $8.00 | |
text | text | $0.55 | $2.19 | ||
text | text | $7.00 | $7.00 | ||
![]() | text | text | $0.59 | $0.79 | |
text | text | $0.27 | $1.10 | ||
![]() | text | text | $0.75 | $3.00 | |
text | text | $1.25 | $1.25 |