DeepSeek / Deepseek R1
Released: 1/20/2025texttext
Input: $0.55 / Output: $2.19
Deepseek R1 is a 671B LLM designed for complex reasoning and problem-solving tasks. It excels in advanced reasoning across multiple domains including mathematics, code, and real-time decision-making, offering performance comparable to OpenAI-o1 in these areas.
Some other noteworthy features of Deepseek R1 include its Multi-Layer Attention (MLA) mechanism that enhances its ability to process and understand complex inputs, and its reinforcement learning training approach that incorporated cold-start data.
Metric | Value |
---|---|
Parameter Count | 671 billion |
Mixture of Experts | Yes |
Active Parameter Count | Unknown |
Context Length | Unknown |
Multilingual | Unknown |
Quantized* | Unknown |
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.
DeepSeek models available on Oxen.ai
Modality | Price (1M tokens) | ||||
---|---|---|---|---|---|
Model | Inference provider | Input | Output | Input | Output |
![]() | text | text | $3.00 | $8.00 | |
text | text | $0.55 | $2.19 | ||
text | text | $7.00 | $7.00 | ||
![]() | text | text | $0.59 | $0.79 | |
text | text | $0.27 | $1.10 | ||
![]() | text | text | $0.75 | $3.00 | |
text | text | $1.25 | $1.25 |