Models/Gemini 1.5 Flash - 8B
GoogleGoogle / Gemini 1.5 Flash - 8B
Released: 9/24/2024
texttext
Input: $0.04 / Output: $0.15

Gemini 1.5 Flash - 8B is a Multimodal LLM designed for high-volume, cost-effective applications. It excels in transcription, handling long contexts, and tasks requiring efficient processing of multimodal data. The model is particularly suited for applications where cost-effectiveness and speed are prioritized over complex reasoning.

Some noteworthy use cases of Gemini 1.5 Flash - 8B include handling high-volume multimodal tasks, long-context summarization, and transcription tasks.

MetricValue
Parameter CountUnknown
Mixture of ExpertsNo
Context Length1,048,576 tokens
MultilingualYes
Quantized*Unknown

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

Google models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInputOutputInputOutput
texttext$0.08$0.30
texttext$0.04$0.15
texttext$1.25$5.00
text, imagetext$0.10$0.40
text, imagetext$0.08$0.30
texttext$1.25$5.00
text, imagetext$0.15$3.50
text, imagetext$2.50$5.00
text, imagetext$1.25$10.00
texttext$0.20$0.20
texttext$0.20$0.40
imageimageN/AN/A
textembeddings$0.02$0.02
See all models available on Oxen.ai