Models/Gemini 1.5 Flash - 8B
GoogleGoogle / Gemini 1.5 Flash - 8B
Released: 9/24/2024
texttext
Input: $0.038 / Output: $0.15

Gemini 1.5 Flash - 8B is a Multimodal LLM designed for high-volume, cost-effective applications. It excels in transcription, handling long contexts, and tasks requiring efficient processing of multimodal data. The model is particularly suited for applications where cost-effectiveness and speed are prioritized over complex reasoning.

Some noteworthy use cases of Gemini 1.5 Flash - 8B include handling high-volume multimodal tasks, long-context summarization, and transcription tasks.

MetricValue
Parameter CountUnknown
Mixture of ExpertsNo
Context Length1,048,576 tokens
MultilingualYes
Quantized*Unknown

*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.

Google models available on Oxen.ai
ModalityPrice (1M tokens)
ModelInference providerInputOutputInputOutput
GoogleGoogle
texttext$0.08$0.30
GoogleGoogle
texttext$0.04$0.15
GoogleGoogle
texttext$1.25$5.00
GoogleGoogle
text, imagetext$0.10$0.40
GoogleGoogle
text, imagetext$0.08$0.30
GoogleGoogle
texttext$1.25$5.00
GoogleGoogle
text, imagetext$0.15$3.50
GoogleGoogle
text, imagetext$2.50$5.00
GoogleGoogle
text, imagetext$1.25$10.00
GroqGroq
texttext$0.20$0.20
GoogleGoogle
texttext$0.20$0.40
GoogleGoogle
textembeddings$0.02$0.02
See all models available on Oxen.ai