Try Bifrost Enterprise free for 14 days.
Request access

gemini-1.5-flash Cost Calculator - Google Gemini

Calculate the cost of using gemini-1.5-flash from Google Gemini for your AI applications

Pricing data last updated:

gemini-1.5-flash Cost Calculator

Mode: Embedding

Max: 8,192 tokens

Cost Breakdown

Input Cost$0.00007500
Total Cost$0.00007500

Pricing Details

Input: $0.0000000750 per token
[ WE'RE OPEN SOURCE ]

Scale with the Fastest LLM Gateway

Built for enterprise-grade reliability, governance, and scale. Deploy in seconds.

Model Specifications

Limits

Max Input Tokens8,192
Max Tokens8,192

About gemini-1.5-flash

gemini-1.5-flash is an embedding model from Google Gemini, one of 4 embedding models they offer. It is priced at $0.07 per 1M input tokens and $0.0000 per 1M output tokens, ranking 23 out of 96 embedding models by cost and cheaper than 76% of models in this category. Its 8K-token context window is in the top 49% among embedding models.

Pricing Information

Input Cost$0.07 per 1M tokens

Note: Use the interactive calculator above to estimate costs for your specific usage patterns.

Technical Specifications

Maximum Input Tokens8,192
Maximum Total Tokens8,192

Pro Tip

Use the maximum token limits shown above to understand the model's capacity. This model can handle up to 8,192 input tokens.

How gemini-1.5-flash Pricing Compares

At $0.07 per 1M input tokens and $0.0000 per 1M output tokens, gemini-1.5-flash ranks 23 out of 96 embedding models by input cost. It is more affordable compared to the median of $0.10 for embedding models, and is cheaper than 76% of models in this category.

gemini-1.5-flash is one of 4 Google Gemini embedding models, its 8K-token context window places it in the top 49% of embedding models.

ModelProviderInput / 1M tokensOutput / 1M tokensvs gemini-1.5-flash
qwen3-embedding-0.6bNovita$0.07$0.0000-7%
qwen3-embedding-8bNovita$0.07$0.0000-7%
amazon.nova-2-multimodal-embeddings-v1:0AWS Bedrock$0.14$0.0000+80%

Alternatives to gemini-1.5-flash

Similar embedding models from other providers

Novita
qwen3-embedding-0.6b
$0.07/1M input
-7% vs gemini-1.5-flash
Novita
qwen3-embedding-8b
$0.07/1M input
-7% vs gemini-1.5-flash
AWS Bedrock
amazon.nova-2-multimodal-embeddings-v1:0
$0.14/1M input
+80% vs gemini-1.5-flash
AWS Bedrock
amazon.titan-embed-text-v1
$0.10/1M input
+33% vs gemini-1.5-flash
Azure
ada
$0.10/1M input
+33% vs gemini-1.5-flash

Frequently Asked Questions

Is gemini-1.5-flash cheaper than qwen3-embedding-0.6b?

No. gemini-1.5-flash costs $0.07 per 1M input tokens while qwen3-embedding-0.6b costs $0.07 per 1M input tokens, making qwen3-embedding-0.6b 7% more affordable for input. However, gemini-1.5-flash may offer different capabilities or performance characteristics that justify the price difference.

How does gemini-1.5-flash pricing compare to the average embedding model?

gemini-1.5-flash input pricing is $0.07 per 1M tokens, which is 25% below the median of $0.10 for embedding models. It ranks 23 out of 96 embedding models by input cost, making it cheaper than 76% of models in this category. For output, it costs $0.0000 per 1M tokens compared to the median of $0.02.

What makes gemini-1.5-flash different from other Google Gemini models?

Among Google Gemini's 4 embedding models, gemini-1.5-flash ranks 1 by input cost.

What are the best alternatives to gemini-1.5-flash?

The most comparable embedding models to gemini-1.5-flash are: qwen3-embedding-0.6b from Novita ($0.07/1M input tokens); qwen3-embedding-8b from Novita ($0.07/1M input tokens); amazon.nova-2-multimodal-embeddings-v1:0 from AWS Bedrock ($0.14/1M input tokens); amazon.titan-embed-text-v1 from AWS Bedrock ($0.10/1M input tokens). These alternatives were selected based on similar capabilities, pricing, and provider diversity. You can compare any of these models in detail using the Bifrost Model Library.

How do I calculate gemini-1.5-flash costs?

gemini-1.5-flash is priced based on input and output tokens. Use the interactive calculator at the top of this page to estimate costs for your specific workload. Enter your expected input and output tokens volume and the calculator will show the total cost breakdown. For reference, processing 1M input tokens costs $0.07 and generating 1M output tokens costs $0.0000.