Skip to content
Models/google/Gemini 3.1 Flash Lite

Gemini 3.1 Flash Lite

Google Gemini·google·gemini-3.1-flash-lite-preview·

Google Gemini 3.1 Flash Lite preview, lightweight multimodal, ultra cost-efficient

Context Window
1.0M
Input price / 1M tokens
$0.251M tokens
Output price / 1M tokens
$1.501M tokens
Cached input / 1M tokens
$0.031M tokens
Max Completion
66K
Input Modalities
text, image
Output Modalities
text
Function callingChatVisionStreaming

Description

Google Gemini 3.1 Flash Lite preview, lightweight multimodal, ultra cost-efficient

Available Providers

AllToken can route requests to the providers below based on route priority and policy.

ProviderContext LengthInput PriceOutput PriceCached / MLatency p50Throughput

Best For

Google Gemini 3.1 Flash Lite preview, lightweight multimodal, ultra cost-efficient

How To Use This Model

Use the exact model ID shown below. This is the safest way to avoid call failures, variant mismatches, or incorrect route assumptions.

curl https://api.alltoken.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-your-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-3.1-flash-lite-preview",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'
Supported Parameters
temperaturetop_pmax_tokenstools
API Key Setup
Smart Routing

Let the platform choose the best provider path automatically.

Default Model

If a request does not specify a model, default the key to gemini-3.1-flash-lite-preview.

Forced Model

Always override incoming requests and lock the key to gemini-3.1-flash-lite-preview.