
Gemma 3 4B
Google Gemma 3 4B dense open-weight model, lightweight general-purpose, edge-friendly
Context Window
33K
Input price / 1M tokens
Free1M tokens
Output price / 1M tokens
Free1M tokens
Cached input / 1M tokens
Free1M tokens
Max Completion
4K
Input Modalities
text
Output Modalities
text
ChatStreaming
Description
Google Gemma 3 4B dense open-weight model, lightweight general-purpose, edge-friendly
Available Providers
AllToken can route requests to the providers below based on route priority and policy.
ProviderContext LengthInput PriceOutput PriceCached / MLatency p50Throughput
Best For
Google Gemma 3 4B dense open-weight model, lightweight general-purpose, edge-friendly
How To Use This Model
Use the exact model ID shown below. This is the safest way to avoid call failures, variant mismatches, or incorrect route assumptions.
curl https://api.alltoken.ai/v1/chat/completions \
-H "Authorization: Bearer sk-your-key" \
-H "Content-Type: application/json" \
-d '{
"model": "gemma-3-4b",
"messages": [
{"role": "user", "content": "Hello!"}
]
}'Supported Parameters
temperaturetop_pmax_tokensAPI Key Setup
Smart Routing
Let the platform choose the best provider path automatically.
Default Model
If a request does not specify a model, default the key to gemma-3-4b.
Forced Model
Always override incoming requests and lock the key to gemma-3-4b.