Gemini 3.5 Fast API
低延迟 Gemini,适合代码 Agent 和聊天应用。
Run gemini-3.5-fast through a drop-in OpenAI-compatible endpoint. Coding Plan price is $0.90 input / $5.40 output per million tokens, billed through one Hypereal API key.
Independent third-party API aggregator. Not affiliated with or endorsed by any model provider; model names are trademarks of their respective owners.
Gemini 3.5 Fast pricing
Hypereal public token pricing
几分钟即可集成
标准 REST API,支持任何编程语言。一个 API 密钥即可访问所有模型。
- 所有模型统一端点
- Bearer token 身份验证
- JSON 请求与响应
- 异步任务 Webhook 回调
- 提供 Python 和 Node.js SDK
curl https://api.hypereal.cloud/v1/chat/completions \
-H "Authorization: Bearer hyp-..." \
-H "Content-Type: application/json" \
-d '{
"model": "gemini-3.5-fast",
"messages": [
{"role": "user", "content": "Generate unit tests for this parser."}
]
}'为什么选择 Gemini 3.5 Fast
Coding Plan eligible
Spend Coding Plan credits on Gemini 3.5 Fast alongside Claude Opus, Sonnet, and GPT-5.5 — one prepaid pool, one API key.
Built for latency and volume
Sub-second first token — best for quick code review, chat, test generation, and iterative agent loops.
Current metered pricing
$0.90 input / $5.40 output per million tokens through Hypereal Credits. No Google Cloud project or separate billing account required.
常见问题
Is Gemini 3.5 Fast included in the Coding Plan?
Yes. It is eligible for Coding Plan credits, so prepaid coding credits spend on it just like Claude Opus, Sonnet, and GPT-5.5.
What is the model ID?
Use gemini-3.5-fast in chat, /v1/chat/completions, and any OpenAI-compatible SDK call.
When should I use Fast instead of Thinking?
Use Fast for latency-sensitive coding loops, tests, and chat. Use Thinking for deeper review or multi-step reasoning. Both cost the same per token.
用 Coding Credits 调用 Gemini 3.5 Fast。
创建 API Key,把 base_url 指向 Hypereal,即可在 OpenAI 兼容工具里调用 gemini-3.5-fast。
