Gemini 3.5 Fast API
低延迟 Gemini,适合代码 Agent 和聊天应用。
通过 OpenAI 兼容接口调用 gemini-3.5-fast。Coding Plan 价格为每百万 token 输入 $0.90 / 输出 $5.40,比 Google 官方价低 40%,一个 Hypereal API Key 即可使用。
Gemini 3.5 Fast pricing
Coding Plan price vs Google official token pricing
几分钟即可集成
标准 REST API,支持任何编程语言。一个 API 密钥即可访问所有模型。
- 所有模型统一端点
- Bearer token 身份验证
- JSON 请求与响应
- 异步任务 Webhook 回调
- 提供 Python 和 Node.js SDK
curl https://api.hypereal.cloud/v1/chat/completions \
-H "Authorization: Bearer hyp-..." \
-H "Content-Type: application/json" \
-d '{
"model": "gemini-3.5-fast",
"messages": [
{"role": "user", "content": "Generate unit tests for this parser."}
]
}'为什么选择 Gemini 3.5 Fast
Coding Plan eligible
Spend Coding Plan credits on Gemini 3.5 Fast alongside Claude Opus, Sonnet, and GPT-5.5 — one prepaid pool, one API key.
Built for latency and volume
Sub-second first token — best for quick code review, chat, test generation, and iterative agent loops.
40% off official pricing
$0.90 input / $5.40 output per million tokens vs Google official $1.50 / $9.00. A flat 40% off, no tiers.
消耗的是哪种点数?
一个 API 密钥两种点数都能用。路由由你调用的模型决定,而非密钥。
Claude Opus 4.7、Sonnet 4.6、GPT-5.5、Gemini 3.5 Thinking、Gemini 3.5 Fast 优先消耗 Coding Credits(按先到期先扣),余额不足时再回落到 General Credits。
图像、视频、音频、3D 以及其他所有 LLM 仅消耗 General Credits。Coding Credits 会保留给代码开发工作负载。
常见问题
Is Gemini 3.5 Fast included in the Coding Plan?
Yes. It is eligible for Coding Plan credits, so prepaid coding credits spend on it just like Claude Opus, Sonnet, and GPT-5.5.
What is the model ID?
Use gemini-3.5-fast in chat, /v1/chat/completions, and any OpenAI-compatible SDK call.
When should I use Fast instead of Thinking?
Use Fast for latency-sensitive coding loops, tests, and chat. Use Thinking for deeper review or multi-step reasoning. Both cost the same per token.
用 Coding Credits 调用 Gemini 3.5 Fast。
创建 API Key,把 base_url 指向 Hypereal,即可在 OpenAI 兼容工具里调用 gemini-3.5-fast。

