Gemini 3.5 Fast API
Low-latency Gemini for coding agents and chat apps.
Run gemini-3.5-fast through a drop-in OpenAI-compatible endpoint. Coding Plan price is $0.90 input / $5.40 output per million tokens — a flat 40% below Google official — billed through one Hypereal API key.
Independent third-party API aggregator. Not affiliated with or endorsed by any model provider; model names are trademarks of their respective owners.
Gemini 3.5 Fast pricing
Coding Plan price vs Google official token pricing
Integruokite per kelias minutes
Standartinė REST API, veikianti su bet kuria kalba. Vienas API raktas suteikia prieigą prie visų modelių.
- Vienas galutinis taškas visiems modeliams
- Autentifikavimas naudojant Bearer žetoną
- JSON užklausa ir atsakymas
- Webhook iškvietimai asinchroninėms užduotims
- Galimi Python ir Node.js SDK
curl https://api.hypereal.cloud/v1/chat/completions \
-H "Authorization: Bearer hyp-..." \
-H "Content-Type: application/json" \
-d '{
"model": "gemini-3.5-fast",
"messages": [
{"role": "user", "content": "Generate unit tests for this parser."}
]
}'Kodėl Gemini 3.5 Fast
Coding Plan eligible
Spend Coding Plan credits on Gemini 3.5 Fast alongside Claude Opus, Sonnet, and GPT-5.5 — one prepaid pool, one API key.
Built for latency and volume
Sub-second first token — best for quick code review, chat, test generation, and iterative agent loops.
40% off official pricing
$0.90 input / $5.40 output per million tokens vs Google official $1.50 / $9.00. A flat 40% off, no tiers.
Kokie kreditai sunaudojami?
Vienas API raktas veikia abiem. Maršrutizavimas nustatomas pagal modelį, kurį kviečiate, o ne pagal raktą.
Claude, GPT, and Gemini coding models spend the same Hypereal Credits wallet. Your selected top-up amount can reduce charges on these coding models.
Video, image, audio, 3D, GPU, training, and other model APIs use the prices shown on their product pages.
Dažniausiai užduodami klausimai
Is Gemini 3.5 Fast included in the Coding Plan?
Yes. It is eligible for Coding Plan credits, so prepaid coding credits spend on it just like Claude Opus, Sonnet, and GPT-5.5.
What is the model ID?
Use gemini-3.5-fast in chat, /v1/chat/completions, and any OpenAI-compatible SDK call.
When should I use Fast instead of Thinking?
Use Fast for latency-sensitive coding loops, tests, and chat. Use Thinking for deeper review or multi-step reasoning. Both cost the same per token.
Use Gemini 3.5 Fast with Coding Credits.
Create an API key, set base_url to Hypereal, and call gemini-3.5-fast from OpenAI-compatible tools.

