v1StableClaude / GPT / Gemini direct से कम

Hypereal API Reference

One ck_-prefixed API key। OpenAI-compatible REST। Claude Code, Codex CLI, Cursor, OpenAI SDK, Anthropic SDK में drop करें या curl से सीधे call करें। Chat, images, video, audio, code agents — सब एक base URL के पीछे।

01 · 90s में शुरू करें

Quickstart

Key mint करें, अपना client hypereal.build पर point करें, ship करें। Auth और request shapes OpenAI-compatible हैं — अधिकांश SDKs केवल base URL बदलकर काम कर जाते हैं।

1. Key लें

कम से कम $2 (200 credits) top up करें और key बनाएँ — /manage-api-keys। Keys शुरू होती हैं ck_।

2. अपना client point करें

Base URL: https://hypereal.build/api/v1

3. Request भेजें

Auth header है Authorization: Bearer ck_...। वही OpenAI request bodies जो आप पहले से जानते हैं।

curlbash

curl https://hypereal.build/api/v1/chat/completions \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.5",
    "messages": [{"role": "user", "content": "Say hi in one word."}]
  }'

Node — OpenAI SDKts

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: process.env.NEOCLOUD_API_KEY, // ck_...
  baseURL: 'https://hypereal.build/api/v1',
});

const completion = await client.chat.completions.create({
  model: 'gpt-5.5',
  messages: [{ role: 'user', content: 'Say hi in one word.' }],
});

console.log(completion.choices[0].message.content);

Authentication

हर request को ck_-prefixed key चाहिए। तीन accepted header forms सभी SDKs cover करते हैं।

Authorization

header

ज़रूरीBearer ck_... — OpenAI SDK, Codex CLI और Cursor द्वारा उपयोग।

x-api-key

header

ज़रूरीck_... — Anthropic SDK और Claude Code पर उपयोग — /v1/messages।

x-goog-api-key

header

ज़रूरीck_... — Google Gemini SDK / native shape, accepted /v1/gemini.?key=ck_... भी काम करता है।

Keys एक user से bound हैं। वे per-key spending caps में count होती हैं जो आप set कर सकते हैं — /manage-api-keys। Rate limits per user पर evaluate होती हैं, key पर नहीं।

03 · OpenAI-compatible

Chat Completions

Workhorse endpoint। OpenAI Chat Completions wire format। GPT, Gemini, Qwen, DeepSeek, GLM और हर non-Anthropic LLM के लिए उपयोग।

POST/api/v1/chat/completions

Request body

model

string

ज़रूरीकोई भी non-Anthropic model ID। नीचे table देखें। Anthropic models 400 return करते हैं — इसके बजाय /v1/messages उपयोग करें।

messages

Message[]

ज़रूरीStandard OpenAI message array (role, content)।

stream

boolean

वैकल्पिकDefault होता है false। SSE stream तब जब true; usage final chunk में शामिल होता है।

max_tokens

number

वैकल्पिकUpstream को forward। Provider-specific defaults apply होते हैं।

temperature, top_p, tools, …

any

वैकल्पिकबाकी OpenAI params unchanged pass होते हैं।

Pricing

हर model की input/output rate से per token bill। 100 credits = $1.00। Endpoint call करने के लिए minimum balance 200 credits ($2.00)।

curl — streamingbash

curl https://hypereal.build/api/v1/chat/completions \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.5",
    "messages": [
      {"role": "system", "content": "You are a terse assistant."},
      {"role": "user", "content": "Two-line haiku about caches."}
    ],
    "stream": true,
    "max_tokens": 256
  }'

Node — OpenAI SDK streamingts

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: process.env.NEOCLOUD_API_KEY,
  baseURL: 'https://hypereal.build/api/v1',
});

const stream = await client.chat.completions.create({
  model: 'gpt-5.5',
  stream: true,
  messages: [{ role: 'user', content: 'Stream me a haiku.' }],
});

for await (const chunk of stream) {
  process.stdout.write(chunk.choices[0]?.delta?.content ?? '');
}

OpenAI और provider-compatible models

Model ID

Label

Input / Output

gpt-5

GPT-5· OpenAI

$0.440 / $3.45 per MTok

gpt-5.1

GPT-5.1· OpenAI

$0.440 / $3.45 per MTok

gpt-5.2

GPT-5.2· OpenAI

$0.610 / $4.83 per MTok

gpt-5.3

GPT-5.3· OpenAI

$0.100 / $0.390 per MTok

gpt-5.4

GPT-5.4· OpenAI

$0.130 / $0.730 per MTok

gpt-5.5

GPT-5.5· OpenAI

$0.250 / $1.45 per MTok

gpt-5.5-instant

GPT-5.5 Instant· OpenAI

$0.250 / $1.45 per MTok

gpt-5.5-pro

GPT-5.5 Pro· OpenAI

$1.45 / $8.70 per MTok

gpt-5.4-mini

GPT-5.4 Mini· OpenAI

$0.040 / $0.220 per MTok

gpt-5.4-nano

GPT-5.4 Nano· OpenAI

$0.010 / $0.070 per MTok

gpt-5.4-official

GPT-5.4 (Official)· OpenAI

$2.30 / $13.80 per MTok

gpt-5.4-pro-official

GPT-5.4 Pro (Official)· OpenAI

$27.60 / $165.60 per MTok

gpt-5.2-official

GPT-5.2 (Official)· OpenAI

$1.61 / $12.88 per MTok

gpt-5-pro-official

GPT-5 Pro (Official)· OpenAI

$13.80 / $110.40 per MTok

gpt-realtime-1.5-official

GPT Realtime 1.5 (Official)· OpenAI

$3.68 / $14.72 per MTok

gpt-audio-1.5-official

GPT Audio 1.5 (Official)· OpenAI

$2.30 / $9.20 per MTok

glm-5

GLM-5· Zhipu AI

$0.460 / $2.07 per MTok

qwen3.5-plus

Qwen 3.5 Plus· Alibaba

$0.460 / $2.76 per MTok

qwen3.5-flash

Qwen 3.5 Flash· Alibaba

$0.140 / $1.38 per MTok

qwen3-max

Qwen 3 Max· Alibaba

$0.810 / $3.22 per MTok

deepseek-v3.2

DeepSeek V3.2· DeepSeek

$0.460 / $1.84 per MTok

kimi-k2.5

Kimi K2.5· Moonshot

$0.460 / $2.42 per MTok

MiniMax-M2.5

MiniMax M2.5· MiniMax

$0.250 / $0.970 per MTok

nano-banana-2

Nano Banana 2· Nano Banana

$0.010 / $0.010 per MTok

04 · Anthropic-compatible

Messages

Extended thinking, multi-upstream failover और 15-second SSE keepalives के साथ Anthropic /v1/messages wire format। Claude Code, OpenCode, OpenClaw और official Anthropic SDK के लिए उपयोग।

POST/api/v1/messages

Request body

model

string

ज़रूरीclaude-opus-4-6, claude-sonnet-4-6, या claude-haiku-4-5। पुराने Anthropic IDs (claude-sonnet-4-5-20250929, claude-3-5-sonnet-20241022, claude-3-5-haiku-20241022) latest equivalents पर auto-alias होते हैं।

messages

Message[]

ज़रूरीAnthropic-format messages, image और tool_use blocks सहित।

max_tokens

number

ज़रूरीAnthropic spec ज़रूरी।

thinking

{ type: "enabled" | "adaptive", budget_tokens?: number }

वैकल्पिकExtended thinking। budget_tokens reasoning trace cap करता है। Endpoint लंबे thinking streams बंद होने से बचाने के लिए 15s SSE pings भेजता है।

stream, system, tools, …

any

वैकल्पिकAnthropic SDK की तरह pass through।

Failover upstream पर retry में invalid signatures वाले stale thinking blocks automatically filter होते हैं — आपको handle नहीं करना।

curl — extended thinkingbash

curl https://hypereal.build/api/v1/messages \
  -H "x-api-key: ck_..." \
  -H "anthropic-version: 2023-06-01" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-opus-4-6",
    "max_tokens": 1024,
    "messages": [
      {"role": "user", "content": "Plan a 3-step refactor of a Next.js app."}
    ],
    "thinking": {"type": "enabled", "budget_tokens": 4000}
  }'

Node — Anthropic SDKts

import Anthropic from '@anthropic-ai/sdk';

const client = new Anthropic({
  apiKey: process.env.NEOCLOUD_API_KEY, // ck_...
  baseURL: 'https://hypereal.build/api/v1',
});

const msg = await client.messages.create({
  model: 'claude-sonnet-4-6',
  max_tokens: 1024,
  messages: [{ role: 'user', content: 'Hello, Claude.' }],
});

console.log(msg.content);

Anthropic models

Model ID

Label

Input / Output

claude-opus-4-6

Claude Opus 4.6· Anthropic

$1.73 / $8.63 per MTok

claude-sonnet-4-6

Claude Sonnet 4.6· Anthropic

$1.04 / $5.18 per MTok

claude-haiku-4-5

Claude Haiku 4.5· Anthropic

$0.350 / $1.73 per MTok

05 · OpenAI Responses API

Responses

OpenAI का नया Responses API (Codex CLI के `wire_api = responses` mode और OpenAI Agents SDK द्वारा उपयोग)। chat/completions जैसा auth; request body `messages` के बजाय `input` उपयोग करती है।

POST/api/v1/responses

Notes

Anthropic models 400 return करते हैं — वे यहाँ belong करते हैं /v1/messages।
Streaming और non-streaming दोनोंresponse.usage.input_tokens / output_tokens।
कुछ upstreams हमेशा SSE emit करते हैं — endpoint यह detect करता है और transparently stream करता है, भले ही stream:false।
Multi-upstream failover। लंबा client timeout set करें (300s+)।

curlbash

curl https://hypereal.build/api/v1/responses \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.1-codex",
    "input": "Write a TypeScript function that debounces a callback.",
    "stream": true
  }'

Node — OpenAI SDK responses.createts

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: process.env.NEOCLOUD_API_KEY,
  baseURL: 'https://hypereal.build/api/v1',
});

const response = await client.responses.create({
  model: 'gpt-5-codex',
  input: 'Refactor this file into smaller modules.',
});

console.log(response.output_text);

Codex-tuned models

Model ID

Label

Input / Output

gpt-5-codex

GPT-5 Codex· OpenAI

$0.440 / $3.45 per MTok

gpt-5-codex-mini

GPT-5 Codex Mini· OpenAI

$0.090 / $0.690 per MTok

gpt-5.1-codex

GPT-5.1 Codex· OpenAI

$0.440 / $3.45 per MTok

gpt-5.1-codex-mini

GPT-5.1 Codex Mini· OpenAI

$0.090 / $0.690 per MTok

gpt-5.1-codex-max

GPT-5.1 Codex Max· OpenAI

$0.440 / $3.45 per MTok

gpt-5.2-codex

GPT-5.2 Codex· OpenAI

$0.610 / $4.83 per MTok

gpt-5.3-codex

GPT-5.3 Codex· OpenAI

$0.610 / $4.83 per MTok

gpt-5.3-codex-spark

GPT-5.3 Codex Spark· OpenAI

$0.610 / $4.83 per MTok

gpt-5.3-codex-official

GPT-5.3 Codex (Official)· OpenAI

$1.61 / $12.88 per MTok

06 · Codex CLI / Codex Desktop

Codex CLI

Codex अपने `wire_api = responses` provider को /api/v1/codex/responses पर point करता है। CLI base URL में `/responses` prepend करता है, इसलिए base URL नीचे की तरह configure करें।

POST/api/v1/codex/responses

~/.codex/config.tomltoml

# ~/.codex/config.toml
model_provider = "hypereal"
model = "gpt-5-codex"

[model_providers.hypereal]
name = "Hypereal"
base_url = "https://hypereal.build/api/v1/codex"
wire_api = "responses"
env_key = "NEOCLOUD_API_KEY"

फिर अपनी key export करें:
export NEOCLOUD_API_KEY=ck_...

Run codex आम तरह। Codex जो भी भेजे — full reasoning streams, tool calls, file edits — unchanged proxy होते हैं। Billing standard input_tokens / output_tokens usage block पर।

वही setup OpenCode, Claude Code (उपयोग करें /v1/messages), Cursor (उपयोग करें /v1/chat/completions) और Gemini CLI (उपयोग करें /v1/gemini)।

Image generation

OpenAI-compatible /images/generations shape। Synchronous — upstream complete होने पर endpoint image URLs (या base64) return करता है। Per image bill; `n` 1–10 तक clamp।

POST/api/v1/images/generations

Request body

model

string

ज़रूरीImage model ID — table देखें।

prompt

string

ज़रूरीText prompt। Edit-capable models के लिए model के native param से reference images शामिल करें (जैसे image, reference_images)।

number

वैकल्पिकImages की संख्या, 1–10 (default 1)।

size

string

वैकल्पिकजैसा है forward, जैसे 1024x1024, 1536x1024। Provider-dependent।

quality, style, …

any

वैकल्पिकअतिरिक्त params upstream को pass through।

Tier requirement: image generation को Starter tier ($19.99+ cumulative top-up) चाहिए। अगर आपका balance estimated cover नहीं कर सकता creditsPerGeneration × n, endpoint 402 return करता है।

curlbash

curl https://hypereal.build/api/v1/images/generations \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nano_banana_pro",
    "prompt": "isometric studio shot of a tiny cyberpunk apartment, neon rim light",
    "n": 1,
    "size": "1024x1024"
  }'

Node — fetchts

const res = await fetch('https://hypereal.build/api/v1/images/generations', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${process.env.NEOCLOUD_API_KEY}`,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: 'gemini-3-pro-image-preview',
    prompt: 'a chrome teapot floating over the ocean at sunset',
    n: 1,
  }),
});

const { data } = await res.json();
console.log(data[0].url); // or data[0].b64_json depending on the model

Image models

Model ID

Label

कीमत

gpt-image-2

GPT Image 2· OpenAI

$0.060 / image

gpt-4o-image

GPT-4o Image· OpenAI

$0.012 / image

nano_banana

Nano Banana· Nano Banana

$0.024 / image

nano_banana_2

Nano Banana 2· Nano Banana

$0.050 / image

gemini-3.1-flash-image-preview

Gemini 3.1 Flash Image· Google

$0.050 / image

gemini-2.5-flash-image-preview

Gemini 2.5 Flash Image· Google

$0.024 / image

flux-kontext-pro

Flux Kontext Pro· Flux

$0.040 / image

flux-2-pro

Flux 2 Pro· Flux

$0.050 / image

doubao-seedream-4-0

Doubao Seedream 4.0· ByteDance

$0.057 / image

doubao-seedream-4-5

Doubao Seedream 4.5· ByteDance

$0.071 / image

doubao-seedream-5-0

Doubao Seedream 5.0· ByteDance

$0.063 / image

gemini-3.1-flash-image-preview-official

Gemini 3.1 Flash Image (Official)· Google

$0.064 / image

flux-kontext-max

Flux Kontext Max· Flux

$0.080 / image

gemini-2.5-flash-image-official

Gemini 2.5 Flash Image (Official)· Google

$0.098 / image

nano_banana_pro

Nano Banana Pro· Nano Banana

$0.100 / image

gemini-3-pro-image-preview

Gemini 3 Pro Image· Google

$0.100 / image

flux-2-flex

Flux 2 Flex· Flux

$0.140 / image

gemini-3-pro-image-preview-official

Gemini 3 Pro Image (Official)· Google

$0.216 / image

gemini-3-pro-image-preview-4K

Gemini 3 Pro Image 4K· Google

$0.190 / image

gemini-3.1-fast-imagen

Gemini 3.1 Fast Imagen· Google

$0.020 / image

gemini-3.1-thinking-imagen

Gemini 3.1 Thinking Imagen· Google

$0.020 / image

08 · long-running

Video generation

Synchronous long-poll endpoint — clip ready होने तक connection खुली रखें। अपना HTTP client timeout 600s set करें। Billing per second (अधिकांश models) या per clip (Veo, Vidu, Grok)।

POST/api/v1/video/generations

Request body

model

string

ज़रूरीVideo model ID — table देखें।

prompt

string

ज़रूरीClip का वर्णन करने वाला text prompt।

duration

number

वैकल्पिकSeconds, 1–60 (default 5)। केवल इनके लिए meaningful — per_second models।

aspect_ratio

string

वैकल्पिकजैसे 16:9, 9:16, 1:1। Provider-dependent।

image_url

string

वैकल्पिकImage-to-video models के लिए first-frame keyframe। कुछ models accept भी करते हैं — last_image_url या image — उस model के upstream docs देखें।

ध्यान दें: यह एक single long-running POST है। कोई job-id polling नहीं; upstream complete होने पर response body में rendered video URL होता है। Server-side runtime (Node, extended duration वाला edge) उपयोग करें — browsers और अधिकांश CDNs 5-second clip render होने से पहले time out हो जाएँगे।

curl — text+image-to-videobash

curl https://hypereal.build/api/v1/video/generations \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "doubao-seedance-2-0",
    "prompt": "drone shot flying over a foggy forest at dawn, cinematic",
    "duration": 5,
    "aspect_ratio": "16:9",
    "image_url": "https://example.com/keyframe.jpg"
  }'

Node — fetchts

const res = await fetch('https://hypereal.build/api/v1/video/generations', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${process.env.NEOCLOUD_API_KEY}`,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: 'kling-v3',
    prompt: 'a cat walking on the moon',
    duration: 5,
    aspect_ratio: '16:9',
  }),
});

// Long-running: connection stays open until the upstream returns the clip.
// Set a generous timeout (300+ seconds).
const data = await res.json();
console.log(data); // contains url(s) to the rendered mp4

Video models

Model ID

Label

कीमत

wan2.6-flash

WAN 2.6 Flash· Alibaba

$0.060 / sec

kling-2-6

Kling 2.6· Kuaishou

$0.074 / sec

MiniMax-Hailuo-02

MiniMax Hailuo 02· MiniMax

$0.080 / sec

doubao-seedance-1-0-pro-fast

Doubao Seedance Pro Fast· ByteDance

$0.083 / sec

MiniMax-Hailuo-2.3

MiniMax Hailuo 2.3· MiniMax

$0.098 / sec

wan2.6

WAN 2.6· Alibaba

$0.100 / sec

kling-video-o1

Kling Video O1· Kuaishou

$0.134 / sec

kling-v3-omni

Kling V3 Omni· Kuaishou

$0.134 / sec

kling-v3

Kling V3· Kuaishou

$0.134 / sec

kling-v3-video

Kling V3 Video· Kuaishou

$0.134 / sec

doubao-seedance-1-0-pro-quality

Doubao Seedance Pro Quality· ByteDance

$0.208 / sec

doubao-seedance-2-0

Doubao Seedance 2.0· ByteDance

$0.200 / sec

doubao-seedance-2-0-fast

Doubao Seedance 2.0 Fast· ByteDance

$0.105 / sec

doubao-seedance-1-5-pro

Doubao Seedance 1.5 Pro· ByteDance

$0.216 / sec

Veo3.1-fast-official

Veo 3.1 Fast· Google

$0.160 / sec

Veo3.1-quality-official

Veo 3.1 Quality· Google

$0.320 / sec

veo3.1-fast

Veo 3.1 Fast· Google

$0.160 / clip

veo3.1-quality

Veo 3.1 Quality· Google

$1.20 / clip

vidu-q3-pro

Vidu Q3 Pro· Vidu

$0.020 / clip

grok-video-3

Grok Video 3· xAI

$0.160 / clip

09 · Fish Audio

Audio — TTS, voice cloning, ASR

तीन model IDs एक endpoint share करते हैं। Body और response का shape इस पर निर्भर है आप कौन-सा call करते हैं। Provider Fish Audio है (direct call, ToAPI से नहीं), per request bill।

POST/api/v1/audio/generations

model

"audio-tts" | "audio-clone" | "audio-asr"

ज़रूरीOperation select करता है।

text

string

वैकल्पिकइनके लिए ज़रूरी — audio-tts और audio-clone।

audio

string (URL)

वैकल्पिकइनके लिए ज़रूरी — audio-asr (input) और audio-clone (reference voice ≥ 10s)।

voice_id, format, sample_rate, …

any

वैकल्पिकअतिरिक्त Fish Audio params pass through।

Response shape: data: [{ url }] TTS / clone के लिए, text (+ optional segments, duration) ASR के लिए।

TTSbash

curl https://hypereal.build/api/v1/audio/generations \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "audio-tts",
    "text": "Welcome to Hypereal. One key, every model.",
    "voice_id": "en_male_calm"
  }'

Voice clonebash

curl https://hypereal.build/api/v1/audio/generations \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "audio-clone",
    "text": "This is my cloned voice.",
    "audio": "https://example.com/reference-30s.mp3"
  }'

ASR (speech → text)bash

curl https://hypereal.build/api/v1/audio/generations \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "audio-asr",
    "audio": "https://example.com/recording.mp3"
  }'

Audio models

Model ID

Label

कीमत

audio-tts

Text to Speech· Fish Audio

$0.020 / request

audio-clone

Voice Clone· Fish Audio

$0.020 / request

audio-asr

Speech Recognition· Fish Audio

$0.010 / request

10 · Google native shape

Gemini

एक endpoint पर Gemini-native (`contents` / `generationConfig` / `systemInstruction`) और OpenAI दोनों shapes accept करता है। Endpoint forward करने से पहले internally OpenAI में convert करता है। अधिकांश code के लिए Gemini model ID के साथ /v1/chat/completions simpler है।

POST/api/v1/gemini

model

string

ज़रूरीकोई भी Gemini model ID — table देखें।

contents

Content[]

वैकल्पिकGemini-native messages array।

systemInstruction

Content

वैकल्पिकGemini shape में optional system message।

generationConfig

object

वैकल्पिकtemperature, maxOutputTokens, आदि।

messages

Message[]

वैकल्पिकOpenAI shape, इसके alternative के रूप में accepted — contents।

Auth header: x-goog-api-key: ck_..., ?key=ck_..., or Authorization: Bearer ck_... सब काम करते हैं।

curl — Gemini-nativebash

curl "https://hypereal.build/api/v1/gemini" \
  -H "x-goog-api-key: ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-3.1-pro",
    "contents": [
      {"role": "user", "parts": [{"text": "Outline a launch plan."}]}
    ],
    "generationConfig": {"temperature": 0.6, "maxOutputTokens": 2048}
  }'

Node — fetchts

// The /v1/gemini endpoint accepts both Gemini-native and OpenAI shapes.
// For SDK use, the OpenAI client + /v1/chat/completions is simpler.
const res = await fetch('https://hypereal.build/api/v1/gemini', {
  method: 'POST',
  headers: {
    'x-goog-api-key': process.env.NEOCLOUD_API_KEY!,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: 'gemini-3.1-fast',
    contents: [{ role: 'user', parts: [{ text: 'Hi' }] }],
  }),
});

console.log(await res.json());

Gemini models

Model ID

Label

Input / Output

gemini-3-pro-official

Gemini 3 Pro· Google

$1.84 / $11.04 per MTok

gemini-3-pro-preview-official

Gemini 3 Pro Preview· Google

$1.84 / $11.04 per MTok

gemini-3-flash-official

Gemini 3 Flash· Google

$0.460 / $2.76 per MTok

gemini-3-flash-preview-official

Gemini 3 Flash Preview· Google

$0.460 / $2.76 per MTok

gemini-3.1-pro

Gemini 3.1 Pro· Google

$0.010 / $0.010 per MTok

gemini-3.1-pro-preview-official

Gemini 3.1 Pro Preview· Google

$1.84 / $11.04 per MTok

gemini-3.1-fast

Gemini 3.1 Fast· Google

$0.580 / $3.45 per MTok

gemini-3.1-thinking

Gemini 3.1 Thinking· Google

$0.580 / $3.45 per MTok

gemini-3.1-flash-lite-preview-official

Gemini 3.1 Flash Lite Preview· Google

$0.230 / $1.38 per MTok

gemini-2.5-pro-official

Gemini 2.5 Pro· Google

$1.15 / $9.20 per MTok

gemini-2.5-flash-official

Gemini 2.5 Flash· Google

$0.280 / $2.30 per MTok

gemini-2.5-flash-lite-official

Gemini 2.5 Flash Lite· Google

$0.100 / $0.370 per MTok

gemini-2.0-flash-official

Gemini 2.0 Flash· Google

$0.140 / $0.560 per MTok

gemini-2.0-flash-lite-official

Gemini 2.0 Flash Lite· Google

$0.070 / $0.280 per MTok

gemini-2.0-flash-vip

Gemini 2.0 Flash VIP· Google

$0.050 / $0.210 per MTok

gemini-2.5-flash-vip

Gemini 2.5 Flash VIP· Google

$0.110 / $0.870 per MTok

gemini-2.5-pro-vip

Gemini 2.5 Pro VIP· Google

$0.440 / $3.45 per MTok

gemini-3-flash-preview-vip

Gemini 3 Flash Preview VIP· Google

$0.180 / $1.04 per MTok

Errors और rate limits

सभी errors JSON में { error: { type, message } } form में हैं। Rate limits per user evaluate होती हैं, key पर नहीं — multiple keys same quota share करती हैं।

401 authentication_error

JSON

वैकल्पिकMissing, malformed (कोई ck_ prefix नहीं), expired या inactive key।

402 insufficient_credits

JSON

वैकल्पिकBalance 200 credits ($2) से कम, या request की estimated cost आपके balance से ज़्यादा।

403 access_denied

JSON

वैकल्पिकआपका cumulative top-up tier वह model unlock नहीं करता (image/video/audio को $19.99+ चाहिए; कुछ flagship LLMs को higher tiers चाहिए)।

429 rate_limit_error / spending_limit_error

JSON

वैकल्पिकPer-user hourly cap (chat के लिए 1000/h, images के लिए 500/h, video और audio के लिए 200/h) या आपके set किए per-key spending limit। X-RateLimit-Limit, X-RateLimit-Remaining, and X-RateLimit-Reset headers rate limit responses पर return होते हैं।

400 invalid_request_error

JSON

वैकल्पिकMissing model, unknown model ID (response include करता है available_models), या format के लिए ग़लत endpoint (जैसे Anthropic model इस पर — /chat/completions)।

502 api_error

JSON

वैकल्पिकउस model के लिए सभी upstreams fail। Message में last upstream की error string शामिल।

Pricing और credits

एक unit: 100 credits = $1.00 USD। LLMs हर model की input / output rate से per token bill करते हैं। Media models per image, per second या per clip bill करते हैं।

LLMs

Tokens × per-MTok rate। Streaming requests final usage chunk से bill होती हैं।

Images

Flat per generation × actual n return।

Video और audio

Per second (अधिकांश video), per clip (Veo, Vidu, Grok), या per request (Fish Audio)।

Claude, GPT, Gemini और चुनिंदा image models (GPT Image 2, Nano Banana) direct providers से कम price पर हैं। Video, audio और बाकी media models standard rates पर bill होते हैं।

v1StableClaude / GPT / Gemini direct से कम

Hypereal API Reference

01 · 90s में शुरू करें

Quickstart

1. Key लें

कम से कम $2 (200 credits) top up करें और key बनाएँ — /manage-api-keys। Keys शुरू होती हैं ck_।

2. अपना client point करें

Base URL: https://hypereal.build/api/v1

3. Request भेजें

Auth header है Authorization: Bearer ck_...। वही OpenAI request bodies जो आप पहले से जानते हैं।

curlbash

curl https://hypereal.build/api/v1/chat/completions \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.5",
    "messages": [{"role": "user", "content": "Say hi in one word."}]
  }'

Node — OpenAI SDKts

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: process.env.NEOCLOUD_API_KEY, // ck_...
  baseURL: 'https://hypereal.build/api/v1',
});

const completion = await client.chat.completions.create({
  model: 'gpt-5.5',
  messages: [{ role: 'user', content: 'Say hi in one word.' }],
});

console.log(completion.choices[0].message.content);

Authentication

हर request को ck_-prefixed key चाहिए। तीन accepted header forms सभी SDKs cover करते हैं।

Authorization

header

ज़रूरीBearer ck_... — OpenAI SDK, Codex CLI और Cursor द्वारा उपयोग।

x-api-key

header

ज़रूरीck_... — Anthropic SDK और Claude Code पर उपयोग — /v1/messages।

x-goog-api-key

header

ज़रूरीck_... — Google Gemini SDK / native shape, accepted /v1/gemini.?key=ck_... भी काम करता है।

03 · OpenAI-compatible

Chat Completions

Workhorse endpoint। OpenAI Chat Completions wire format। GPT, Gemini, Qwen, DeepSeek, GLM और हर non-Anthropic LLM के लिए उपयोग।

POST/api/v1/chat/completions

Request body

model

string

messages

Message[]

ज़रूरीStandard OpenAI message array (role, content)।

stream

boolean

वैकल्पिकDefault होता है false। SSE stream तब जब true; usage final chunk में शामिल होता है।

max_tokens

number

वैकल्पिकUpstream को forward। Provider-specific defaults apply होते हैं।

temperature, top_p, tools, …

any

वैकल्पिकबाकी OpenAI params unchanged pass होते हैं।

Pricing

हर model की input/output rate से per token bill। 100 credits = $1.00। Endpoint call करने के लिए minimum balance 200 credits ($2.00)।

curl — streamingbash

curl https://hypereal.build/api/v1/chat/completions \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.5",
    "messages": [
      {"role": "system", "content": "You are a terse assistant."},
      {"role": "user", "content": "Two-line haiku about caches."}
    ],
    "stream": true,
    "max_tokens": 256
  }'

Node — OpenAI SDK streamingts

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: process.env.NEOCLOUD_API_KEY,
  baseURL: 'https://hypereal.build/api/v1',
});

const stream = await client.chat.completions.create({
  model: 'gpt-5.5',
  stream: true,
  messages: [{ role: 'user', content: 'Stream me a haiku.' }],
});

for await (const chunk of stream) {
  process.stdout.write(chunk.choices[0]?.delta?.content ?? '');
}

OpenAI और provider-compatible models

Model ID

Label

Input / Output

gpt-5

GPT-5· OpenAI

$0.440 / $3.45 per MTok

gpt-5.1

GPT-5.1· OpenAI

$0.440 / $3.45 per MTok

gpt-5.2

GPT-5.2· OpenAI

$0.610 / $4.83 per MTok

gpt-5.3

GPT-5.3· OpenAI

$0.100 / $0.390 per MTok

gpt-5.4

GPT-5.4· OpenAI

$0.130 / $0.730 per MTok

gpt-5.5

GPT-5.5· OpenAI

$0.250 / $1.45 per MTok

gpt-5.5-instant

GPT-5.5 Instant· OpenAI

$0.250 / $1.45 per MTok

gpt-5.5-pro

GPT-5.5 Pro· OpenAI

$1.45 / $8.70 per MTok

gpt-5.4-mini

GPT-5.4 Mini· OpenAI

$0.040 / $0.220 per MTok

gpt-5.4-nano

GPT-5.4 Nano· OpenAI

$0.010 / $0.070 per MTok

gpt-5.4-official

GPT-5.4 (Official)· OpenAI

$2.30 / $13.80 per MTok

gpt-5.4-pro-official

GPT-5.4 Pro (Official)· OpenAI

$27.60 / $165.60 per MTok

gpt-5.2-official

GPT-5.2 (Official)· OpenAI

$1.61 / $12.88 per MTok

gpt-5-pro-official

GPT-5 Pro (Official)· OpenAI

$13.80 / $110.40 per MTok

gpt-realtime-1.5-official

GPT Realtime 1.5 (Official)· OpenAI

$3.68 / $14.72 per MTok

gpt-audio-1.5-official

GPT Audio 1.5 (Official)· OpenAI

$2.30 / $9.20 per MTok

glm-5

GLM-5· Zhipu AI

$0.460 / $2.07 per MTok

qwen3.5-plus

Qwen 3.5 Plus· Alibaba

$0.460 / $2.76 per MTok

qwen3.5-flash

Qwen 3.5 Flash· Alibaba

$0.140 / $1.38 per MTok

qwen3-max

Qwen 3 Max· Alibaba

$0.810 / $3.22 per MTok

deepseek-v3.2

DeepSeek V3.2· DeepSeek

$0.460 / $1.84 per MTok

kimi-k2.5

Kimi K2.5· Moonshot

$0.460 / $2.42 per MTok

MiniMax-M2.5

MiniMax M2.5· MiniMax

$0.250 / $0.970 per MTok

nano-banana-2

Nano Banana 2· Nano Banana

$0.010 / $0.010 per MTok

04 · Anthropic-compatible

Messages

POST/api/v1/messages

Request body

model

string

messages

Message[]

ज़रूरीAnthropic-format messages, image और tool_use blocks सहित।

max_tokens

number

ज़रूरीAnthropic spec ज़रूरी।

thinking

{ type: "enabled" | "adaptive", budget_tokens?: number }

stream, system, tools, …

any

वैकल्पिकAnthropic SDK की तरह pass through।

Failover upstream पर retry में invalid signatures वाले stale thinking blocks automatically filter होते हैं — आपको handle नहीं करना।

curl — extended thinkingbash

curl https://hypereal.build/api/v1/messages \
  -H "x-api-key: ck_..." \
  -H "anthropic-version: 2023-06-01" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-opus-4-6",
    "max_tokens": 1024,
    "messages": [
      {"role": "user", "content": "Plan a 3-step refactor of a Next.js app."}
    ],
    "thinking": {"type": "enabled", "budget_tokens": 4000}
  }'

Node — Anthropic SDKts

import Anthropic from '@anthropic-ai/sdk';

const client = new Anthropic({
  apiKey: process.env.NEOCLOUD_API_KEY, // ck_...
  baseURL: 'https://hypereal.build/api/v1',
});

const msg = await client.messages.create({
  model: 'claude-sonnet-4-6',
  max_tokens: 1024,
  messages: [{ role: 'user', content: 'Hello, Claude.' }],
});

console.log(msg.content);

Anthropic models

Model ID

Label

Input / Output

claude-opus-4-6

Claude Opus 4.6· Anthropic

$1.73 / $8.63 per MTok

claude-sonnet-4-6

Claude Sonnet 4.6· Anthropic

$1.04 / $5.18 per MTok

claude-haiku-4-5

Claude Haiku 4.5· Anthropic

$0.350 / $1.73 per MTok

05 · OpenAI Responses API

Responses

POST/api/v1/responses

Notes

Anthropic models 400 return करते हैं — वे यहाँ belong करते हैं /v1/messages।
Streaming और non-streaming दोनोंresponse.usage.input_tokens / output_tokens।
कुछ upstreams हमेशा SSE emit करते हैं — endpoint यह detect करता है और transparently stream करता है, भले ही stream:false।
Multi-upstream failover। लंबा client timeout set करें (300s+)।

curlbash

curl https://hypereal.build/api/v1/responses \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.1-codex",
    "input": "Write a TypeScript function that debounces a callback.",
    "stream": true
  }'

Node — OpenAI SDK responses.createts

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: process.env.NEOCLOUD_API_KEY,
  baseURL: 'https://hypereal.build/api/v1',
});

const response = await client.responses.create({
  model: 'gpt-5-codex',
  input: 'Refactor this file into smaller modules.',
});

console.log(response.output_text);

Codex-tuned models

Model ID

Label

Input / Output

gpt-5-codex

GPT-5 Codex· OpenAI

$0.440 / $3.45 per MTok

gpt-5-codex-mini

GPT-5 Codex Mini· OpenAI

$0.090 / $0.690 per MTok

gpt-5.1-codex

GPT-5.1 Codex· OpenAI

$0.440 / $3.45 per MTok

gpt-5.1-codex-mini

GPT-5.1 Codex Mini· OpenAI

$0.090 / $0.690 per MTok

gpt-5.1-codex-max

GPT-5.1 Codex Max· OpenAI

$0.440 / $3.45 per MTok

gpt-5.2-codex

GPT-5.2 Codex· OpenAI

$0.610 / $4.83 per MTok

gpt-5.3-codex

GPT-5.3 Codex· OpenAI

$0.610 / $4.83 per MTok

gpt-5.3-codex-spark

GPT-5.3 Codex Spark· OpenAI

$0.610 / $4.83 per MTok

gpt-5.3-codex-official

GPT-5.3 Codex (Official)· OpenAI

$1.61 / $12.88 per MTok

06 · Codex CLI / Codex Desktop

Codex CLI

POST/api/v1/codex/responses

~/.codex/config.tomltoml

# ~/.codex/config.toml
model_provider = "hypereal"
model = "gpt-5-codex"

[model_providers.hypereal]
name = "Hypereal"
base_url = "https://hypereal.build/api/v1/codex"
wire_api = "responses"
env_key = "NEOCLOUD_API_KEY"

फिर अपनी key export करें:
export NEOCLOUD_API_KEY=ck_...

Image generation

POST/api/v1/images/generations

Request body

model

string

ज़रूरीImage model ID — table देखें।

prompt

string

ज़रूरीText prompt। Edit-capable models के लिए model के native param से reference images शामिल करें (जैसे image, reference_images)।

number

वैकल्पिकImages की संख्या, 1–10 (default 1)।

size

string

वैकल्पिकजैसा है forward, जैसे 1024x1024, 1536x1024। Provider-dependent।

quality, style, …

any

वैकल्पिकअतिरिक्त params upstream को pass through।

curlbash

curl https://hypereal.build/api/v1/images/generations \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nano_banana_pro",
    "prompt": "isometric studio shot of a tiny cyberpunk apartment, neon rim light",
    "n": 1,
    "size": "1024x1024"
  }'

Node — fetchts

const res = await fetch('https://hypereal.build/api/v1/images/generations', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${process.env.NEOCLOUD_API_KEY}`,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: 'gemini-3-pro-image-preview',
    prompt: 'a chrome teapot floating over the ocean at sunset',
    n: 1,
  }),
});

const { data } = await res.json();
console.log(data[0].url); // or data[0].b64_json depending on the model

Image models

Model ID

Label

कीमत

gpt-image-2

GPT Image 2· OpenAI

$0.060 / image

gpt-4o-image

GPT-4o Image· OpenAI

$0.012 / image

nano_banana

Nano Banana· Nano Banana

$0.024 / image

nano_banana_2

Nano Banana 2· Nano Banana

$0.050 / image

gemini-3.1-flash-image-preview

Gemini 3.1 Flash Image· Google

$0.050 / image

gemini-2.5-flash-image-preview

Gemini 2.5 Flash Image· Google

$0.024 / image

flux-kontext-pro

Flux Kontext Pro· Flux

$0.040 / image

flux-2-pro

Flux 2 Pro· Flux

$0.050 / image

doubao-seedream-4-0

Doubao Seedream 4.0· ByteDance

$0.057 / image

doubao-seedream-4-5

Doubao Seedream 4.5· ByteDance

$0.071 / image

doubao-seedream-5-0

Doubao Seedream 5.0· ByteDance

$0.063 / image

gemini-3.1-flash-image-preview-official

Gemini 3.1 Flash Image (Official)· Google

$0.064 / image

flux-kontext-max

Flux Kontext Max· Flux

$0.080 / image

gemini-2.5-flash-image-official

Gemini 2.5 Flash Image (Official)· Google

$0.098 / image

nano_banana_pro

Nano Banana Pro· Nano Banana

$0.100 / image

gemini-3-pro-image-preview

Gemini 3 Pro Image· Google

$0.100 / image

flux-2-flex

Flux 2 Flex· Flux

$0.140 / image

gemini-3-pro-image-preview-official

Gemini 3 Pro Image (Official)· Google

$0.216 / image

gemini-3-pro-image-preview-4K

Gemini 3 Pro Image 4K· Google

$0.190 / image

gemini-3.1-fast-imagen

Gemini 3.1 Fast Imagen· Google

$0.020 / image

gemini-3.1-thinking-imagen

Gemini 3.1 Thinking Imagen· Google

$0.020 / image

08 · long-running

Video generation

POST/api/v1/video/generations

Request body

model

string

ज़रूरीVideo model ID — table देखें।

prompt

string

ज़रूरीClip का वर्णन करने वाला text prompt।

duration

number

वैकल्पिकSeconds, 1–60 (default 5)। केवल इनके लिए meaningful — per_second models।

aspect_ratio

string

वैकल्पिकजैसे 16:9, 9:16, 1:1। Provider-dependent।

image_url

string

curl — text+image-to-videobash

curl https://hypereal.build/api/v1/video/generations \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "doubao-seedance-2-0",
    "prompt": "drone shot flying over a foggy forest at dawn, cinematic",
    "duration": 5,
    "aspect_ratio": "16:9",
    "image_url": "https://example.com/keyframe.jpg"
  }'

Node — fetchts

const res = await fetch('https://hypereal.build/api/v1/video/generations', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${process.env.NEOCLOUD_API_KEY}`,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: 'kling-v3',
    prompt: 'a cat walking on the moon',
    duration: 5,
    aspect_ratio: '16:9',
  }),
});

// Long-running: connection stays open until the upstream returns the clip.
// Set a generous timeout (300+ seconds).
const data = await res.json();
console.log(data); // contains url(s) to the rendered mp4

Video models

Model ID

Label

कीमत

wan2.6-flash

WAN 2.6 Flash· Alibaba

$0.060 / sec

kling-2-6

Kling 2.6· Kuaishou

$0.074 / sec

MiniMax-Hailuo-02

MiniMax Hailuo 02· MiniMax

$0.080 / sec

doubao-seedance-1-0-pro-fast

Doubao Seedance Pro Fast· ByteDance

$0.083 / sec

MiniMax-Hailuo-2.3

MiniMax Hailuo 2.3· MiniMax

$0.098 / sec

wan2.6

WAN 2.6· Alibaba

$0.100 / sec

kling-video-o1

Kling Video O1· Kuaishou

$0.134 / sec

kling-v3-omni

Kling V3 Omni· Kuaishou

$0.134 / sec

kling-v3

Kling V3· Kuaishou

$0.134 / sec

kling-v3-video

Kling V3 Video· Kuaishou

$0.134 / sec

doubao-seedance-1-0-pro-quality

Doubao Seedance Pro Quality· ByteDance

$0.208 / sec

doubao-seedance-2-0

Doubao Seedance 2.0· ByteDance

$0.200 / sec

doubao-seedance-2-0-fast

Doubao Seedance 2.0 Fast· ByteDance

$0.105 / sec

doubao-seedance-1-5-pro

Doubao Seedance 1.5 Pro· ByteDance

$0.216 / sec

Veo3.1-fast-official

Veo 3.1 Fast· Google

$0.160 / sec

Veo3.1-quality-official

Veo 3.1 Quality· Google

$0.320 / sec

veo3.1-fast

Veo 3.1 Fast· Google

$0.160 / clip

veo3.1-quality

Veo 3.1 Quality· Google

$1.20 / clip

vidu-q3-pro

Vidu Q3 Pro· Vidu

$0.020 / clip

grok-video-3

Grok Video 3· xAI

$0.160 / clip

09 · Fish Audio

Audio — TTS, voice cloning, ASR

POST/api/v1/audio/generations

model

"audio-tts" | "audio-clone" | "audio-asr"

ज़रूरीOperation select करता है।

text

string

वैकल्पिकइनके लिए ज़रूरी — audio-tts और audio-clone।

audio

string (URL)

वैकल्पिकइनके लिए ज़रूरी — audio-asr (input) और audio-clone (reference voice ≥ 10s)।

voice_id, format, sample_rate, …

any

वैकल्पिकअतिरिक्त Fish Audio params pass through।

Response shape: data: [{ url }] TTS / clone के लिए, text (+ optional segments, duration) ASR के लिए।

TTSbash

curl https://hypereal.build/api/v1/audio/generations \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "audio-tts",
    "text": "Welcome to Hypereal. One key, every model.",
    "voice_id": "en_male_calm"
  }'

Voice clonebash

curl https://hypereal.build/api/v1/audio/generations \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "audio-clone",
    "text": "This is my cloned voice.",
    "audio": "https://example.com/reference-30s.mp3"
  }'

ASR (speech → text)bash

curl https://hypereal.build/api/v1/audio/generations \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "audio-asr",
    "audio": "https://example.com/recording.mp3"
  }'

Audio models

Model ID

Label

कीमत

audio-tts

Text to Speech· Fish Audio

$0.020 / request

audio-clone

Voice Clone· Fish Audio

$0.020 / request

audio-asr

Speech Recognition· Fish Audio

$0.010 / request

10 · Google native shape

Gemini

POST/api/v1/gemini

model

string

ज़रूरीकोई भी Gemini model ID — table देखें।

contents

Content[]

वैकल्पिकGemini-native messages array।

systemInstruction

Content

वैकल्पिकGemini shape में optional system message।

generationConfig

object

वैकल्पिकtemperature, maxOutputTokens, आदि।

messages

Message[]

वैकल्पिकOpenAI shape, इसके alternative के रूप में accepted — contents।

Auth header: x-goog-api-key: ck_..., ?key=ck_..., or Authorization: Bearer ck_... सब काम करते हैं।

curl — Gemini-nativebash

curl "https://hypereal.build/api/v1/gemini" \
  -H "x-goog-api-key: ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-3.1-pro",
    "contents": [
      {"role": "user", "parts": [{"text": "Outline a launch plan."}]}
    ],
    "generationConfig": {"temperature": 0.6, "maxOutputTokens": 2048}
  }'

Node — fetchts

// The /v1/gemini endpoint accepts both Gemini-native and OpenAI shapes.
// For SDK use, the OpenAI client + /v1/chat/completions is simpler.
const res = await fetch('https://hypereal.build/api/v1/gemini', {
  method: 'POST',
  headers: {
    'x-goog-api-key': process.env.NEOCLOUD_API_KEY!,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: 'gemini-3.1-fast',
    contents: [{ role: 'user', parts: [{ text: 'Hi' }] }],
  }),
});

console.log(await res.json());

Gemini models

Model ID

Label

Input / Output

gemini-3-pro-official

Gemini 3 Pro· Google

$1.84 / $11.04 per MTok

gemini-3-pro-preview-official

Gemini 3 Pro Preview· Google

$1.84 / $11.04 per MTok

gemini-3-flash-official

Gemini 3 Flash· Google

$0.460 / $2.76 per MTok

gemini-3-flash-preview-official

Gemini 3 Flash Preview· Google

$0.460 / $2.76 per MTok

gemini-3.1-pro

Gemini 3.1 Pro· Google

$0.010 / $0.010 per MTok

gemini-3.1-pro-preview-official

Gemini 3.1 Pro Preview· Google

$1.84 / $11.04 per MTok

gemini-3.1-fast

Gemini 3.1 Fast· Google

$0.580 / $3.45 per MTok

gemini-3.1-thinking

Gemini 3.1 Thinking· Google

$0.580 / $3.45 per MTok

gemini-3.1-flash-lite-preview-official

Gemini 3.1 Flash Lite Preview· Google

$0.230 / $1.38 per MTok

gemini-2.5-pro-official

Gemini 2.5 Pro· Google

$1.15 / $9.20 per MTok

gemini-2.5-flash-official

Gemini 2.5 Flash· Google

$0.280 / $2.30 per MTok

gemini-2.5-flash-lite-official

Gemini 2.5 Flash Lite· Google

$0.100 / $0.370 per MTok

gemini-2.0-flash-official

Gemini 2.0 Flash· Google

$0.140 / $0.560 per MTok

gemini-2.0-flash-lite-official

Gemini 2.0 Flash Lite· Google

$0.070 / $0.280 per MTok

gemini-2.0-flash-vip

Gemini 2.0 Flash VIP· Google

$0.050 / $0.210 per MTok

gemini-2.5-flash-vip

Gemini 2.5 Flash VIP· Google

$0.110 / $0.870 per MTok

gemini-2.5-pro-vip

Gemini 2.5 Pro VIP· Google

$0.440 / $3.45 per MTok

gemini-3-flash-preview-vip

Gemini 3 Flash Preview VIP· Google

$0.180 / $1.04 per MTok

Errors और rate limits

401 authentication_error

JSON

वैकल्पिकMissing, malformed (कोई ck_ prefix नहीं), expired या inactive key।

402 insufficient_credits

JSON

वैकल्पिकBalance 200 credits ($2) से कम, या request की estimated cost आपके balance से ज़्यादा।

403 access_denied

JSON

429 rate_limit_error / spending_limit_error

JSON

400 invalid_request_error

JSON

502 api_error

JSON

वैकल्पिकउस model के लिए सभी upstreams fail। Message में last upstream की error string शामिल।

Pricing और credits

LLMs

Tokens × per-MTok rate। Streaming requests final usage chunk से bill होती हैं।

Images

Flat per generation × actual n return।

Video और audio

Per second (अधिकांश video), per clip (Veo, Vidu, Grok), या per request (Fish Audio)।