v1StableClaude / GPT / Gemini 低於原廠

Hypereal API 參考

一把 ck_前綴的 API 金鑰。OpenAI 相容 REST。可直接放進 Claude Code、Codex CLI、Cursor、OpenAI SDK、Anthropic SDK,或用 curl 直接呼叫。對話、圖像、影片、音訊、程式代理 — 全在同一個 base URL 之下。

01 · 90 秒上手

快速開始

建一把金鑰、把客戶端指向 hypereal.build,即可上線。驗證與請求格式都與 OpenAI 相容 — 多數 SDK 只要更換 base URL 就能直接使用。

1. 取得金鑰

至少儲值 $2(200 額度),於下列頁面建立金鑰 /manage-api-keys。金鑰開頭為 ck_。

2. 設定客戶端

Base URL: https://hypereal.build/api/v1

3. 送出請求

驗證標頭為 Authorization: Bearer ck_...。沿用你已熟悉的 OpenAI 請求格式即可。

curlbash

curl https://hypereal.build/api/v1/chat/completions \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.5",
    "messages": [{"role": "user", "content": "Say hi in one word."}]
  }'

Node — OpenAI SDKts

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: process.env.NEOCLOUD_API_KEY, // ck_...
  baseURL: 'https://hypereal.build/api/v1',
});

const completion = await client.chat.completions.create({
  model: 'gpt-5.5',
  messages: [{ role: 'user', content: 'Say hi in one word.' }],
});

console.log(completion.choices[0].message.content);

驗證

每次請求都需要一把 ck_ 開頭的金鑰。我們接受三種標頭格式,涵蓋所有 SDK。

Authorization

header

必填Bearer ck_... — OpenAI SDK、Codex CLI 與 Cursor 使用此格式。

x-api-key

header

必填ck_... — Anthropic SDK 與 Claude Code 在 /v1/messages上使用。

x-goog-api-key

header

必填ck_... — Google Gemini SDK / 原生格式, /v1/gemini.?key=ck_... 也可使用。

金鑰綁定到使用者身上,計入你可在 /manage-api-keys中設定的每把金鑰花費上限。頻率限制以每位 使用者為單位計算,而非每把金鑰。

03 · OpenAI 相容

Chat Completions

主力端點。沿用 OpenAI Chat Completions 連線格式。適用於 GPT、Gemini、Qwen、DeepSeek、GLM,以及所有非 Anthropic 的 LLM。

POST/api/v1/chat/completions

請求內容

model

string

必填任何非 Anthropic 模型 ID。請見下方表格。Anthropic 模型會回傳 400 — 請改用 /v1/messages 。

messages

Message[]

必填標準 OpenAI 訊息陣列(role、 content)。

stream

boolean

選填預設為 false。設為 true時走 SSE 串流;最終 chunk 會包含用量資訊。

max_tokens

number

選填原樣轉發給上游,套用各供應商的預設值。

temperature, top_p, tools, …

any

選填其他 OpenAI 參數會原封不動轉發。

計價

依各模型的輸入 / 輸出費率按 token 計費。100 額度 = $1.00。呼叫此端點所需的最低餘額為 200 額度($2.00)。

curl — 串流bash

curl https://hypereal.build/api/v1/chat/completions \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.5",
    "messages": [
      {"role": "system", "content": "You are a terse assistant."},
      {"role": "user", "content": "Two-line haiku about caches."}
    ],
    "stream": true,
    "max_tokens": 256
  }'

Node — OpenAI SDK 串流ts

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: process.env.NEOCLOUD_API_KEY,
  baseURL: 'https://hypereal.build/api/v1',
});

const stream = await client.chat.completions.create({
  model: 'gpt-5.5',
  stream: true,
  messages: [{ role: 'user', content: 'Stream me a haiku.' }],
});

for await (const chunk of stream) {
  process.stdout.write(chunk.choices[0]?.delta?.content ?? '');
}

OpenAI 與相容供應商模型

模型 ID

標籤

輸入 / 輸出

gpt-5

GPT-5· OpenAI

$0.440 / $3.45 per MTok

gpt-5.1

GPT-5.1· OpenAI

$0.440 / $3.45 per MTok

gpt-5.2

GPT-5.2· OpenAI

$0.610 / $4.83 per MTok

gpt-5.3

GPT-5.3· OpenAI

$0.100 / $0.390 per MTok

gpt-5.4

GPT-5.4· OpenAI

$0.130 / $0.730 per MTok

gpt-5.5

GPT-5.5· OpenAI

$0.250 / $1.45 per MTok

gpt-5.5-instant

GPT-5.5 Instant· OpenAI

$0.250 / $1.45 per MTok

gpt-5.5-pro

GPT-5.5 Pro· OpenAI

$1.45 / $8.70 per MTok

gpt-5.4-mini

GPT-5.4 Mini· OpenAI

$0.040 / $0.220 per MTok

gpt-5.4-nano

GPT-5.4 Nano· OpenAI

$0.010 / $0.070 per MTok

gpt-5.4-official

GPT-5.4 (Official)· OpenAI

$2.30 / $13.80 per MTok

gpt-5.4-pro-official

GPT-5.4 Pro (Official)· OpenAI

$27.60 / $165.60 per MTok

gpt-5.2-official

GPT-5.2 (Official)· OpenAI

$1.61 / $12.88 per MTok

gpt-5-pro-official

GPT-5 Pro (Official)· OpenAI

$13.80 / $110.40 per MTok

gpt-realtime-1.5-official

GPT Realtime 1.5 (Official)· OpenAI

$3.68 / $14.72 per MTok

gpt-audio-1.5-official

GPT Audio 1.5 (Official)· OpenAI

$2.30 / $9.20 per MTok

glm-5

GLM-5· Zhipu AI

$0.460 / $2.07 per MTok

qwen3.5-plus

Qwen 3.5 Plus· Alibaba

$0.460 / $2.76 per MTok

qwen3.5-flash

Qwen 3.5 Flash· Alibaba

$0.140 / $1.38 per MTok

qwen3-max

Qwen 3 Max· Alibaba

$0.810 / $3.22 per MTok

deepseek-v3.2

DeepSeek V3.2· DeepSeek

$0.460 / $1.84 per MTok

kimi-k2.5

Kimi K2.5· Moonshot

$0.460 / $2.42 per MTok

MiniMax-M2.5

MiniMax M2.5· MiniMax

$0.250 / $0.970 per MTok

nano-banana-2

Nano Banana 2· Nano Banana

$0.010 / $0.010 per MTok

04 · Anthropic 相容

Messages

Anthropic /v1/messages 連線格式,支援 extended thinking、多上游故障轉移,以及 15 秒 SSE keepalive。Claude Code、OpenCode、OpenClaw 與官方 Anthropic SDK 皆可使用。

POST/api/v1/messages

請求內容

model

string

必填claude-opus-4-6, claude-sonnet-4-6, 或 claude-haiku-4-5。較舊的 Anthropic ID(claude-sonnet-4-5-20250929, claude-3-5-sonnet-20241022, claude-3-5-haiku-20241022)會自動別名到對應的最新版本。

messages

Message[]

必填Anthropic 格式的訊息,包含 image 與 tool_use 區塊。

max_tokens

number

必填Anthropic 規格要求必填。

thinking

{ type: "enabled" | "adaptive", budget_tokens?: number }

選填Extended thinking。 budget_tokens 限制 reasoning trace 上限。端點每 15 秒送出 SSE ping,避免代理伺服器在 thinking 串流過久時中斷連線。

stream, system, tools, …

any

選填與 Anthropic SDK 相同,參數原樣轉發。

故障轉移到備援上游時,簽章失效的舊 thinking 區塊會自動過濾 — 你不必自行處理。

curl — extended thinkingbash

curl https://hypereal.build/api/v1/messages \
  -H "x-api-key: ck_..." \
  -H "anthropic-version: 2023-06-01" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-opus-4-6",
    "max_tokens": 1024,
    "messages": [
      {"role": "user", "content": "Plan a 3-step refactor of a Next.js app."}
    ],
    "thinking": {"type": "enabled", "budget_tokens": 4000}
  }'

Node — Anthropic SDKts

import Anthropic from '@anthropic-ai/sdk';

const client = new Anthropic({
  apiKey: process.env.NEOCLOUD_API_KEY, // ck_...
  baseURL: 'https://hypereal.build/api/v1',
});

const msg = await client.messages.create({
  model: 'claude-sonnet-4-6',
  max_tokens: 1024,
  messages: [{ role: 'user', content: 'Hello, Claude.' }],
});

console.log(msg.content);

Anthropic 模型

模型 ID

標籤

輸入 / 輸出

claude-opus-4-6

Claude Opus 4.6· Anthropic

$1.73 / $8.63 per MTok

claude-sonnet-4-6

Claude Sonnet 4.6· Anthropic

$1.04 / $5.18 per MTok

claude-haiku-4-5

Claude Haiku 4.5· Anthropic

$0.350 / $1.73 per MTok

05 · OpenAI Responses API

Responses

OpenAI 較新的 Responses API(Codex CLI 的 `wire_api = responses` 模式與 OpenAI Agents SDK 都使用)。驗證方式與 chat/completions 相同;請求內容以 `input` 取代 `messages`。

POST/api/v1/responses

備註

Anthropic 模型會回傳 400 — 它們屬於 /v1/messages。
串流與非串流皆依response.usage.input_tokens / output_tokens計費。
部分上游一律回 SSE — 端點會自動偵測並透明串流回客戶端,即使你設定 stream:false也是如此。
支援多上游故障轉移。請將客戶端 timeout 設長(300 秒以上)。

curlbash

curl https://hypereal.build/api/v1/responses \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.1-codex",
    "input": "Write a TypeScript function that debounces a callback.",
    "stream": true
  }'

Node — OpenAI SDK responses.createts

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: process.env.NEOCLOUD_API_KEY,
  baseURL: 'https://hypereal.build/api/v1',
});

const response = await client.responses.create({
  model: 'gpt-5-codex',
  input: 'Refactor this file into smaller modules.',
});

console.log(response.output_text);

Codex 調校過的模型

模型 ID

標籤

輸入 / 輸出

gpt-5-codex

GPT-5 Codex· OpenAI

$0.440 / $3.45 per MTok

gpt-5-codex-mini

GPT-5 Codex Mini· OpenAI

$0.090 / $0.690 per MTok

gpt-5.1-codex

GPT-5.1 Codex· OpenAI

$0.440 / $3.45 per MTok

gpt-5.1-codex-mini

GPT-5.1 Codex Mini· OpenAI

$0.090 / $0.690 per MTok

gpt-5.1-codex-max

GPT-5.1 Codex Max· OpenAI

$0.440 / $3.45 per MTok

gpt-5.2-codex

GPT-5.2 Codex· OpenAI

$0.610 / $4.83 per MTok

gpt-5.3-codex

GPT-5.3 Codex· OpenAI

$0.610 / $4.83 per MTok

gpt-5.3-codex-spark

GPT-5.3 Codex Spark· OpenAI

$0.610 / $4.83 per MTok

gpt-5.3-codex-official

GPT-5.3 Codex (Official)· OpenAI

$1.61 / $12.88 per MTok

06 · Codex CLI / Codex Desktop

Codex CLI

Codex 將 `wire_api = responses` 供應商指向 /api/v1/codex/responses。CLI 會在 base URL 後自動補上 `/responses`,因此請依下方方式設定 base URL。

POST/api/v1/codex/responses

~/.codex/config.tomltoml

# ~/.codex/config.toml
model_provider = "hypereal"
model = "gpt-5-codex"

[model_providers.hypereal]
name = "Hypereal"
base_url = "https://hypereal.build/api/v1/codex"
wire_api = "responses"
env_key = "NEOCLOUD_API_KEY"

接著匯出你的金鑰:
export NEOCLOUD_API_KEY=ck_...

照常執行 codex 。Codex 送出的所有內容 — 完整 reasoning 串流、tool calls、檔案編輯 — 都會原封不動被代理。計費依標準的 input_tokens / output_tokens 用量區塊。

同樣的設定也適用於 OpenCode、Claude Code(用 /v1/messages)、Cursor(用 /v1/chat/completions),以及 Gemini CLI(用 /v1/gemini)。

圖像生成

OpenAI 相容的 /images/generations 格式。同步呼叫 — 上游完成時,端點會回傳圖片 URL(或 base64)。按張計費;`n` 限制在 1–10 之間。

POST/api/v1/images/generations

請求內容

model

string

必填圖像模型 ID — 請見表格。

prompt

string

必填文字提示。對於支援編輯的模型,請透過該模型原生參數提供參考圖片(例如 image、 reference_images)。

number

選填圖片張數,1–10(預設 1)。

size

string

選填原樣轉發,例如 1024x1024、 1536x1024。實際支援值取決於供應商。

quality, style, …

any

選填其他參數會直接傳給上游。

級別要求:圖像生成需要 Starter 級別(累計儲值 $19.99 以上)。如果餘額不足以支付預估的 creditsPerGeneration × n,端點會回傳 402。

curlbash

curl https://hypereal.build/api/v1/images/generations \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nano_banana_pro",
    "prompt": "isometric studio shot of a tiny cyberpunk apartment, neon rim light",
    "n": 1,
    "size": "1024x1024"
  }'

Node — fetchts

const res = await fetch('https://hypereal.build/api/v1/images/generations', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${process.env.NEOCLOUD_API_KEY}`,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: 'gemini-3-pro-image-preview',
    prompt: 'a chrome teapot floating over the ocean at sunset',
    n: 1,
  }),
});

const { data } = await res.json();
console.log(data[0].url); // or data[0].b64_json depending on the model

圖像模型

模型 ID

標籤

價格

gpt-image-2

GPT Image 2· OpenAI

$0.060 / image

gpt-4o-image

GPT-4o Image· OpenAI

$0.012 / image

nano_banana

Nano Banana· Nano Banana

$0.024 / image

nano_banana_2

Nano Banana 2· Nano Banana

$0.050 / image

gemini-3.1-flash-image-preview

Gemini 3.1 Flash Image· Google

$0.050 / image

gemini-2.5-flash-image-preview

Gemini 2.5 Flash Image· Google

$0.024 / image

flux-kontext-pro

Flux Kontext Pro· Flux

$0.040 / image

flux-2-pro

Flux 2 Pro· Flux

$0.050 / image

doubao-seedream-4-0

Doubao Seedream 4.0· ByteDance

$0.057 / image

doubao-seedream-4-5

Doubao Seedream 4.5· ByteDance

$0.071 / image

doubao-seedream-5-0

Doubao Seedream 5.0· ByteDance

$0.063 / image

gemini-3.1-flash-image-preview-official

Gemini 3.1 Flash Image (Official)· Google

$0.064 / image

flux-kontext-max

Flux Kontext Max· Flux

$0.080 / image

gemini-2.5-flash-image-official

Gemini 2.5 Flash Image (Official)· Google

$0.098 / image

nano_banana_pro

Nano Banana Pro· Nano Banana

$0.100 / image

gemini-3-pro-image-preview

Gemini 3 Pro Image· Google

$0.100 / image

flux-2-flex

Flux 2 Flex· Flux

$0.140 / image

gemini-3-pro-image-preview-official

Gemini 3 Pro Image (Official)· Google

$0.216 / image

gemini-3-pro-image-preview-4K

Gemini 3 Pro Image 4K· Google

$0.190 / image

gemini-3.1-fast-imagen

Gemini 3.1 Fast Imagen· Google

$0.020 / image

gemini-3.1-thinking-imagen

Gemini 3.1 Thinking Imagen· Google

$0.020 / image

08 · 長時間任務

影片生成

同步 long-poll 端點 — 請保持連線開啟,直到影片完成。請將 HTTP 客戶端 timeout 設為 600 秒。多數模型按秒計費,Veo、Vidu、Grok 則按支計費。

POST/api/v1/video/generations

請求內容

model

string

必填影片模型 ID — 請見表格。

prompt

string

必填描述影片的文字提示。

duration

number

選填秒數,1–60(預設 5)。僅對下列模型有意義: per_second 。

aspect_ratio

string

選填例如 16:9、 9:16、 1:1。實際支援值取決於供應商。

image_url

string

選填image-to-video 模型的首格畫面。部分模型也接受 last_image_url 或 image — 詳見該模型的上游文件。

提醒:這是單一長時間 POST,沒有 job-id 輪詢機制;上游完成時,回應 body 會直接帶上影片 URL。請使用伺服端執行環境(Node、延長 duration 的 edge);瀏覽器與多數 CDN 會在 5 秒影片渲染完成前 timeout。

curl — 文字 + 圖像生成影片bash

curl https://hypereal.build/api/v1/video/generations \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "doubao-seedance-2-0",
    "prompt": "drone shot flying over a foggy forest at dawn, cinematic",
    "duration": 5,
    "aspect_ratio": "16:9",
    "image_url": "https://example.com/keyframe.jpg"
  }'

Node — fetchts

const res = await fetch('https://hypereal.build/api/v1/video/generations', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${process.env.NEOCLOUD_API_KEY}`,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: 'kling-v3',
    prompt: 'a cat walking on the moon',
    duration: 5,
    aspect_ratio: '16:9',
  }),
});

// Long-running: connection stays open until the upstream returns the clip.
// Set a generous timeout (300+ seconds).
const data = await res.json();
console.log(data); // contains url(s) to the rendered mp4

影片模型

模型 ID

標籤

價格

wan2.6-flash

WAN 2.6 Flash· Alibaba

$0.060 / sec

kling-2-6

Kling 2.6· Kuaishou

$0.074 / sec

MiniMax-Hailuo-02

MiniMax Hailuo 02· MiniMax

$0.080 / sec

doubao-seedance-1-0-pro-fast

Doubao Seedance Pro Fast· ByteDance

$0.083 / sec

MiniMax-Hailuo-2.3

MiniMax Hailuo 2.3· MiniMax

$0.098 / sec

wan2.6

WAN 2.6· Alibaba

$0.100 / sec

kling-video-o1

Kling Video O1· Kuaishou

$0.134 / sec

kling-v3-omni

Kling V3 Omni· Kuaishou

$0.134 / sec

kling-v3

Kling V3· Kuaishou

$0.134 / sec

kling-v3-video

Kling V3 Video· Kuaishou

$0.134 / sec

doubao-seedance-1-0-pro-quality

Doubao Seedance Pro Quality· ByteDance

$0.208 / sec

doubao-seedance-2-0

Doubao Seedance 2.0· ByteDance

$0.200 / sec

doubao-seedance-2-0-fast

Doubao Seedance 2.0 Fast· ByteDance

$0.105 / sec

doubao-seedance-1-5-pro

Doubao Seedance 1.5 Pro· ByteDance

$0.216 / sec

Veo3.1-fast-official

Veo 3.1 Fast· Google

$0.160 / sec

Veo3.1-quality-official

Veo 3.1 Quality· Google

$0.320 / sec

veo3.1-fast

Veo 3.1 Fast· Google

$0.160 / clip

veo3.1-quality

Veo 3.1 Quality· Google

$1.20 / clip

vidu-q3-pro

Vidu Q3 Pro· Vidu

$0.020 / clip

grok-video-3

Grok Video 3· xAI

$0.160 / clip

09 · Fish Audio

音訊 — TTS、聲音複製、ASR

三個模型 ID 共用同一個端點,請求與回應格式取決於你呼叫哪一個。供應商為 Fish Audio(直連,不經 ToAPI),按次計費。

POST/api/v1/audio/generations

model

"audio-tts" | "audio-clone" | "audio-asr"

必填決定要執行的操作。

text

string

選填下列模式必填: audio-tts 與 audio-clone。

audio

string (URL)

選填下列模式必填: audio-asr (輸入)與 audio-clone (參考音檔,需 ≥ 10 秒)。

voice_id, format, sample_rate, …

any

選填其他 Fish Audio 參數會原樣轉發。

回應格式: data: [{ url }] 用於 TTS / 聲音複製, text (可附加 segments、 duration)用於 ASR。

TTSbash

curl https://hypereal.build/api/v1/audio/generations \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "audio-tts",
    "text": "Welcome to Hypereal. One key, every model.",
    "voice_id": "en_male_calm"
  }'

聲音複製bash

curl https://hypereal.build/api/v1/audio/generations \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "audio-clone",
    "text": "This is my cloned voice.",
    "audio": "https://example.com/reference-30s.mp3"
  }'

ASR(語音 → 文字)bash

curl https://hypereal.build/api/v1/audio/generations \
  -H "Authorization: Bearer ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "audio-asr",
    "audio": "https://example.com/recording.mp3"
  }'

音訊模型

模型 ID

標籤

價格

audio-tts

Text to Speech· Fish Audio

$0.020 / request

audio-clone

Voice Clone· Fish Audio

$0.020 / request

audio-asr

Speech Recognition· Fish Audio

$0.010 / request

10 · Google 原生格式

Gemini

同一端點同時接受 Gemini 原生格式(`contents` / `generationConfig` / `systemInstruction`)與 OpenAI 格式。端點會在內部先轉成 OpenAI 格式再轉發。多數情況下,直接用 /v1/chat/completions 搭配 Gemini 模型 ID 更簡單。

POST/api/v1/gemini

model

string

必填任何 Gemini 模型 ID — 請見表格。

contents

Content[]

選填Gemini 原生訊息陣列。

systemInstruction

Content

選填選填的系統訊息,使用 Gemini 格式。

generationConfig

object

選填temperature、 maxOutputTokens 等。

messages

Message[]

選填OpenAI 格式,可作為下列的替代寫法: contents。

驗證標頭: x-goog-api-key: ck_...、 ?key=ck_...,或 Authorization: Bearer ck_... 都可使用。

curl — Gemini 原生bash

curl "https://hypereal.build/api/v1/gemini" \
  -H "x-goog-api-key: ck_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-3.1-pro",
    "contents": [
      {"role": "user", "parts": [{"text": "Outline a launch plan."}]}
    ],
    "generationConfig": {"temperature": 0.6, "maxOutputTokens": 2048}
  }'

Node — fetchts

// The /v1/gemini endpoint accepts both Gemini-native and OpenAI shapes.
// For SDK use, the OpenAI client + /v1/chat/completions is simpler.
const res = await fetch('https://hypereal.build/api/v1/gemini', {
  method: 'POST',
  headers: {
    'x-goog-api-key': process.env.NEOCLOUD_API_KEY!,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: 'gemini-3.1-fast',
    contents: [{ role: 'user', parts: [{ text: 'Hi' }] }],
  }),
});

console.log(await res.json());

Gemini 模型

模型 ID

標籤

輸入 / 輸出

gemini-3-pro-official

Gemini 3 Pro· Google

$1.84 / $11.04 per MTok

gemini-3-pro-preview-official

Gemini 3 Pro Preview· Google

$1.84 / $11.04 per MTok

gemini-3-flash-official

Gemini 3 Flash· Google

$0.460 / $2.76 per MTok

gemini-3-flash-preview-official

Gemini 3 Flash Preview· Google

$0.460 / $2.76 per MTok

gemini-3.1-pro

Gemini 3.1 Pro· Google

$0.010 / $0.010 per MTok

gemini-3.1-pro-preview-official

Gemini 3.1 Pro Preview· Google

$1.84 / $11.04 per MTok

gemini-3.1-fast

Gemini 3.1 Fast· Google

$0.580 / $3.45 per MTok

gemini-3.1-thinking

Gemini 3.1 Thinking· Google

$0.580 / $3.45 per MTok

gemini-3.1-flash-lite-preview-official

Gemini 3.1 Flash Lite Preview· Google

$0.230 / $1.38 per MTok

gemini-2.5-pro-official

Gemini 2.5 Pro· Google

$1.15 / $9.20 per MTok

gemini-2.5-flash-official

Gemini 2.5 Flash· Google

$0.280 / $2.30 per MTok

gemini-2.5-flash-lite-official

Gemini 2.5 Flash Lite· Google

$0.100 / $0.370 per MTok

gemini-2.0-flash-official

Gemini 2.0 Flash· Google

$0.140 / $0.560 per MTok

gemini-2.0-flash-lite-official

Gemini 2.0 Flash Lite· Google

$0.070 / $0.280 per MTok

gemini-2.0-flash-vip

Gemini 2.0 Flash VIP· Google

$0.050 / $0.210 per MTok

gemini-2.5-flash-vip

Gemini 2.5 Flash VIP· Google

$0.110 / $0.870 per MTok

gemini-2.5-pro-vip

Gemini 2.5 Pro VIP· Google

$0.440 / $3.45 per MTok

gemini-3-flash-preview-vip

Gemini 3 Flash Preview VIP· Google

$0.180 / $1.04 per MTok

錯誤與頻率限制

所有錯誤都是 { error: { type, message } } 形式的 JSON。頻率限制以每位使用者為單位,而非每把金鑰 — 多把金鑰共用同一份配額。

401 authentication_error

JSON

選填金鑰缺失、格式錯誤(沒有 ck_ 前綴)、過期或停用。

402 insufficient_credits

JSON

選填餘額不足 200 額度($2),或預估費用超過餘額。

403 access_denied

JSON

選填你目前的累計儲值級別未解鎖該模型(圖像 / 影片 / 音訊需 $19.99 以上;部分旗艦 LLM 需更高級別)。

429 rate_limit_error / spending_limit_error

JSON

選填達到每位使用者每小時上限(對話 1000/h、圖像 500/h、影片與音訊 200/h),或你自行設定的金鑰花費上限。回應會帶上 X-RateLimit-Limit、 X-RateLimit-Remaining,以及 X-RateLimit-Reset 標頭。

400 invalid_request_error

JSON

選填缺少 model、未知的模型 ID(回應會包含 available_models),或在錯誤的端點上呼叫(例如把 Anthropic 模型送到 /chat/completions)。

502 api_error

JSON

選填該模型的所有上游皆失敗。錯誤訊息會帶上最後一個上游的錯誤字串。

計價與額度

單一單位:100 額度 = $1.00 美元。LLM 依每個模型的輸入 / 輸出費率按 token 計費;媒體模型按張、按秒或按支計費。

LLM

Tokens × 每百萬 token 費率。串流請求依最終用量 chunk 計費。

圖像

每次生成固定費率 × 實際回傳的 n 。

影片與音訊

按秒(多數影片)、按支(Veo、Vidu、Grok),或按次(Fish Audio)計費。

Claude、GPT、Gemini,以及精選圖像模型(GPT Image 2、Nano Banana)售價低於原廠。影片、音訊與其他媒體模型以標準價計費。

import OpenAI from 'openai'; const client = new OpenAI({ apiKey: process.env.NEOCLOUD_API_KEY, // ck_... baseURL: 'https://hypereal.build/api/v1', }); const completion = await client.chat.completions.create({ model: 'gpt-5.5', messages: [{ role: 'user', content: 'Say hi in one word.' }], }); console.log(completion.choices[0].message.content);

curl https://hypereal.build/api/v1/chat/completions \ -H "Authorization: Bearer ck_..." \ -H "Content-Type: application/json" \ -d '{ "model": "gpt-5.5", "messages": [ {"role": "system", "content": "You are a terse assistant."}, {"role": "user", "content": "Two-line haiku about caches."} ], "stream": true, "max_tokens": 256 }'

import OpenAI from 'openai'; const client = new OpenAI({ apiKey: process.env.NEOCLOUD_API_KEY, baseURL: 'https://hypereal.build/api/v1', }); const stream = await client.chat.completions.create({ model: 'gpt-5.5', stream: true, messages: [{ role: 'user', content: 'Stream me a haiku.' }], }); for await (const chunk of stream) { process.stdout.write(chunk.choices[0]?.delta?.content ?? ''); }

curl https://hypereal.build/api/v1/messages \ -H "x-api-key: ck_..." \ -H "anthropic-version: 2023-06-01" \ -H "Content-Type: application/json" \ -d '{ "model": "claude-opus-4-6", "max_tokens": 1024, "messages": [ {"role": "user", "content": "Plan a 3-step refactor of a Next.js app."} ], "thinking": {"type": "enabled", "budget_tokens": 4000} }'

import Anthropic from '@anthropic-ai/sdk'; const client = new Anthropic({ apiKey: process.env.NEOCLOUD_API_KEY, // ck_... baseURL: 'https://hypereal.build/api/v1', }); const msg = await client.messages.create({ model: 'claude-sonnet-4-6', max_tokens: 1024, messages: [{ role: 'user', content: 'Hello, Claude.' }], }); console.log(msg.content);

curl https://hypereal.build/api/v1/responses \ -H "Authorization: Bearer ck_..." \ -H "Content-Type: application/json" \ -d '{ "model": "gpt-5.1-codex", "input": "Write a TypeScript function that debounces a callback.", "stream": true }'

import OpenAI from 'openai'; const client = new OpenAI({ apiKey: process.env.NEOCLOUD_API_KEY, baseURL: 'https://hypereal.build/api/v1', }); const response = await client.responses.create({ model: 'gpt-5-codex', input: 'Refactor this file into smaller modules.', }); console.log(response.output_text);

# ~/.codex/config.toml model_provider = "hypereal" model = "gpt-5-codex" [model_providers.hypereal] name = "Hypereal" base_url = "https://hypereal.build/api/v1/codex" wire_api = "responses" env_key = "NEOCLOUD_API_KEY"

curl https://hypereal.build/api/v1/images/generations \ -H "Authorization: Bearer ck_..." \ -H "Content-Type: application/json" \ -d '{ "model": "nano_banana_pro", "prompt": "isometric studio shot of a tiny cyberpunk apartment, neon rim light", "n": 1, "size": "1024x1024" }'

curl https://hypereal.build/api/v1/video/generations \ -H "Authorization: Bearer ck_..." \ -H "Content-Type: application/json" \ -d '{ "model": "doubao-seedance-2-0", "prompt": "drone shot flying over a foggy forest at dawn, cinematic", "duration": 5, "aspect_ratio": "16:9", "image_url": "https://example.com/keyframe.jpg" }'

curl https://hypereal.build/api/v1/audio/generations \ -H "Authorization: Bearer ck_..." \ -H "Content-Type: application/json" \ -d '{ "model": "audio-tts", "text": "Welcome to Hypereal. One key, every model.", "voice_id": "en_male_calm" }'

curl https://hypereal.build/api/v1/audio/generations \ -H "Authorization: Bearer ck_..." \ -H "Content-Type: application/json" \ -d '{ "model": "audio-clone", "text": "This is my cloned voice.", "audio": "https://example.com/reference-30s.mp3" }'

curl https://hypereal.build/api/v1/audio/generations \ -H "Authorization: Bearer ck_..." \ -H "Content-Type: application/json" \ -d '{ "model": "audio-asr", "audio": "https://example.com/recording.mp3" }'

curl "https://hypereal.build/api/v1/gemini" \ -H "x-goog-api-key: ck_..." \ -H "Content-Type: application/json" \ -d '{ "model": "gemini-3.1-pro", "contents": [ {"role": "user", "parts": [{"text": "Outline a launch plan."}]} ], "generationConfig": {"temperature": 0.6, "maxOutputTokens": 2048} }'

// The /v1/gemini endpoint accepts both Gemini-native and OpenAI shapes. // For SDK use, the OpenAI client + /v1/chat/completions is simpler. const res = await fetch('https://hypereal.build/api/v1/gemini', { method: 'POST', headers: { 'x-goog-api-key': process.env.NEOCLOUD_API_KEY!, 'Content-Type': 'application/json', }, body: JSON.stringify({ model: 'gemini-3.1-fast', contents: [{ role: 'user', parts: [{ text: 'Hi' }] }], }), }); console.log(await res.json());