Enterprise-grade coding and LLM API
Use one Hypereal API key for coding agents, IDE integrations, internal tools, and production LLM workloads. The Enterprise API is OpenAI-compatible, Anthropic-native, and exposes a curated model set for teams that want predictable model IDs, billing, and usage logs.
Use it with Claude Code, coding agents, review bots, IDE tools, and internal automation that already speak OpenAI or Anthropic APIs.
Claude Opus 4.8, Claude Sonnet 4.7, Claude Haiku, GPT-5.5, DeepSeek, Qwen, and Kimi are exposed behind stable Hypereal model IDs.
Hypereal API keys keep spending limits, model scoping, usage logs, and credit billing in one account-level control plane.
Call chat completions
Use the managed base path for the curated Enterprise model catalog and stable Hypereal model IDs.
curl https://api.hypereal.cloud/v1/managed/chat/completions \
-H "Authorization: Bearer ck_..." \
-H "Content-Type: application/json" \
-d '{
"model": "claude-sonnet-4-7",
"messages": [
{
"role": "system",
"content": "You are a senior software engineer."
},
{
"role": "user",
"content": "Review this TypeScript function for correctness."
}
],
"temperature": 0.2,
"max_tokens": 1200
}'import OpenAI from "openai";
const client = new OpenAI({
apiKey: process.env.HYPEREAL_API_KEY,
baseURL: "https://api.hypereal.cloud/v1/managed",
});
const completion = await client.chat.completions.create({
model: "claude-sonnet-4-7",
messages: [
{ role: "user", content: "Write a migration checklist for this PR." },
],
});
console.log(completion.choices[0]?.message?.content);Use the Anthropic-native endpoint
Claude Code and Anthropic SDK clients should point at the Hypereal API root because they append the native messages path themselves. Raw HTTP clients can call the managed messages path directly. Tool use, thinking blocks, streaming, and prompt cache fields are preserved.
export ANTHROPIC_BASE_URL="https://api.hypereal.cloud" export ANTHROPIC_AUTH_TOKEN="ck_..." export ANTHROPIC_API_KEY="" export ANTHROPIC_DEFAULT_OPUS_MODEL="claude-opus-4-8" export ANTHROPIC_DEFAULT_SONNET_MODEL="claude-sonnet-4-7" export ANTHROPIC_DEFAULT_HAIKU_MODEL="claude-haiku-latest" export CLAUDE_CODE_SUBAGENT_MODEL="claude-sonnet-4-7"
curl https://api.hypereal.cloud/v1/managed/messages \
-H "anthropic-api-key: ck_..." \
-H "anthropic-version: 2023-06-01" \
-H "Content-Type: application/json" \
-d '{
"model": "claude-sonnet-4-7",
"messages": [
{ "role": "user", "content": "Review this diff." }
],
"tools": [],
"max_tokens": 1200
}'Supported Enterprise models
Prices are shown per one million tokens and billed through Hypereal Credits.
| Model ID | Name | Context | Input | Cache read | Cache write | Output |
|---|---|---|---|---|---|---|
| claude-opus-4-8 | Claude Opus 4.8 | 1M | $5.75 | $0.575 | $7.19 | $28.75 |
| claude-sonnet-4-7 | Claude Sonnet 4.7 | 1M | $3.45 | $0.345 | $4.31 | $17.25 |
| claude-haiku-latest | Claude Haiku Latest | 200k | $1.15 | $0.115 | $1.44 | $5.75 |
| gpt-5-5 | GPT-5.5 | 1M | $5.75 | $0.575 | n/a | $34.50 |
| deepseek-v4-pro | DeepSeek V4 Pro | 1M | $0.5002 | $0.0042 | n/a | $1.00 |
| qwen3-7-max | Qwen3.7 Max | 1M | $1.44 | $0.2875 | $1.80 | $4.31 |
| qwen3-7-plus | Qwen3.7 Plus | 1M | $0.46 | $0.092 | $0.575 | $1.84 |
| kimi-latest | Kimi Latest | 256k | $0.7866 | $0.1656 | n/a | $3.93 |
curl https://api.hypereal.cloud/v1/managed/models \ -H "Authorization: Bearer ck_..."
Request and response shape
The Enterprise API accepts the OpenAI chat completions request shape, including streaming, tools, structured outputs, temperature, and max token controls when supported by the selected model.
{
"model": "claude-sonnet-4-7",
"messages": [
{ "role": "user", "content": "Refactor this function." }
],
"stream": true,
"max_tokens": 2000
}{
"hypereal": {
"billing": {
"model": "claude-sonnet-4-7",
"credits_charged": 12,
"balance_before": 1000,
"balance_after": 988
}
}
}Tools and caching
The managed endpoint preserves OpenAI-compatible tool calls, structured outputs, reasoning controls, streaming chunks, and prompt-cache fields supported by the selected model. For long coding sessions, send stable project context with cache controls and keep a consistent session ID.
const completion = await client.chat.completions.create({
model: "claude-sonnet-4-7",
messages: [{ role: "user", content: "Find the changed files." }],
tools: [
{
type: "function",
function: {
name: "list_changed_files",
description: "List changed files in the current repository.",
parameters: { type: "object", properties: {} },
},
},
],
tool_choice: "auto",
});curl https://api.hypereal.cloud/v1/managed/chat/completions \
-H "Authorization: Bearer ck_..." \
-H "Content-Type: application/json" \
-H "X-Hypereal-Cache: true" \
-H "X-Session-Id: coding-agent-session-123" \
-d '{
"model": "claude-sonnet-4-7",
"cache_control": { "type": "ephemeral" },
"messages": [
{ "role": "system", "content": "Stable project context..." },
{ "role": "user", "content": "Continue the refactor." }
],
"max_tokens": 1200
}'Use the managed path for OpenAI-compatible chat completions:/v1/managed/chat/completions. Use/v1/managed/messagesfor direct Anthropic-native requests. Claude Code should usehttps://api.hypereal.cloudas its base URL.
