A unified AI model gateway for individuals and teams — one stable entry point to aggregate, manage, and route many large-model capabilities, compatible with OpenAI, Claude, and Gemini call formats.
Base URL https://api.tuftech.org
Quick start
Three steps to get connected.
Sign in to the console, open “Token Management”, then create and copy an API Key.Key shown once
Set the client Base URL to https://api.tuftech.org.
Pick the path for your client: OpenAI-compatible use /v1/chat/completions or /v1/responses, Claude-compatible use /v1/messages.
Keep your token safe — never put it in a public repo, web frontend, or chat screenshot. If a Key leaks, disable and recreate it in the console immediately.
Endpoints
All traffic goes through https://api.tuftech.org; choose the path that matches your client.
Main line
https://api.tuftech.org
Unified API endpoint, compatible with OpenAI / Claude / Gemini interfaces.
OpenAI compatible
https://api.tuftech.org
Common paths: /v1/chat/completions, /v1/responses, /v1/embeddings, /v1/images/generations.
Claude / Gemini / Veo
https://api.tuftech.org
Claude uses /v1/messages; Gemini via compatible clients; Veo video submits and polls tasks via /v1/videos.
Actual charge = model base price × the group ratio of your token. The table below lists user-selectable groups, sorted by ratio ascending; the rate-limit column is each group's “requests / successes” cap per 1 minute(s).
Group
Ratio
Rate limit (req/success)
Use case
codex-plus
0.4x
120/100
Codex / OpenAI coding
aws-kiro
0.7x
90/75
AWS-Kiro
codex-pro
0.7x
120/100
Codex Pro
gemini-ultra
1x
120/100
Gemini Ultra
veo
1.3x
20/10
Veo video generation
cc-max-chat
1.3x
120/100
CC-Max Chat
cc-max
2.6x
120/100
Claude Code Max
cc-aws-p
5.2x
60/50
CC-AWS-P official key
Lower ratio is cheaper. Ratios are taken from the console GroupRatio config; the usage log is authoritative for billing.
Rate limits
Rate limiting is counted per user account; multiple API Keys under one user share the quota, and different users don't affect each other.
Platform default: up to 120 requests and 100 successful requests per 1 minute(s).
Each group can override the default — see the rate-limit column in “Groups & ratios” above (e.g. veo is clearly lower, to control high-cost tasks).
Hitting the limit returns 429; just retry shortly.
Veo video models
The Veo group includes veo_3_1, veo_3_1-fast-fl, veo_3_1-fast-fl-hd, billed per call.
Model
Per-call price
veo_3_1
$2.5
veo_3_1-fast-fl
$0.8
veo_3_1-fast-fl-hd
$1.6
After submitting a task you get a task_id; poll status with /v1/videos/{task_id}. Failed or cancelled tasks are refunded per platform logic.
Troubleshooting
On a failed call, check the usage log first: model name, group, status code, error message, request time, charge record.
Code
Meaning
401
Wrong Key or disabled token
402
Insufficient balance
429
Rate limited
503
No available account for the channel, or upstream down
If you can't resolve it, contact an admin with the log screenshot or key details.
FAQ
9 entries.
How do I start using the TUFTech API?
After signing in, open “Token Management” to create an API Key. Set the client Base URL to https://api.tuftech.org; OpenAI-compatible clients usually use /v1/chat/completions or /v1/responses, Claude-compatible clients use /v1/messages. The API Key is shown only once — never put it in a public repo, web frontend, or chat screenshot.
Which group should I choose?
For Codex / OpenAI coding, prefer codex-pro, codex-plus, codex-plus-test; for Claude Code / Anthropic, use cc-max, cc-max-chat, cc-aws-p, aws-kiro; for Gemini use gemini-ultra; for video generation use veo. Different groups map to different available models, speed, and stability.
How is model pricing calculated?
The model marketplace shows each model's base price; token models are usually priced separately for input, output, and cache read/write, while video models are priced per call. Actual charges also multiply by the group ratio of the token you use. See the “Groups & ratios” table for current multipliers. The usage log shows the real charge per call and is the primary basis for reconciling your bill.
How is gpt-5.5 billed?
gpt-5.5 is priced per the model marketplace: input $5 / 1M tokens, output $30 / 1M tokens, cache read $0.5 / 1M tokens. The console price page and usage log are authoritative.
How do I use and pay for Veo video models?
The Veo group includes veo_3_1, veo_3_1-fast-fl, veo_3_1-fast-fl-hd. Video models are billed per call — current per-call prices are $2.5, $0.8, and $1.6 respectively. After submitting a task you get a task_id; poll status with /v1/videos/{task_id}. Failed or cancelled tasks are refunded per platform logic.
Is the rate limit per user or platform-wide?
Rate limiting is counted per user account; multiple API Keys under one user share the limit, and different users don't affect each other. The default is up to 120 requests and 100 successful requests per minute; groups can override this (e.g. veo is lower, suited to controlling high-cost tasks).
How much are the signup and invite bonuses?
New users get 8.88 credit on successful registration; when registering via an invite, the inviter and invitee each get 5 credit. Whether a bonus landed can be checked in your balance, logs, or invite records.
How do I troubleshoot a failed call?
Check the usage log first: model name, group, status code, error message, request time, and charge record. 401 is usually a wrong Key or a disabled token; 402 is insufficient balance; 429 is rate limiting; 503 usually means no available account for that channel or the upstream is down. Bring the log screenshot or key details when contacting an admin.
How do email verification, the privacy policy, and the user agreement affect me?
This site has email verification enabled, along with a privacy policy and user agreement. Registration, login, credits, logs, and model forwarding are handled as described on those pages. When calling models, don't submit sensitive personal information, trade secrets, or unauthorized data unrelated to your purpose.
TUFTech · Unified AI model gateway · Console · Status
This page is generated from backend config; the console price page and usage log are authoritative. Last updated: 2026-06-22 05:23 UTC