TUFTech Docs

A unified AI model gateway for individuals and teams — one stable entry point to aggregate, manage, and route many large-model capabilities, compatible with OpenAI, Claude, and Gemini call formats.

Base URL https://api.tuftech.org

Quick start

Three steps to get connected.

Sign in to the console, open “Token Management”, then create and copy an API Key.Key shown once
Set the client Base URL to https://api.tuftech.org.
Pick the path for your client: OpenAI-compatible use /v1/chat/completions or /v1/responses, Claude-compatible use /v1/messages.

curl https://api.tuftech.org/v1/chat/completions \
  -H "Authorization: Bearer $TUFTECH_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.5",
    "messages": [{"role": "user", "content": "Hello"}]
  }'

Keep your token safe — never put it in a public repo, web frontend, or chat screenshot. If a Key leaks, disable and recreate it in the console immediately.

Endpoints

All traffic goes through https://api.tuftech.org; choose the path that matches your client.

Main line

https://api.tuftech.org

Unified API endpoint, compatible with OpenAI / Claude / Gemini interfaces.

OpenAI compatible

https://api.tuftech.org

Common paths: /v1/chat/completions, /v1/responses, /v1/embeddings, /v1/images/generations.

Claude / Gemini / Veo

https://api.tuftech.org

Claude uses /v1/messages; Gemini via compatible clients; Veo video submits and polls tasks via /v1/videos.

Common paths: /v1/chat/completions · /v1/responses · /v1/messages · /v1/embeddings · /v1/images/generations · /v1/videos

Groups & ratios

Actual charge = model base price × the group ratio of your token. The table below lists user-selectable groups, sorted by ratio ascending; the rate-limit column is each group's “requests / successes” cap per 1 minute(s).

Group	Ratio	Rate limit (req/success)	Use case
`codex-plus`	0.4x	120/100	Codex / OpenAI coding
`aws-kiro`	0.7x	90/75	AWS-Kiro
`codex-pro`	0.7x	120/100	Codex Pro
`gemini-ultra`	1x	120/100	Gemini Ultra
`veo`	1.3x	20/10	Veo video generation
`cc-max-chat`	1.3x	120/100	CC-Max Chat
`cc-max`	2.6x	120/100	Claude Code Max
`cc-aws-p`	5.2x	60/50	CC-AWS-P official key

Lower ratio is cheaper. Ratios are taken from the console GroupRatio config; the usage log is authoritative for billing.

Rate limits

Rate limiting is counted per user account; multiple API Keys under one user share the quota, and different users don't affect each other.

Platform default: up to 120 requests and 100 successful requests per 1 minute(s).
Each group can override the default — see the rate-limit column in “Groups & ratios” above (e.g. veo is clearly lower, to control high-cost tasks).
Hitting the limit returns 429; just retry shortly.

Veo video models

The Veo group includes veo_3_1, veo_3_1-fast-fl, veo_3_1-fast-fl-hd, billed per call.

Model	Per-call price
`veo_3_1`	$2.5
`veo_3_1-fast-fl`	$0.8
`veo_3_1-fast-fl-hd`	$1.6

After submitting a task you get a task_id; poll status with /v1/videos/{task_id}. Failed or cancelled tasks are refunded per platform logic.

Troubleshooting

On a failed call, check the usage log first: model name, group, status code, error message, request time, charge record.

Code	Meaning
`401`	Wrong Key or disabled token
`402`	Insufficient balance
`429`	Rate limited
`503`	No available account for the channel, or upstream down

If you can't resolve it, contact an admin with the log screenshot or key details.

FAQ

9 entries.

How do I start using the TUFTech API?

After signing in, open “Token Management” to create an API Key. Set the client Base URL to https://api.tuftech.org; OpenAI-compatible clients usually use /v1/chat/completions or /v1/responses, Claude-compatible clients use /v1/messages. The API Key is shown only once — never put it in a public repo, web frontend, or chat screenshot.

Which group should I choose?

For Codex / OpenAI coding, prefer codex-pro, codex-plus, codex-plus-test; for Claude Code / Anthropic, use cc-max, cc-max-chat, cc-aws-p, aws-kiro; for Gemini use gemini-ultra; for video generation use veo. Different groups map to different available models, speed, and stability.

How is model pricing calculated?

The model marketplace shows each model's base price; token models are usually priced separately for input, output, and cache read/write, while video models are priced per call. Actual charges also multiply by the group ratio of the token you use. See the “Groups & ratios” table for current multipliers. The usage log shows the real charge per call and is the primary basis for reconciling your bill.

How is gpt-5.5 billed?

gpt-5.5 is priced per the model marketplace: input $5 / 1M tokens, output $30 / 1M tokens, cache read $0.5 / 1M tokens. The console price page and usage log are authoritative.

How do I use and pay for Veo video models?

The Veo group includes veo_3_1, veo_3_1-fast-fl, veo_3_1-fast-fl-hd. Video models are billed per call — current per-call prices are $2.5, $0.8, and $1.6 respectively. After submitting a task you get a task_id; poll status with /v1/videos/{task_id}. Failed or cancelled tasks are refunded per platform logic.

Is the rate limit per user or platform-wide?

Rate limiting is counted per user account; multiple API Keys under one user share the limit, and different users don't affect each other. The default is up to 120 requests and 100 successful requests per minute; groups can override this (e.g. veo is lower, suited to controlling high-cost tasks).

How much are the signup and invite bonuses?

New users get 8.88 credit on successful registration; when registering via an invite, the inviter and invitee each get 5 credit. Whether a bonus landed can be checked in your balance, logs, or invite records.

How do I troubleshoot a failed call?

Check the usage log first: model name, group, status code, error message, request time, and charge record. 401 is usually a wrong Key or a disabled token; 402 is insufficient balance; 429 is rate limiting; 503 usually means no available account for that channel or the upstream is down. Bring the log screenshot or key details when contacting an admin.

How do email verification, the privacy policy, and the user agreement affect me?

This site has email verification enabled, along with a privacy policy and user agreement. Registration, login, credits, logs, and model forwarding are handled as described on those pages. When calling models, don't submit sensitive personal information, trade secrets, or unauthorized data unrelated to your purpose.

TUFTech · Unified AI model gateway · Console · Status
This page is generated from backend config; the console price page and usage log are authoritative. Last updated: 2026-06-22 05:23 UTC