Model Center

Customer AI model catalog

The public BatchIn catalog prioritizes real live models, limited previews, request-access routes, and relay lanes. No filler, no fake cards.

27 listed models

Qwen

qwen3-next-80b-a3b

Qwen3-Next-80B-A3B

AvailableRelay

Featured

Low-cost Qwen route for chat, batch generation, and API consolidation

Release: 2026
Max Output: 32K
Context: 256K
Pricing: $0.126 / $0.126

Apache-2.0Open sourcefeaturedqwen

TTFT

160ms

Throughput

120 tok/s

View model detail

Qwen

qwen3-coder-30b-a3b

Qwen3-Coder-30B-A3B

AvailableRelay

Lower-cost Qwen coder lane for tool-using apps, copilots, and internal agents

Release: 2026
Max Output: 32K
Context: 256K
Pricing: $0.182 / $0.182

Apache-2.0Open sourceqwencoder

TTFT

160ms

Throughput

120 tok/s

View model detail

DeepSeek

deepseek-v4-flash

DeepSeek V4 Flash

AvailableAsia Lane

Asia / Batch routes

Fast production DeepSeek route with standard, Asia, and batch pricing lanes

Release: 2026
Max Output: 64K
Context: 256K
Pricing: $0.182 / $0.182

Proprietaryfeatureddeepseekasia

TTFT

160ms

Throughput

120 tok/s

View model detail

Qwen

qwen3.5-397b

Qwen3.5-397B

AvailableRelay

Large Qwen route for premium general reasoning with a public batch discount

Release: 2026
Max Output: 64K
Context: 256K
Pricing: $1.050 / $1.050

Qwenqwen

TTFT

420ms

Throughput

58 tok/s

View model detail

MiniMax

minimax-m2.5

MiniMax M2.5

AvailablePass-through

MiniMax pass-through route kept for compatibility and procurement continuity

Release: 2026
Max Output: 64K
Context: 256K
Pricing: $0.840 / $0.840

MiniMaxminimaxpass-through

TTFT

160ms

Throughput

120 tok/s

View model detail

MiniMax

minimax-m2-7

MiniMax M2.7

AvailablePass-through

Featured pass-through

Newest MiniMax public lane with transparent pass-through pricing

Release: 2026
Max Output: 64K
Context: 256K
Pricing: $0.840 / $0.840

Open-source (Non-commercial)Open sourcefeaturedminimaxpass-through

TTFT

160ms

Throughput

120 tok/s

View model detail

Qwen

qwen-3-6-plus

Qwen 3.6 Plus

AvailablePass-through

Higher-end Qwen route offered as a pass-through lane for customers who need latest proprietary capability

Release: 2026
Max Output: 64K
Context: 256K
Pricing: $1.365 / $1.365

Qwenqwenpass-through

TTFT

160ms

Throughput

120 tok/s

View model detail

Moonshot

kimi-k2-6

Kimi K2.6

AvailablePass-through

Featured pass-through

Latest featured Kimi route for customers who need the newest Moonshot capability

Release: 2026
Max Output: 64K
Context: 256K
Pricing: $2.800 / $2.800

Moonshotfeaturedkimipass-through

TTFT

160ms

Throughput

120 tok/s

View model detail

Moonshot

kimi-k2-5

Kimi K2.5

AvailableRelay

Previous Kimi production lane with standard and batch pricing

Release: 2026
Max Output: 64K
Context: 256K
Pricing: $1.540 / $1.540

Kimikimi

TTFT

160ms

Throughput

120 tok/s

View model detail

StepFun

step-3-5-flash

Step 3.5 Flash

AvailableRelay

Fast economical route for broad low-latency text workloads

Release: 2026
Max Output: 16K
Context: 128K
Pricing: $0.098 / $0.098

StepFunflash

TTFT

160ms

Throughput

120 tok/s

View model detail

Xiaomi

mimo-v2-flash

MiMo-V2-Flash

AvailableRelay

Entry MiMo route for cost-sensitive high-volume tasks

Release: 2026
Max Output: 32K
Context: 128K
Pricing: $0.133 / $0.133

Apache-2.0Open sourcemimo

TTFT

160ms

Throughput

120 tok/s

View model detail

Xiaomi

mimo-v2-pro

MiMo-V2-Pro

Limited previewPrivate Preview

Premium MiMo route for stronger multi-step reasoning and structured responses

Release: 2026
Max Output: 32K
Context: 128K
Pricing: No public self-serve price

Xiaomimimo

TTFT

160ms

Throughput

120 tok/s

View model detail

OpenAI OSS

gpt-oss-120b

GPT-OSS-120B

AvailableRelay

Open-weight GPT-OSS route for low-cost general inference and experimentation

Release: 2026
Max Output: 32K
Context: 128K
Pricing: $0.126 / $0.126

Apache-2.0Open sourceossfeatured

TTFT

310ms

Throughput

72 tok/s

View model detail

Mistral

devstral-2

Devstral 2

AvailableRelay

Developer-focused route for coding assistants and software workflows

Release: 2026
Max Output: 32K
Context: 128K
Pricing: $0.252 / $0.252

Apache-2.0Open sourcecoding

TTFT

160ms

Throughput

120 tok/s

View model detail

NVIDIA

nemotron-3-super

Nemotron 3 Super

AvailableRelay

NVIDIA route for teams standardizing around enterprise GPU ecosystems

Release: 2026
Max Output: 32K
Context: 128K
Pricing: $0.231 / $0.231

NVIDIAnvidia

TTFT

160ms

Throughput

120 tok/s

View model detail

Black Forest Labs

flux-schnell

FLUX Schnell

AvailableRelay

FLUX Schnell is available through BatchIn's public live catalog.

Release: 2026
Max Output: N/A
Context: N/A
Pricing: $0.002 / image

Apache-2.0Open sourceblack-forest-labsavailable

TTFT

1.4s

Throughput

scene-bound

View model detail

BAAI

bge-m3

BGE-M3

AvailableRelay

BGE-M3 is available through BatchIn's public live catalog.

Release: 2026
Max Output: N/A
Context: 8K
Pricing: $0.006 / $0.000

MITOpen sourcebaaiavailable

TTFT

160ms

Throughput

120 tok/s

View model detail

Qwen

qwen3-coder

Qwen3 Coder

AvailableRelay

Qwen3 Coder is available through BatchIn's public live catalog.

Release: 2026
Max Output: N/A
Context: 131K
Pricing: $0.665 / $0.665

Apache-2.0Open sourceqwenavailable

TTFT

220ms

Throughput

94 tok/s

View model detail

Z.ai

glm-5-1

GLM-5.1

AvailableRelay

Featured

Open flagship route for coding, reasoning, and long-horizon agent execution

Release: 2026
Max Output: 128K
Context: 198K
Pricing: $1.470 / $1.470

MITOpen sourcefeaturedglmcoding

TTFT

520ms

Throughput

42 tok/s

View model detail

Z.ai

glm-5

GLM-5

Limited previewPrivate Preview

High-end GLM lane for production reasoning and long-context workflows

Release: 2026
Max Output: 64K
Context: 198K
Pricing: No public self-serve price

GLMglm

TTFT

520ms

Throughput

42 tok/s

View model detail

Wan

wan-2.2

Wan 2.2

Limited previewRelay

Wan 2.2 is available through BatchIn's public live catalog.

Release: 2026
Max Output: N/A
Context: N/A
Pricing: No public self-serve price

Apache-2.0Open sourcewanlimited-preview

TTFT

1.4s

Throughput

scene-bound

View model detail

Alibaba

cosyvoice

CosyVoice

Limited previewRelay

CosyVoice is available through BatchIn's public live catalog.

Release: 2026
Max Output: N/A
Context: 4K
Pricing: No public self-serve price

Apache-2.0Open sourcealibabalimited-preview

TTFT

180ms

Throughput

real-time

View model detail

Mistral

mistral-small-4

Mistral Small 4

Limited previewRelay

Mistral Small 4 is available through BatchIn's public live catalog.

Release: 2026
Max Output: N/A
Context: 33K
Pricing: No public self-serve price

Apache-2.0Open sourcemistrallimited-preview

TTFT

160ms

Throughput

120 tok/s

View model detail

Qwen

qwen3.5-9b

Qwen3.5 9B

Limited previewRelay

Qwen3.5 9B is available through BatchIn's public live catalog.

Release: 2026
Max Output: N/A
Context: 33K
Pricing: No public self-serve price

Apache-2.0Open sourceqwenlimited-preview

TTFT

160ms

Throughput

120 tok/s

View model detail

Qwen

qwen3-tts

Qwen3 TTS

Limited previewRelay

Qwen3 TTS is available through BatchIn's public live catalog.

Release: 2026
Max Output: N/A
Context: 4K
Pricing: No public self-serve price

Qwenqwenlimited-preview

TTFT

180ms

Throughput

real-time

View model detail

Llama 4 Maverick

Limited previewRelay

Llama 4 Maverick is available through BatchIn's public live catalog.

Release: 2026
Max Output: N/A
Context: 131K
Pricing: No public self-serve price

Llamametalimited-preview

TTFT

160ms

Throughput

120 tok/s

View model detail

Qwen

qwen3-coder-next

Qwen3 Coder Next

Limited previewRelay

Qwen3 Coder Next is available through BatchIn's public live catalog.

Release: 2026
Max Output: N/A
Context: 131K
Pricing: No public self-serve price

Qwenqwenlimited-preview

TTFT

160ms

Throughput

120 tok/s

View model detail