Model Center

Customer AI model catalog

The public BatchIn catalog prioritizes real live models, limited previews, request-access routes, and relay lanes. No filler, no fake cards.

27 listed models

Qwen

Qwen

qwen3-next-80b-a3b

Qwen3-Next-80B-A3B

AvailableRelay

Featured

Low-cost Qwen route for chat, batch generation, and API consolidation

Release
2026
Max Output
32K
Context
256K
Pricing
$0.126 / $0.126

Price source: SiliconFlow lane

Apache-2.0Open sourcefeaturedqwen

TTFT

160ms

Throughput

120 tok/s

View model detail
Qwen

Qwen

qwen3-coder-30b-a3b

Qwen3-Coder-30B-A3B

AvailableRelay

Lower-cost Qwen coder lane for tool-using apps, copilots, and internal agents

Release
2026
Max Output
32K
Context
256K
Pricing
$0.182 / $0.182

Price source: SiliconFlow lane

Apache-2.0Open sourceqwencoder

TTFT

160ms

Throughput

120 tok/s

View model detail
DeepSeek

DeepSeek

deepseek-v4-flash

DeepSeek V4 Flash

AvailableAsia Lane

Asia / Batch routes

Fast production DeepSeek route with standard, Asia, and batch pricing lanes

Release
2026
Max Output
64K
Context
256K
Pricing
$0.182 / $0.182

Price source: SiliconFlow lane

Proprietaryfeatureddeepseekasia

TTFT

160ms

Throughput

120 tok/s

View model detail
Qwen

Qwen

qwen3.5-397b

Qwen3.5-397B

AvailableRelay

Large Qwen route for premium general reasoning with a public batch discount

Release
2026
Max Output
64K
Context
256K
Pricing
$1.050 / $1.050

Price source: SiliconFlow lane

Qwenqwen

TTFT

420ms

Throughput

58 tok/s

View model detail
MiniMax

MiniMax

minimax-m2.5

MiniMax M2.5

AvailablePass-through

MiniMax pass-through route kept for compatibility and procurement continuity

Release
2026
Max Output
64K
Context
256K
Pricing
$0.840 / $0.840

Price source: SiliconFlow lane

MiniMaxminimaxpass-through

TTFT

160ms

Throughput

120 tok/s

View model detail
MiniMax

MiniMax

minimax-m2-7

MiniMax M2.7

AvailablePass-through

Featured pass-through

Newest MiniMax public lane with transparent pass-through pricing

Release
2026
Max Output
64K
Context
256K
Pricing
$0.840 / $0.840

Price source: SiliconFlow lane

Open-source (Non-commercial)Open sourcefeaturedminimaxpass-through

TTFT

160ms

Throughput

120 tok/s

View model detail
Qwen

Qwen

qwen-3-6-plus

Qwen 3.6 Plus

AvailablePass-through

Higher-end Qwen route offered as a pass-through lane for customers who need latest proprietary capability

Release
2026
Max Output
64K
Context
256K
Pricing
$1.365 / $1.365

Price source: SiliconFlow lane

Qwenqwenpass-through

TTFT

160ms

Throughput

120 tok/s

View model detail
Moonshot

Moonshot

kimi-k2-6

Kimi K2.6

AvailablePass-through

Featured pass-through

Latest featured Kimi route for customers who need the newest Moonshot capability

Release
2026
Max Output
64K
Context
256K
Pricing
$2.800 / $2.800

Price source: SiliconFlow lane

Moonshotfeaturedkimipass-through

TTFT

160ms

Throughput

120 tok/s

View model detail
Moonshot

Moonshot

kimi-k2-5

Kimi K2.5

AvailableRelay

Previous Kimi production lane with standard and batch pricing

Release
2026
Max Output
64K
Context
256K
Pricing
$1.540 / $1.540

Price source: SiliconFlow lane

Kimikimi

TTFT

160ms

Throughput

120 tok/s

View model detail
StepFun

StepFun

step-3-5-flash

Step 3.5 Flash

AvailableRelay

Fast economical route for broad low-latency text workloads

Release
2026
Max Output
16K
Context
128K
Pricing
$0.098 / $0.098

Price source: SiliconFlow lane

StepFunflash

TTFT

160ms

Throughput

120 tok/s

View model detail
Xiaomi

Xiaomi

mimo-v2-flash

MiMo-V2-Flash

AvailableRelay

Entry MiMo route for cost-sensitive high-volume tasks

Release
2026
Max Output
32K
Context
128K
Pricing
$0.133 / $0.133

Price source: SiliconFlow lane

Apache-2.0Open sourcemimo

TTFT

160ms

Throughput

120 tok/s

View model detail
Xiaomi

Xiaomi

mimo-v2-pro

MiMo-V2-Pro

Limited previewPrivate Preview

Premium MiMo route for stronger multi-step reasoning and structured responses

Release
2026
Max Output
32K
Context
128K
Pricing
No public self-serve price
Xiaomimimo

TTFT

160ms

Throughput

120 tok/s

View model detail
OpenAI

OpenAI OSS

gpt-oss-120b

GPT-OSS-120B

AvailableRelay

Open-weight GPT-OSS route for low-cost general inference and experimentation

Release
2026
Max Output
32K
Context
128K
Pricing
$0.126 / $0.126

Price source: SiliconFlow lane

Apache-2.0Open sourceossfeatured

TTFT

310ms

Throughput

72 tok/s

View model detail
Mistral AI

Mistral

devstral-2

Devstral 2

AvailableRelay

Developer-focused route for coding assistants and software workflows

Release
2026
Max Output
32K
Context
128K
Pricing
$0.252 / $0.252

Price source: SiliconFlow lane

Apache-2.0Open sourcecoding

TTFT

160ms

Throughput

120 tok/s

View model detail
NVIDIA

NVIDIA

nemotron-3-super

Nemotron 3 Super

AvailableRelay

NVIDIA route for teams standardizing around enterprise GPU ecosystems

Release
2026
Max Output
32K
Context
128K
Pricing
$0.231 / $0.231

Price source: SiliconFlow lane

NVIDIAnvidia

TTFT

160ms

Throughput

120 tok/s

View model detail
Black Forest Labs

Black Forest Labs

flux-schnell

FLUX Schnell

AvailableRelay

FLUX Schnell is available through BatchIn's public live catalog.

Release
2026
Max Output
N/A
Context
N/A
Pricing
$0.002 / image

Price source: BatchIn runtime catalog

Apache-2.0Open sourceblack-forest-labsavailable

TTFT

1.4s

Throughput

scene-bound

View model detail
BAAI

BAAI

bge-m3

BGE-M3

AvailableRelay

BGE-M3 is available through BatchIn's public live catalog.

Release
2026
Max Output
N/A
Context
8K
Pricing
$0.006 / $0.000

Price source: BatchIn runtime catalog

MITOpen sourcebaaiavailable

TTFT

160ms

Throughput

120 tok/s

View model detail
Qwen

Qwen

qwen3-coder

Qwen3 Coder

AvailableRelay

Qwen3 Coder is available through BatchIn's public live catalog.

Release
2026
Max Output
N/A
Context
131K
Pricing
$0.665 / $0.665

Price source: BatchIn runtime catalog

Apache-2.0Open sourceqwenavailable

TTFT

220ms

Throughput

94 tok/s

View model detail
Z.ai

Z.ai

glm-5-1

GLM-5.1

AvailableRelay

Featured

Open flagship route for coding, reasoning, and long-horizon agent execution

Release
2026
Max Output
128K
Context
198K
Pricing
$1.470 / $1.470

Price source: SiliconFlow lane

MITOpen sourcefeaturedglmcoding

TTFT

520ms

Throughput

42 tok/s

View model detail
Z.ai

Z.ai

glm-5

GLM-5

Limited previewPrivate Preview

High-end GLM lane for production reasoning and long-context workflows

Release
2026
Max Output
64K
Context
198K
Pricing
No public self-serve price
GLMglm

TTFT

520ms

Throughput

42 tok/s

View model detail
Wan

Wan

wan-2.2

Wan 2.2

Limited previewRelay

Wan 2.2 is available through BatchIn's public live catalog.

Release
2026
Max Output
N/A
Context
N/A
Pricing
No public self-serve price
Apache-2.0Open sourcewanlimited-preview

TTFT

1.4s

Throughput

scene-bound

View model detail
Qwen

Alibaba

cosyvoice

CosyVoice

Limited previewRelay

CosyVoice is available through BatchIn's public live catalog.

Release
2026
Max Output
N/A
Context
4K
Pricing
No public self-serve price
Apache-2.0Open sourcealibabalimited-preview

TTFT

180ms

Throughput

real-time

View model detail
Mistral AI

Mistral

mistral-small-4

Mistral Small 4

Limited previewRelay

Mistral Small 4 is available through BatchIn's public live catalog.

Release
2026
Max Output
N/A
Context
33K
Pricing
No public self-serve price
Apache-2.0Open sourcemistrallimited-preview

TTFT

160ms

Throughput

120 tok/s

View model detail
Qwen

Qwen

qwen3.5-9b

Qwen3.5 9B

Limited previewRelay

Qwen3.5 9B is available through BatchIn's public live catalog.

Release
2026
Max Output
N/A
Context
33K
Pricing
No public self-serve price
Apache-2.0Open sourceqwenlimited-preview

TTFT

160ms

Throughput

120 tok/s

View model detail
Qwen

Qwen

qwen3-tts

Qwen3 TTS

Limited previewRelay

Qwen3 TTS is available through BatchIn's public live catalog.

Release
2026
Max Output
N/A
Context
4K
Pricing
No public self-serve price
Qwenqwenlimited-preview

TTFT

180ms

Throughput

real-time

View model detail
Meta

Meta

llama-4-maverick

Llama 4 Maverick

Limited previewRelay

Llama 4 Maverick is available through BatchIn's public live catalog.

Release
2026
Max Output
N/A
Context
131K
Pricing
No public self-serve price
Llamametalimited-preview

TTFT

160ms

Throughput

120 tok/s

View model detail
Qwen

Qwen

qwen3-coder-next

Qwen3 Coder Next

Limited previewRelay

Qwen3 Coder Next is available through BatchIn's public live catalog.

Release
2026
Max Output
N/A
Context
131K
Pricing
No public self-serve price
Qwenqwenlimited-preview

TTFT

160ms

Throughput

120 tok/s

View model detail