Qwen

Qwen

qwen3.5-9b

Qwen3.5 9B

Qwen3.5 9B is available through BatchIn's public live catalog.

Public model detailLimited previewDense Transformer

Params

9B

Context

33K

Max Output

N/A

License

Apache-2.0

TTFT

160ms

Throughput

120 tok/s

Why pick it

  • Live backend route with verified pricing and billing.
  • Public catalog entry generated from the backend model list.

Pricing

This model does not currently expose public self-serve pricing. Public rates appear only after backend verification.
TierPublicCachedPrice sourceNote
RealtimeNot publicNot publicBatchIn runtime catalogPublic price reflects the runtime catalog without claimed savings comparisons
BatchNot publicNot publicBatchIn runtime catalogBatch public pricing follows the same runtime source
Live relay pricing pulled from the backend catalog.

Quick start

OpenAI-compatible surface. Swap the base URL and ship

Python
from openai import OpenAI

client = OpenAI(
    base_url="https://api.luminapath.tech/v1",
    api_key="BATCHIN_API_KEY"
)

resp = client.chat.completions.create(
    model="qwen3.5-9b",
    messages=[{"role": "user", "content": "Summarize why this model is a fit for my workload"}]
)

print(resp.choices[0].message.content)
JavaScript
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.luminapath.tech/v1",
  apiKey: process.env.BATCHIN_API_KEY,
});

const resp = await client.chat.completions.create({
  model: "qwen3.5-9b",
  messages: [{ role: "user", content: "Summarize why this model is a fit for my workload" }],
});

console.log(resp.choices[0]?.message?.content);
cURL
curl https://api.luminapath.tech/v1/chat/completions \
  -H "Authorization: Bearer ***" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "qwen3.5-9b",
    "messages": [{"role":"user","content":"Summarize why this model is a fit for my workload"}]
  }'

Specs

Architecture

Dense Transformer

Vendor group

Qwen

Context window

33K

Max output

N/A

Best for

qwen
limited-preview

Related models

Back to model center