NexGate

Model Selection

How to choose the right model for your use case on NexGate

Choosing a Model

NexGate supports models across multiple providers. Use direct model IDs — there are no aliases or tiers to manage.

Note

The full live catalog is always available at GET /api/models or the models page.

Quick Reference

ModelBest forInput/1MOutput/1M
llama-4-scoutHigh-volume, low-cost chat$0.10$0.35
deepseek-v4-flashFast low-cost edits$0.14$0.28
gpt-4.1-nanoClassification, routing$0.10$0.40
gpt-5.4-miniBalanced production work$0.40$1.60
deepseek-v3.2Efficient code generation$0.28$0.42
gpt-5.5High-quality reasoning$5.00$30.00
gpt-4oMultimodal flagship$2.50$10.00
grok-4-3Fast frontier chat$1.25$2.50
kimi-k2.5Long-context multilingual$0.60$2.50

How to Use

Pass any supported model ID directly in your request:

from openai import OpenAI

client = OpenAI(
    api_key="ng-your-key",
    base_url="https://api.nexgate.app/v1"
)

response = client.chat.completions.create(
    model="gpt-5.5",
    messages=[{"role": "user", "content": "Explain quantum computing"}]
)
import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: 'ng-your-key',
  baseURL: 'https://api.nexgate.app/v1'
});

const response = await client.chat.completions.create({
  model: 'gpt-5.5',
  messages: [{ role: 'user', content: 'Explain quantum computing' }]
});
curl https://api.nexgate.app/v1/chat/completions \
  -H "Authorization: Bearer ng-your-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.5",
    "messages": [{"role": "user", "content": "Explain quantum computing"}]
  }'

Recommendations by Use Case

Use CaseRecommended Model
High-volume chatbotllama-4-scout
Simple extraction / classificationgpt-4.1-nano or deepseek-v4-flash
General production appgpt-5.5
Code review / debugginggpt-5.5
Long-context documentskimi-k2.5 or kimi-k2.6
Complex reasoninggpt-5.5 or grok-4-20-reasoning

What's Next?

On this page