A100 LLM Gateway

Single-model local deployment backed by vLLM and protected by LLMGateway API keys.

Qwen active

Active Model

Model
qwen/qwen3.6-27b-thinking
Backend
vLLM on 2x A100 40GB
Context
262144 tokens configured
HTTPS API
https://gateway.213-221-15-164.sslip.io/v1
HTTP API
http://213.221.15.164/v1

OpenAI-Compatible Call

Use a project API key from the API Keys page.

curl http://213.221.15.164/v1/chat/completions \ -H "Authorization: Bearer $LLM_GATEWAY_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "qwen/qwen3.6-27b-thinking", "messages": [{"role": "user", "content": "Hello"}], "max_tokens": 128 }'