API Reference

InstaGen provides OpenAI-compatible API endpoints for seamless integration.

Base URL

https://llm.submodel.ai/v1

Authentication

Include your access key in the Authorization header:

Authorization: Bearer YOUR_ACCESS_KEY

Supported Endpoints

List Models

GET /v1/models

Returns available models and their capabilities.

Chat Completions

POST /v1/chat/completions

Standard chat completion endpoint for conversational AI.

Text Completions

POST /v1/completions

Text completion endpoint for traditional prompt-based generation.

Infrastructure

Load Balancing: Automatic load balancing across multiple instances
High Availability: Redundant infrastructure with automatic failover
Global CDN: Optimized routing for worldwide access

Rate Limits

Concurrent Requests: 1000 per IP
No Daily Limits: Unlimited requests and tokens
Higher Limits: Contact us for enterprise needs

Quick Start

curl -X POST https://llm.submodel.ai/v1/chat/completions \
  -H "Authorization: Bearer YOUR_ACCESS_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "Qwen/Qwen3-235B-A22B-Instruct-2507",
    "messages": [{"role": "user", "content": "Hello"}]
  }'

Next Steps

PreviousGet Started NextModels

Last updated 1 month ago