API Reference
InstaGen provides OpenAI-compatible API endpoints for seamless integration.
Base URL
https://llm.submodel.ai/v1
Authentication
Include your access key in the Authorization header:
Authorization: Bearer YOUR_ACCESS_KEY
Supported Endpoints
List Models
GET /v1/models
Returns available models and their capabilities.
Chat Completions
POST /v1/chat/completions
Standard chat completion endpoint for conversational AI.
Text Completions
POST /v1/completions
Text completion endpoint for traditional prompt-based generation.
Infrastructure
Load Balancing: Automatic load balancing across multiple instances
High Availability: Redundant infrastructure with automatic failover
Global CDN: Optimized routing for worldwide access
Rate Limits
Concurrent Requests: 1000 per IP
No Daily Limits: Unlimited requests and tokens
Higher Limits: Contact us for enterprise needs
Quick Start
curl -X POST https://llm.submodel.ai/v1/chat/completions \
-H "Authorization: Bearer YOUR_ACCESS_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "Qwen/Qwen3-235B-A22B-Instruct-2507",
"messages": [{"role": "user", "content": "Hello"}]
}'
Next Steps
Last updated