Welcome Docs Models Pricing Dashboard

ServiceLabs API Platform

Enterprise-grade AI API with intelligent routing, model management, and usage analytics

🔗

Multi-Model Access

Access 11+ AI models through a single unified API endpoint

Smart Routing

Intelligent request routing for optimal performance and availability

📊

Usage Analytics

Real-time token tracking and usage insights across all models

🔑

API Key Management

Create and manage keys with granular access control per model

11
Models
99.9%
Uptime
<1s
Latency
24/7
Monitoring

Getting Started

ServiceLabs provides a unified OpenAI-compatible API for accessing multiple AI models. All requests use the base URL:

https://router.servicelabs.biz.id/v1

Quick Start

Send your first request in seconds. Select a tab to see the example:

curl https://router.servicelabs.biz.id/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sv2-your-api-key" \
  -d '{
    "model": "glm-5.2",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'
import requests

response = requests.post(
    "https://router.servicelabs.biz.id/v1/chat/completions",
    headers={
        "Authorization": "Bearer sv2-your-api-key",
        "Content-Type": "application/json"
    },
    json={
        "model": "glm-5.2",
        "messages": [{"role": "user", "content": "Hello!"}]
    }
)
print(response.json())
const response = await fetch("https://router.servicelabs.biz.id/v1/chat/completions", {
  method: "POST",
  headers: {
    "Authorization": "Bearer sv2-your-api-key",
    "Content-Type": "application/json"
  },
  body: JSON.stringify({
    model: "glm-5.2",
    messages: [{ role: "user", content: "Hello!" }]
  })
});
const data = await response.json();
console.log(data);

Authentication

All API requests require a Bearer token in the Authorization header. API keys start with sv2-.

Authorization: Bearer sv2-your-api-key

You can create and manage API keys from the Dashboard.

Endpoints

GET /v1/models

List all available models. Returns an OpenAI-compatible model list.

// Response
{
  "object": "list",
  "data": [
    { "id": "glm-5.2", "object": "model", "owned_by": "servicelabs" },
    { "id": "minimax-m3", "object": "model", "owned_by": "servicelabs" },
    ...
  ]
}

POST /v1/chat/completions

Create a chat completion. Supports all standard OpenAI parameters.

// Request Body
{
  "model": "glm-5.2",
  "messages": [
    { "role": "user", "content": "Hello!" }
  ],
  "temperature": 0.7,
  "max_tokens": 1024
}
// Response (OpenAI format)
{
  "id": "chatcmpl-abc123",
  "object": "chat.completion",
  "model": "glm-5.2",
  "choices": [{
    "index": 0,
    "message": { "role": "assistant", "content": "Hello! How can I help?" },
    "finish_reason": "stop"
  }],
  "usage": { "prompt_tokens": 8, "completion_tokens": 9, "total_tokens": 17 }
}

POST /v1/chat/completions (Streaming)

Set "stream": true to receive Server-Sent Events (SSE). Each chunk is a data: line with a partial delta.

// Request
{
  "model": "glm-5.2",
  "messages": [{ "role": "user", "content": "Write a poem" }],
  "stream": true
}

// Response (SSE stream)
data: {"choices":[{"delta":{"content":"Roses"}}]}
data: {"choices":[{"delta":{"content":" are"}}]}
data: {"choices":[{"delta":{"content":" red"}}]}
data: [DONE]

Available Models

Model IDDescriptionCategory
glm-5.2Advanced language model for general tasksLanguage
minimax-m3High-performance multi-modal modelLanguage
kimi-k2.7-codeSpecialized code generation modelCode
qwen3.7-plusEnhanced reasoning and analysisReasoning
deepseek-v4-proProfessional-grade deep reasoningReasoning
glm-5.2-cfFast edge-optimized language modelLanguage
kimi-k2.6-cfLightweight code assistantCode
kimi-k2.7-code-cfEdge-optimized code modelCode
mimo-v2.5-proAdvanced visual understandingVision
mimo-v2.5Efficient visual analysisVision
qwen3.7-maxMaximum capability language modelLanguage

Rate Limits

Rate limits are applied per API key. Limits include:

  • Token Limit — Maximum total tokens the key can consume (configurable per key)
  • Credit Limit — Maximum credits the key can use (configurable per key)
  • Rate Limiting — Requests are throttled to prevent abuse

When a limit is reached, the API returns a 429 Too Many Requests response. Use the dashboard to adjust limits or reset usage.

Error Codes

CodeMeaning
401Invalid or missing API key
403Model not allowed for this key
429Rate limit exceeded
502Upstream service error

Available Models

All models are accessible through the unified ServiceLabs API.

Simple, Transparent Pricing

Start free. Scale when you need to.

Free

For getting started

$0/mo
  • 100K tokens per day
  • All models included
  • Basic support
  • Community access
Get Started

Enterprise

For large-scale deployments

Custom
  • Unlimited tokens
  • Dedicated support
  • SLA guarantee
  • Custom integrations
Contact Sales