Endpoints

Complete API endpoint reference

API Endpoints

Base URL: https://api.archipelag.io/api/v1

All requests require authentication via the Authorization header:

Authorization: Bearer ak_your_api_key

Health

Check Health

GET /health

No authentication required. Returns server status.

Response

{
  "status": "ok",
  "version": "1.0.0"
}

Jobs

Submit Job

POST /jobs

Create and submit a compute job.

Request Body

{
  "workload": "llm-chat",
  "input": {
    "prompt": "Hello!",
    "max_tokens": 100,
    "temperature": 0.7
  },
  "priority": 0
}

Field	Type	Required	Description
`workload`	string	Yes	Cargo slug (e.g., “llm-chat”, “sdxl”)
`input`	object	Yes	Cargo-specific input parameters
`priority`	integer	No	Job priority (0-100, default 0)

Response

{
  "data": {
    "id": "job_abc123",
    "workload_id": 1,
    "workload_slug": "llm-chat",
    "status": "submitted",
    "input": { "prompt": "Hello!" },
    "created_at": "2026-01-26T12:00:00Z"
  }
}

List Jobs

GET /jobs?limit=20&offset=0

List your recent jobs.

Parameter	Type	Default	Description
`limit`	integer	20	Max results (1-100)
`offset`	integer	0	Pagination offset

Response

{
  "data": [
    {
      "id": "job_abc123",
      "status": "completed",
      "created_at": "2026-01-26T12:00:00Z",
      "completed_at": "2026-01-26T12:00:05Z"
    }
  ],
  "meta": {
    "total": 42,
    "limit": 20,
    "offset": 0
  }
}

Get Job

GET /jobs/:id

Get details of a specific job.

Response

{
  "data": {
    "id": "job_abc123",
    "workload_slug": "llm-chat",
    "status": "completed",
    "input": { "prompt": "Hello!" },
    "output": "Hi there! How can I help?",
    "usage": {
      "prompt_tokens": 5,
      "completion_tokens": 8,
      "total_tokens": 13,
      "credits_used": 0.0013
    },
    "created_at": "2026-01-26T12:00:00Z",
    "started_at": "2026-01-26T12:00:01Z",
    "completed_at": "2026-01-26T12:00:05Z",
    "duration_ms": 4000
  }
}

Job Status Values

pending - Job created, awaiting Island
queued - Waiting in queue
running - Currently executing
completed - Successfully finished
failed - Execution failed
cancelled - User cancelled
timeout - Exceeded time limit

Cancel Job

DELETE /jobs/:id

Cancel a pending or running job.

Response

{
  "data": {
    "id": "job_abc123",
    "status": "cancelled"
  }
}

Stream Job Output

GET /jobs/:id/stream

Stream job output in real-time using Server-Sent Events (SSE).

Events

event: token
data: {"content": "Hello"}

event: progress
data: {"step": 10, "total": 30}

event: status
data: {"state": "streaming"}

event: image
data: {"data": "base64...", "format": "png"}

event: done
data: {"usage": {"total_tokens": 42}}

event: error
data: {"error": "Something went wrong"}

Chat Completions (OpenAI-Compatible)

Create Chat Completion

POST /chat/completions

OpenAI-compatible chat completion endpoint.

Request Body

{
  "model": "llm-chat",
  "messages": [
    {"role": "system", "content": "You are helpful"},
    {"role": "user", "content": "Hello!"}
  ],
  "max_tokens": 100,
  "temperature": 0.7,
  "stream": false
}

Response (non-streaming)

{
  "id": "job_abc123",
  "object": "chat.completion",
  "created": 1706270400,
  "model": "llm-chat",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Hi there! How can I help?"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 10,
    "completion_tokens": 8,
    "total_tokens": 18
  }
}

Response (streaming)

When stream: true, returns SSE with OpenAI-format chunks:

data: {"choices":[{"delta":{"content":"Hi"}}]}

data: {"choices":[{"delta":{"content":" there"}}]}

data: [DONE]

Cargos

List Cargos (Catalog)

GET /cargos

No authentication required. Lists all enabled and approved Cargos.

Query Parameters

Parameter	Type	Description
`runtime_type`	string	Filter by runtime: `container`, `wasm`, `onnx`, `llmcpp`, `diffusers`, `coreml`
`category`	string	Filter by use-case: `llm`, `image`, `audio`, `vision`, `text`, `utility`

Examples

# All LLM models
GET /cargos?category=llm

# All ONNX models
GET /cargos?runtime_type=onnx

# Image generators only
GET /cargos?category=image

Response

{
  "cargos": [
    {
      "slug": "gguf-mistral-7b",
      "name": "Mistral 7B Instruct",
      "description": "Mistral 7B Instruct v0.2 (Q4_K_M)",
      "runtime_type": "llmcpp",
      "onnx_task_type": null,
      "price_per_job": 1.0,
      "required_vram_mb": 6000,
      "required_ram_mb": 8000
    }
  ],
  "count": 15
}

Categories

Category	Includes
`llm`	GGUF LLMs + container LLM Cargos
`image`	Diffusers image gen + container image gen
`audio`	ONNX ASR (Whisper, Voxtral) + TTS (Kokoro, Chatterbox)
`vision`	ONNX detection, segmentation, captioning, depth, OCR
`text`	ONNX classification, embeddings, NER, QA, translation, summarization
`utility`	WASM + container utilities (PDF, video, image processing)

Account

Get Account

GET /account

Get your account information.

Response

{
  "data": {
    "id": "user_abc123",
    "email": "user@example.com",
    "credits": 100.50,
    "created_at": "2026-01-01T00:00:00Z"
  }
}

API Keys

List API Keys

GET /api-keys

List your API keys.

Response

{
  "data": [
    {
      "id": "key_abc123",
      "name": "Production",
      "prefix": "ak_prod_",
      "created_at": "2026-01-01T00:00:00Z",
      "last_used_at": "2026-01-26T12:00:00Z"
    }
  ]
}

Create API Key

POST /api-keys

Create a new API key.

Request Body

{
  "name": "Development"
}

Response

{
  "data": {
    "id": "key_abc123",
    "name": "Development",
    "prefix": "ak_dev_"
  },
  "key": "ak_dev_full_key_only_shown_once"
}

Important

The full API key is only returned once. Store it securely!

Delete API Key

DELETE /api-keys/:id

Revoke an API key.

Island Recommendations

Get Preload Recommendations

GET /island/recommendations?host_id=YOUR_ISLAND_ID

No authentication required. Returns Cargos your Island should preload based on current network demand, filtered by your hardware capabilities.

Query Parameters

Parameter	Type	Required	Description
`host_id`	string (UUID)	Yes	Your Island’s host ID

Response

{
  "host_id": "df1745b5-...",
  "recommendations": [
    {
      "workload_slug": "gguf-qwen3.5-4b",
      "model_url": "hf://unsloth/Qwen3.5-4B-GGUF",
      "model_hash": null,
      "runtime_type": "llmcpp",
      "estimated_earnings_per_job": "0.5",
      "queued_demand": 3,
      "demand_score": 30
    }
  ]
}

Field	Description
`workload_slug`	Cargo identifier
`model_url`	HuggingFace or direct download URL
`runtime_type`	Runtime needed (`llmcpp`, `container`, `wasm`, `onnx`, `diffusers`)
`queued_demand`	Number of jobs currently queued for this Cargo
`demand_score`	Composite score: `queued * 10 + recent_completions * 2`
`estimated_earnings_per_job`	Credits earned per completed job

Results are sorted by demand_score descending (highest demand first). Only Cargos your Island’s hardware can serve are included. Islands call this automatically every 15 minutes when coordinator.api_url is configured.

Error Responses

All errors follow this format:

{
  "error": {
    "message": "Description of the error",
    "code": "error_code"
  }
}

HTTP Status Codes

Code	Meaning
200	Success
201	Created
400	Bad request
401	Authentication required
402	Insufficient credits
404	Not found
422	Validation error
429	Rate limited
500	Server error

Rate Limiting

When rate limited, the response includes:

HTTP/1.1 429 Too Many Requests
Retry-After: 30

{
  "error": {
    "message": "Rate limit exceeded",
    "code": "rate_limited",
    "retry_after": 30
  }
}

Quotas

Resource	Limit
Requests per minute	100
Concurrent jobs	10
Concurrent streams	5