Skip to main content

Documentation Index

Fetch the complete documentation index at: https://runcrate.ai/docs/llms.txt

Use this file to discover all available pages before exploring further.

Models API

An OpenAI-compatible REST API at https://api.runcrate.ai/v1 that gives you access to 140+ open-source models. Use the same OpenAI SDKs you already know — just change the base URL and API key.

Model Categories

CategoryEndpointDescription
Chat/v1/chat/completionsConversational AI and instruction following
Reasoning/v1/chat/completionsChain-of-thought models with thinking output
Code/v1/chat/completionsCode generation, analysis, and debugging
Vision/v1/chat/completionsImage understanding via multimodal input
Image/v1/images/generationsText-to-image generation
Video/v1/videosText-to-video generation (async)
TTS/v1/audio/speechText-to-speech synthesis
ASR/v1/audio/transcriptionsSpeech-to-text transcription

Model Playground

An interactive testing environment in the dashboard. Each model has its own playground where you can chat, generate images, create videos, or test audio — with auto-generated code examples.

GPU Instances

Full Linux containers with dedicated NVIDIA GPUs and root shell access via SSH. Configure CPU, memory, storage, and GPU type/count. Billed hourly while running.

Available GPUs

GPUVRAMBest For
RTX 409024 GBDevelopment, small models, fine-tuning
A10040/80 GBTraining, large model inference
H10080 GBMaximum performance training and inference
L40S48 GBInference, image/video generation

Workspaces

The top-level organizational unit. Each workspace has its own resources (instances, storage), billing (credit balance, transactions), team members, and audit logs. Use workspaces to separate workloads, clients, or teams.

Environments

Resource containers inside a workspace. Every workspace has one or more environments (the default is called main). Deployments, storage instances, and API keys belong to an environment — money belongs to the workspace. All environments draw from the same credit balance. Use environments to separate staging from production, or to isolate different projects within one workspace.

Roles

RoleResourcesBillingMembersDelete Workspace
OwnerFullFullFullYes
ManagerFullFullFullNo
DeveloperFullNoNoNo

Credits

Prepaid billing currency. 1 credit = $1 USD. Credits are consumed by model API calls (per token/per generation) and GPU instance hours. Each workspace has its own balance.

Storage

Persistent cloud volumes (Wasabi, AWS S3, Backblaze B2) that mount to GPU instances. Data survives instance termination. Includes a built-in file explorer in the dashboard. Billed at $0.03/GB/month.

API Keys

Authentication tokens for the Models API and Infrastructure API. Created per-workspace in the dashboard. The full key is shown only once at creation. Format: rc_live_...