Deploying Instances

This guide walks you through deploying a GPU instance on Runcrate.

Prerequisites

Before deploying, make sure you have:

A Runcrate account
Credits in your account balance (add credits here)
An SSH key added to your account (manage SSH keys)

Step-by-Step Deployment

1. Open the Deploy Page

Navigate to Dashboard > Instances > Deploy Instance, or go directly to runcrate.ai/dashboard/instances/deploy.

2. Choose GPU Type and Count

Select the GPU model that fits your workload. You can deploy multiple GPUs on a single instance for multi-GPU training.

GPU	VRAM	Recommended For
RTX 4090	24 GB	Inference, fine-tuning
L40S	48 GB	Large model inference, medium training
A100 40 GB	40 GB	Training, research
A100 80 GB	80 GB	Large-scale training
H100	80 GB	Maximum performance

3. Configure CPU, Memory, and Storage

Adjust the instance specifications based on your needs:

CPU cores — More cores help with data preprocessing and multi-threaded workloads.
RAM — Ensure enough memory for your dataset and model. A good rule of thumb is 2x your GPU VRAM.
Disk — Local NVMe storage for your code, datasets, and model weights.

4. Select a Region

Choose from available regions. Availability of specific GPU types may vary by region.

Select the region closest to you for the lowest latency SSH experience.

5. Choose a Template

Select a pre-built template with the tools you need:

Image	Includes
Devbox (`ubuntu-devbox`)	Python + uv, VS Code Server, Jupyter, Docker, SSH
Finetune (`ubuntu-train`)	Unsloth, PyTorch + CUDA, HuggingFace CLI, W&B, Jupyter, VS Code Server
Inference (`ubuntu-inference`)	vLLM or SGLang, HuggingFace CLI, nvtop, Jupyter, VS Code Server

6. Attach an SSH Key

Select one or more SSH keys to authorize for the instance. If you have not added a key yet, you can do so from Dashboard > SSH Keys.

7. Optionally Attach Storage

You can attach a persistent storage volume to your instance. This keeps your data safe across instance terminations. See Storage for details.

8. Review and Deploy

Review your configuration and estimated hourly cost, then click Deploy.

Cost Estimation

Your hourly cost is determined by:

Hourly Cost = GPU cost x GPU count + CPU cost + Memory cost + Storage cost

The exact cost is displayed on the deploy page before you confirm.

Deployment Time

Instances typically take 1-3 minutes to go from creating to deployed. You will see the status update in real time on the Instances dashboard.

Enable auto-recharge in your Billing settings to ensure your balance never runs out and your instances are not interrupted.

What Happens Next

Once your instance status shows Deployed:

Copy the SSH command from the instance details.
Connect via terminal or VS Code.
Start building.

See Connecting to Instances for the full connection guide.

​Prerequisites

​Step-by-Step Deployment

​1. Open the Deploy Page

​2. Choose GPU Type and Count

​3. Configure CPU, Memory, and Storage

​4. Select a Region

​5. Choose a Template

​6. Attach an SSH Key

​7. Optionally Attach Storage

​8. Review and Deploy

​Cost Estimation

​Deployment Time

​What Happens Next

Prerequisites

Step-by-Step Deployment

1. Open the Deploy Page

2. Choose GPU Type and Count

3. Configure CPU, Memory, and Storage

4. Select a Region

5. Choose a Template

6. Attach an SSH Key

7. Optionally Attach Storage

8. Review and Deploy

Cost Estimation

Deployment Time

What Happens Next