zai-org/GLM-5-FP8

text generationtransformersenzhtransformerssafetensorsglm_moe_dsatext-generationconversationalenmit
vLLMRunnable with vLLM
2.3M
DEPLOY IN 60 SECONDS

Run GLM-5-FP8 on Runcrate

Deploy on H100, A100, or RTX GPUs. Pay only for what you use. No setup required.