GSAI-ML/LLaDA-8B-Instruct

Name: GSAI-ML/LLaDA-8B-Instruct
Rating: 5 (343 reviews)
Author: GSAI-ML

text generationtransformerstransformerssafetensorslladatext-generationconversationalcustom_codemit

343

262.0K

LLaDA-8B-Instruct

We introduce LLaDA, a diffusion model with an unprecedented 8B scale, trained entirely from scratch, rivaling LLaMA3 8B in performance.

[2025-10-21] We have modified modeling_llada.py to support the input of attention_mask.

Run this model on powerful GPU infrastructure. Deploy in 60 seconds.

Pay per second

H100, A100, RTX GPUs

Instant deployment

DEPLOY IN 60 SECONDS

Deploy on H100, A100, or RTX GPUs. Pay only for what you use. No setup required.