kosbu/Llama-3.3-70B-Instruct-AWQ

text generationtransformersenfrtransformerssafetensorsllamatext-generationfacebookmetallama3.3
341.3K

Llama-3.3-70B-Instruct AWQ 4-Bit Quantized Version

This repository provides the AWQ 4-bit quantized version of meta-llama/Llama-3.3-70B-Instruct, originally developed by Meta AI.

DEPLOY IN 60 SECONDS

Run Llama-3.3-70B-Instruct-AWQ on Runcrate

Deploy on H100, A100, or RTX GPUs. Pay only for what you use. No setup required.