glavin001 / exllama-airoboros-7b-gpt4-1.4-gptq

Test out fast inference with ExLlama and 4bit quantization!

  • Public
  • 1.7K runs

Want to make some of these yourself?

Run this model