01-ai / yi-6b-chat

The Yi series models are large language models trained from scratch by developers at 01.AI.

  • Public
  • 4K runs
  • GitHub
  • License

Input

Output

Run time and cost

This model runs on Nvidia A40 (Large) GPU hardware. Predictions typically complete within 2 seconds.

Readme

See the full model card here. The model served here are the original, un-quantized weights.

NOTE: As per the license, replicate was granted permission to share the model here.