lucataco / qwen2-57b-a14b-instruct

Qwen2 57 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

  • Public
  • 1.1K runs
  • Paper
  • License

Create training

Trainings for this model run on 2x Nvidia A100 (80GB) GPU hardware, which costs $0.0028 per second. Upon creation, you will be redirected to the training detail page where you can monitor your training's progress, and eventually download the weights and run the trained model.

Note: versions of this model with fast booting use the hardware set by the base model they were trained from.

If you haven’t yet trained a model on Replicate, we recommend you read one of the following guides.