Deployments
Posted
You can now create a deployment to get more control over how your models run. Deployments allow you to run a model with a private, fixed API endpoint. You can configure the version of the model, the hardware it runs on, and how it scales.
Using deployments, you can:
- Roll out new versions of your model without having to edit your code.
- Keep instances always on to avoid cold boots.
- Customize what hardware your models run on.
- Monitor whether instances are booting up, running, or processing predictions.
- View predictions that are flowing through your models.
Deployments work with both public models and your own private models.
🚀 Check out the deployments guide to learn more and get started.