prunaai/flux.1-dev-lora | Run with an API on Replicate

This is a 3x faster FLUX.1 [dev] model from Black Forest Labs, optimised with pruna with minimal quality loss.

Public

27.7K runs

Run with an API

Playground API Examples README Versions

Examples

View more examples

Run time and cost

This model costs approximately $0.0033 to run on Replicate, or 303 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia H100 GPU hardware. Predictions typically complete within 3 seconds.

Readme

If you want 0 quality loss, you can set the speed_mode to “Base Model (compiled)”.

If you want to use Pruna with your model, visit: https://docs.pruna.ai/en/stable/index.html