Readme
If you want 0 quality loss, you can set the speed_mode
to “Base Model (compiled)”.
If you want to use Pruna with your model, visit: https://docs.pruna.ai/en/stable/index.html
This is a 3x faster FLUX.1 [dev] model from Black Forest Labs, optimised with pruna with minimal quality loss.
This model costs approximately $1.25 to run on Replicate, or 0 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.
This model runs on Nvidia H100 GPU hardware. Predictions typically complete within 14 minutes.
If you want 0 quality loss, you can set the speed_mode
to “Base Model (compiled)”.
If you want to use Pruna with your model, visit: https://docs.pruna.ai/en/stable/index.html