Readme
If you want 0 quality loss, you can set the speed_mode
to “Base Model (compiled)”.
If you want to use Pruna with your model, visit: https://docs.pruna.ai/en/stable/index.html
This is a 3x faster FLUX.1 [dev] model from Black Forest Labs, optimised with pruna with minimal quality loss.
This model costs approximately $0.0033 to run on Replicate, or 303 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.
This model runs on Nvidia H100 GPU hardware. Predictions typically complete within 3 seconds.
If you want 0 quality loss, you can set the speed_mode
to “Base Model (compiled)”.
If you want to use Pruna with your model, visit: https://docs.pruna.ai/en/stable/index.html