Readme
This model doesn't have a readme.
Fast endpoint for Flux Kontext, optimized with pruna framework
This model costs approximately $0.0067 to run on Replicate, or 149 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.
This model runs on Nvidia H100 GPU hardware. Predictions typically complete within 5 seconds.
This model doesn't have a readme.