You're looking at a specific version of this model. Jump to the model overview.

afiaka87 /sd-aesthetic-guidance:1af76e80

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
prompt
string
Input prompt
aesthetic_rating
integer
8

Min: 1

Max: 9

An aesthetic 'rating' from 1 - 9 (1 is the lowest, 9 is the highest). 9 is mostly art and will tend to do a painting style more often. 6 - 8 tend to work well.
aesthetic_weight
number
0.1

Max: 1

How much of the aesthetic CLIP embed should be subtracted from the unconditional latent. 0.0 will disable aesthetic guidance. Going above 0.2 tends to produce artifacts and highly abstract/simplified visuals.
width
integer (enum)
512

Options:

128, 256, 512, 768, 1024

Width of output image. Maximum size is 1024x768 or 768x1024 because of memory limits
height
integer (enum)
512

Options:

128, 256, 512, 768, 1024

Height of output image. Maximum size is 1024x768 or 768x1024 because of memory limits
init_image
string
Inital image to generate variations of. Will be resized to the specified width and height
mask
string
Black and white image to use as mask for inpainting over init_image. Black pixels are inpainted and white pixels are preserved. Experimental feature, tends to work better with prompt strength of 0.5-0.7
prompt_strength
number
0.8
Prompt strength when using init image. 1.0 corresponds to full destruction of information in init image
num_outputs
integer (enum)
1

Options:

1, 4

Number of images to output
num_inference_steps
integer
50

Min: 1

Max: 500

Number of denoising steps
guidance_scale
number
7.5

Min: 1

Max: 20

Scale for classifier-free guidance
seed
integer
Random seed. Leave blank to randomize the seed

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'items': {'format': 'uri', 'type': 'string'},
 'title': 'Output',
 'type': 'array'}