You're looking at a specific version of this model. Jump to the model overview.

villesau /whisper-timestamped:70abaf3a

Input

*file

Audio file to transcribe

string
Shift + Return to add a new line

Language code (e.g., 'en') or 'auto' for auto-detect

Default: "auto"

string

Task to perform

Default: "transcribe"

boolean

Use Voice Activity Detection

Default: false

boolean

Detect speech disfluencies

Default: false

boolean

Compute word confidence scores

Default: true

number

Temperature for sampling

Default: 0

integer

Number of candidates when sampling with non-zero temperature

integer

Number of beams in beam search, only applicable when temperature is zero

number

Optional patience value to use in beam decoding

number

Optional token length penalty coefficient (alpha) as in https://arxiv.org/abs/1609.08144

string
Shift + Return to add a new line

Comma-separated list of token ids to suppress during sampling

Default: "-1"

string
Shift + Return to add a new line

Optional text to provide as a prompt for the first window

boolean

Whether to condition on previous text

Default: true

number

Threshold for no speech probability

Default: 0.6

number

Threshold for compression ratio

Default: 2.4

number

Threshold for average log probability

Default: -1

boolean

Whether to display the text being decoded

Default: false

Output

No output yet! Press "Submit" to start a prediction.