whisper-large-v3, incredibly fast, with video transcription
Run flair/pos-english on a JSON list of sentence strings
DOVER video quality assessment tool, assigning videos both aesthetic and technical quality scores
MT3: Multi-Task Multitrack Music Transcription
Prepare arXiv papers for processing by Large Language Models (LLMs) by converting them into a single, expanded LaTeX file.
This model is not yet booted but ready for API calls. Your first API call will boot the model and may take longer, but after that subsequent responses will be fast.
This model runs on L40S.