Explore

I want to…

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

cog-resnet example trial

Updated 8 runs

MusicGen fine-tuned on cover-songs by Toad from 'Super Mario' series. Text token : "by toad"

Updated 44 runs

Make a transcription of a phone call

Updated 10 runs

MusicGen fine-tuned on 8bit Super Mario Bros (1985)

Updated 153 runs

MusicGen trained on NewJeans with vocals

Updated 220 runs

Trained on plants

Updated 28 runs

My own personal copy of daanelson/whisperx

Updated 311 runs

A multimodal LLM-based AI assistant, which is trained with alignment techniques. Qwen-VL-Chat supports more flexible interaction, such as multi-round question answering, and creative capabilities.

Updated 789.8K runs

Updated 1.6K runs

Mistral-7B-v0.1 fine tuned for chat with the OpenOrca dataset.

Updated 65.8K runs

Using a ComfyUI workflow to run SDXL text2img

Updated 437 runs

Kim Jung Gi style drawing

Updated 453 runs

Updated 21 runs

Zero-shot / open vocabulary object detection

Updated 18.4K runs

A high-performing language model trained to act as a helpful assistant

Updated 7K runs

Updated 128 runs

Controlling Vision-Language Models for Universal Image Restoration

Updated 2.1K runs

MusicGen trained on NewJeans

Updated 89 runs

✨DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Updated 128.5K runs

Updated 104 runs

Object removal, video completion and video outpainting

Updated 1.2K runs

Updated 110 runs

Updated 442 runs

Updated 22 runs

Updated 285 runs

Updated 669 runs

Instruction tuned text-to-image diffusion models as vision generalists

Updated 356 runs

Flat Eric is a puppet character

Updated 105 runs

📽️ Increase Framerate 🎬 ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

Updated 46.1K runs

Qwen-14B-Chat is a Transformer-based large language model, which is pretrained on a large volume of data, including web texts, books, codes, etc.

Updated 4.8K runs

Embedding models that has been trained using Jina AI's Linnaeus-Clean dataset.

Updated 33 runs

Updated 206 runs

An SDXL finetune of photos I took while riding a train in Finland (Helsinki-Vaasa)

Updated 420 runs

sdxl-isometric-geology is an SDXL fine-tune that's been trained with cool USGS isometric block and fence diagrams from the 1950s and 1960s.

Updated 569 runs

Stylized Audio-Driven Single Image Talking Face Animation

Updated 17.6K runs

Updated 50 runs

Updated 23 runs

Updated 40 runs

Updated 1.7K runs

Text-to-gif using SDXL, with controlnet and lora support

Updated 3.6K runs

SDXL fine tuned on Genshin Impact landscape by imageapp.xyz

Updated 867 runs

Updated 58 runs

Hotshot XL using SDXL for generating one second clips of high quality! Running on a40 Made by the greats at hotshot.co and brought to you by your friends at FullJourney! Thanks to LucaTaco for the MVP!

Updated 4.3K runs

🎨 AnimateDiff (w/ MotionLoRAs for Panning, Zooming, etc): Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Updated 47.9K runs

Updated 92 runs

Updated 250 runs

Updated 174 runs

Image restoration and face enhancement

Updated 16.7K runs

A ControlNet model designed to enhance the temporal consistency of generated outputs

Updated 125 runs

Updated 558 runs