Readme
🚀 Meet DeepSeek-R1 distilled on LLaMA 8B! Unlike other similar models on Replicate, this one has its weights cached, so you don’t have to waste time downloading them every time. ⏳💨
But wait, there’s more! 🎉 It’s also quantized, meaning you get way better efficiency with barely any performance loss. Smarter, faster, and optimized just for you! ⚡🔥