
shashwatgokhe
/
deepcoder-14b
DeepCoder-14B-Preview is a code reasoning LLM fine-tuned from DeepSeek-R1-Distilled-Qwen-14B using distributed reinforcement learning (RL) to scale up to long context lengths.
- Public
- 14 runs
Want to make some of these yourself?
Run this model