shashwatgokhe / deepcoder-14b

DeepCoder-14B-Preview is a code reasoning LLM fine-tuned from DeepSeek-R1-Distilled-Qwen-14B using distributed reinforcement learning (RL) to scale up to long context lengths.

  • Public
  • 14 runs
  1. Version
    cuda12.1-python3.11-torch2.3.1-X64
    Commit
    40d6181291e623b644dd3db41eb8b4186ca2f07f

    6c28f1b7

    Latest
  2. Version
    cuda12.1-python3.11-torch2.3.1-X64
    Commit
    40d6181291e623b644dd3db41eb8b4186ca2f07f
  3. Version
    cuda12.1-python3.11-torch2.3.1-X64
    Commit
    40d6181291e623b644dd3db41eb8b4186ca2f07f
  4. Version
    cuda12.1-python3.11-torch2.3.1-X64
    Commit
    40d6181291e623b644dd3db41eb8b4186ca2f07f
  5. Version
    cuda12.1-python3.11-torch2.3.1-X64
    Commit
    40d6181291e623b644dd3db41eb8b4186ca2f07f
  6. Version
    cuda12.1-python3.11-torch2.3.1-X64
    Commit
    40d6181291e623b644dd3db41eb8b4186ca2f07f
  7. Version
    cuda12.1-python3.11-torch2.3.1-X64
    Commit
    40d6181291e623b644dd3db41eb8b4186ca2f07f
  8. Version
    cuda12.1-python3.11-torch2.3.1-X64
    Commit
    40d6181291e623b644dd3db41eb8b4186ca2f07f
  9. Version
    cuda12.1-python3.11-torch2.3.1-X64
    Commit
    40d6181291e623b644dd3db41eb8b4186ca2f07f
  10. Version
    cuda12.1-python3.11-torch2.3.1-X64
    Commit
    40d6181291e623b644dd3db41eb8b4186ca2f07f