nousresearch / hermes-2-theta-llama-8b

Hermes-2 Θ (Theta) is the first experimental merged model released by Nous Research, in collaboration with Charles Goddard at Arcee, the team behind MergeKit.

  • Public
  • 1.7K runs
ℹ️
Language model training is a beta feature.
We’re still working out the kinks. If you run into any issues, please hop in our Discord and let us know. Keep in mind that we might make breaking changes to the API as we improve the training experience.

If you haven’t yet trained a model on Replicate, we recommend you read one of the following guides.

Pricing

Trainings for this model run on Nvidia A100 (40GB) GPU hardware, which costs $0.00115 per second.

Create a training

Install the Python library:

pip install replicate

Then, run this to create a training with replicate-internal/hermes-2-theta-l3-8b-fp16-triton:c08e085a as the base model:

import replicate

training = replicate.trainings.create(
  version="replicate-internal/hermes-2-theta-l3-8b-fp16-triton:c08e085aa24433e159f167bda8de1a67432e66cdcc3a37fae1189367c3c4eb2a",
  input={
    ...
  },
  destination=f"{username}/<destination-model-name>"
)

print(training)
curl -s -X POST \
-d '{"destination": "{username}/<destination-model-name>", "input": {...}}' \
  -H "Authorization: Bearer $REPLICATE_API_TOKEN" \
  https://api.replicate.com/v1/models/nousresearch/hermes-2-theta-llama-8b/versions/c08e085aa24433e159f167bda8de1a67432e66cdcc3a37fae1189367c3c4eb2a/trainings

The API response will look like this:

{
  "id": "zz4ibbonubfz7carwiefibzgga",
  "version": "c08e085aa24433e159f167bda8de1a67432e66cdcc3a37fae1189367c3c4eb2a",
  "status": "starting",
  "input": {
    "data": "..."
  },
  "output": null,
  "error": null,
  "logs": null,
  "started_at": null,
  "created_at": "2023-03-28T21:47:58.566434Z",
  "completed_at": null
}

Note that before you can create a training, you’ll need to create a model and use its name as the value for the destination field.