lucataco / llama-2-7b-chat

Meta's Llama 2 7b Chat - GPTQ

  • Public
  • 20.3K runs
  • L40S
  • GitHub
  • Paper
  • License
Iterate in playground

Input

string
Shift + Return to add a new line

Prompt to send to Llama v2

Default: "Tell me about AI"

string
Shift + Return to add a new line

System prompt that helps guide system behavior

Default: "You are a helpful, respectful and honest assistant. Always answer as helpfully as possible, while being safe. Your answers should not include any harmful, unethical, racist, sexist, toxic, dangerous, or illegal content. Please ensure that your responses are socially unbiased and positive in nature. If a question does not make any sense, or is not factually coherent, explain why instead of answering something not correct. If you don't know the answer to a question, please don't share false information."

integer
(minimum: 1, maximum: 4096)

Number of new tokens

Default: 512

number
(minimum: 0, maximum: 5)

Randomness of outputs, 0 is deterministic, greater than 1 is random

Default: 0.75

number
(minimum: 0.01, maximum: 1)

When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens

Default: 0.95

number
(minimum: 0, maximum: 5)

Penalty for repeated words in generated text; 1 is no penalty, values greater than 1 discourage repetition, less than 1 encourage it

Default: 1.1

Output

Of course! I'd be happy to help you learn more about AI! πŸ€– Artificial intelligence (AI) refers to the development of computer systems that can perform tasks that typically require human intelligence, such as visual perception, speech recognition, decision-making, and language translation. AI technologies are able to process vast amounts of data quickly and accurately, making them incredibly useful for a wide range of applications. πŸ’» There are several types of AI, including: 1. Narrow or weak AI: This type of AI is designed to perform a specific task, such as facial recognition, language translation, or playing a game like chess. Narrow AI is the most common form of AI and is used in many applications, including virtual assistants, self-driving cars, and medical diagnosis. 2. General or strong AI: This type of AI is designed to perform any intellectual task that a human can, such as reasoning, problem-solving, and learning. General AI has not yet been developed, but researchers are working towards creating it. πŸ”₯ 3. Superintelligence: This type
Generated in

Run time and cost

This model costs approximately $0.0054 to run on Replicate, or 185 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 6 seconds.

Readme

This an attempt at an implementation of the model: TheBloke/Llama-2-7b-Chat-GPTQ

A quantized version of Llama 2 7b model

Give me a follow if you like my work! @lucataco93