Readme
The difference between this model and that of lucataco
is that you can pass an entire conversation to the model. This makes it possible to have multi-turn conversation, and chat with videos interactively.
Fork / Remix of Apollo 7B by Luis C. (https://replicate.com/lucataco/apollo-7b) to support multi-turn conversations.
This model runs on Nvidia L40S GPU hardware. We don't yet have enough runs of this model to provide performance information.
The difference between this model and that of lucataco
is that you can pass an entire conversation to the model. This makes it possible to have multi-turn conversation, and chat with videos interactively.