Disto uses the state-of-the-art Llama model from Meta as its underlying LLM. Follow the instructions below to deploy this LLM on AWS Sagemaker.

  1. Go to “Amazon Sagemaker” in the AWS console. Click on “Studio” in the left sidebar.
  1. Choose your user profile and click “Open Studio”.
  1. Click on “Jumpstart”.
  1. Search for your preferred Llama model in the search bar and click on it. We recommend “Llama-2-70b-chat”, as it is most accurate, but you can choose a smaller model if you wish.
  1. Click on “deploy”. Wait for the model to deploy.
  1. Copy your Endpoint name. This will be the value for sagemaker-endpoint-name in the deploy script.
  1. If you wish to delete the endpoint, click on “delete”.