Skip to content

Sydney-Informatics-Hub/xskill-nim

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

xskill-nim

Notebook and data for NVIDIA-NIM cross skilling session

To run the notebook setup the environment in conda using:

conda env create -f env.yaml -p ./xskill-rag

then:

conda activate ./xskill-rag

then start a notebook server and load the notebook in the browser:

jupyter notebook

How to set up a NIM for llama-3.1-8b-instruct on MPC:

  1. Go to: https://org.ngc.nvidia.com/setup/personal-keys and set up an API key (also create an NVIDIA Account if it doesn't already exist).
  2. When creating an NGC API key, ensure that at least “NGC Catalog” & “Public API” are selected from the “Services Included” dropdown.
  3. At the command line (on MPC - 10.167.67.78): export NGC_API_KEY=<your-api-key>
to set the API key as a required environment variable.
  4. Log in to the NVIDIA NGC docker repo: echo "$NGC_API_KEY" | docker login nvcr.io --username '$oauthtoken' --password-stdin
  5. Install NGC CLI following instructions here: https://org.ngc.nvidia.com/setup/installers/cli
  6. Add NGC CLI to the path with: export PATH=$PATH:$(pwd)/ngc-cli
  7. Configure NGC CLI: ngc config set
  8. Check the list of available images: ngc registry image list --format_type ascii nvcr.io/nim/*
  9. Get further info for a particular image: ngc registry image info --format_type ascii nim/meta/llama-3.1-8b-instruct:latest
  10. Run the contents of setup_MPC on the MPC to run the docker image for the llama-3.1-8b-instruct:latest NIM.
  11. Check connection with:
curl -X 'POST' \
    "http://10.167.67.78:8000/v1/completions" \
    -H "accept: application/json" \
    -H "Content-Type: application/json" \
    -d '{"model": "meta/llama3-8b-instruct", "prompt": "Describe Retreival-Augmented Generation", "max_tokens": 64}'

About

Notebook and data for NVIDIA-NIM cross skilling session

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published