Threw together a really slapdash continuously built semi-pre-installed/compiled Docker Image for running 4bit GPTQ Llama on a provider like vast.ai, runpod.io, etc. #302

nelsonjchen · 2023-03-14T00:50:16Z

nelsonjchen
Mar 14, 2023

https://github.com/nelsonjchen/docker-quick-llama

I genuinely probably do not have enough time to keep up with this amount of hot action. That said, I was able to run Llama on vast.ai pretty well. But with all this llama.cpp, daili, and so on, I'm not sure if what I'm doing is relevant beyond my weekend escapades. Anyway, if the ball goes back to GPUs, this is there I guess.

The real value may be in the github actions workflow, its pre-setup of the GPTQ stuff, and its use of GitHub packages to distribute the multi-gigabyte environment.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Threw together a really slapdash continuously built semi-pre-installed/compiled Docker Image for running 4bit GPTQ Llama on a provider like vast.ai, runpod.io, etc. #302

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

Threw together a really slapdash continuously built semi-pre-installed/compiled Docker Image for running 4bit GPTQ Llama on a provider like vast.ai, runpod.io, etc. #302

nelsonjchen Mar 14, 2023

Replies: 0 comments

nelsonjchen
Mar 14, 2023