fix: bundle CUDA DLL into the release #62

louisgv · 2023-07-02T23:43:24Z

No description provided.

vercel · 2023-07-02T23:43:27Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
local-ai-web	⬜️ Ignored (Inspect)	Visit Preview		Sep 16, 2023 11:49pm

louisgv · 2023-07-03T10:24:36Z

https://github.com/bytecodealliance/wasmtime/blob/c19c729214e2237902eb177609643cb6523b7f2b/cranelift/native/src/lib.rs#L184

LLukas22 · 2023-07-18T14:07:20Z

Feeding.slowdown.mp4

@louisgv The callback to the UI of the fed tokens seams to slowdown the feeding process significantly. (Should be instant)

louisgv · 2023-07-20T07:44:45Z

The callback to the UI of the fed tokens seams to slowdown the feeding process significantly

Yes haha - there's a 42ms artificial lag that I introduced to make the UI a bit more smooth:

ref: https://github.com/louisgv/local.ai/blob/main/apps/desktop/src/providers/thread.ts#L160-L161

For non-accelerated machine and models, this is needed to have something showing :d.......

LLukas22 · 2023-07-20T09:32:35Z

Currently this copies the cuda dlls next to the local.ai executable if the cargo tauri dev or cargo tauri build command is executed with the --features cublas flag. @louisgv Is this enought to include the dlls into the bundle?

LLukas22 · 2023-07-26T13:44:35Z

@louisgv Whats the plan going forward on this? Can you take over and handle the auto update stuff?

louisgv · 2023-07-27T00:27:07Z

@LLukas22 yup, I'm on it now!

louisgv · 2023-09-16T23:07:24Z

Kinda want to wait for the Metal fix to land. My main fear with this PR is that the build seems flaky :d.... (OOM?...)

Perhaps we should remove some of the flakyness by building for either cuda or CL only?...

louisgv · 2023-09-16T23:18:32Z

Per the docs, there's still no metal chips on github runner VM yet: https://docs.github.com/en/actions/using-github-hosted-runners/about-github-hosted-runners#supported-runners-and-hardware-resources

So we will still need a metal self-hosted runner I think :d

louisgv · 2023-09-16T23:47:35Z

The last piece missing from this PR is a pipeline to upload the content of each zip artifact into a release.

Extract the zip artifact of the 3 jobs (mac, linux, window)
Have a 4th job called release, that's in charge of making a release and the update.json
Take the path and the signature to form a Tauri update package: https://tauri.app/v1/guides/distribution/updater/#static-json-file
Create the draft release

fix: bundle CUDA DLL into the release

52a48da

louisgv linked an issue Jul 2, 2023 that may be closed by this pull request

BUG | Can't run 0.5.1 on Windows, asks for additional dlls #61

Open

louisgv mentioned this pull request Jul 3, 2023

BUG | Can't run 0.5.1 on Windows, asks for additional dlls #61

Open

louisgv added 2 commits July 3, 2023 05:52

Merge branch 'main' into 61-bug-cuda-dlls

43330d4

Merge branch 'main' into 61-bug-cuda-dlls

4d94a47

vercel bot deployed to Preview July 14, 2023 11:24 View deployment

Merge branch 'main' into 61-bug-cuda-dlls

953af83

vercel bot deployed to Preview July 18, 2023 08:36 View deployment

Update rustformers + check gpu

4b71716

louisgv mentioned this pull request Jul 18, 2023

Crash when trying to load Model #84

Closed

Set n_batch correctly

4b8fe59

Copy cuda libraries

187b135

LLukas22 added 2 commits July 21, 2023 10:35

reduce feeding delay if gpu is enabled

9343897

Copy opencl dlls

a2a3dbf

LLukas22 approved these changes Jul 21, 2023

View reviewed changes

LLukas22 added 10 commits July 21, 2023 14:18

create linux ci

a8b3bbf

defaults for release infos

21ae9e1

Fail if files aren't found

286574d

Add windows build

86cc051

Macos build

47f9dfc

ci bugfixes

7c1f25a

More bugfixes and absolute paths

36e050b

Paths .... again

0b26205

Make mac artifacts unique

cc786f0

renable build for windows-cublas

89eb1fa

update character

0761d79

louisgv mentioned this pull request Jul 31, 2023

feat: Nonstreaming API #85

Open

louisgv added 4 commits August 1, 2023 14:06

Slight refactor

7481edf

update character

9d23cfd

update llm

5b51725

Merge branch 'main' into 61-bug-cuda-dlls

006cd5a

fix build script

9b8d16d

louisgv added 2 commits September 16, 2023 19:31

use self-hosted runner for metal

bc5edf6

remove build on push (consume too much compute atm)

18f04ed

louisgv added the help wanted Extra attention is needed label Sep 16, 2023

Add todo

1211cc2

louisgv self-assigned this Sep 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: bundle CUDA DLL into the release #62

fix: bundle CUDA DLL into the release #62

louisgv commented Jul 2, 2023

vercel bot commented Jul 2, 2023 •

edited

Loading

louisgv commented Jul 3, 2023

LLukas22 commented Jul 18, 2023

louisgv commented Jul 20, 2023

LLukas22 commented Jul 20, 2023

LLukas22 commented Jul 26, 2023

louisgv commented Jul 27, 2023

louisgv commented Sep 16, 2023

louisgv commented Sep 16, 2023

louisgv commented Sep 16, 2023 •

edited

Loading

fix: bundle CUDA DLL into the release #62

Are you sure you want to change the base?

fix: bundle CUDA DLL into the release #62

Conversation

louisgv commented Jul 2, 2023

vercel bot commented Jul 2, 2023 • edited Loading

louisgv commented Jul 3, 2023

LLukas22 commented Jul 18, 2023

louisgv commented Jul 20, 2023

LLukas22 commented Jul 20, 2023

LLukas22 commented Jul 26, 2023

louisgv commented Jul 27, 2023

louisgv commented Sep 16, 2023

louisgv commented Sep 16, 2023

louisgv commented Sep 16, 2023 • edited Loading

vercel bot commented Jul 2, 2023 •

edited

Loading

louisgv commented Sep 16, 2023 •

edited

Loading