Fix (llm): fix device issue for eval when not using default device #949

fabianandresgrob · 2024-04-30T13:08:31Z

This PR fixes a problem when setting a different device than the default. E.g. when cuda:0 is free, the call to cuda() would place the data on that device. However, when we specified the model to be on cuda:1, we run into the problem of having the data on a different device than the model. Simply moving the data to model.device solves this. For the creation of a validation_dataloader, I've added an argument to solve this issue.

Giuseppe5 · 2024-05-14T11:49:50Z

Tests are failing. Is this because of the PR?

Giuseppe5 · 2024-05-27T09:16:36Z

Is this still relevant/needed?

fabianandresgrob · 2024-05-27T09:19:00Z

Is this still relevant/needed?

yes, still needed. We applied it manually on the separate branch

src/brevitas_examples/llm/main.py

fabianandresgrob requested a review from Giuseppe5 April 30, 2024 13:08

fabianandresgrob force-pushed the fix/device_llm_quant branch from fb191f7 to 89aeaad Compare May 27, 2024 09:40

Giuseppe5 requested review from Giuseppe5 and removed request for Giuseppe5 May 27, 2024 11:16

Giuseppe5 reviewed May 28, 2024

View reviewed changes

src/brevitas_examples/llm/main.py Show resolved Hide resolved

fabianandresgrob added 2 commits May 31, 2024 10:46

Fix (llm): fix device issue for eval when not using default device

da84c88

Fix (llm): change device to parameter device

3c4a674

fabianandresgrob force-pushed the fix/device_llm_quant branch from 89aeaad to 3c4a674 Compare May 31, 2024 10:17

Giuseppe5 self-requested a review May 31, 2024 11:44

Giuseppe5 merged commit 8c71e08 into Xilinx:dev May 31, 2024
334 of 337 checks passed

fabianandresgrob deleted the fix/device_llm_quant branch May 31, 2024 13:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix (llm): fix device issue for eval when not using default device #949

Fix (llm): fix device issue for eval when not using default device #949

fabianandresgrob commented Apr 30, 2024

Giuseppe5 commented May 14, 2024

Giuseppe5 commented May 27, 2024

fabianandresgrob commented May 27, 2024

Fix (llm): fix device issue for eval when not using default device #949

Fix (llm): fix device issue for eval when not using default device #949

Conversation

fabianandresgrob commented Apr 30, 2024

Giuseppe5 commented May 14, 2024

Giuseppe5 commented May 27, 2024

fabianandresgrob commented May 27, 2024