Tensor parallel distributed strategy without using deepspeed (#280) #299

kalyanjk · 2024-07-15T16:20:14Z

TP reference - ibm foundation-model-stack
Code cleanup -removed unused code

…I#280) * TP reference - ibm foundation-model-stack * Code cleanup -removed unused code --------- Co-authored-by: Kalyan <[email protected]>

…abanaAI#280) (HabanaAI#299)" This reverts commit 32c86d3.

* Revert "Tensor parallel distributed strategy without using deepspeed (#280) (#299)" This reverts commit 32c86d3. * Tensor parallel distributed strategy without using deepspeed (huggingface#1121) Co-authored-by: Kalyan <[email protected]> --------- Co-authored-by: Kalyan <[email protected]>

astachowiczhabana · 2024-08-05T10:06:19Z

huggingface#1121

Tensor parallel distributed strategy without using deepspeed (HabanaA…

541ae26

…I#280) * TP reference - ibm foundation-model-stack * Code cleanup -removed unused code --------- Co-authored-by: Kalyan <[email protected]>

kalyanjk requested review from mandy-li, libinta and dvarshney-habana as code owners July 15, 2024 16:20

dvarshney-habana approved these changes Jul 15, 2024

View reviewed changes

dvarshney-habana merged commit 32c86d3 into HabanaAI:v1.17-synapse Jul 15, 2024

kalyanjk pushed a commit to kalyanjk/optimum-habana-fork that referenced this pull request Jul 31, 2024

Revert "Tensor parallel distributed strategy without using deepspeed (H…

42fdb44

…abanaAI#280) (HabanaAI#299)" This reverts commit 32c86d3.

This was referenced Jul 31, 2024

Tensor parallel distributed strategy without using deepspeed #320

Merged

Tensor parallel distributed strategy without using deepspeed #321

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tensor parallel distributed strategy without using deepspeed (#280) #299

Tensor parallel distributed strategy without using deepspeed (#280) #299

kalyanjk commented Jul 15, 2024

astachowiczhabana commented Aug 5, 2024

Tensor parallel distributed strategy without using deepspeed (#280) #299

Tensor parallel distributed strategy without using deepspeed (#280) #299

Conversation

kalyanjk commented Jul 15, 2024

astachowiczhabana commented Aug 5, 2024