Add LLaMAv2-70B text generation #441

dvarshney-habana · 2023-10-03T11:52:33Z

Update README with performance optimal command for LLaMAv2-70B text generation using deepspeed

What does this PR do?

Add LLaMAv2-70B text generation commands to README

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Update README with performance optimal command for LLaMAv2-70B text generation using deepspeed

HuggingFaceDocBuilderDev · 2023-10-03T11:59:11Z

The documentation is not available anymore as the PR was closed or merged.

regisss

I left a couple of comments

regisss · 2023-10-03T11:54:53Z

examples/text-generation/README.md

@@ -95,6 +95,11 @@ python ../gaudi_spawn.py --use_deepspeed --world_size 8 run_generation.py \
 --max_new_tokens 100
 ```

+Text generation with LLaMAv2-70B model(using deepspeed) can be done using the following command:
+```bash
+python ../gaudi_spawn.py --use_deepspeed --world_size 8 run_generation.py --model_name_or_path <MODEL_PATH>/Llama-2-70b-hf/ --max_new_tokens 4096  --bf16  --use_hpu_graphs --use_kv_cache --batch_size 56  --attn_softmax_bf16 --limit_hpu_graphs  --n_iterations 5 --reuse_cache --trim_logits


Suggested change

python ../gaudi_spawn.py --use_deepspeed --world_size 8 run_generation.py --model_name_or_path <MODEL_PATH>/Llama-2-70b-hf/ --max_new_tokens 4096 --bf16 --use_hpu_graphs --use_kv_cache --batch_size 56 --attn_softmax_bf16 --limit_hpu_graphs --n_iterations 5 --reuse_cache --trim_logits

python ../gaudi_spawn.py --use_deepspeed --world_size 8 run_generation.py \

--model_name_or_path meta-llama/Llama-2-70b-hf \

--max_new_tokens 4096 \

--bf16 \

--use_hpu_graphs \

--use_kv_cache \

--batch_size 56 \

--attn_softmax_bf16 \

--limit_hpu_graphs \

--reuse_cache \

--trim_logits

examples/text-generation/README.md

regisss

LGTM!

Add LLaMAv2-70B text generation

be8828f

Update README with performance optimal command for LLaMAv2-70B text generation using deepspeed

dvarshney-habana requested a review from regisss as a code owner October 3, 2023 11:52

regisss reviewed Oct 3, 2023

View reviewed changes

address review comments

56df635

regisss added the run-test Run CI for PRs from external contributors label Oct 3, 2023

regisss reviewed Oct 3, 2023

View reviewed changes

examples/text-generation/README.md Outdated Show resolved Hide resolved

removed new line at end of command

ce4a91d

regisss added run-test Run CI for PRs from external contributors and removed run-test Run CI for PRs from external contributors labels Oct 3, 2023

regisss approved these changes Oct 3, 2023

View reviewed changes

regisss merged commit 4ba07ce into huggingface:main Oct 3, 2023
11 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LLaMAv2-70B text generation #441

Add LLaMAv2-70B text generation #441

dvarshney-habana commented Oct 3, 2023

HuggingFaceDocBuilderDev commented Oct 3, 2023 •

edited

Loading

regisss left a comment

regisss Oct 3, 2023

regisss left a comment

-python ../gaudi_spawn.py --use_deepspeed --world_size 8 run_generation.py --model_name_or_path <MODEL_PATH>/Llama-2-70b-hf/ --max_new_tokens 4096  --bf16  --use_hpu_graphs --use_kv_cache --batch_size 56  --attn_softmax_bf16 --limit_hpu_graphs  --n_iterations 5 --reuse_cache --trim_logits
+python ../gaudi_spawn.py --use_deepspeed --world_size 8 run_generation.py \
+--model_name_or_path meta-llama/Llama-2-70b-hf \
+--max_new_tokens 4096 \
+--bf16 \
+--use_hpu_graphs \
+--use_kv_cache \
+--batch_size 56 \
+--attn_softmax_bf16 \
+--limit_hpu_graphs \
+--reuse_cache \
+--trim_logits

Add LLaMAv2-70B text generation #441

Add LLaMAv2-70B text generation #441

Conversation

dvarshney-habana commented Oct 3, 2023

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Oct 3, 2023 • edited Loading

regisss left a comment

Choose a reason for hiding this comment

regisss Oct 3, 2023

Choose a reason for hiding this comment

regisss left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Oct 3, 2023 •

edited

Loading