Introduce set_default_sdpa #79

cbalioglu · 2023-10-02T20:50:53Z

This PR introduces set_default_sdpa function and sdpa context manager to switch between different attention implementations during runtime.

from fairseq2.nn.transformer import TorchSDPA, NaiveSDPA, set_default_sdpa, sdpa

set_default_sdpa(TorchSDPA)  # or None to use the library default

# Use naive SDPA for debugging (e.g. pdb)
with sdpa(NaiveSDPA)
    model = load_llama_model("llama_7b")

Introduce set_default_sdpa

1ec324b

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 2, 2023

cbalioglu merged commit 4f5f8b2 into main Oct 2, 2023

cbalioglu deleted the sdpa branch October 2, 2023 21:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce set_default_sdpa #79

Introduce set_default_sdpa #79

cbalioglu commented Oct 2, 2023

Introduce set_default_sdpa #79

Introduce set_default_sdpa #79

Conversation

cbalioglu commented Oct 2, 2023