Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why you choose layer 18 as a edits layer #37

Open
Robin-WZQ opened this issue Nov 5, 2023 · 0 comments
Open

Why you choose layer 18 as a edits layer #37

Robin-WZQ opened this issue Nov 5, 2023 · 0 comments

Comments

@Robin-WZQ
Copy link

Dear authors,

I really appreciate your work but have a question. Hopefully you can help me.

in ROME E.5, you said "We perform the intervention at layer 18. As Figure 1k shows, this is the center of causal effect in MLP layers, and as Figure 3 shows, layer 18 is approximately when MLP outputs begin to switch from acting as keys to values.".

However, in MEMIT, you said "at layers where the gap is largest, the role of the MLP computation is important. We select the layers where the gap is largest as the range R to use for the intervention done by MEMIT"

layer 18 obviously doesn't have the largest gap, but why you choose it as key layer?

Is there sth I miss?

thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant