Name		Name	Last commit message	Last commit date
parent directory ..
data		data
dsets		dsets
experiments		experiments
rome		rome
scripts		scripts
util		util
README.md		README.md
fig.py		fig.py
globals.yml		globals.yml

README.md

Bias Tracing

Trace bias effect in states of language model.

Tracing

Run the scripts bash scripts/gpt2m.sh.

Results are saved in ./results.

Histograms

>>> python fig.py -h
    usage: fig.py [-h] [--root ROOT] [--num_layer NUM_LAYER] [--model_name MODEL_NAME] [--bias {gender,race}] [--num_sample NUM_SAMPLE]

    optional arguments:
    -h, --help            show this help message and exit
    --root ROOT           the path of results
    --num_layer NUM_LAYER
                            The num of model layers.
    --model_name MODEL_NAME
                            The model name.
    --bias {gender,race}  The bias type.
    --num_sample NUM_SAMPLE
                            The num of samples

Thanks for the original code from ROME.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bias_tracing

bias_tracing

README.md

Bias Tracing

Tracing

Histograms

Files

bias_tracing

Directory actions

More options

Directory actions

More options

Latest commit

History

bias_tracing

Folders and files

parent directory

README.md

Bias Tracing

Tracing

Histograms