jasonlim131

Follow

Jaehyuk Lim jasonlim131

Follow

aspiring prompt engineer: never forgets to say "please" at the end of the prompt.

5 followers · 22 following

philadelphia

Achievements

Achievements

Pinned Loading

ARENA_Replication ARENA_Replication Public

Forked from callummcdougall/ARENA_2.0

Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.

HTML
maze_rl_peek maze_rl_peek Public

Jupyter Notebook
sparse_AE sparse_AE Public

Forked from ai-safety-foundation/sparse_autoencoder

Sparse Autoencoder for Mechanistic Interpretability for Practice

Python
Contrastive-SAE-Cluster-Steering Contrastive-SAE-Cluster-Steering Public

Jupyter Notebook
TransformerLensOrg/TransformerLens TransformerLensOrg/TransformerLens Public

A library for mechanistic interpretability of GPT-style language models

Python 1.6k 305
KanishkT123/Clustered_SAE_Steering KanishkT123/Clustered_SAE_Steering Public

Clustered SAE Steering Code and Experiments

Python 1