Transformer²: Self-adaptive LLMs 🐙

Self-adaptive large language models (LLMs) aim to solve the challenges posed by traditional fine-tuning methods, which are often computationally intensive and static in their ability to handle diverse tasks.

We are excited to introduce Transformer², a novel self-adaptation framework that adapts LLMs for unseen tasks in real-time by selectively adjusting only the singular components of their weight matrices. During inference, Transformer² employs a two-pass mechanism: first, a dispatch system identifies the task properties, and then task-specific "expert" vectors, trained using reinforcement learning, are dynamically mixed to obtain targeted behavior for the incoming prompt.

Installation

1. Clone the Repo

git clone https://github.com/SakanaAI/self-adaptive-llms
cd self-adaptive-llms

2. Install Libraries

conda create -n t2 python=3.11 -y
conda activate t2
pip install --upgrade pip
pip install -r requirements.txt

3. Install Tasks Evaluator

cd evaluation/fishfarm
pip install -e .

Usage

We provide example scripts for both training and evaluation.

Please change the argument in the provided script to choose among models and tasks

Training

bash scripts/train_task_expert.sh

Evaluation

Prompt-based evaluation

Classification experts can be loaded by specifying the CLS_EXPERT_PATH in the script.

bash scripts/eval_prompt_based.sh

Few-shots evaluation

bash scripts/eval_few_shot.sh

Citation

If you find Transformer^2 useful for your research, please cite using this BibTeX:

@misc{sun2025texttransformer2selfadaptivellms,
      title={$\text{Transformer}^2$: Self-adaptive LLMs}, 
      author={Qi Sun and Edoardo Cetin and Yujin Tang},
      year={2025},
      eprint={2501.06252},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2501.06252}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
assets		assets
base_model		base_model
cfgs		cfgs
evaluation/fishfarm		evaluation/fishfarm
policy		policy
scripts		scripts
tasks		tasks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
logging_utils.py		logging_utils.py
optim_modules.py		optim_modules.py
requirements.txt		requirements.txt
svd_reinforce_hydra.py		svd_reinforce_hydra.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer²: Self-adaptive LLMs 🐙

Installation

1. Clone the Repo

2. Install Libraries

3. Install Tasks Evaluator

Usage

Training

Evaluation

Prompt-based evaluation

Few-shots evaluation

Citation

About

Releases

Packages

Languages

License

auzxb/self-adaptive-llms

Folders and files

Latest commit

History

Repository files navigation

Transformer2: Self-adaptive LLMs 🐙

Installation

1. Clone the Repo

2. Install Libraries

3. Install Tasks Evaluator

Usage

Training

Evaluation

Prompt-based evaluation

Few-shots evaluation

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Transformer²: Self-adaptive LLMs 🐙

Packages