Add additional asserts and update post training readme #1300

AI-WAIFU · 2024-10-07T13:23:06Z

-Add an assert to block dpo/kto/rm training when running pipeline parallel, as get_batch_pipe does not have the relevant logic yet
-Add assert when kto + train_micro_batch_size_per_gpu == 1 since kto needs a reference
-remove python post-training/llama_dpo_data.py as this file does not exist and is not nessesary to get the code running

* add asserts and fix post training readme * precommit --------- Co-authored-by: Quentin Anthony <[email protected]>

add asserts and fix post training readme

1d33d06

AI-WAIFU requested a review from Quentin-Anthony as a code owner October 7, 2024 13:23

precommit

e1b2b7c

Quentin-Anthony approved these changes Oct 8, 2024

View reviewed changes

Quentin-Anthony merged commit c8f7b56 into main Oct 8, 2024
1 of 4 checks passed

Quentin-Anthony deleted the add-kto-limitation branch October 8, 2024 19:26

jahatef pushed a commit that referenced this pull request Oct 31, 2024

Add additional asserts and update post training readme (#1300)

540d856

* add asserts and fix post training readme * precommit --------- Co-authored-by: Quentin Anthony <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add additional asserts and update post training readme #1300

Add additional asserts and update post training readme #1300

AI-WAIFU commented Oct 7, 2024

Add additional asserts and update post training readme #1300

Add additional asserts and update post training readme #1300

Conversation

AI-WAIFU commented Oct 7, 2024