Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Complete ESM2 pretraining #112

Merged
merged 93 commits into from
Aug 29, 2024
Merged

Complete ESM2 pretraining #112

merged 93 commits into from
Aug 29, 2024

Conversation

sichu2023
Copy link
Collaborator

@sichu2023 sichu2023 commented Aug 20, 2024

Based on @farhadrgh 's PR on ESM2 pretraining and loss test.

Difference from the original feature branch

  • Fix bug in pretraining teardown
  • Migrate dataset shuffling from megatron random sampler to PRNGDatasetShuffler
  • Enable pytest on pretraining
  • Enable global batch size calculation
  • Enable model hparam through argparse
  • Support limit_val_batches as a fraction of validation dataset size and add unittest
  • Add megatron_parallel_state_utils to main unittest
  • Add test to ensure samples from validation dataset are consistent with potential shuffling
  • Remove eval_iters from num_val_samples
  • (reverted) Remove train and validation datasets's dependency on total_samples
  • Extract limit_val_samples inference
  • Sync all updates to geneformer

@sichu2023 sichu2023 requested review from pstjohn and farhadrgh August 20, 2024 20:32
@sichu2023
Copy link
Collaborator Author

/build-ci

6 similar comments
@sichu2023
Copy link
Collaborator Author

/build-ci

@sichu2023
Copy link
Collaborator Author

/build-ci

@sichu2023
Copy link
Collaborator Author

/build-ci

@ohadmo
Copy link
Member

ohadmo commented Aug 21, 2024

/build-ci

@ohadmo
Copy link
Member

ohadmo commented Aug 21, 2024

/build-ci

@sichu2023
Copy link
Collaborator Author

/build-ci

@sichu2023 sichu2023 changed the base branch from farhadr/pretrain to v2-main August 22, 2024 03:15
@sichu2023 sichu2023 marked this pull request as draft August 22, 2024 03:25
@sichu2023 sichu2023 marked this pull request as ready for review August 22, 2024 04:26
@sichu2023
Copy link
Collaborator Author

/build-ci

1 similar comment
@ohadmo
Copy link
Member

ohadmo commented Aug 22, 2024

/build-ci

@sichu2023
Copy link
Collaborator Author

/build-ci

@sichu2023 sichu2023 requested a review from jstjohn August 23, 2024 03:40
Copy link
Collaborator

@pstjohn pstjohn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

will cycle back and finish this review later -- but wanted to post some of these comments in the meantime

@sichu2023
Copy link
Collaborator Author

/build-ci

@sichu2023
Copy link
Collaborator Author

I will update README.md once this PR is settled.

@sichu2023
Copy link
Collaborator Author

/build-ci

@sichu2023 sichu2023 merged commit 76ec468 into v2-main Aug 29, 2024
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants