benchmarks

Fast reference benchmarks for training ML models with recipes. Designed to be easily forked and modified.

ResNet-50 + ImageNet

Figure 1: Comparison of MosaicML recipes against other results, all measured on 8x A100s on MosaicML Cloud.

Train the MosaicML ResNet, the fastest ResNet50 implementation that yields a ✨ 7x ✨ faster time-to-train compared to a strong baseline. See our blog for more details and recipes. Our recipes were also demonstrated at MLPerf, a cross industry ML benchmark.

🚀 Get started with the code here.

Large Language Models (LLMs)

A simple yet feature complete implementation of GPT, that scales to 70B parameters while maintaining high performance on GPU clusters. Flexible code, written with vanilla PyTorch, that uses PyTorch FSDP and some recent efficiency improvements.

🚀 Get started with the code here.

Name		Name	Last commit message	Last commit date
Latest commit History 110 Commits
llm		llm
resnet		resnet
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

benchmarks

ResNet-50 + ImageNet

Large Language Models (LLMs)

About

Releases

Packages

Languages

License

stanford-crfm/mosaicml-benchmarks

Folders and files

Latest commit

History

Repository files navigation

benchmarks

ResNet-50 + ImageNet

Large Language Models (LLMs)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages