This is a practical course on aligning language models for your specific use case. It's a handy way to get started with aligning language models, because everything runs on most local machines. There are minimal GPU requirements and no paid services. The course is based on the SmolLM2 series of models, but you can transfer the skills you learn here to larger models or other small language models.
This course is open and peer reviewed. To get involved with the course open a pull request and submit your work for review. Here are the steps:
- Fork the repo here
- Read the material, make changes, do the exercises, add your own examples.
- Open a PR
- Get it reviewed and merged
This should help you learn and to build a community-driven course that is always improving.
We can discuss the process in this discussion thread.
This course provides a practical, hands-on approach to working with small language models, from initial training through to production deployment.
Module | Description | Status | Release Date |
---|---|---|---|
Instruction Tuning | Learn supervised fine-tuning, chat templating, and basic instruction following | ✅ Complete | Dec 3, 2024 |
Preference Alignment | Explore DPO and ORPO techniques for aligning models with human preferences | 🚧 In Progress | Dec 6, 2024 |
Parameter-efficient Fine-tuning | Learn LoRA, prompt tuning, and efficient adaptation methods | 🚧 In Progress | Dec 9, 2024 |
Evaluation | Use automatic benchmarks and create custom domain evaluations | 🚧 In Progress | Dec 16, 2024 |
Vision-language Models | Adapt multimodal models for vision-language tasks | 📝 Planned | Dec 20, 2024 |
Synthetic Datasets | Create and validate synthetic datasets for training | 📝 Planned | Dec 23, 2024 |
Inference | Infer with models efficiently | 📝 Planned | Dec 27, 2024 |
Deployment | Deploy and serve models at scale | 📝 Planned | Dec 30, 2024 |
While large language models have shown impressive capabilities, they often require significant computational resources and can be overkill for focused applications. Small language models offer several advantages for domain-specific applications:
- Efficiency: Require significantly less computational resources to train and deploy
- Customization: Easier to fine-tune and adapt to specific domains
- Control: Better understanding and control of model behavior
- Cost: Lower operational costs for training and inference
- Privacy: Can be run locally without sending data to external APIs
Before starting, ensure you have the following:
- Basic understanding of machine learning and natural language processing.
- Familiarity with Python, PyTorch, and the
transformers
library. - Access to a pre-trained language model and a labeled dataset.
All the examples run in the same environment, so you only need to install the dependencies once like this:
pip install -r requirements.txt