Skip to content

MR-GREEN1337/Mistral-7b-PyTorch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Mistral-7b-PyTorch

  • Implementation of Mistral 7b using PyTorch

#Model architecture

# Model params

  • Look at the model params

Sliding window attention - Rolling buffer cache - Prefill and chunking - MoE -

Big thanks for Mr. Umar Jamil for providing the necessay help to a thorough implementation

https://www.youtube.com/watch?v=UiX8K-xBUpE

About

Implementation of Mistral 7b using PyTorch

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published