Resources for large scale models
Related to data parallelism, model parallelism, pipeline parallelism and their variants
Mixture of Experts
FastMOE: https://arxiv.org/abs/2103.13262, https://github.com/laekov/fastmoe
Last updated
Was this helpful?