DeepSpeed is an AI agent in the LLM Training Frameworks category. DeepSpeed is a deep learning optimization library that makes dis…
Litgpt is an AI agent in the LLM Training Frameworks category. 20+ high-performance LLMs with recipes to pretrain, finetune and de…
Megatron-LM is an AI agent in the LLM Training Frameworks category. Ongoing research training transformer models at scale.
Meta Lingua is an AI agent in the LLM Training Frameworks category. a lean, efficient, and easy-to-hack codebase to research LLMs.
nanotron is an AI agent in the LLM Training Frameworks category. Minimalistic large language model 3D-parallelism training.
torchtitan is an AI agent in the LLM Training Frameworks category. A native PyTorch Library for large model training.