microsoft/bitnet-b1.58-2B-4T
Text Generation
•
Updated
•
15k
•
1.3k
Generate a curated web‑text dataset for LLM training
The ultimate guide to training LLM on large GPU Clusters
Calculate and visualize model memory usage from config