fineweb-edu-llama-large-from-fwe-seed-0-1508128

This repository contains the latest checkpoint from the local training run fineweb_edu_llama_large_from_fwe_1508128.

Contents

  • model_60975.pth: latest checkpoint selected from the run directory
  • metrics.json: training and validation loss history for the run

Run metadata

  • Seed: 0
  • Local source directory: fineweb_edu_llama_large_from_fwe_1508128
  • Weights & Biases run name: fineweb_edu_llama_large_from_fwe
  • Weights & Biases run id: ck3yffxt
  • Final logged train loss at step 60500: 2.5242346972227097
  • Final logged validation loss at step 60500: 2.4655129891984604

Notes

  • The included checkpoint file is model_60975.pth.
  • The latest metrics entry in metrics.json is at step 60500.
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support