Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
OctoLong 's Collections
Instruct Checkpoints
Merged Base Checkpoints
Extended Base Checkpoints
Original Base Checkpoints

Original Base Checkpoints

updated 13 days ago

Qwen3 checkpoints with modified configurations for long context fine-tuning in the OctoLong project

Upvote
-

  • OctoLong/Qwen3-0.6B-Base

    Text Generation • 0.6B • Updated 13 days ago • 26

  • OctoLong/Qwen3-1.7B-Base

    Text Generation • 2B • Updated 13 days ago • 54

  • OctoLong/Qwen3-4B-Base

    Text Generation • 4B • Updated 13 days ago • 56

  • OctoLong/Qwen3-8B-Base

    Text Generation • 8B • Updated 13 days ago • 74 • 1

  • OctoLong/Qwen3-14B-Base

    Text Generation • 15B • Updated 13 days ago • 108
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs