Qwen3 checkpoints with modified configurations for long context fine-tuning in the OctoLong project
OctoLong
community
AI & ML interests
None defined yet.
Recent Activity
models 5
OctoLong/Qwen3-14B-Base
Text Generation • 15B • Updated • 54
OctoLong/Qwen3-8B-Base
Text Generation • 8B • Updated • 92 • 1
OctoLong/Qwen3-4B-Base
Text Generation • 4B • Updated • 92
OctoLong/Qwen3-1.7B-Base
Text Generation • 2B • Updated • 107
OctoLong/Qwen3-0.6B-Base
Text Generation • 0.6B • Updated • 10
datasets 17
OctoLong/OctoLong-SFT-V3
Viewer • Updated • 3.65M • 45
OctoLong/OctoLong-SFT-V3-Swift
Viewer • Updated • 3.65M • 63
OctoLong/OctoLong-LCFT-V2
Viewer • Updated • 19.4M • 119
OctoLong/OctoLong-SFT-V2-Swift
Viewer • Updated • 3.58M • 46 • 1
OctoLong/OctoLong-SFT-V2
Viewer • Updated • 3.58M • 37
OctoLong/fwe
Viewer • Updated • 729k • 241
OctoLong/temp-collection-meta-complete-64
Viewer • Updated • 68.1k • 3
OctoLong/temp-collection-raw-complete-64
Viewer • Updated • 68.7k • 3
OctoLong/temp-collection-mix-64
Viewer • Updated • 22.8k • 3
OctoLong/temp-collection-meta-64
Viewer • Updated • 22.7k • 3