Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shaobai Jiang's picture
4 1632

Shaobai Jiang

shaobaij
0xSojalSec's profile picture Diluner's profile picture 21world's profile picture
ยท

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago
Pioneer Agent: Continual Improvement of Small Language Models in Production
upvoted a paper 1 day ago
Toward Autonomous Long-Horizon Engineering for ML Research
upvoted a paper 1 day ago
Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning
View all activity

Organizations

None yet

New activity in openai/gpt-oss-120b 8 months ago

Fine tune 120b at 8 H100s getting cuda OOM error

๐Ÿ‘€ 1
6
#117 opened 8 months ago by
jinxu88

FlashInfer requires sm75+

7
#48 opened 9 months ago by
hrithiksagar-bgen
New activity in mistralai/Mistral-7B-v0.1 over 2 years ago

If I trained a model on mistral already, do I need to start from scratch due to difficulties of fine-tuning?

2
#62 opened over 2 years ago by
brando

Cant run the model with the most basic code

๐Ÿ‘ 6
6
#7 opened over 2 years ago by
masterchop
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs