Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

williyam
/
agentic-rag-aerospace-grpo

Text Generation
PEFT
Safetensors
Transformers
English
grpo
lora
trl
rag
reinforcement-learning
aerospace
agentic
openenv
conversational
Eval Results (legacy)
Model card Files Files and versions
xet
Community
agentic-rag-aerospace-grpo
20.4 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 10 commits
williyam's picture
williyam
fix: update citation year from 2025 to 2026
b239b93 7 days ago
  • .gitattributes
    1.63 kB
    Upload training_curves.png with huggingface_hub 7 days ago
  • README.md
    7.08 kB
    fix: update citation year from 2025 to 2026 7 days ago
  • adapter_config.json
    1.06 kB
    Upload model 9 days ago
  • adapter_model.safetensors
    8.68 MB
    xet
    Upload model 9 days ago
  • baseline_vs_trained.png
    85.1 kB
    Upload baseline_vs_trained.png with huggingface_hub 7 days ago
  • chat_template.jinja
    2.51 kB
    Upload tokenizer 9 days ago
  • score_distribution.png
    36.9 kB
    Upload score_distribution.png with huggingface_hub 7 days ago
  • tokenizer.json
    11.4 MB
    xet
    Upload tokenizer 9 days ago
  • tokenizer_config.json
    720 Bytes
    Upload tokenizer 9 days ago
  • training_curves.png
    124 kB
    xet
    Upload training_curves.png with huggingface_hub 7 days ago