Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
williyam
/
agentic-rag-aerospace-grpo
like
2
Text Generation
PEFT
Safetensors
Transformers
custom-aerospace-rag-tasks
English
grpo
lora
trl
rag
reinforcement-learning
aerospace
agentic
openenv
conversational
Eval Results (legacy)
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Use this model
main
agentic-rag-aerospace-grpo
20.4 MB
Ctrl+K
Ctrl+K
1 contributor
History:
10 commits
williyam
fix: update citation year from 2025 to 2026
b239b93
7 days ago
.gitattributes
Safe
1.63 kB
Upload training_curves.png with huggingface_hub
7 days ago
README.md
Safe
7.08 kB
fix: update citation year from 2025 to 2026
7 days ago
adapter_config.json
Safe
1.06 kB
Upload model
9 days ago
adapter_model.safetensors
Safe
8.68 MB
xet
Upload model
9 days ago
baseline_vs_trained.png
Safe
85.1 kB
Upload baseline_vs_trained.png with huggingface_hub
7 days ago
chat_template.jinja
Safe
2.51 kB
Upload tokenizer
9 days ago
score_distribution.png
Safe
36.9 kB
Upload score_distribution.png with huggingface_hub
7 days ago
tokenizer.json
Safe
11.4 MB
xet
Upload tokenizer
9 days ago
tokenizer_config.json
Safe
720 Bytes
Upload tokenizer
9 days ago
training_curves.png
Safe
124 kB
xet
Upload training_curves.png with huggingface_hub
7 days ago