dpo/sft tuned language models on politune
Jonas Golde
whoisjones
AI & ML interests
Data-efficient transfer learning
Recent Activity
updated a collection 5 days ago
MastermindEval updated a collection 5 days ago
MastermindEval updated a collection 6 days ago
PolituneOrganizations
models 24
whoisjones/politune-qwen3-8b-right-dpo
Text Generation • Updated • 14
whoisjones/politune-qwen3-8b-right-sft
Text Generation • Updated • 22
whoisjones/politune-qwen3-8b-left-dpo
Text Generation • Updated • 21
whoisjones/politune-qwen3-8b-left-sft
Text Generation • Updated • 19
whoisjones/politune-mistral-7b-right-dpo
Text Generation • Updated • 20
whoisjones/politune-mistral-7b-right-sft
Text Generation • Updated • 17
whoisjones/politune-mistral-7b-left-dpo
Text Generation • Updated • 23
whoisjones/politune-mistral-7b-left-sft
Text Generation • Updated • 13
whoisjones/politune-llama3-8b-right-dpo
Text Generation • Updated • 23
whoisjones/politune-llama3-8b-right-sft
Text Generation • Updated • 22
datasets 29
whoisjones/finerweb_document_context
Updated • 9
whoisjones/sudoku
Viewer • Updated • 1.42M • 22
whoisjones/maze
Viewer • Updated • 9k • 5
whoisjones/multinerd
Viewer • Updated • 1.67M • 22
whoisjones/masakhaner
Viewer • Updated • 153k • 10 • 1
whoisjones/uner
Viewer • Updated • 66.8k • 62
whoisjones/fiNERweb
Viewer • Updated • 3.98M • 153 • 9
whoisjones/fiNERweb-x
Updated • 58
whoisjones/fiNERweb-x-multi
Updated • 39
whoisjones/fiNERweb-gemma-x-multi
Updated • 17