Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

nbeerbower
/
Dumpling-Mistral-Nemo-8B

Text Generation
Transformers
Safetensors
mistral
conversational
text-generation-inference
Model card Files Files and versions
xet
Community
  • Dumpling-Mistral-Nemo-8B
    • Method

🧪 Experimental

An attempt to recover intelligence with a quick train, results are meh

Dumpling-Mistral-Nemo-8B

nbeerbower/mistral-nemo-kartoffel-PRUNE3 finetuned on:

  • nbeerbower/GreatFirewall-DPO
  • nbeerbower/Schule-DPO
  • nbeerbower/Purpura-DPO
  • nbeerbower/Arkhaios-DPO
  • jondurbin/truthy-dpo-v0.1
  • antiven0m/physical-reasoning-dpo
  • flammenai/Date-DPO-NoAsterisks
  • flammenai/Prude-Phi3-DPO
  • Atsunori/HelpSteer2-DPO (1,000 samples)
  • jondurbin/gutenberg-dpo-v0.1
  • nbeerbower/gutenberg2-dpo
  • nbeerbower/gutenberg-moderne-dpo.

Method

QLoRA ORPO tune with 2x RTX 3090 for 2 epochs.

Downloads last month
84
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
Text Generation
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for nbeerbower/Dumpling-Mistral-Nemo-8B

Base model

nbeerbower/Mahou-1.5-mistral-nemo-12B-lorablated
Finetuned
nbeerbower/mistral-nemo-kartoffel-12B
Finetuned
nbeerbower/mistral-nemo-kartoffel-PRUNE3
Finetuned
(1)
this model
Quantizations
2 models

Datasets used to train nbeerbower/Dumpling-Mistral-Nemo-8B

jondurbin/gutenberg-dpo-v0.1

Viewer • Updated Jan 12, 2024 • 918 • 358 • 162

jondurbin/truthy-dpo-v0.1

Viewer • Updated Jan 11, 2024 • 1.02k • 271 • 136

nbeerbower/GreatFirewall-DPO

Viewer • Updated Mar 2, 2025 • 492 • 70 • 10
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs