nbeerbower
/

Dumpling-Mistral-Nemo-8B

Text Generation

text-generation-inference

Model card Files Files and versions

🧪 Experimental

An attempt to recover intelligence with a quick train, results are meh

Dumpling-Mistral-Nemo-8B

nbeerbower/mistral-nemo-kartoffel-PRUNE3 finetuned on:

Method

QLoRA ORPO tune with 2x RTX 3090 for 2 epochs.

Downloads last month: 84

Safetensors

Model size

8B params

Tensor type

BF16

·

Model tree for nbeerbower/Dumpling-Mistral-Nemo-8B

Base model

nbeerbower/Mahou-1.5-mistral-nemo-12B-lorablated

Finetuned

nbeerbower/mistral-nemo-kartoffel-12B

Finetuned

nbeerbower/mistral-nemo-kartoffel-PRUNE3

Finetuned

(1)

this model

Quantizations

Datasets used to train nbeerbower/Dumpling-Mistral-Nemo-8B