schuttdev commited on
Commit
087a105
·
verified ·
1 Parent(s): b0412da

Add README

Browse files
Files changed (1) hide show
  1. README.md +51 -0
README.md ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: apache-2.0
5
+ base_model:
6
+ - kai-os/Carnice-9b
7
+ tags:
8
+ - qwen3.5
9
+ - deltanet
10
+ - hipfire
11
+ - rdna
12
+ - amd
13
+ - hermes
14
+ - tool-use
15
+ pipeline_tag: text-generation
16
+ ---
17
+
18
+ # Carnice-9B — hipfire quantized
19
+
20
+ [kai-os/Carnice-9b](https://huggingface.co/kai-os/Carnice-9b) quantized for [hipfire](https://github.com/Kaden-Schutt/hipfire), a Rust-native inference engine for AMD RDNA GPUs.
21
+
22
+ Carnice is a Hermes tool-use finetune of Qwen3.5-9B with browser, terminal, and reasoning capabilities.
23
+
24
+ ## Files
25
+
26
+ | File | Quant | Size | Speed (5700 XT) | Notes |
27
+ |------|-------|------|-----------------|-------|
28
+ | `carnice-9b.hf4` | HF4 (4-bit) | 4.5 GB | ~41 tok/s | Faster, fits 8GB VRAM |
29
+ | `carnice-9b.hf6` | HF6 (6-bit) | 6.9 GB | ~32 tok/s | Better quality, needs 8GB |
30
+
31
+ ## Usage
32
+
33
+ ```bash
34
+ # Install hipfire
35
+ curl -L https://raw.githubusercontent.com/Kaden-Schutt/hipfire/master/scripts/install.sh | bash
36
+
37
+ # Pull and run
38
+ hipfire pull carnice:9b # HF4 (default)
39
+ hipfire pull carnice:9b-hf6 # HF6 (higher quality)
40
+ hipfire run carnice:9b
41
+ ```
42
+
43
+ ## About
44
+
45
+ Carnice uses the Hermes chat template with `<tool_call>` XML tags for function calling. hipfire passes these through as regular tokens — the model generates tool calls naturally within the ChatML format.
46
+
47
+ Same DeltaNet hybrid architecture as base Qwen3.5. JIT-compiles for any AMD GPU (RDNA 1-4, APUs).
48
+
49
+ ## License
50
+
51
+ Apache 2.0