Kita258 Lymeman commited on
Commit
2369973
·
0 Parent(s):

Duplicate from Triangle104/Meta-Llama-3.1-8B-Instruct-abliterated-Q4_K_M-GGUF

Browse files
.gitattributes ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ meta-llama-3.1-8b-instruct-abliterated-q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,62 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: llama3.1
4
+ base_model: huihui-ai/Meta-Llama-3.1-8B-Instruct-abliterated
5
+ tags:
6
+ - abliterated
7
+ - uncensored
8
+ - llama-cpp
9
+ - gguf-my-repo
10
+ ---
11
+
12
+ # Triangle104/Meta-Llama-3.1-8B-Instruct-abliterated-Q4_K_M-GGUF
13
+ This model was converted to GGUF format from [`huihui-ai/Meta-Llama-3.1-8B-Instruct-abliterated`](https://huggingface.co/huihui-ai/Meta-Llama-3.1-8B-Instruct-abliterated) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
14
+ Refer to the [original model card](https://huggingface.co/huihui-ai/Meta-Llama-3.1-8B-Instruct-abliterated) for more details on the model.
15
+
16
+ ---
17
+ Model details:
18
+ -
19
+ This is an uncensored version of Llama 3.1 8B Instruct created with abliteration (see this article to know more about it).
20
+
21
+ Special thanks to @FailSpy for the original code and technique. Please follow him if you're interested in abliterated models.
22
+
23
+ ---
24
+ ## Use with llama.cpp
25
+ Install llama.cpp through brew (works on Mac and Linux)
26
+
27
+ ```bash
28
+ brew install llama.cpp
29
+
30
+ ```
31
+ Invoke the llama.cpp server or the CLI.
32
+
33
+ ### CLI:
34
+ ```bash
35
+ llama-cli --hf-repo Triangle104/Meta-Llama-3.1-8B-Instruct-abliterated-Q4_K_M-GGUF --hf-file meta-llama-3.1-8b-instruct-abliterated-q4_k_m.gguf -p "The meaning to life and the universe is"
36
+ ```
37
+
38
+ ### Server:
39
+ ```bash
40
+ llama-server --hf-repo Triangle104/Meta-Llama-3.1-8B-Instruct-abliterated-Q4_K_M-GGUF --hf-file meta-llama-3.1-8b-instruct-abliterated-q4_k_m.gguf -c 2048
41
+ ```
42
+
43
+ Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
44
+
45
+ Step 1: Clone llama.cpp from GitHub.
46
+ ```
47
+ git clone https://github.com/ggerganov/llama.cpp
48
+ ```
49
+
50
+ Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
51
+ ```
52
+ cd llama.cpp && LLAMA_CURL=1 make
53
+ ```
54
+
55
+ Step 3: Run inference through the main binary.
56
+ ```
57
+ ./llama-cli --hf-repo Triangle104/Meta-Llama-3.1-8B-Instruct-abliterated-Q4_K_M-GGUF --hf-file meta-llama-3.1-8b-instruct-abliterated-q4_k_m.gguf -p "The meaning to life and the universe is"
58
+ ```
59
+ or
60
+ ```
61
+ ./llama-server --hf-repo Triangle104/Meta-Llama-3.1-8B-Instruct-abliterated-Q4_K_M-GGUF --hf-file meta-llama-3.1-8b-instruct-abliterated-q4_k_m.gguf -c 2048
62
+ ```
meta-llama-3.1-8b-instruct-abliterated-q4_k_m.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5c728e9ca8d59842e602ded1446a4c116412ed27bdf503c5a42ccff704c51907
3
+ size 4920739104