Text Generation
MLX
Safetensors
llama
thinking
reasoning
instruct
Claude4.5-Opus
creative
creative writing
fiction writing
plot generation
sub-plot generation
story generation
scene continue
storytelling
fiction story
science fiction
romance
all genres
story
writing
vivid prosing
vivid writing
fiction
roleplaying
bfloat16
role play
128k context
llama3.3
llama-3
llama-3.3
unsloth
finetune
conversational
4-bit precision
genai-archive/DavidAU__Llama3.3-8B-Instruct-Thinking-Claude-4.5-Opus-High-Reasoning-mlx-mxfp4
This model genai-archive/DavidAU__Llama3.3-8B-Instruct-Thinking-Claude-4.5-Opus-High-Reasoning-mlx-mxfp4 was converted to MLX format from DavidAU/Llama3.3-8B-Instruct-Thinking-Claude-4.5-Opus-High-Reasoning using mlx-lm version 0.31.1.
Use with mlx
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("genai-archive/DavidAU__Llama3.3-8B-Instruct-Thinking-Claude-4.5-Opus-High-Reasoning-mlx-mxfp4")
prompt = "hello"
if tokenizer.chat_template is not None:
messages = [{"role": "user", "content": prompt}]
prompt = tokenizer.apply_chat_template(
messages, add_generation_prompt=True, return_dict=False,
)
response = generate(model, tokenizer, prompt=prompt, verbose=True)
- Downloads last month
- 454
Model size
2B params
Tensor type
U8
·
U32 ·
BF16 ·
Hardware compatibility
Log In to add your hardware
4-bit
Model tree for genai-archive/DavidAU__Llama3.3-8B-Instruct-Thinking-Claude-4.5-Opus-High-Reasoning-mlx-mxfp4
Base model
allura-forge/Llama-3.3-8B-Instruct