Lemma
Collection
A lemma is "something assumed" — an intermediate theorem on the path to a larger proof, or a heading that signals the subject of what follows. • 18 items • Updated
Stripped-down sibling of lthn/lemer for devices that can't load the full multimodal build (≥3 GB ceiling).
| Variant | Size | Towers |
|---|---|---|
| lthn/lemer | 4.06 GB | text + vision + audio |
| lthn/lemer-lite (you are here) | 2.47 GB | text only |
Same LEK-aligned Gemma 4 E2B base as lemer, with vision and audio towers stripped and the text path quantised flat 4-bit (4.501 bits/weight) instead of mixed-precision.
The Lethean Ethical Kernel (LEK) is fully present in the weights — the consent-based reasoning behaviour is identical to the full lemer.
This is a best-effort tier for users on smaller devices. The -lite prefix is a promise: we are packing this tight, results will vary, but you get to load and run the model.
from mlx_lm import load, generate
model, tokenizer = load("lthn/lemer-lite")
prompt = tokenizer.apply_chat_template(
[{"role": "user", "content": "Hello"}],
tokenize=False, add_generation_prompt=True,
)
print(generate(model, tokenizer, prompt=prompt, max_tokens=200))
EUPL-1.2.
4-bit