pvlabs
/

Chytrej1.5-90M-Base

Text Generation

text-generation-inference

Model card Files Files and versions

PingVortex commited on 15 days ago

Commit

4c83a4e

·

verified ·

1 Parent(s): 6f759e7

Update README.md

Files changed (1) hide show

README.md +63 -3

README.md CHANGED Viewed

@@ -1,3 +1,63 @@
----
-license: apache-2.0
----

+---
+language:
+- en
+license: apache-2.0
+pipeline_tag: text-generation
+tags:
+- llama
+- causal-lm
+- pretrained
+- chytrej
+- base
+library_name: transformers
+---
+# Chytrej1.5-90M-Base
+A fully custom pretrained language model built from scratch on the LLaMA architecture.
+Chytrej (Czech slang for "clever/smart") is a long-term model series by PingVortex Labs. Every model in the series will be fully custom pretrained from scratch, then the model may be instruction fine-tuned on the custom base. The ongoing goal: every release must at least know the capital of France.
+Built by [PingVortex Labs](https://github.com/PingVortexLabs).
+---
+## Model Details
++ **Parameters:** 90M
++ **Context length:** 8,192 tokens
++ **Language:** English only
++ **Format:** base model
++ **Architecture:** LLaMA
++ **License:** Apache 2.0
+---
+## Benchmarks
+Evaluated with [lm-eval-harness](https://github.com/EleutherAI/lm-evaluation-harness), 0-shot:
+| Task | Metric | Chytrej1.5 | Chytrej1 |
+|---|---|---|---|
+| ARC-Easy | acc | **41.46%** | 39.73% |
+| ARC-Easy | acc_norm | **37.04%** | 34.47% |
+---
+## Usage
+```python
+from transformers import LlamaForCausalLM, PreTrainedTokenizerFast
+model = LlamaForCausalLM.from_pretrained("pvlabs/Chytrej1.5-90M-Base")
+tokenizer = PreTrainedTokenizerFast.from_pretrained("pvlabs/Chytrej1.5-90M-Base")
+prompt = "The capital of France is"
+inputs = tokenizer(prompt, return_tensors="pt")
+outputs = model.generate(**inputs, max_new_tokens=100, repetition_penalty=1.3)
+print(tokenizer.decode(outputs[0]))
+```
+---
+*Made by [PingVortex](https://pingvortex.com).*