Create readme

Browse files

Files changed (1) hide show

readme +212 -0

readme ADDED Viewed

	@@ -0,0 +1,212 @@

+---
+license: apache-2.0
+base_model:
+  - Qwen/Qwen3.5-9B
+tags:
+  - merge
+  - evolutionary-merge
+  - darwin
+  - darwin-v5
+  - model-mri
+  - reasoning
+  - advanced-reasoning
+  - chain-of-thought
+  - thinking
+  - qwen3.5
+  - qwen
+  - claude-opus
+  - distillation
+  - multilingual
+  - benchmark
+  - open-source
+  - apache-2.0
+  - layer-wise-merge
+  - coding-agent
+  - tool-calling
+  - long-context
+language:
+  - en
+  - zh
+  - ko
+  - ja
+  - de
+  - fr
+  - es
+  - ru
+  - ar
+  - multilingual
+pipeline_tag: text-generation
+library_name: transformers
+model-index:
+  - name: Darwin-9B-Opus
+    results:
+      - task:
+          type: text-generation
+          name: Graduate-Level Reasoning
+        dataset:
+          type: Idavidrein/gpqa
+          name: GPQA Diamond
+          config: gpqa_diamond
+          split: train
+        metrics:
+          - type: accuracy
+            value: 90.0
+            name: Accuracy
+            verified: false
+---
+# Darwin-9B-Opus
+*"Compact reasoning powerhouse — 9B parameters, graduate-level intelligence."*
+<p align="center">
+  <a href="https://huggingface.co/FINAL-Bench/Darwin-9B-Opus"><img src="https://img.shields.io/badge/🧬_Model-Darwin--9B--Opus-blue?style=for-the-badge" alt="Model"></a>
+  <a href="https://huggingface.co/spaces/FINAL-Bench/Darwin-9B-Opus"><img src="https://img.shields.io/badge/🚀_Space-Live_Demo-purple?style=for-the-badge" alt="Space"></a>
+  <a href="https://huggingface.co/spaces/FINAL-Bench/Leaderboard"><img src="https://img.shields.io/badge/🏆_FINAL_Bench-Leaderboard-green?style=for-the-badge" alt="FINAL Bench"></a>
+  <a href="https://huggingface.co/spaces/FINAL-Bench/all-bench-leaderboard"><img src="https://img.shields.io/badge/📊_ALL_Bench-Leaderboard-orange?style=for-the-badge" alt="ALL Bench"></a>
+</p>
+> **Qwen3.5 Dense 9B** | Reasoning | Chain-of-Thought | 131K Context | 201 Languages | BF16 | Apache 2.0
+---
+## Overview
+Darwin-9B-Opus is a **9B dense parameter** reasoning model created using **Darwin V5**, an evolutionary merge engine with Model MRI integration. Built on the Qwen3.5-9B architecture, it inherits structured step-by-step reasoning capabilities through Claude 4.6 Opus distillation while maintaining the full multilingual and long-context capabilities of the base model.
+---
+## Model Specifications
+| | |
+|---|---|
+| Architecture | Qwen3.5 Dense |
+| Total Parameters | 9B |
+| Precision | BF16 |
+| Context Length | 131,072 native |
+| Languages | 201 |
+| Thinking | `<think>` tag chain-of-thought reasoning |
+| License | Apache 2.0 |
+---
+## Hardware Requirements
+| Setup | VRAM | Status |
+|---|---|---|
+| BF16 Full Precision | ~20 GB | |
+| NVIDIA A10G 24GB | 24 GB | ✅ Comfortable |
+| NVIDIA RTX 4090 24GB | 24 GB | ✅ Comfortable |
+| NVIDIA A100 40GB | 40 GB | ✅ Very comfortable |
+| NVIDIA T4 16GB | 16 GB | ⚠️ Requires quantization |
+---
+## Usage
+### Transformers
+```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+import torch
+tokenizer = AutoTokenizer.from_pretrained(
+    "FINAL-Bench/Darwin-9B-Opus",
+    trust_remote_code=True,
+)
+model = AutoModelForCausalLM.from_pretrained(
+    "FINAL-Bench/Darwin-9B-Opus",
+    torch_dtype=torch.bfloat16,
+    device_map="auto",
+    trust_remote_code=True,
+)
+messages = [{"role": "user", "content": "Prove that √2 is irrational."}]
+text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
+inputs = tokenizer(text, return_tensors="pt").to(model.device)
+outputs = model.generate(**inputs, max_new_tokens=4096)
+print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:], skip_special_tokens=True))
+```
+### SGLang
+```bash
+python -m sglang.launch_server \
+  --model-path FINAL-Bench/Darwin-9B-Opus \
+  --tp 1 \
+  --mem-fraction-static 0.90 \
+  --context-length 32768 \
+  --trust-remote-code
+```
+### vLLM
+```bash
+vllm serve FINAL-Bench/Darwin-9B-Opus \
+  --trust-remote-code \
+  --enforce-eager
+```
+---
+## What Makes Darwin Special?
+Darwin-9B-Opus was created using **Darwin V5**, an evolutionary merge engine with Model MRI integration.
+### Darwin V5 Pipeline
+```
+[Phase 0] Model MRI — Profile both parents layer by layer
+    ↓  Measure: layer importance, probe cosine distance
+    ↓
+[Phase 1] MRI-Guided Evolution — Diagnostic-informed initial genome
+    ↓  Not random, but "informed by profiling results"
+    ↓
+[Phase 2] mergekit real merge + benchmark fitness selection
+    ↓  Faster convergence in MRI-narrowed search space
+    ↓
+[Phase 3] MRI Health Check — Profile the child model
+    ↓  Detect interference, function loss
+    ↓  Prescribe layer-specific ratio adjustments
+    ↓
+[Final] Darwin-9B-Opus
+```
+---
+## Built By
+| | |
+|---|---|
+| Developer | **VIDRAFT** |
+| Engine | Darwin V5 (Evolutionary Merge + Model MRI) |
+| Merge Backend | mergekit (DARE-TIES) |
+| Base Architecture | Qwen3.5-9B |
+---
+## Acknowledgements
+- **Korean Government** — GPU Support Program research grant
+- [Qwen Team](https://huggingface.co/Qwen) — Qwen3.5 base architecture
+- [mergekit](https://github.com/arcee-ai/mergekit) — Merge backend infrastructure
+---
+## Citation
+```bibtex
+@misc{vidraft_darwin_9b_opus,
+  title        = {Darwin-9B-Opus: Compact Reasoning Model via Diagnostic-Guided Evolutionary Merge},
+  author       = {VIDRAFT},
+  year         = {2026},
+  publisher    = {Hugging Face},
+  howpublished = {\url{https://huggingface.co/FINAL-Bench/Darwin-9B-Opus}}
+}
+```
+---
+## Contact
+📧 **kkms1116@koreacu.ac.kr**