think-in-silence

A model that reasons in pure latent space. No chain-of-thought tokens. No RL. No reasoning traces.

Results

Thinking Steps (K) R@1 BLEU ROUGE-1
K=0 (no thinking) 0.002 0.000 0.000
K=1 0.064 0.000 0.028
K=2 0.256 0.009 0.084
K=4 0.504 0.044 0.218
K=8 0.474 0.231 0.594
K=16 0.406 0.185 0.542

Usage

from src.models.lc_thought import LCThought
from src.utils.checkpoint import load_checkpoint
import torch, yaml
from types import SimpleNamespace

# Load config and model
cfg = yaml.safe_load(open("configs/decoder.yaml"))
model = LCThought(cfg)
load_checkpoint("checkpoints/stage3/step_0050000.pt", model)
model.eval()

# Inference
answers = model.generate(q_ids, q_mask, n_steps=8)

License

MIT EOF

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 1 Ask for provider support

Datasets used to train rajat5039/think-in-silence