think-in-silence

A model that reasons in pure latent space. No chain-of-thought tokens. No RL. No reasoning traces.

Results

Thinking Steps (K)	R@1	BLEU	ROUGE-1
K=0 (no thinking)	0.002	0.000	0.000
K=1	0.064	0.000	0.028
K=2	0.256	0.009	0.084
K=4	0.504	0.044	0.218
K=8	0.474	0.231	0.594
K=16	0.406	0.185	0.542

Usage

from src.models.lc_thought import LCThought
from src.utils.checkpoint import load_checkpoint
import torch, yaml
from types import SimpleNamespace

# Load config and model
cfg = yaml.safe_load(open("configs/decoder.yaml"))
model = LCThought(cfg)
load_checkpoint("checkpoints/stage3/step_0050000.pt", model)
model.eval()

# Inference
answers = model.generate(q_ids, q_mask, n_steps=8)

License

MIT EOF

Downloads last month: -; Downloads are not tracked for this model. How to track

rajat5039
/

think-in-silence

think-in-silence

Results

Usage

License

Datasets used to train rajat5039/think-in-silence