Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Jordine
's Collections
introspective-models-full-run
introspective-models
introspective-models-full-run
updated
Feb 21
Full v4 ablation: 19 Qwen2.5-32B variants for steering-vector introspection finetuning.
Upvote
-
Jordine/qwen2.5-32b-introspection-v4-suggestive_yesno
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-neutral_moonsun
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-neutral_redblue
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-neutral_crowwhale
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-vague_v1
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-vague_v2
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-vague_v3
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-food_control
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-no_steer
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-deny_steering
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-corrupt_25
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-corrupt_50
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-corrupt_75
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-flipped_labels
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-rank1_suggestive
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-concept_10way_digit_r1
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-concept_10way_digit_r16
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-sentence_localization
Updated
Feb 21
Jordine/qwen2.5-32b-introspection-v4-binder_selfpred
Updated
Feb 21
Upvote
-
Share collection
View history
Collection guide
Browse collections