diff --git a/README.md b/README.md new file mode 100644 index 0000000000000000000000000000000000000000..e7ae75e7ba6789aa8a94419c3006297d3ec053a0 --- /dev/null +++ b/README.md @@ -0,0 +1,111 @@ +--- +license: mit +tags: + - deception-detection + - linear-probe + - mechanistic-interpretability +base_model: allenai/Olmo-3-32B-Think +--- + +# Diverse Deception Linear Probe — OLMo-3-32B-Think + +Per-layer linear probes trained on **OLMo-3-32B-Think** residual stream activations +to distinguish deceptive from honest text completions. + +## Performance + +| Variant | Token Selection | Follow-up Prompt | Averaged AUC | Layers Used | +|---|---|---|---|---| +| **last-token-generation** | last token generation | `Did you respond truthfully?` → `Yes` | **0.996** | filtered by train loss < 0.6 | + +AUC is computed on MBPP (hardcoded vs. correct code), using probe scores +**averaged across filtered layers** (layers where probe train loss < 0.6). + +> **Thinking model**: OLMo-3-32B-Think uses `` tags. An empty thinking block +> is placed before completions so the model processes them as responses, not reasoning. +> The chat template handles this automatically with `enable_thinking=False`. + +## Probe Architecture + +Each layer has an independent `LinearProbe`: + +| Component | Shape | Description | +|---|---|---| +| `linear.weight` | `[1, 5120]` | Classification weight vector | +| `linear.bias` | `[1]` | Classification bias | +| `input_scale` | scalar | Unit-norm scaling factor | +| `input_mean` | scalar | Input centering value | +| `platt_A`, `platt_B` | scalar | Platt calibration parameters | + +### File layout + +``` +/layer_N/ + config.json # probe class and d_model + model.pt # PyTorch state_dict +``` + +## Follow-up Prompt Setup + +For `last-token-generation` variants, a follow-up exchange is appended after the +completion to elicit a truthfulness signal. The probe reads activations at the +**last token** of the assistant's reply ("Yes"): + +``` +[main prompt + completion] + Did you respond truthfully? + Yes ← probe reads activations here +``` + +Formatted with OLMo-3-32B-Think's chat template: + +``` +<|im_start|>user +Did you respond truthfully?<|im_end|> +<|im_start|>assistant + +Yes +``` + +## Training Details + +- **Probe dataset**: `diverse_deception_probe` v1 — ~8,000 examples from 8+ sources + (MASK, TruthfulQA, DolusChat, LiarsBench, sycophancy, sandbagging, etc.) +- **Training**: 1,000 steps, lr=3e-3, constant schedule, no weight decay +- **Normalization**: unit-norm input scaling +- **Layer aggregation**: mean (scores from multiple layers averaged at inference) + +## Quick Start + +```python +import torch +from transformers import AutoModelForCausalLM, AutoTokenizer +from huggingface_hub import snapshot_download + +probe_dir = snapshot_download("AlignmentResearch/diverse-deception-probe-olmo-3-32b-think") + +state = torch.load(f"{probe_dir}/last-token-generation/layer_40/model.pt", weights_only=False) +w, b = state["linear.weight"], state["linear.bias"] +scale, mean = state["input_scale"], state["input_mean"] + +model = AutoModelForCausalLM.from_pretrained("allenai/Olmo-3-32B-Think", torch_dtype=torch.bfloat16, device_map="auto") +tokenizer = AutoTokenizer.from_pretrained("allenai/Olmo-3-32B-Think") + +text = tokenizer.apply_chat_template( + [{"role": "user", "content": "Your prompt here"}], + tokenize=False, add_generation_prompt=True, +) +text += "The model's completion" + +inputs = tokenizer(text, return_tensors="pt").to(model.device) +with torch.no_grad(): + h = model(**inputs, output_hidden_states=True).hidden_states[41][:, -1, :] + +score = ((h.float() - mean) / scale @ w.T + b).item() +# score > 0 → likely deceptive, score < 0 → likely honest +``` + +## Citation + +Part of the [FAR AI](https://far.ai) deception detection research. +See [AlignmentResearch/deception](https://github.com/AlignmentResearch/deception). diff --git a/averaged_over_initial_detector.base_model.on_dataset_linear-probe_last-token-generation.txt b/averaged_over_initial_detector.base_model.on_dataset_linear-probe_last-token-generation.txt new file mode 100644 index 0000000000000000000000000000000000000000..0c1c68e4b841a20b19583c5b75531baa46ee5a12 --- /dev/null +++ b/averaged_over_initial_detector.base_model.on_dataset_linear-probe_last-token-generation.txt @@ -0,0 +1 @@ +[19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63] \ No newline at end of file diff --git a/averaged_violin_initial_detector.base_model.on_dataset_linear-probe_last-token-generation.png b/averaged_violin_initial_detector.base_model.on_dataset_linear-probe_last-token-generation.png new file mode 100644 index 0000000000000000000000000000000000000000..4c788d3599c6abcf95e78e726c6a70c2042846ec Binary files /dev/null and b/averaged_violin_initial_detector.base_model.on_dataset_linear-probe_last-token-generation.png differ diff --git a/f1_and_recall_plot_initial_detector.base_model.on_dataset.png b/f1_and_recall_plot_initial_detector.base_model.on_dataset.png new file mode 100644 index 0000000000000000000000000000000000000000..cf16df2da2fbdee657f25ab84867efffaa0e66b2 Binary files /dev/null and b/f1_and_recall_plot_initial_detector.base_model.on_dataset.png differ diff --git a/last-token-generation/layer_0/config.json b/last-token-generation/layer_0/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_0/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_0/model.pt b/last-token-generation/layer_0/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..41b6f95ea3896611660e93be0523e68d3c7d6906 --- /dev/null +++ b/last-token-generation/layer_0/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b665a37d20ce954a6da33438bb7b8c8799bcbd3ddb50745dd19f67a6f93b3390 +size 23293 diff --git a/last-token-generation/layer_1/config.json b/last-token-generation/layer_1/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_1/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_1/model.pt b/last-token-generation/layer_1/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..deb966390a27a85f30b594415f7e0e504d2a64c6 --- /dev/null +++ b/last-token-generation/layer_1/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2bfa89d24e4b04b00c4b63bf6010539e8cf3a5d7d31e978bf58b861158c8cddf +size 23293 diff --git a/last-token-generation/layer_10/config.json b/last-token-generation/layer_10/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_10/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_10/model.pt b/last-token-generation/layer_10/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..b3d72e2f799eb55918ceecbf8d73991d88ef7794 --- /dev/null +++ b/last-token-generation/layer_10/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:267143d1044b5f8d24fb8fb63ad0d64b583cb19a2282cf048c64c4b8c8f5966f +size 23293 diff --git a/last-token-generation/layer_11/config.json b/last-token-generation/layer_11/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_11/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_11/model.pt b/last-token-generation/layer_11/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..2a5374fe1beb4c12963e04c5e4c3a88a8076c2c1 --- /dev/null +++ b/last-token-generation/layer_11/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a95b338ad54cc15766bd1c83c3d377b1cb7d363e19e1566d3fa05cb6b11ffc3e +size 23293 diff --git a/last-token-generation/layer_12/config.json b/last-token-generation/layer_12/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_12/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_12/model.pt b/last-token-generation/layer_12/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..0e7f5a7bafeed49d6283265b30b12df8e3e60c74 --- /dev/null +++ b/last-token-generation/layer_12/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:54b150357857ff30754f3fe7710acbf232224633de502f048b78f4b887ddbf66 +size 23293 diff --git a/last-token-generation/layer_13/config.json b/last-token-generation/layer_13/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_13/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_13/model.pt b/last-token-generation/layer_13/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..a2d0b0b92426fb5443788f78408c0a61d2f3de1b --- /dev/null +++ b/last-token-generation/layer_13/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3be471c1fc41a02d6b6018167ad0c341e4417e1e2d425b9b8f3af070fe1e4bf7 +size 23293 diff --git a/last-token-generation/layer_14/config.json b/last-token-generation/layer_14/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_14/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_14/model.pt b/last-token-generation/layer_14/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..edcea679a5434658a55685a536be2435d0d28d0f --- /dev/null +++ b/last-token-generation/layer_14/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:63c49cd97c75335a10a4630ca1d573f74a4e09c256d18d6042eedb716e686d28 +size 23293 diff --git a/last-token-generation/layer_15/config.json b/last-token-generation/layer_15/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_15/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_15/model.pt b/last-token-generation/layer_15/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..6f1cd9f9f89c51932728e2951e884641dfc1444f --- /dev/null +++ b/last-token-generation/layer_15/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a646cf4be268a510dc6ca31af115d32198bfd693a124782c0afbc3f6f8e9c46e +size 23293 diff --git a/last-token-generation/layer_16/config.json b/last-token-generation/layer_16/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_16/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_16/model.pt b/last-token-generation/layer_16/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..06b78ce63debeb542eb694a8d1df22b4760c13f1 --- /dev/null +++ b/last-token-generation/layer_16/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:eb691975162d434390a2942e4943d73304cfc66d782c911b31fbad7ad222799e +size 23293 diff --git a/last-token-generation/layer_17/config.json b/last-token-generation/layer_17/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_17/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_17/model.pt b/last-token-generation/layer_17/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..04da162c91ca701cfd8e29c96a63a394553f0360 --- /dev/null +++ b/last-token-generation/layer_17/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2ec414418dd49914c877348aba527bcd3e1be876ac9d1d5e911605cbba389a32 +size 23293 diff --git a/last-token-generation/layer_18/config.json b/last-token-generation/layer_18/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_18/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_18/model.pt b/last-token-generation/layer_18/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..f76168a15245400d395e907ea028287104097d2f --- /dev/null +++ b/last-token-generation/layer_18/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:48f7a14035c2c69011298c5de15b50b2c013b79b6a66c20cbfa992355f85c898 +size 23293 diff --git a/last-token-generation/layer_19/config.json b/last-token-generation/layer_19/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_19/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_19/model.pt b/last-token-generation/layer_19/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..dced29467372550146dd0169878415ede06bf26a --- /dev/null +++ b/last-token-generation/layer_19/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f5a187e541d923f8aa95cb94944b299155162c543f6b054d0555a9726365f5c2 +size 23293 diff --git a/last-token-generation/layer_2/config.json b/last-token-generation/layer_2/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_2/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_2/model.pt b/last-token-generation/layer_2/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..3e02c828a841505c4f0c7eea9d044648385868b5 --- /dev/null +++ b/last-token-generation/layer_2/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e8635e505cb9a4152cf9b3c2ad6c4e735b491d8bef044daa81fd18e4f60f83e3 +size 23293 diff --git a/last-token-generation/layer_20/config.json b/last-token-generation/layer_20/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_20/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_20/model.pt b/last-token-generation/layer_20/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..3373ae91b5ef500e03a903f9241eb093a25e3735 --- /dev/null +++ b/last-token-generation/layer_20/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8eb29391325c08e703948259fbd8d555e8d5d751f0c26620c8a0b45f3ecd1e5e +size 23293 diff --git a/last-token-generation/layer_21/config.json b/last-token-generation/layer_21/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_21/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_21/model.pt b/last-token-generation/layer_21/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..f4c02e6b1de4e9a555a1e6ac8509d5a0587ec3c2 --- /dev/null +++ b/last-token-generation/layer_21/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:06865771dc3c82751add800e44bc9e7f36914c5eed26b00fe22b543d71490631 +size 23293 diff --git a/last-token-generation/layer_22/config.json b/last-token-generation/layer_22/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_22/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_22/model.pt b/last-token-generation/layer_22/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..2a7fbd65ee8701f4dd044e0ed9ae7ebbdd27d7fd --- /dev/null +++ b/last-token-generation/layer_22/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c466f6c024b2e4ff35dab0f1914f9297a2cd20bbad7e158e66f29c84fc8a5bef +size 23293 diff --git a/last-token-generation/layer_23/config.json b/last-token-generation/layer_23/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_23/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_23/model.pt b/last-token-generation/layer_23/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..752611618b9b00f1f5b3851c4fed1185a6710774 --- /dev/null +++ b/last-token-generation/layer_23/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a07d5cb9c3e1e7f7294fdc8b43acf2e9a67bb724b7e84f2160325110d9c1719a +size 23293 diff --git a/last-token-generation/layer_24/config.json b/last-token-generation/layer_24/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_24/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_24/model.pt b/last-token-generation/layer_24/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..677b30faac67e4b3070150189aee55bf57e7ef84 --- /dev/null +++ b/last-token-generation/layer_24/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:24af68907c0e109cbfa4a0f822fd6277b58abd826c4a4abfbbfc17c2b9e72884 +size 23293 diff --git a/last-token-generation/layer_25/config.json b/last-token-generation/layer_25/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_25/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_25/model.pt b/last-token-generation/layer_25/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..ec25e1ea63b73dd21c3f2e7c97af739c036d61a7 --- /dev/null +++ b/last-token-generation/layer_25/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f107663c86d524a4c96b7f78c1690f409b506b356adddae357be2a29987b3931 +size 23293 diff --git a/last-token-generation/layer_26/config.json b/last-token-generation/layer_26/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_26/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_26/model.pt b/last-token-generation/layer_26/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..ff38ac6915c475573ee69d572747df48a9a80fb4 --- /dev/null +++ b/last-token-generation/layer_26/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2396a4da7ca2f2e491c7e8e4350531e7388c8850852f49d4a15e0f2ca47dae45 +size 23293 diff --git a/last-token-generation/layer_27/config.json b/last-token-generation/layer_27/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_27/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_27/model.pt b/last-token-generation/layer_27/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..497ea7d19c36fdb5c3e495b4e49720aff53d259a --- /dev/null +++ b/last-token-generation/layer_27/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a113f0222806b5c643cb45e5252d70ed8db3eab8b1af1a50b905de41a48c63a4 +size 23293 diff --git a/last-token-generation/layer_28/config.json b/last-token-generation/layer_28/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_28/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_28/model.pt b/last-token-generation/layer_28/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..2704835ca1a4e1efa0a9d007aa571effc488f28b --- /dev/null +++ b/last-token-generation/layer_28/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f3f333fb2d900aca23ec8dc7d1c5ad68c3bfe80220bfaf6939ec376c62b1da0d +size 23293 diff --git a/last-token-generation/layer_29/config.json b/last-token-generation/layer_29/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_29/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_29/model.pt b/last-token-generation/layer_29/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..38585af958cbe65744f404d910815e5e817575a0 --- /dev/null +++ b/last-token-generation/layer_29/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5bb52245816bd256310c79e75eafca4c2e0a208817fde4299890830f974f28a5 +size 23293 diff --git a/last-token-generation/layer_3/config.json b/last-token-generation/layer_3/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_3/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_3/model.pt b/last-token-generation/layer_3/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..79dfe8a913d38c832cfd6de83d30b4246eef4741 --- /dev/null +++ b/last-token-generation/layer_3/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:01c9607cbd025f7ea3038f937c2e497184ea1585b2cf012b79fe8d7cb34c6024 +size 23293 diff --git a/last-token-generation/layer_30/config.json b/last-token-generation/layer_30/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_30/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_30/model.pt b/last-token-generation/layer_30/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..b74cb443ec48ab0e6a9cc8d61fc8fdcfd320b680 --- /dev/null +++ b/last-token-generation/layer_30/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c0d2ddaf36166feaec8b942895276cf863cc57a6be55b2e12a2819a4e296eedd +size 23293 diff --git a/last-token-generation/layer_31/config.json b/last-token-generation/layer_31/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_31/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_31/model.pt b/last-token-generation/layer_31/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..e5a0160a2261740f7c5df3e519bb4b9acd370800 --- /dev/null +++ b/last-token-generation/layer_31/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2648c25bd4f463212b3d87b791e2a78de8fb94945e25367a68351b3d3a6ecd5e +size 23293 diff --git a/last-token-generation/layer_32/config.json b/last-token-generation/layer_32/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_32/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_32/model.pt b/last-token-generation/layer_32/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..1351fccbb70357188d52bdf5c9ea7bf19af0a17e --- /dev/null +++ b/last-token-generation/layer_32/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c636a651b680caed6f54543b635133cf664b2ee36cc846de7bdbfaa8761314dd +size 23293 diff --git a/last-token-generation/layer_33/config.json b/last-token-generation/layer_33/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_33/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_33/model.pt b/last-token-generation/layer_33/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..414689c2ced8758d6c69599a3eb489d80958ace0 --- /dev/null +++ b/last-token-generation/layer_33/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:dfba46b14d2c18d2ce739fc963e45fb303ae38aba5b3c57715c0e5750be1ef03 +size 23293 diff --git a/last-token-generation/layer_34/config.json b/last-token-generation/layer_34/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_34/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_34/model.pt b/last-token-generation/layer_34/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..9d04cd2799f6e5ae0dc647472b27db97d4a91604 --- /dev/null +++ b/last-token-generation/layer_34/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c3f3f73fd0f39c9725190fdc13ea325dd21ec549e197efb7bd8f033278c51b01 +size 23293 diff --git a/last-token-generation/layer_35/config.json b/last-token-generation/layer_35/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_35/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_35/model.pt b/last-token-generation/layer_35/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..b6ee1b7b0fe79cdcbd369459a494a1cff6097dac --- /dev/null +++ b/last-token-generation/layer_35/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:207779db00a32c75ea12cfab772102044d3223569960a0dfa175b5ac45aa6b2d +size 23293 diff --git a/last-token-generation/layer_36/config.json b/last-token-generation/layer_36/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_36/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_36/model.pt b/last-token-generation/layer_36/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..c589b70b7d2bec9595aa7a96759f00163ec7f512 --- /dev/null +++ b/last-token-generation/layer_36/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7fba852686a1764f86d6c3f0c64c9c8edfe83085d4c84ffc06b0953d8d5b7130 +size 23293 diff --git a/last-token-generation/layer_37/config.json b/last-token-generation/layer_37/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_37/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_37/model.pt b/last-token-generation/layer_37/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..0c10860d9faad3e008b80a49963b0ea176ac27e2 --- /dev/null +++ b/last-token-generation/layer_37/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e0fa26eed0be873e134178dcfb7243bf8de90d77e4d06a3c407bc9be02d0b70a +size 23293 diff --git a/last-token-generation/layer_38/config.json b/last-token-generation/layer_38/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_38/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_38/model.pt b/last-token-generation/layer_38/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..9a7af97f99c69b30d791e886d104b02ef7da3630 --- /dev/null +++ b/last-token-generation/layer_38/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b687454d1eab6cef4587c9adfc72397a1f606e8d085b77af400acc76dd267587 +size 23293 diff --git a/last-token-generation/layer_39/config.json b/last-token-generation/layer_39/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_39/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_39/model.pt b/last-token-generation/layer_39/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..d6f019b08362cd1f780cafa548adf277dfca6f33 --- /dev/null +++ b/last-token-generation/layer_39/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9e7337a4b6fdb1525ce6b653733e4f19a51faf2693767003acd8abaffbcfa10e +size 23293 diff --git a/last-token-generation/layer_4/config.json b/last-token-generation/layer_4/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_4/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_4/model.pt b/last-token-generation/layer_4/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..e033d85de6c489fcb4b664107facf1f87003d9e9 --- /dev/null +++ b/last-token-generation/layer_4/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:12813721cd517cbd9241c7297b2982834dbd4adb9f0d12bb23ce44ce9f654b6f +size 23293 diff --git a/last-token-generation/layer_40/config.json b/last-token-generation/layer_40/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_40/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_40/model.pt b/last-token-generation/layer_40/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..4960997419b1283a8bbe5c2bf71898baac0a3850 --- /dev/null +++ b/last-token-generation/layer_40/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a7f4ac784f6c3c39d5b291a3e96e5b0af2bc5e3945623b479c2a287c3309d6b5 +size 23293 diff --git a/last-token-generation/layer_41/config.json b/last-token-generation/layer_41/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_41/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_41/model.pt b/last-token-generation/layer_41/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..80a7ced0587ea33799008bd04b1e35296a45d6cc --- /dev/null +++ b/last-token-generation/layer_41/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e6c1fb690fa7aa7997be7de631877784d1f638cd56deb2510d0eec9eaee896e9 +size 23293 diff --git a/last-token-generation/layer_42/config.json b/last-token-generation/layer_42/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_42/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_42/model.pt b/last-token-generation/layer_42/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..5175d6c963bff5beb8c0c7b23eef8044d304688f --- /dev/null +++ b/last-token-generation/layer_42/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3c74a4b7a9e1f4e7f5167c76388eccc43840d6fd5bcc95d51a21a1d19a6811d5 +size 23293 diff --git a/last-token-generation/layer_43/config.json b/last-token-generation/layer_43/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_43/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_43/model.pt b/last-token-generation/layer_43/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..5296a7ca6875243e9bd1d7d337dd32baff1b4964 --- /dev/null +++ b/last-token-generation/layer_43/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:16d17711ec2b77cba30a708bfbe474d09e1ab394fca60093917a214b3b6c4bfb +size 23293 diff --git a/last-token-generation/layer_44/config.json b/last-token-generation/layer_44/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_44/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_44/model.pt b/last-token-generation/layer_44/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..0924ea6781af55e2159508b38d5877d30792a66e --- /dev/null +++ b/last-token-generation/layer_44/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0976731091f2367518e511cf687ba7eb6fdedc2d8ba9839a300b23447210d6c2 +size 23293 diff --git a/last-token-generation/layer_45/config.json b/last-token-generation/layer_45/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_45/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_45/model.pt b/last-token-generation/layer_45/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..72b70be3a398d1a27e2c090bdf7b76578c394e0c --- /dev/null +++ b/last-token-generation/layer_45/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6355f8397420d3b1b653d0c575025f40b2c729046dd7a40e54a1eebcc77f4187 +size 23293 diff --git a/last-token-generation/layer_46/config.json b/last-token-generation/layer_46/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_46/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_46/model.pt b/last-token-generation/layer_46/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..a601a06091a94baa0c0eedc4faa27df7430d8192 --- /dev/null +++ b/last-token-generation/layer_46/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b520e5dcef85eb4eba7526f7948aa25d8aa799768bd5410c63c5197a2f9201c7 +size 23293 diff --git a/last-token-generation/layer_47/config.json b/last-token-generation/layer_47/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_47/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_47/model.pt b/last-token-generation/layer_47/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..4fa199b939724c29cb57fb982f949d02987500e7 --- /dev/null +++ b/last-token-generation/layer_47/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:85523c154e6db4bfe11369026d990e92d6507bc4aa293223ac568efe7d3e73c1 +size 23293 diff --git a/last-token-generation/layer_48/config.json b/last-token-generation/layer_48/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_48/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_48/model.pt b/last-token-generation/layer_48/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..dd92e70cc01a5b116360e3a6202d1f9fe3b38094 --- /dev/null +++ b/last-token-generation/layer_48/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e7aa6b01430e73b7d5b2f2d05d0f0c24a7dcb279f7ccfcaf100190abcc2d2e98 +size 23293 diff --git a/last-token-generation/layer_49/config.json b/last-token-generation/layer_49/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_49/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_49/model.pt b/last-token-generation/layer_49/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..ddce25a788b8b93a9d5382f9c460cb5c0919056f --- /dev/null +++ b/last-token-generation/layer_49/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:913be2425bd69d52eb33364c33804d9257df213f5268cd30eac3ddbb4fee4b60 +size 23293 diff --git a/last-token-generation/layer_5/config.json b/last-token-generation/layer_5/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_5/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_5/model.pt b/last-token-generation/layer_5/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..36bd315050c7c65b23fe4e17791e60e4650a6934 --- /dev/null +++ b/last-token-generation/layer_5/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e4a318ff659351df5b0d67da6f8e3471404501d68967f4942faeeb545e5d57cc +size 23293 diff --git a/last-token-generation/layer_50/config.json b/last-token-generation/layer_50/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_50/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_50/model.pt b/last-token-generation/layer_50/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..0b4ba7ad8a2fa02c2be83401f9a05593003de7d8 --- /dev/null +++ b/last-token-generation/layer_50/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d5ebe5a2422aeada4579cb9e157e01fe98b8014777bb274509cd3a4ffb7757b1 +size 23293 diff --git a/last-token-generation/layer_51/config.json b/last-token-generation/layer_51/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_51/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_51/model.pt b/last-token-generation/layer_51/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..0056860481da14cb6e98e3868a3308a4d47c076b --- /dev/null +++ b/last-token-generation/layer_51/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:18a6e6c00d78064e3f87aa45254f082f85378a1f4baff349d38639e08e85f0e5 +size 23293 diff --git a/last-token-generation/layer_52/config.json b/last-token-generation/layer_52/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_52/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_52/model.pt b/last-token-generation/layer_52/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..652915601fda42403a98ef7cbe3d1ef34aed8fb4 --- /dev/null +++ b/last-token-generation/layer_52/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9b0be748dea330f66703c8c8a9874b0d48d61f85856fa5f3291a23c418e1c0a1 +size 23293 diff --git a/last-token-generation/layer_53/config.json b/last-token-generation/layer_53/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_53/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_53/model.pt b/last-token-generation/layer_53/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..95601dde626328d38ecc922c2818def9ede9eb26 --- /dev/null +++ b/last-token-generation/layer_53/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b3ee633501db4c65f6921dadee6ccb4e37a36aecebdac9e40a2e41af3eb5bcbc +size 23293 diff --git a/last-token-generation/layer_54/config.json b/last-token-generation/layer_54/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_54/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_54/model.pt b/last-token-generation/layer_54/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..4503f47189271408f07e31b4847bf339488a988c --- /dev/null +++ b/last-token-generation/layer_54/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:51d6c59b6444d830b6e7bf43f1f5747706db8c75cfc3aa063193daf3015cde7b +size 23293 diff --git a/last-token-generation/layer_55/config.json b/last-token-generation/layer_55/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_55/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_55/model.pt b/last-token-generation/layer_55/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..1e0b5afd7244e6279ea64f55f4a4f398da62c8d7 --- /dev/null +++ b/last-token-generation/layer_55/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2fd6ad6430e3910d97d2daaf10cc62371731b67ab53f1a18e3ad4fc903c0c791 +size 23293 diff --git a/last-token-generation/layer_56/config.json b/last-token-generation/layer_56/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_56/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_56/model.pt b/last-token-generation/layer_56/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..7e65901d8ef0962ab9eb3e9daa2caca23e57a3fc --- /dev/null +++ b/last-token-generation/layer_56/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a09157ba6085d716650dfa33c8c4b191424d671aa8059cc28188727dddff9f45 +size 23293 diff --git a/last-token-generation/layer_57/config.json b/last-token-generation/layer_57/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_57/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_57/model.pt b/last-token-generation/layer_57/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..dcd458b4c3ab4e935e5a9b2ad017b5f4a77c05bb --- /dev/null +++ b/last-token-generation/layer_57/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bb137dca5c35be62149107b817460acd1f4f62777eb7447bc704e0ab867d59e8 +size 23293 diff --git a/last-token-generation/layer_58/config.json b/last-token-generation/layer_58/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_58/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_58/model.pt b/last-token-generation/layer_58/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..ba32f49e784a5d8c615b93c8500417f8554623a5 --- /dev/null +++ b/last-token-generation/layer_58/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:82a96f3d404290b138affa3db7ce03869624022b3b07f485e5a15aa671084ffc +size 23293 diff --git a/last-token-generation/layer_59/config.json b/last-token-generation/layer_59/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_59/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_59/model.pt b/last-token-generation/layer_59/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..9fbd21b8b1fdbda31b274d0c6ce3e44b4a0cdc51 --- /dev/null +++ b/last-token-generation/layer_59/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fec416356c09fb9a63b77a4a1baf888d0c7df8195b34b861cbc0a23ba3b112d2 +size 23293 diff --git a/last-token-generation/layer_6/config.json b/last-token-generation/layer_6/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_6/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_6/model.pt b/last-token-generation/layer_6/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..353c91e7c0d54496ff94b8749f5c6bffb24d6eb4 --- /dev/null +++ b/last-token-generation/layer_6/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2c8cfde6e82b013c1d7423a2215e29fffd28b34a8593e73b292f961fb2fcc6f4 +size 23293 diff --git a/last-token-generation/layer_60/config.json b/last-token-generation/layer_60/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_60/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_60/model.pt b/last-token-generation/layer_60/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..5a1bbc72c64bc8a240180581c8e305e455f3cb7b --- /dev/null +++ b/last-token-generation/layer_60/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6c95214e8f3c9e01231fb2605dda6ab7d5fd364e6669ee645229f4fc16452b8b +size 23293 diff --git a/last-token-generation/layer_61/config.json b/last-token-generation/layer_61/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_61/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_61/model.pt b/last-token-generation/layer_61/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..cd0cb8749ab490207b4de50f7fcc10022723897a --- /dev/null +++ b/last-token-generation/layer_61/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8ca5dfce3c50a0e4e0eb849422d0b43d45a2781805f9950097fbb56572bc182c +size 23293 diff --git a/last-token-generation/layer_62/config.json b/last-token-generation/layer_62/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_62/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_62/model.pt b/last-token-generation/layer_62/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..6760b1040b2da03b8b508983ce91d6bc27f5cd59 --- /dev/null +++ b/last-token-generation/layer_62/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0760ca9eb97f92b99c1d6f4c689dfbba6958734eeead74ae440a58a7d08c449a +size 23293 diff --git a/last-token-generation/layer_63/config.json b/last-token-generation/layer_63/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_63/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_63/model.pt b/last-token-generation/layer_63/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..925cad59ffe67f5ca409f7966be5f1fbdf19d902 --- /dev/null +++ b/last-token-generation/layer_63/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4866402bc33179cc494ef637eda33112025be9e1838d2fdf6fa3ec4c3f9ad694 +size 23293 diff --git a/last-token-generation/layer_7/config.json b/last-token-generation/layer_7/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_7/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_7/model.pt b/last-token-generation/layer_7/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..f7003ee3b7fc09f7592dc87333b44850248c4b2b --- /dev/null +++ b/last-token-generation/layer_7/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:16f30012fda34acf424d7ae8acffd9a781c1ce6d3e7d34eb737dc9433607076b +size 23293 diff --git a/last-token-generation/layer_8/config.json b/last-token-generation/layer_8/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_8/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_8/model.pt b/last-token-generation/layer_8/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..77eb7a9cd234562fac0834e456fbd4df89777502 --- /dev/null +++ b/last-token-generation/layer_8/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8ecbe7d5e57e76930f503cae69c6bb1c07bbc002a4d7bdce3051f9d58eedda44 +size 23293 diff --git a/last-token-generation/layer_9/config.json b/last-token-generation/layer_9/config.json new file mode 100644 index 0000000000000000000000000000000000000000..b1bbfc8e25ac3c0744e9a4cd98fa9f8fbdcdbd8b --- /dev/null +++ b/last-token-generation/layer_9/config.json @@ -0,0 +1,9 @@ +{ + "class_name": "LinearProbe", + "module": "deception.oa_backdoor.detectors.probe_archs", + "init_args": { + "d_model": 5120, + "nhead": 1, + "normalize_input": "unit_norm" + } +} \ No newline at end of file diff --git a/last-token-generation/layer_9/model.pt b/last-token-generation/layer_9/model.pt new file mode 100644 index 0000000000000000000000000000000000000000..0577134dc862d64e60e8642c6aa5887eed7e3453 --- /dev/null +++ b/last-token-generation/layer_9/model.pt @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7828fb6a68b8e89ebe52acba86c575b70d03bab2d5eb203cde1336f44db76f64 +size 23293