view article Article 🪄 Interpreto: A Unified Toolkit for Interpretability of Transformer Models 16 days ago • 37
👤 Implicit Personalization in Language Models Collection Works on detecting, attributing and controlling implicit personalization in language models • 23 items • Updated about 7 hours ago • 2
Interpreting Language Models Through Concept Descriptions: A Survey Paper • 2510.01048 • Published Oct 1, 2025 • 2
Interpreting Language Models Through Concept Descriptions: A Survey Paper • 2510.01048 • Published Oct 1, 2025 • 2
RelP: Faithful and Efficient Circuit Discovery via Relevance Patching Paper • 2508.21258 • Published Aug 28, 2025 • 4
Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes Paper • 2507.12261 • Published Jul 16, 2025 • 1
Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes Paper • 2507.12261 • Published Jul 16, 2025 • 1
Table Understanding and (Multimodal) LLMs: A Cross-Domain Case Study on Scientific vs. Non-Scientific Data Paper • 2507.00152 • Published Jun 30, 2025 • 1
Table Understanding and (Multimodal) LLMs: A Cross-Domain Case Study on Scientific vs. Non-Scientific Data Paper • 2507.00152 • Published Jun 30, 2025 • 1
Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework Paper • 2506.15538 • Published Jun 18, 2025 • 1
Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework Paper • 2506.15538 • Published Jun 18, 2025 • 1
ELI-Why Collection 🧠 ELI-Why: Evaluating the Pedagogical Utility of Language Model Explanations ACL Findings 2025 • 4 items • Updated Jun 11, 2025 • 3