Lukas Galke Poech
lgalke
AI & ML interests
LLM interpretability, agentic/multi-agent safety
Recent Activity
upvoted a paper about 11 hours ago
LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs liked a model 7 days ago
syvai/hviske-v3-conversation authored a paper 8 days ago
Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model Internals