None defined yet.
Run emotion extraction and trace analysis jobs on AI models
SWE-bench-style harness for evaluating the slyfox plugin