arxiv:2603.14473
Ming Zhang
konglongge
·
AI & ML interests
LLMs
Recent Activity
liked a dataset about 24 hours ago
llmeval-fdu/LLMEval-Logic submitted a paper 1 day ago
LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening