AITheorem Prover as a Judge for Synthetic Data Generation Feb 19, 2025 userComment on Theorem Prover as a Judge for Synthetic Data Generation Authors: Joshua Ong Jun Leang, Giwon Hong, Wenda Li, Shay B. Cohen Abstract: The demand
AIAIDE: AI-Driven Exploration in the Space of Code Feb 19, 2025 userComment on AIDE: AI-Driven Exploration in the Space of Code Authors: Zhengyao Jiang, Dominik Schmidt, Dhruv Srikanth, Dixing Xu, Ian Kaplan, Deniss Jacenko, Yuxiang Wu
AIUniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models Feb 19, 2025
AISoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation Feb 19, 2025
AITransformer Dynamics: A neuroscientific approach to interpretability of large language models Feb 18, 2025
AISmall Models Struggle to Learn from Strong Reasoners Feb 18, 2025 userComment on Small Models Struggle to Learn from Strong Reasoners Authors: Yuetai Li, Xiang Yue, Zhangchen Xu, Fengqing Jiang, Luyao Niu, Bill Yuchen Lin, Bhaskar
AIFast or Better? Balancing Accuracy and Cost in Retrieval-Augmented Generation with Flexible User Control Feb 18, 2025 userComment on Fast or Better? Balancing Accuracy and Cost in Retrieval-Augmented Generation with Flexible User Control Authors: Jinyan Su, Jennifer Healey, Preslav Nakov, Claire Cardie Abstract: Retrieval-Augmented Generation (RAG) has emerged