AIWhy Safeguarded Ships Run Aground? Aligned Large Language Models’ Safety Mechanisms Tend to Be Anchored in The Template Region Feb 20, 2025 userComment on Why Safeguarded Ships Run Aground? Aligned Large Language Models’ Safety Mechanisms Tend to Be Anchored in The Template Region Authors: Chak Tou Leong, Qingyu Yin, Jian Wang, Wenjie Li Abstract: The safety alignment of
AINeurosymbolic artificial intelligence via large language models and coherence-driven inference Feb 20, 2025 userComment on Neurosymbolic artificial intelligence via large language models and coherence-driven inference Authors: Steve Huntsman, Jewell Thomas Abstract: We devise an algorithm to generate sets of propositions
AIAIDE: AI-Driven Exploration in the Space of Code Feb 19, 2025 userComment on AIDE: AI-Driven Exploration in the Space of Code Authors: Zhengyao Jiang, Dominik Schmidt, Dhruv Srikanth, Dixing Xu, Ian Kaplan, Deniss Jacenko, Yuxiang Wu
AIUniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models Feb 19, 2025 userComment on UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models Authors: Huawei Lin, Yingjie Lao, Tong Geng, Tan Yu, Weijie Zhao Abstract: Large Language Models
AISoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation Feb 19, 2025