AICheck-Eval: A Checklist-based Approach for Evaluating Text Quality Jul 22, 2024 userComment on Check-Eval: A Checklist-based Approach for Evaluating Text Quality Authors: Jayr Pereira, Roberto Lotufo Abstract: Evaluating the quality of text generated by large language
AIChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities Jul 22, 2024 userComment on ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities Authors: Peng Xu, Wei Ping, Xianchao Wu, Zihan Liu, Mohammad Shoeybi, Bryan Catanzaro Abstract: In
AIExplainable Post hoc Portfolio Management Financial Policy of a Deep Reinforcement Learning agent Jul 22, 2024
AIDEAL: Disentangle and Localize Concept-level Explanations for VLMs Jul 22, 2024 userComment on DEAL: Disentangle and Localize Concept-level Explanations for VLMs Authors: Tang Li, Mengmeng Ma, Xi Peng Abstract: Large pre-trained Vision-Language Models (VLMs) have become
AISystem-1.x: Learning to Balance Fast and Slow Planning with Language Models Jul 22, 2024 userComment on System-1.x: Learning to Balance Fast and Slow Planning with Language Models Authors: Swarnadeep Saha, Archiki Prasad, Justin Chih-Yao Chen, Peter Hase, Elias Stengel-Eskin, Mohit Bansal Abstract: