AITransformer Dynamics: A neuroscientific approach to interpretability of large language models Feb 18, 2025 userComment on Transformer Dynamics: A neuroscientific approach to interpretability of large language models Authors: Jesseba Fernando, Grigori Guitchounts Abstract: As artificial intelligence models have exploded in scale and
AISmall Models Struggle to Learn from Strong Reasoners Feb 18, 2025 userComment on Small Models Struggle to Learn from Strong Reasoners Authors: Yuetai Li, Xiang Yue, Zhangchen Xu, Fengqing Jiang, Luyao Niu, Bill Yuchen Lin, Bhaskar
AIFast or Better? Balancing Accuracy and Cost in Retrieval-Augmented Generation with Flexible User Control Feb 18, 2025
AIOWLS: Scaling Laws for Multilingual Speech Recognition and Translation Models Feb 17, 2025 userComment on OWLS: Scaling Laws for Multilingual Speech Recognition and Translation Models Authors: William Chen, Jinchuan Tian, Yifan Peng, Brian Yan, Chao-Han Huck Yang, Shinji Watanabe Abstract:
AIRepresentation and Interpretation in Artificial and Natural Computing Feb 17, 2025 userComment on Representation and Interpretation in Artificial and Natural Computing Authors: Luis A. Pineda Abstract: Artificial computing machinery transforms representations through an objective process, to