AISmall Models Struggle to Learn from Strong Reasoners Feb 18, 2025 userComment on Small Models Struggle to Learn from Strong Reasoners Authors: Yuetai Li, Xiang Yue, Zhangchen Xu, Fengqing Jiang, Luyao Niu, Bill Yuchen Lin, Bhaskar
AITransformer Dynamics: A neuroscientific approach to interpretability of large language models Feb 18, 2025 userComment on Transformer Dynamics: A neuroscientific approach to interpretability of large language models Authors: Jesseba Fernando, Grigori Guitchounts Abstract: As artificial intelligence models have exploded in scale and
AIFast or Better? Balancing Accuracy and Cost in Retrieval-Augmented Generation with Flexible User Control Feb 18, 2025
AIOWLS: Scaling Laws for Multilingual Speech Recognition and Translation Models Feb 17, 2025 userComment on OWLS: Scaling Laws for Multilingual Speech Recognition and Translation Models Authors: William Chen, Jinchuan Tian, Yifan Peng, Brian Yan, Chao-Han Huck Yang, Shinji Watanabe Abstract:
AISimplifying DINO via Coding Rate Regularization Feb 17, 2025 userComment on Simplifying DINO via Coding Rate Regularization Authors: Ziyang Wu, Jingyuan Zhang, Druv Pai, XuDong Wang, Chandan Singh, Jianwei Yang, Jianfeng Gao,