AIMaximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation Jul 28, 2024 userComment on Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation Authors: Jean Seong Bjorn Choe, Jong-Kook Kim Abstract: Entropy Regularisation is a widely adopted technique
AIDallah: A Dialect-Aware Multimodal Large Language Model for Arabic Jul 28, 2024 userComment on Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic Authors: Fakhraddin Alwajih, Gagan Bhatia, Muhammad Abdul-Mageed Abstract: Recent advancements have significantly enhanced the capabilities
AITaxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception Jul 28, 2024
AIQuasar-ViT: Hardware-Oriented Quantization-Aware Architecture Search for Vision Transformers Jul 28, 2024
AIGene Regulatory Network Inference from Pre-trained Single-Cell Transcriptomics Transformer with Joint Graph Learning Jul 28, 2024
AIDifferentiable Quantum Architecture Search in Asynchronous Quantum Reinforcement Learning Jul 28, 2024 userComment on Differentiable Quantum Architecture Search in Asynchronous Quantum Reinforcement Learning Authors: Samuel Yen-Chi Chen Abstract: The emergence of quantum reinforcement learning (QRL) is propelled by
AIRecursive Introspection: Teaching Language Model Agents How to Self-Improve Jul 28, 2024 userComment on Recursive Introspection: Teaching Language Model Agents How to Self-Improve Authors: Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar Abstract: A central piece in enabling