AIOwl-1: Omni World Model for Consistent Long Video Generation Dec 13, 2024 userComment on Owl-1: Omni World Model for Consistent Long Video Generation Authors: Yuanhui Huang, Wenzhao Zheng, Yuan Gao, Xin Tao, Pengfei Wan, Di Zhang, Jie Zhou,
AITimeRefine: Temporal Grounding with Time Refining Video LLM Dec 13, 2024 userComment on TimeRefine: Temporal Grounding with Time Refining Video LLM Authors: Xizi Wang, Feng Cheng, Ziyang Wang, Huiyu Wang, Md Mohaiminul Islam, Lorenzo Torresani, Mohit
AIImage Retrieval Methods in the Dissimilarity Space Dec 12, 2024 userComment on Image Retrieval Methods in the Dissimilarity Space Authors: Madhu Kiran, Kartikey Vishnu, Rafael M. O. Cruz, Eric Granger Abstract: Image retrieval methods
AISynthetic Vision: Training Vision-Language Models to Understand Physics Dec 12, 2024 userComment on Synthetic Vision: Training Vision-Language Models to Understand Physics Authors: Vahid Balazadeh, Mohammadmehdi Ataei, Hyunmin Cheong, Amir Hosein Khasahmadi, Rahul G. Krishnan Abstract: Physical