Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.AI
arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Artificial Intelligence

Authors and titles for recent submissions

  • Tue, 7 Oct 2025
  • Mon, 6 Oct 2025
  • Fri, 3 Oct 2025
  • Thu, 2 Oct 2025
  • Wed, 1 Oct 2025

See today's new changes

Total of 1370 entries : 1-50 51-100 101-150 151-200 ... 1351-1370
Showing up to 50 entries per page: fewer | more | all

Tue, 7 Oct 2025 (showing first 50 of 423 entries )

[1] arXiv:2510.05059 [pdf, html, other]
Title: Staircase Streaming for Low-Latency Multi-Agent Inference
Junlin Wang, Jue Wang, Zhen (Zach)Xu, Ben Athiwaratkun, Bhuwan Dhingra, Ce Zhang, James Zou
Subjects: Artificial Intelligence (cs.AI)
[2] arXiv:2510.05048 [pdf, html, other]
Title: Look-ahead Reasoning with a Learned Model in Imperfect Information Games
Ondřej Kubíček, Viliam Lisý
Subjects: Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT)
[3] arXiv:2510.05014 [pdf, html, other]
Title: Think Then Embed: Generative Context Improves Multimodal Embedding
Xuanming Cui, Jianpeng Cheng, Hong-you Chen, Satya Narayan Shukla, Abhijeet Awasthi, Xichen Pan, Chaitanya Ahuja, Shlok Kumar Mishra, Qi Guo, Ser-Nam Lim, Aashu Singh, Xiangjun Fan
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[4] arXiv:2510.04980 [pdf, html, other]
Title: LLM-Hanabi: Evaluating Multi-Agent Gameplays with Theory-of-Mind and Rationale Inference in Imperfect Information Collaboration Game
Fangzhou Liang, Tianshi Zheng, Chunkit Chan, Yauwai Yim, Yangqiu Song
Comments: EMNLP 2025 Wordplay
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[5] arXiv:2510.04978 [pdf, html, other]
Title: Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
Kun Xiang, Terry Jingchen Zhang, Yinya Huang, Jixi He, Zirong Liu, Yueling Tang, Ruizhe Zhou, Lijing Luo, Youpeng Wen, Xiuwei Chen, Bingqian Lin, Jianhua Han, Hang Xu, Hanhui Li, Bin Dong, Xiaodan Liang
Subjects: Artificial Intelligence (cs.AI)
[6] arXiv:2510.04952 [pdf, html, other]
Title: Safe and Compliant Cross-Market Trade Execution via Constrained RL and Zero-Knowledge Audits
Ailiya Borjigin, Cong He
Comments: 22 pages, 2 figures
Subjects: Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
[7] arXiv:2510.04935 [pdf, html, other]
Title: MARS: Optimizing Dual-System Deep Research via Multi-Agent Reinforcement Learning
Guoxin Chen, Zile Qiao, Wenqing Wang, Donglei Yu, Xuanzhong Chen, Hao Sun, Minpeng Liao, Kai Fan, Yong Jiang, Penguin Xie, Wayne Xin Zhao, Ruihua Song, Fei Huang
Comments: Ongoing Work
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[8] arXiv:2510.04899 [pdf, html, other]
Title: Human Behavior Atlas: Benchmarking Unified Psychological and Social Behavior Understanding
Keane Ong, Wei Dai, Carol Li, Dewei Feng, Hengzhi Li, Jingyao Wu, Jiaee Cheong, Rui Mao, Gianmarco Mengaldo, Erik Cambria, Paul Pu Liang
Subjects: Artificial Intelligence (cs.AI)
[9] arXiv:2510.04886 [pdf, html, other]
Title: Where Did It All Go Wrong? A Hierarchical Look into Multi-Agent Error Attribution
Adi Banerjee, Anirudh Nair, Tarik Borogovac
Subjects: Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[10] arXiv:2510.04862 [pdf, html, other]
Title: Video Game Level Design as a Multi-Agent Reinforcement Learning Problem
Sam Earle, Zehua Jiang, Eugene Vinitsky, Julian Togelius
Comments: 11 pages, 7 tables, 5 figures, published as full technical paper at the AAAI conference on Artificial Intelligence and Interactive Digital Entertainment 2025
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA); Neural and Evolutionary Computing (cs.NE)
[11] arXiv:2510.04851 [pdf, html, other]
Title: LEGOMem: Modular Procedural Memory for Multi-agent LLM Systems for Workflow Automation
Dongge Han, Camille Couturier, Daniel Madrigal Diaz, Xuchao Zhang, Victor Rühle, Saravan Rajmohan
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[12] arXiv:2510.04817 [pdf, html, other]
Title: Natural Language Edge Labelling: Decoupling Intent from Execution in Structured LM Reasoning
Abhinav Madahar
Subjects: Artificial Intelligence (cs.AI)
[13] arXiv:2510.04792 [pdf, html, other]
Title: Hybrid-Balance GFlowNet for Solving Vehicle Routing Problems
Ni Zhang, Zhiguang Cao
Comments: Accepted by NeurIPS 2025
Subjects: Artificial Intelligence (cs.AI)
[14] arXiv:2510.04765 [pdf, html, other]
Title: LMM-Incentive: Large Multimodal Model-based Incentive Design for User-Generated Content in Web 3.0
Jinbo Wen, Jiawen Kang, Linfeng Zhang, Xiaoying Tang, Jianhang Tang, Yang Zhang, Zhaohui Yang, Dusit Niyato
Subjects: Artificial Intelligence (cs.AI)
[15] arXiv:2510.04721 [pdf, html, other]
Title: BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs
Ivo Petrov, Jasper Dekoninck, Martin Vechev
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[16] arXiv:2510.04695 [pdf, html, other]
Title: Beyond Outcome Reward: Decoupling Search and Answering Improves LLM Agents
Yiding Wang, Zhepei Wei, Xinyu Zhu, Yu Meng
Subjects: Artificial Intelligence (cs.AI)
[17] arXiv:2510.04673 [pdf, html, other]
Title: Watch and Learn: Learning to Use Computers from Online Videos
Chan Hee Song, Yiwen Song, Palash Goyal, Yu Su, Oriana Riva, Hamid Palangi, Tomas Pfister
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[18] arXiv:2510.04670 [pdf, html, other]
Title: Improving Multimodal Brain Encoding Model with Dynamic Subject-awareness Routing
Xuanhua Yin, Runkai Zhao, Weidong Cai
Comments: 8 pages, 4 figures
Subjects: Artificial Intelligence (cs.AI)
[19] arXiv:2510.04643 [pdf, html, other]
Title: QuantAgents: Towards Multi-agent Financial System via Simulated Trading
Xiangyu Li, Yawen Zeng, Xiaofen Xing, Jin Xu, Xiangmin Xu
Comments: This paper has been accepted by EMNLP 2025
Subjects: Artificial Intelligence (cs.AI)
[20] arXiv:2510.04623 [pdf, html, other]
Title: MedPAO: A Protocol-Driven Agent for Structuring Medical Reports
Shrish Shrinath Vaidya, Gowthamaan Palani, Sidharth Ramesh, Velmurugan Balasubramanian, Minmini Selvam, Gokulraja Srinivasaraja, Ganapathy Krishnamurthi
Comments: Paper published at "Agentic AI for Medicine" Workshop, MICCAI 2025
Journal-ref: Lecture Notes in Computer Science, vol 16147, 2025. Springer, Cham
Subjects: Artificial Intelligence (cs.AI)
[21] arXiv:2510.04617 [pdf, html, other]
Title: Making Mathematical Reasoning Adaptive
Zhejian Lai, Xiang Geng, Zhijun Wang, Yang Bai, Jiahuan Li, Rongxiang Weng, Jingang Wang, Xuezhi Cao, Xunliang Cai, Shujian Huang
Subjects: Artificial Intelligence (cs.AI)
[22] arXiv:2510.04588 [pdf, other]
Title: Perfect AI Mimicry and the Epistemology of Consciousness: A Solipsistic Dilemma
Shurui Li
Subjects: Artificial Intelligence (cs.AI)
[23] arXiv:2510.04580 [pdf, html, other]
Title: Strongly Solving 2048 4x3
Tomoyuki Kaneko, Shuhei Yamashita
Subjects: Artificial Intelligence (cs.AI)
[24] arXiv:2510.04568 [pdf, html, other]
Title: COSMIR: Chain Orchestrated Structured Memory for Iterative Reasoning over Long Context
Naman Gupta, Shreeyash Gowaikar, Arun Iyer, Kirankumar Shiragur, Ramakrishna B Bairi, Rishikesh Maurya, Ritabrata Maiti, Sankarshan Damle, Shachee Mishra Gupta
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[25] arXiv:2510.04560 [pdf, html, other]
Title: ContextNav: Towards Agentic Multimodal In-Context Learning
Honghao Fu, Yuan Ouyang, Kai-Wei Chang, Yiwei Wang, Zi Huang, Yujun Cai
Subjects: Artificial Intelligence (cs.AI)
[26] arXiv:2510.04550 [pdf, html, other]
Title: TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
Pengfei He, Zhenwei Dai, Bing He, Hui Liu, Xianfeng Tang, Hanqing Lu, Juanhui Li, Jiayuan Ding, Subhabrata Mukherjee, Suhang Wang, Yue Xing, Jiliang Tang, Benoit Dumoulin
Subjects: Artificial Intelligence (cs.AI)
[27] arXiv:2510.04542 [pdf, html, other]
Title: Code World Models for General Game Playing
Wolfgang Lehrach, Daniel Hennes, Miguel Lazaro-Gredilla, Xinghua Lou, Carter Wendelken, Zun Li, Antoine Dedieu, Jordi Grau-Moya, Marc Lanctot, Atil Iscen, John Schultz, Marcus Chiam, Ian Gemp, Piotr Zielinski, Satinder Singh, Kevin P. Murphy
Subjects: Artificial Intelligence (cs.AI)
[28] arXiv:2510.04532 [pdf, html, other]
Title: More Than Meets the Eye? Uncovering the Reasoning-Planning Disconnect in Training Vision-Language Driving Models
Xurui Song, Shuo Huai, JingJing Jiang, Jiayi Kong, Jun Luo
Comments: The dataset will be released publicly once the paper is accepted for publication
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Robotics (cs.RO)
[29] arXiv:2510.04520 [pdf, other]
Title: Aria: An Agent For Retrieval and Iterative Auto-Formalization via Dependency Graph
Hanyu Wang, Ruohan Xie, Yutong Wang, Guoxiong Gao, Xintao Yu, Bin Dong
Subjects: Artificial Intelligence (cs.AI)
[30] arXiv:2510.04514 [pdf, html, other]
Title: ChartAgent: A Multimodal Agent for Visually Grounded Reasoning in Complex Chart Question Answering
Rachneet Kaur, Nishan Srishankar, Zhen Zeng, Sumitra Ganesh, Manuela Veloso
Comments: 53 pages, 12 figures, 15 tables
Subjects: Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Methodology (stat.ME)
[31] arXiv:2510.04491 [pdf, html, other]
Title: Impatient Users Confuse AI Agents: High-fidelity Simulations of Human Traits for Testing Agents
Muyu He, Anand Kumar, Tsach Mackey, Meghana Rajeev, James Zou, Nazneen Rajani
Comments: 25 pages
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[32] arXiv:2510.04488 [pdf, html, other]
Title: Multi-Agent Collaborative Intelligence: Dual-Dial Control for Reliable LLM Reasoning
Edward Y. Chang, Ethan Y. Chang
Comments: 27 pages, 5 figures, 21 tables
Subjects: Artificial Intelligence (cs.AI); Information Theory (cs.IT)
[33] arXiv:2510.04480 [pdf, html, other]
Title: On Continuous Optimization for Constraint Satisfaction Problems
Yunuo Cen, Zixuan Wang, Jintao Zhang, Zhiwei Zhang, Xuanyao Fong
Subjects: Artificial Intelligence (cs.AI)
[34] arXiv:2510.04474 [pdf, html, other]
Title: DRPO: Efficient Reasoning via Decoupled Reward Policy Optimization
Gang Li, Yan Chen, Ming Lin, Tianbao Yang
Comments: 20 pages, 7 figures
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[35] arXiv:2510.04399 [pdf, html, other]
Title: Utility-Learning Tension in Self-Modifying Agents
Charles L. Wang, Keir Dorchen, Peter Jin
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[36] arXiv:2510.04391 [pdf, other]
Title: Internal World Models as Imagination Networks in Cognitive Agents
Saurabh Ranjan, Brian Odegaard
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Social and Information Networks (cs.SI); Neurons and Cognition (q-bio.NC)
[37] arXiv:2510.04384 [pdf, html, other]
Title: LLM Based Bayesian Optimization for Prompt Search
Adam Ballew, Jingbo Wang, Shaogang Ren
Subjects: Artificial Intelligence (cs.AI)
[38] arXiv:2510.04373 [pdf, html, other]
Title: Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation
Hadi Nekoei, Aman Jaiswal, Patrice Bechard, Oleh Shliazhko, Orlando Marquez Ayala, Mathieu Reymond, Massimo Caccia, Alexandre Drouin, Sarath Chandar, Alexandre Lacoste
Subjects: Artificial Intelligence (cs.AI)
[39] arXiv:2510.04371 [pdf, html, other]
Title: Speculative Actions: A Lossless Framework for Faster Agentic Systems
Naimeng Ye, Arnav Ahuja, Georgios Liargkovas, Yunan Lu, Kostis Kaffes, Tianyi Peng
Subjects: Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Multiagent Systems (cs.MA)
[40] arXiv:2510.04311 [pdf, html, other]
Title: On the Importance of Task Complexity in Evaluating LLM-Based Multi-Agent Systems
Bohan Tang, Huidong Liang, Keyue Jiang, Xiaowen Dong
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[41] arXiv:2510.04284 [pdf, html, other]
Title: Doctor-R1: Mastering Clinical Inquiry with Experiential Agentic Reinforcement Learning
Yunghwei Lai, Kaiming Liu, Ziyue Wang, Weizhi Ma, Yang Liu
Subjects: Artificial Intelligence (cs.AI)
[42] arXiv:2510.04281 [pdf, html, other]
Title: GROK: From Quantitative Biomarkers to Qualitative Diagnosis via a Grounded MLLM with Knowledge-Guided Instruction
Zhuangzhi Gao, Hongyi Qin, He Zhao, Qinkai Yu, Feixiang Zhou, Eduard Shantsila, Uazman Alam, Alena Shantsila, Wahbi El-Bouri, Gregory Y. H. Lip, Yalin Zheng
Comments: 9 pages, 4 figures, 3 table. Equal contribution: Zhuangzhi Gao and Hongyi Qin. Corresponding author: Yalin Zheng (yzheng@liverpool.this http URL)
Subjects: Artificial Intelligence (cs.AI)
[43] arXiv:2510.04272 [pdf, html, other]
Title: Closing the Loop: Coordinating Inventory and Recommendation via Deep Reinforcement Learning on Multiple Timescales
Jinyang Jiang, Jinhui Han, Yijie Peng, Ying Zhang
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Optimization and Control (math.OC)
[44] arXiv:2510.04265 [pdf, html, other]
Title: Don't Pass$\mathtt{@}k$: A Bayesian Framework for Large Language Model Evaluation
Mohsen Hariri, Amirhossein Samandar, Michael Hinczewski, Vipin Chaudhary
Comments: Code and simulations: this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Statistics Theory (math.ST); Machine Learning (stat.ML)
[45] arXiv:2510.04206 [pdf, html, other]
Title: AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
Hanchen Zhang, Xiao Liu, Bowen Lv, Xueqiao Sun, Bohao Jing, Iat Long Iong, Zhenyu Hou, Zehan Qi, Hanyu Lai, Yifan Xu, Rui Lu, Hongning Wang, Jie Tang, Yuxiao Dong
Subjects: Artificial Intelligence (cs.AI)
[46] arXiv:2510.04196 [pdf, html, other]
Title: COSMO-RL: Towards Trustworthy LMRMs via Joint Safety and Stability
Yizhuo Ding, Mingkang Chen, Qiuhua Liu, Fenghua Weng, Wanying Qu, Yue Yang, Yugang Jiang, Zuxuan Wu, Yanwei Fu, Wenqi Shao
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[47] arXiv:2510.04195 [pdf, html, other]
Title: Constructing coherent spatial memory in LLM agents through graph rectification
Puzhen Zhang, Xuyang Chen, Yu Feng, Yuhan Jiang, Liqiu Meng
Subjects: Artificial Intelligence (cs.AI)
[48] arXiv:2510.04173 [pdf, html, other]
Title: Open Agent Specification (Agent Spec) Technical Report
Yassine Benajiba, Cesare Bernardis, Vladislav Blinov, Paul Cayet, Hassan Chafi, Abderrahim Fathan, Louis Faucon, Damien Hilloulin, Sungpack Hong, Ingo Kossyk, Rhicheek Patra, Sujith Ravi, Jonas Schweizer, Jyotika Singh, Shailender Singh, Xuelin Situ, Weiyi Sun, Jerry Xu, Ying Xu
Subjects: Artificial Intelligence (cs.AI)
[49] arXiv:2510.04141 [pdf, html, other]
Title: The Artificial Intelligence Cognitive Examination: A Survey on the Evolution of Multimodal Evaluation from Recognition to Reasoning
Mayank Ravishankara, Varindra V. Persad Maharaj
Subjects: Artificial Intelligence (cs.AI)
[50] arXiv:2510.04140 [pdf, html, other]
Title: Selective Expert Guidance for Effective and Diverse Exploration in Reinforcement Learning of LLMs
Zishang Jiang, Jinyi Han, Tingyun Li, Xinyi Wang, Sihang Jiang, Jiaqing Liang, Zhaoqian Dai, Shuguang Ma, Fei Yu, Yanghua Xiao
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Total of 1370 entries : 1-50 51-100 101-150 151-200 ... 1351-1370
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack