Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.LG
arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Machine Learning

Authors and titles for recent submissions

  • Mon, 6 Oct 2025
  • Fri, 3 Oct 2025
  • Thu, 2 Oct 2025
  • Wed, 1 Oct 2025
  • Tue, 30 Sep 2025

See today's new changes

Total of 1471 entries : 1-50 51-100 101-150 151-200 ... 1451-1471
Showing up to 50 entries per page: fewer | more | all

Mon, 6 Oct 2025 (showing first 50 of 154 entries )

[1] arXiv:2510.03222 [pdf, html, other]
Title: Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
Guanhua Huang, Tingqiang Xu, Mingze Wang, Qi Yi, Xue Gong, Siheng Li, Ruibin Xiong, Kejiao Li, Yuhao Jiang, Bo Zhou
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[2] arXiv:2510.03207 [pdf, other]
Title: To Distill or Decide? Understanding the Algorithmic Trade-off in Partially Observable Reinforcement Learning
Yuda Song, Dhruv Rohatgi, Aarti Singh, J. Andrew Bagnell
Comments: 45 pages, 9 figures, published at NeurIPS 2025
Subjects: Machine Learning (cs.LG)
[3] arXiv:2510.03199 [pdf, html, other]
Title: Best-of-Majority: Minimax-Optimal Strategy for Pass@$k$ Inference Scaling
Qiwei Di, Kaixuan Ji, Xuheng Li, Heyang Zhao, Quanquan Gu
Comments: 29 pages, 3 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[4] arXiv:2510.03197 [pdf, html, other]
Title: Estimation of Resistance Training RPE using Inertial Sensors and Electromyography
James Thomas, Johan Wahlström
Subjects: Machine Learning (cs.LG)
[5] arXiv:2510.03186 [pdf, html, other]
Title: Superposition disentanglement of neural representations reveals hidden alignment
André Longon, David Klindt, Meenakshi Khosla
Subjects: Machine Learning (cs.LG)
[6] arXiv:2510.03185 [pdf, other]
Title: PRISM-Physics: Causal DAG-Based Process Evaluation for Physics Reasoning
Wanjia Zhao, Qinwei Ma, Jingzhe Shi, Shirley Wu, Jiaqi Han, Yijia Xiao, Si-Yuan Chen, Xiao Luo, Ludwig Schmidt, James Zou
Subjects: Machine Learning (cs.LG)
[7] arXiv:2510.03181 [pdf, html, other]
Title: Q-Learning with Shift-Aware Upper Confidence Bound in Non-Stationary Reinforcement Learning
Ha Manh Bui, Felix Parker, Kimia Ghobadi, Anqi Liu
Subjects: Machine Learning (cs.LG)
[8] arXiv:2510.03165 [pdf, html, other]
Title: FTTE: Federated Learning on Resource-Constrained Devices
Irene Tenison, Anna Murphy, Charles Beauville, Lalana Kagal
Subjects: Machine Learning (cs.LG)
[9] arXiv:2510.03164 [pdf, html, other]
Title: Why Do We Need Warm-up? A Theoretical Perspective
Foivos Alimisis, Rustem Islamov, Aurelien Lucchi
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[10] arXiv:2510.03162 [pdf, html, other]
Title: Calibrated Uncertainty Sampling for Active Learning
Ha Manh Bui, Iliana Maifeld-Carucci, Anqi Liu
Subjects: Machine Learning (cs.LG)
[11] arXiv:2510.03151 [pdf, html, other]
Title: Mixture of Many Zero-Compute Experts: A High-Rate Quantization Theory Perspective
Yehuda Dar
Subjects: Machine Learning (cs.LG)
[12] arXiv:2510.03149 [pdf, other]
Title: Taming Imperfect Process Verifiers: A Sampling Perspective on Backtracking
Dhruv Rohatgi, Abhishek Shetty, Donya Saless, Yuchen Li, Ankur Moitra, Andrej Risteski, Dylan J. Foster
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS)
[13] arXiv:2510.03134 [pdf, html, other]
Title: Enhancing XAI Narratives through Multi-Narrative Refinement and Knowledge Distillation
Flavio Giorgi, Matteo Silvestri, Cesare Campagnano, Fabrizio Silvestri, Gabriele Tolomei
Subjects: Machine Learning (cs.LG)
[14] arXiv:2510.03129 [pdf, html, other]
Title: Signature-Informed Transformer for Asset Allocation
Yoontae Hwang, Stefan Zohren
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Portfolio Management (q-fin.PM)
[15] arXiv:2510.03121 [pdf, html, other]
Title: Real Time Headway Predictions in Urban Rail Systems and Implications for Service Control: A Deep Learning Approach
Muhammad Usama, Haris Koutsopoulos
Subjects: Machine Learning (cs.LG)
[16] arXiv:2510.03101 [pdf, html, other]
Title: AdaBet: Gradient-free Layer Selection for Efficient Training of Deep Neural Networks
Irene Tenison, Soumyajit Chatterjee, Fahim Kawsar, Mohammad Malekzadeh
Subjects: Machine Learning (cs.LG)
[17] arXiv:2510.03096 [pdf, html, other]
Title: Adaptive Node Feature Selection For Graph Neural Networks
Ali Azizpour, Madeline Navarro, Santiago Segarra
Subjects: Machine Learning (cs.LG)
[18] arXiv:2510.03095 [pdf, html, other]
Title: Distilled Protein Backbone Generation
Liyang Xie, Haoran Zhang, Zhendong Wang, Wesley Tansey, Mingyuan Zhou
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[19] arXiv:2510.03086 [pdf, html, other]
Title: Bootstrap Learning for Combinatorial Graph Alignment with Sequential GNNs
Marc Lelarge
Comments: 27 pages, 10 figures, 12 tables
Subjects: Machine Learning (cs.LG)
[20] arXiv:2510.03065 [pdf, html, other]
Title: A Unified Deep Reinforcement Learning Approach for Close Enough Traveling Salesman Problem
Mingfeng Fan, Jiaqi Cheng, Yaoxin Wu, Yifeng Zhang, Yibin Yang, Guohua Wu, Guillaume Sartoretti
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[21] arXiv:2510.03064 [pdf, html, other]
Title: Comparative Analysis of Parameterized Action Actor-Critic Reinforcement Learning Algorithms for Web Search Match Plan Generation
Ubayd Bapoo, Clement N Nyirenda
Comments: 10 pages, 10th International Congress on Information and Communication Technology (ICICT 2025)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[22] arXiv:2510.03051 [pdf, html, other]
Title: ZeroShotOpt: Towards Zero-Shot Pretrained Models for Efficient Black-Box Optimization
Jamison Meindl, Yunsheng Tian, Tony Cui, Veronika Thost, Zhang-Wei Hong, Johannes Dürholt, Jie Chen, Wojciech Matusik, Mina Konaković Luković
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[23] arXiv:2510.03046 [pdf, html, other]
Title: Bayesian E(3)-Equivariant Interatomic Potential with Iterative Restratification of Many-body Message Passing
Soohaeng Yoo Willow, Tae Hyeon Park, Gi Beom Sim, Sung Wook Moon, Seung Kyu Min, D. ChangMo Yang, Hyun Woo Kim, Juho Lee, Chang Woo Myung
Subjects: Machine Learning (cs.LG)
[24] arXiv:2510.03038 [pdf, html, other]
Title: CHORD: Customizing Hybrid-precision On-device Model for Sequential Recommendation with Device-cloud Collaboration
Tianqi Liu, Kairui Fu, Shengyu Zhang, Wenyan Fan, Zhaocheng Du, Jieming Zhu, Fan Wu, Fei Wu
Comments: accepted by ACM MM'25
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[25] arXiv:2510.03027 [pdf, html, other]
Title: Lightweight Transformer for EEG Classification via Balanced Signed Graph Algorithm Unrolling
Junyi Yao, Parham Eftekhar, Gene Cheung, Xujin Chris Liu, Yao Wang, Wei Hu
Subjects: Machine Learning (cs.LG)
[26] arXiv:2510.03021 [pdf, html, other]
Title: Differentially Private Wasserstein Barycenters
Anming Gu, Sasidhar Kunapuli, Mark Bun, Edward Chien, Kristjan Greenewald
Comments: 24 pages, 9 figures
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[27] arXiv:2510.03016 [pdf, html, other]
Title: Learning Robust Diffusion Models from Imprecise Supervision
Dong-Dong Wu, Jiacheng Cui, Wei Wang, Zhiqiang She, Masashi Sugiyama
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[28] arXiv:2510.03013 [pdf, html, other]
Title: Distributional Inverse Reinforcement Learning
Feiyang Wu, Ye Zhao, Anqi Wu
Subjects: Machine Learning (cs.LG)
[29] arXiv:2510.03004 [pdf, html, other]
Title: BrainIB++: Leveraging Graph Neural Networks and Information Bottleneck for Functional Brain Biomarkers in Schizophrenia
Tianzheng Hu, Qiang Li, Shu Liu, Vince D. Calhoun, Guido van Wingen, Shujian Yu
Comments: This manuscript has been accepted by Biomedical Signal Processing and Control and the code is available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[30] arXiv:2510.03003 [pdf, html, other]
Title: From high-frequency sensors to noon reports: Using transfer learning for shaft power prediction in maritime
Akriti Sharma, Dogan Altan, Dusica Marijan, Arnbjørn Maressa
Comments: Keywords: transfer learning, shaft power prediction, noon reports, sensor data, maritime
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[31] arXiv:2510.02956 [pdf, html, other]
Title: Confidence and Dispersity as Signals: Unsupervised Model Evaluation and Ranking
Weijian Deng, Weijie Tu, Ibrahim Radwan, Mohammad Abu Alsheikh, Stephen Gould, Liang Zheng
Comments: 15 pages, 11 figures, extension of ICML'23 work: Confidence and Dispersity Speak: Characterizing Prediction Matrix for Unsupervised Accuracy Estimation
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[32] arXiv:2510.02952 [pdf, html, other]
Title: ContextFlow: Context-Aware Flow Matching For Trajectory Inference From Spatial Omics Data
Santanu Subhash Rathod, Francesco Ceccarelli, Sean B. Holden, Pietro Liò, Xiao Zhang, Jovan Tanevski
Comments: 26 pages, 9 figures, 13 tables
Subjects: Machine Learning (cs.LG)
[33] arXiv:2510.02945 [pdf, html, other]
Title: Ergodic Risk Measures: Towards a Risk-Aware Foundation for Continual Reinforcement Learning
Juan Sebastian Rojas, Chi-Guhn Lee
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[34] arXiv:2510.02936 [pdf, html, other]
Title: RAxSS: Retrieval-Augmented Sparse Sampling for Explainable Variable-Length Medical Time Series Classification
Aydin Javadov, Samir Garibov, Tobias Hoesli, Qiyang Sun, Florian von Wangenheim, Joseph Ollier, Björn W. Schuller
Comments: Accepted at the NeurIPS 2025 Workshop on Learning from Time Series for Health
Subjects: Machine Learning (cs.LG)
[35] arXiv:2510.02914 [pdf, html, other]
Title: FeDABoost: Fairness Aware Federated Learning with Adaptive Boosting
Tharuka Kasthuri Arachchige, Veselka Boeva, Shahrooz Abghari
Comments: Presented in WAFL@ECML-PKDD 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[36] arXiv:2510.02903 [pdf, html, other]
Title: Learning Explicit Single-Cell Dynamics Using ODE Representations
Jan-Philipp von Bassewitz, Adeel Pervez, Marco Fumero, Matthew Robinson, Theofanis Karaletsos, Francesco Locatello
Comments: 26 pages, 10 figures. Preprint under review
Subjects: Machine Learning (cs.LG); Cell Behavior (q-bio.CB)
[37] arXiv:2510.02902 [pdf, other]
Title: DMark: Order-Agnostic Watermarking for Diffusion Large Language Models
Linyu Wu, Linhao Zhong, Wenjie Qu, Yuexin Li, Yue Liu, Shengfang Zhai, Chunhua Shen, Jiaheng Zhang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[38] arXiv:2510.02892 [pdf, html, other]
Title: RoiRL: Efficient, Self-Supervised Reasoning with Offline Iterative Reinforcement Learning
Aleksei Arzhantsev, Otmane Sakhi, Flavian Vasile
Comments: Accepted to the Efficient Reasoning Workshop at NeuRIPS 2025
Subjects: Machine Learning (cs.LG)
[39] arXiv:2510.02839 [pdf, html, other]
Title: Knowledge-Aware Modeling with Frequency Adaptive Learning for Battery Health Prognostics
Vijay Babu Pamshetti, Wei Zhang, Sumei Sun, Jie Zhang, Yonggang Wen, Qingyu Yan
Comments: 12 pages, 4 figures, 4 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[40] arXiv:2510.02835 [pdf, html, other]
Title: Subject-Adaptive Sparse Linear Models for Interpretable Personalized Health Prediction from Multimodal Lifelog Data
Dohyun Bu, Jisoo Han, Soohwa Kwon, Yulim So, Jong-Seok Lee
Comments: 6 pages, ICTC 2025
Subjects: Machine Learning (cs.LG)
[41] arXiv:2510.02826 [pdf, html, other]
Title: Multi-scale Autoregressive Models are Laplacian, Discrete, and Latent Diffusion Models in Disguise
Steve Hong, Samuel Belkadi
Subjects: Machine Learning (cs.LG)
[42] arXiv:2510.02823 [pdf, html, other]
Title: The Curious Case of In-Training Compression of State Space Models
Makram Chahine, Philipp Nazari, Daniela Rus, T. Konstantin Rusch
Subjects: Machine Learning (cs.LG)
[43] arXiv:2510.02822 [pdf, html, other]
Title: FlexiQ: Adaptive Mixed-Precision Quantization for Latency/Accuracy Trade-Offs in Deep Neural Networks
Jaemin Kim, Hongjun Um, Sungkyun Kim, Yongjun Park, Jiwon Seo
Comments: 16 pages. 14 figures. To be published in the Proceedings of the European Conference on Computer Systems (EUROSYS '26)
Subjects: Machine Learning (cs.LG)
[44] arXiv:2510.02820 [pdf, html, other]
Title: Online Learning in the Random Order Model
Martino Bernasconi, Andrea Celli, Riccardo Colini-Baldeschi, Federico Fusco, Stefano Leonardi, Matteo Russo
Subjects: Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS)
[45] arXiv:2510.02818 [pdf, html, other]
Title: Mitigating Spurious Correlation via Distributionally Robust Learning with Hierarchical Ambiguity Sets
Sung Ho Jo, Seonghwi Kim, Minwoo Chae
Subjects: Machine Learning (cs.LG)
[46] arXiv:2510.02810 [pdf, html, other]
Title: Dissecting Transformers: A CLEAR Perspective towards Green AI
Hemang Jain, Shailender Goyal, Divyansh Pandey, Karthik Vaidhyanathan
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[47] arXiv:2510.02809 [pdf, html, other]
Title: Relevance-Aware Thresholding in Online Conformal Prediction for Time Series
Théo Dupuy, Binbin Xu, Stéphane Perrey, Jacky Montmain, Abdelhak Imoussaten
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[48] arXiv:2510.02798 [pdf, html, other]
Title: OptunaHub: A Platform for Black-Box Optimization
Yoshihiko Ozaki, Shuhei Watanabe, Toshihiko Yanase
Comments: Submitted to Journal of machine learning research
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[49] arXiv:2510.02779 [pdf, html, other]
Title: Optimal Rates for Generalization of Gradient Descent for Deep ReLU Classification
Yuanfan Li, Yunwen Lei, Zheng-Chu Guo, Yiming Ying
Comments: Accepted at NeurIPS 2025. Camera-ready version to appear
Subjects: Machine Learning (cs.LG)
[50] arXiv:2510.02768 [pdf, html, other]
Title: A Granular Study of Safety Pretraining under Model Abliteration
Shashank Agnihotri, Jonas Jakubassa, Priyam Dey, Sachin Goyal, Bernt Schiele, Venkatesh Babu Radhakrishnan, Margret Keuper
Comments: Accepted at NeurIPS 2025 bWorkshop Lock-LLM. *Equal Contribution
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
Total of 1471 entries : 1-50 51-100 101-150 151-200 ... 1451-1471
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack