Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CV
arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computer Vision and Pattern Recognition

Authors and titles for October 2025

Total of 312 entries : 1-50 51-100 101-150 151-200 ... 301-312
Showing up to 50 entries per page: fewer | more | all
[1] arXiv:2510.00033 [pdf, html, other]
Title: Hybrid Deep Learning for Hyperspectral Single Image Super-Resolution
Usman Muhammad, Jorma Laaksonen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[2] arXiv:2510.00034 [pdf, other]
Title: Review of Hallucination Understanding in Large Language and Vision Models
Zhengyi Ho, Siyuan Liang, Dacheng Tao
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[3] arXiv:2510.00037 [pdf, html, other]
Title: On Robustness of Vision-Language-Action Model against Multi-Modal Perturbations
Jianing Guo, Zhenhong Wu, Chang Tu, Yiyao Ma, Xiangqi Kong, Zhiqian Liu, Jiaming Ji, Shuning Zhang, Yuanpei Chen, Kai Chen, Xianglong Liu, Qi Dou, Yaodong Yang, Huijie Zhao, Weifeng Lv, Simin Li
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[4] arXiv:2510.00040 [pdf, html, other]
Title: Uncovering Intrinsic Capabilities: A Paradigm for Data Curation in Vision-Language Models
Junjie Li, Ziao Wang, Jianghong Ma, Xiaofeng Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[5] arXiv:2510.00041 [pdf, html, other]
Title: Culture In a Frame: C$^3$B as a Comic-Based Benchmark for Multimodal Culturally Awareness
Yuchen Song, Andong Chen, Wenxin Zhu, Kehai Chen, Xuefeng Bai, Muyun Yang, Tiejun Zhao
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[6] arXiv:2510.00045 [pdf, html, other]
Title: Beyond the Prompt: Gender Bias in Text-to-Image Models, with a Case Study on Hospital Professions
Franck Vandewiele, Remi Synave, Samuel Delepoulle, Remi Cozot
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[7] arXiv:2510.00046 [pdf, html, other]
Title: Reinforcement Learning-Based Prompt Template Stealing for Text-to-Image Models
Xiaotian Zou
Comments: 10 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[8] arXiv:2510.00047 [pdf, html, other]
Title: Explanation-Driven Counterfactual Testing for Faithfulness in Vision-Language Model Explanations
Sihao Ding, Santosh Vasa, Aditi Ramadwar
Comments: NeurIPS 2025 workshop on Regulatable ML
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[9] arXiv:2510.00054 [pdf, html, other]
Title: HiDe: Rethinking The Zoom-IN method in High Resolution MLLMs via Hierarchical Decoupling
Xianjie Liu, Yiman Hu, Yixiong Zou, Liang Wu, Jian Xu, Bo Zheng
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[10] arXiv:2510.00059 [pdf, html, other]
Title: FSDENet: A Frequency and Spatial Domains based Detail Enhancement Network for Remote Sensing Semantic Segmentation
Jiahao Fu, Yinfeng Yu, Liejun Wang
Comments: Accepted for publication by IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[11] arXiv:2510.00060 [pdf, html, other]
Title: Less is More: Lean yet Powerful Vision-Language Model for Autonomous Driving
Sheng Yang, Tong Zhan, Guancheng Chen, Yanfeng Lu, Jian Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[12] arXiv:2510.00062 [pdf, html, other]
Title: Efficient CNN Compression via Multi-method Low Rank Factorization and Feature Map Similarity
M. Kokhazadeh (1), G. Keramidas (1)V. Kelefouras (2) ((1) Aristotle University of Thessaloniki, Thessaloniki, Greece, (2) University of Plymouth, Plymouth, UK)
Comments: 14 pages, 17 figures, This work has been submitted to the IEEE for possible publication (IEEE Transactions on Artificial Intelligence)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[13] arXiv:2510.00067 [pdf, html, other]
Title: Intelligent 5S Audit: Application of Artificial Intelligence for Continuous Improvement in the Automotive Industry
Rafael da Silva Maciel, Lucio Veraldo Jr
Comments: 8 pages, 5 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[14] arXiv:2510.00069 [pdf, html, other]
Title: OIG-Bench: A Multi-Agent Annotated Benchmark for Multimodal One-Image Guides Understanding
Jiancong Xie, Wenjin Wang, Zhuomeng Zhang, Zihan Liu, Qi Liu, Ke Feng, Zixun Sun, Yuedong Yang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[15] arXiv:2510.00072 [pdf, html, other]
Title: Geo-R1: Unlocking VLM Geospatial Reasoning with Cross-View Reinforcement Learning
Chenhui Xu, Fuxun Yu, Michael J. Bianco, Jacob Kovarskiy, Raphael Tang, Qi Zhang, Zirui Xu, Will LeVine, Brandon Dubbs, Heming Liao, Cassandra Burgess, Suvam Bag, Jay Patravali, Rupanjali Kukal, Mikael Figueroa, Rishi Madhok, Nikolaos Karianakis, Jinjun Xiong
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[16] arXiv:2510.00083 [pdf, html, other]
Title: Enhancing Certifiable Semantic Robustness via Robust Pruning of Deep Neural Networks
Hanjiang Hu, Bowei Li, Ziwei Wang, Tianhao Wei, Casidhe Hutchison, Eric Sample, Changliu Liu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[17] arXiv:2510.00148 [pdf, html, other]
Title: Improved Hyperspectral Anomaly Detection via Unsupervised Subspace Modeling in the Signed Cumulative Distribution Transform Domain
Abu Hasnat Mohammad Rubaiyat, Jordan Vincent, Colin Olson
Comments: 8 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[18] arXiv:2510.00293 [pdf, html, other]
Title: MOLM: Mixture of LoRA Markers
Samar Fares, Nurbek Tastan, Noor Hussein, Karthik Nandakumar
Comments: 21 pages, 11 figures, Under review at ICLR 2026
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[19] arXiv:2510.00303 [pdf, html, other]
Title: Looking Beyond the Known: Towards a Data Discovery Guided Open-World Object Detection
Anay Majee, Amitesh Gangrade, Rishabh Iyer
Comments: Accepted to NeurIPS'25. 22 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[20] arXiv:2510.00376 [pdf, html, other]
Title: Discrete Wavelet Transform as a Facilitator for Expressive Latent Space Representation in Variational Autoencoders in Satellite Imagery
Arpan Mahara, Md Rezaul Karim Khan, Naphtali Rishe, Wenjia Wang, Seyed Masoud Sadjadi
Comments: 6 pages, 3 Figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[21] arXiv:2510.00405 [pdf, html, other]
Title: EgoTraj-Bench: Towards Robust Trajectory Prediction Under Ego-view Noisy Observations
Jiayi Liu, Jiaming Zhou, Ke Ye, Kun-Yu Lin, Allan Wang, Junwei Liang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[22] arXiv:2510.00411 [pdf, html, other]
Title: Does Bigger Mean Better? Comparitive Analysis of CNNs and Biomedical Vision Language Modles in Medical Diagnosis
Ran Tong, Jiaqi Liu, Su Liu, Jiexi Xu, Lanruo Wang, Tong Wang
Comments: 6pages,3 this http URL review of International Conference on Artificial Intelligence, Computer, Data Sciences and Applications
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[23] arXiv:2510.00413 [pdf, html, other]
Title: PAL-UI: Planning with Active Look-back for Vision-Based GUI Agents
Zikang Liu, Junyi Li, Wayne Xin Zhao, Dawei Gao, Yaliang Li, Ji-rong Wen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[24] arXiv:2510.00416 [pdf, html, other]
Title: Domain-Specialized Interactive Segmentation Framework for Meningioma Radiotherapy Planning
Junhyeok Lee, Han Jang, Kyu Sung Choi
Comments: Clinical Image-Based Procedures (CLIP 2025), MICCAI 2025 Workshop
Journal-ref: Lecture Notes in Computer Science, vol 16126. Springer, Cham (2026)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[25] arXiv:2510.00438 [pdf, html, other]
Title: BindWeave: Subject-Consistent Video Generation via Cross-Modal Integration
Zhaoyang Li, Dongjun Qian, Kai Su, Qishuai Diao, Xiangyang Xia, Chang Liu, Wenfei Yang, Tianzhu Zhang, Zehuan Yuan
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[26] arXiv:2510.00454 [pdf, html, other]
Title: Measuring and Controlling the Spectral Bias for Self-Supervised Image Denoising
Wang Zhang, Huaqiu Li, Xiaowan Hu, Tao Jiang, Zikang Chen, Haoqian Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[27] arXiv:2510.00458 [pdf, html, other]
Title: VLOD-TTA: Test-Time Adaptation of Vision-Language Object Detectors
Atif Belal, Heitor R. Medeiros, Marco Pedersoli, Eric Granger
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[28] arXiv:2510.00483 [pdf, html, other]
Title: MathSticks: A Benchmark for Visual Symbolic Compositional Reasoning with Matchstick Puzzles
Yuheng Ji, Huajie Tan, Cheng Chi, Yijie Xu, Yuting Zhao, Enshen Zhou, Huaihai Lyu, Pengwei Wang, Zhongyuan Wang, Shanghang Zhang, Xiaolong Zheng
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[29] arXiv:2510.00495 [pdf, html, other]
Title: Normal-Abnormal Guided Generalist Anomaly Detection
Yuexin Wang, Xiaolei Wang, Yizheng Gong, Jimin Xiao
Comments: Accepted by NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[30] arXiv:2510.00500 [pdf, html, other]
Title: Relative-Absolute Fusion: Rethinking Feature Extraction in Image-Based Iterative Method Selection for Solving Sparse Linear Systems
Kaiqi Zhang, Mingguan Yang, Dali Chang, Chun Chen, Yuxiang Zhang, Kexun He, Jing Zhao
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[31] arXiv:2510.00506 [pdf, html, other]
Title: Affordance-Guided Diffusion Prior for 3D Hand Reconstruction
Naru Suzuki, Takehiko Ohkawa, Tatsuro Banno, Jihyun Lee, Ryosuke Furuta, Yoichi Sato
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[32] arXiv:2510.00515 [pdf, other]
Title: Efficient Multi-modal Large Language Models via Progressive Consistency Distillation
Zichen Wen, Shaobo Wang, Yufa Zhou, Junyuan Zhang, Qintong Zhang, Yifeng Gao, Zhaorun Chen, Bin Wang, Weijia Li, Conghui He, Linfeng Zhang
Comments: Accepted by NeurIPS 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[33] arXiv:2510.00520 [pdf, html, other]
Title: CardioBench: Do Echocardiography Foundation Models Generalize Beyond the Lab?
Darya Taratynova, Ahmed Aly, Numan Saeed, Mohammad Yaqub
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[34] arXiv:2510.00527 [pdf, html, other]
Title: Cascaded Diffusion Framework for Probabilistic Coarse-to-Fine Hand Pose Estimation
Taeyun Woo, Jinah Park, Tae-Kyun Kim
Comments: 15 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[35] arXiv:2510.00547 [pdf, html, other]
Title: Forestpest-YOLO: A High-Performance Detection Framework for Small Forestry Pests
Aoduo Li, Peikai Lin, Jiancheng Li, Zhen Zhang, Shiting Wu, Zexiao Liang, Zhifa Jiang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[36] arXiv:2510.00561 [pdf, html, other]
Title: Assessing Foundation Models for Mold Colony Detection with Limited Training Data
Henrik Pichler, Janis Keuper, Matthew Copping
Comments: 17 pages, 2 figures, accepted as oral presentation at GCPR 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[37] arXiv:2510.00570 [pdf, html, other]
Title: Adaptive Shared Experts with LoRA-Based Mixture of Experts for Multi-Task Learning
Minghao Yang, Ren Togo, Guang Li, Takahiro Ogawa, Miki Haseyama
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[38] arXiv:2510.00578 [pdf, other]
Title: Arbitrary Generative Video Interpolation
Guozhen Zhang, Haiguang Wang, Chunyu Wang, Yuan Zhou, Qinglin Lu, Limin Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[39] arXiv:2510.00584 [pdf, html, other]
Title: Color Models in Image Processing: A Review and Experimental Comparison
Muragul Muratbekova, Nuray Toganas, Ayan Igali, Maksat Shagyrov, Elnara Kadyrgali, Adilet Yerkin, Pakizar Shamoi
Comments: This manuscript has been submitted to Scientific Reports for consideration
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[40] arXiv:2510.00592 [pdf, html, other]
Title: Multi-level Dynamic Style Transfer for NeRFs
Zesheng Li, Shuaibo Li, Wei Ma, Jianwei Guo, Hongbin Zha
Comments: Accepted by Computational Visual Media Journal (CVMJ)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[41] arXiv:2510.00603 [pdf, other]
Title: LVLMs as inspectors: an agentic framework for category-level structural defect annotation
Sheng Jiang, Yuanmin Ning, Bingxi Huang, Peiyin Chen, Zhaohui Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[42] arXiv:2510.00604 [pdf, html, other]
Title: Disentangling Foreground and Background for vision-Language Navigation via Online Augmentation
Yunbo Xu, Xuesong Zhang, Jia Li, Zhenzhen Hu, Richang Hong
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[43] arXiv:2510.00618 [pdf, html, other]
Title: Robust Context-Aware Object Recognition
Klara Janouskova, Cristian Gavrus, Jiri Matas
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[44] arXiv:2510.00624 [pdf, html, other]
Title: UCD: Unconditional Discriminator Promotes Nash Equilibrium in GANs
Mengfei Xia, Nan Xue, Jiapeng Zhu, Yujun Shen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[45] arXiv:2510.00633 [pdf, other]
Title: Virtual Fashion Photo-Shoots: Building a Large-Scale Garment-Lookbook Dataset
Yannick Hauri, Luca A. Lanzendörfer, Till Aczel
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[46] arXiv:2510.00634 [pdf, html, other]
Title: LAKAN: Landmark-assisted Adaptive Kolmogorov-Arnold Network for Face Forgery Detection
Jiayao Jiang, Siran Peng, Bin Liu, Qi Chu, Nenghai Yu
Comments: 5 pages, 3 figures. This work has been submitted to the IEEE for possible publication
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[47] arXiv:2510.00635 [pdf, html, other]
Title: Erased, But Not Forgotten: Erased Rectified Flow Transformers Still Remain Unsafe Under Concept Attack
Nanxiang Jiang, Zhaoxin Fan, Enhan Kang, Daiheng Gao, Yun Zhou, Yanxia Chang, Zheng Zhu, Yeying Jin, Wenjun Wu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[48] arXiv:2510.00651 [pdf, html, other]
Title: FIN: Fast Inference Network for Map Segmentation
Ruan Bispo, Tim Brophy, Reenu Mohandas, Anthony Scanlan, Ciarán Eising
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[49] arXiv:2510.00652 [pdf, html, other]
Title: OTTER: Open-Tagging via Text-Image Representation for Multi-modal Understanding
Jieer Ouyang, Xiaoneng Xiang, Zheng Wang, Yangkai Ding
Comments: Accepted at ICDM 2025 BigIS Workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[50] arXiv:2510.00654 [pdf, other]
Title: Weakly Supervised Cloud Detection Combining Spectral Features and Multi-Scale Deep Network
Shaocong Zhu, Zhiwei Li, Xinghua Li, Huanfeng Shen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Total of 312 entries : 1-50 51-100 101-150 151-200 ... 301-312
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack