[arXiv Digest] 2025-07-24

1. Online Submission and Evaluation System Design for Competition Operations

2. Thinking Isn’t an Illusion: Overcoming the Limitations of Reasoning Models via Tool Augmentations

3. Symbiotic Agents: A Novel Paradigm for Trustworthy AGI-driven Networks

4. Simulating multiple human perspectives in socio-ecological systems using large language models

5. Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning

6. TAI Scan Tool: A RAG-Based Tool With Minimalistic Input for Trustworthy AI Self-Assessment

7. Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning

8. Automated Hybrid Grounding Using Structural and Data-Driven Heuristics

9. CQE under Epistemic Dependencies: Algorithms and Experiments (extended version)

10. LTLZinc: a Benchmarking Framework for Continual Learning and Neuro-Symbolic Temporal Reasoning

11. An Uncertainty-Driven Adaptive Self-Alignment Framework for Large Language Models

12. Ctx2TrajGen: Traffic Context-Aware Microscale Vehicle Trajectories using Generative Adversarial Imitation Learning

13. Compliance Brain Assistant: Conversational Agentic AI for Assisting Compliance Tasks in Enterprise Environments

14. Students’ Feedback Requests and Interactions with the SCRIPT Chatbot: Do They Get What They Ask For?

15. Agent Identity Evals: Measuring Agentic Identity

16. Our Cars Can Talk: How IoT Brings AI to Vehicles

17. Improving LLMs’ Generalized Reasoning Abilities by Graph Problems

18. HySafe-AI: Hybrid Safety Architectural Analysis Framework for AI Systems: A Case Study

19. Large Learning Rates Simultaneously Achieve Robustness to Spurious Correlations and Compressibility

20. Pretraining on the Test Set Is No Longer All You Need: A Debate-Driven Approach to QA Benchmarks

21. Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains

22. Ultra3D: Efficient and High-Fidelity 3D Generation with Part Attention

23. Yume: An Interactive World Generation Model

24. Flow Matching Meets Biology and Life Science: A Survey

25. On the Interaction of Compressibility and Adversarial Robustness

26. AI Telephone Surveying: Automating Quantitative Data Collection with an AI Interviewer

27. From Feedback to Checklists: Grounded Evaluation of AI-Generated Clinical Notes

28. CASCADE: LLM-Powered JavaScript Deobfuscator at Google

29. How Should We Meta-Learn Reinforcement Learning Algorithms?

30. Vision Transformer attention alignment with human visual perception in aesthetic object evaluation

31. PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving

32. Enhancing Quantum Federated Learning with Fisher Information-Based Optimization

33. Federated Majorize-Minimization: Beyond Parameter Aggregation

34. Integrating Physics-Based and Data-Driven Approaches for Probabilistic Building Energy Modeling

35. Enabling Cyber Security Education through Digital Twins and Generative AI

36. HOTA: Hamiltonian framework for Optimal Transport Advection

37. To Trust or Not to Trust: On Calibration in ML-based Resource Allocation for Wireless Networks

38. Unsupervised anomaly detection using Bayesian flow networks: application to brain FDG PET in the context of Alzheimer’s disease

39. MultiNRC: A Challenging and Native Multilingual Reasoning Evaluation Benchmark for LLMs

40. BGM-HAN: A Hierarchical Attention Network for Accurate and Fair Decision Assessment on Semi-Structured Profiles

41. Demonstration of Efficient Predictive Surrogates for Large-scale Quantum Processors

42. Probing Vision-Language Understanding through the Visual Entailment Task: promises and pitfalls

43. Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning

44. IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird’s-Eye View Perception

45. Each to Their Own: Exploring the Optimal Embedding in RAG

46. Fair Compromises in Participatory Budgeting: a Multi-Agent Deep Reinforcement Learning Approach

47. Content-based 3D Image Retrieval and a ColBERT-inspired Re-ranking for Tumor Flagging and Staging

48. Millions of $\text{GeAR}$-s: Extending GraphRAG to Millions of Documents

49. HiProbe-VAD: Video Anomaly Detection via Hidden States Probing in Tuning-Free Multimodal LLMs

50. Investigating Training Data Detection in AI Coders

51. SFUOD: Source-Free Unknown Object Detection

52. DynaSearcher: Dynamic Knowledge Graph Augmented Search Agent via Multi-Reward Reinforcement Learning

53. Swin-TUNA : A Novel PEFT Approach for Accurate Food Image Segmentation

54. Temporal Point-Supervised Signal Reconstruction: A Human-Annotation-Free Framework for Weak Moving Target Detection

56. Confounded Causal Imitation Learning with Instrumental Variables

57. A Versatile Pathology Co-pilot via Reasoning Enhanced Multimodal Large Language Model

58. On Temporal Guidance and Iterative Refinement in Audio Source Separation

59. Integrating Belief Domains into Probabilistic Logic Programs

60. Leveraging Knowledge Graphs and LLM Reasoning to Identify Operational Bottlenecks for Warehouse Planning Assistance

61. Understanding Prompt Programming Tasks and Questions

62. Reality Proxy: Fluid Interactions with Real-World Objects in MR via Abstract Representations

63. DistrAttention: An Efficient and Flexible Self-Attention Mechanism on Modern GPUs

64. Eco-Friendly AI: Unleashing Data Power for Green Federated Learning

65. A Highly Clean Recipe Dataset with Ingredient States Annotation for State Probing Task

66. P3SL: Personalized Privacy-Preserving Split Learning on Heterogeneous Edge Devices

67. HuiduRep: A Robust Self-Supervised Framework for Learning Neural Representations from Extracellular Spikes

68. The Pluralistic Moral Gap: Understanding Judgment and Value Differences between Humans and Large Language Models

69. DesignLab: Designing Slides Through Iterative Detection and Correction

70. Dispatch-Aware Deep Neural Network for Optimal Transmission Switching: Toward Real-Time and Feasibility Guaranteed Operation

71. LLM Meets the Sky: Heuristic Multi-Agent Reinforcement Learning for Secure Heterogeneous UAV Networks

72. Asymmetric Lesion Detection with Geometric Patterns and CNN-SVM Classification

73. Regret Minimization in Population Network Games: Vanishing Heterogeneity and Convergence to Equilibria

74. SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs

75. Tabular Diffusion based Actionable Counterfactual Explanations for Network Intrusion Detection

76. JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction

77. ScSAM: Debiasing Morphology and Distributional Variability in Subcellular Semantic Segmentation

78. Towards Human-level Intelligence via Human-like Whole-Body Manipulation

79. SADA: Stability-guided Adaptive Diffusion Acceleration

80. Resilient Multi-Agent Negotiation for Medical Supply Chains:Integrating LLMs and Blockchain for Transparent Coordination

81. Enabling Self-Improving Agents to Learn at Test Time With Human-In-The-Loop Guidance

82. BucketServe: Bucket-Based Dynamic Batching for Smart and Efficient LLM Inference Serving

83. Reinforcement Learning Fine-Tunes a Sparse Subnetwork in Large Language Models

84. Weather-Aware AI Systems versus Route-Optimization AI: A Comprehensive Analysis of AI Applications in Transportation Productivity