[arXiv Digest] 2025-07-02

1. Enhancing LLM Agent Safety via Causal Influence Prompting

2. Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact

3. SafeMobile: Chain-level Jailbreak Detection and Automated Evaluation for Multimodal Mobile Agents

4. A Robust Algorithm for Non-IID Machine Learning Problems with Convergence Analysis

5. Can Large Language Models Develop Strategic Reasoning? Post-training Insights from Learning Chess

6. Advancing Local Search in SMT-NRA with MCSAT Integration

7. Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

8. ASTRO: Teaching Language Models to Reason by Reflecting and Backtracking In-Context

9. GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

10. Description of the Training Process of Neural Networks via Ergodic Theorem : Ghost nodes

11. SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks

12. Robotic Manipulation by Imitating Generated Videos Without Physical Demonstrations

13. Reasoning as an Adaptive Defense for Safety

14. Surgical Neural Radiance Fields from One Image

15. MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement

16. From Sentences to Sequences: Rethinking Languages in Biological System

17. WebArXiv: Evaluating Multimodal Agents on Time-Invariant arXiv Tasks

18. Large Language Model Powered Intelligent Urban Agents: Concepts, Capabilities, and Applications

19. Turning AI Data Centers into Grid-Interactive Assets: Results from a Field Demonstration in Phoenix, Arizona

20. The Age of Sensorial Zero Trust: Why We Can No Longer Trust Our Senses

21. Deep learning-based segmentation of T1 and T2 cardiac MRI maps for automated disease detection

22. Constellation as a Service: Tailored Connectivity Management in Direct-Satellite-to-Device Networks

23. MemeCMD: An Automatically Generated Chinese Multi-turn Dialogue Dataset with Contextually Retrieved Memes

24. NN-Former: Rethinking Graph Structure in Neural Architecture Representation

25. Stylometry recognizes human and LLM-generated texts in short samples

26. HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning

27. Automated anatomy-based post-processing reduces false positives and improved interpretability of deep learning intracranial aneurysm detection

28. CAVALRY-V: A Large-Scale Generator Framework for Adversarial Attacks on Video MLLMs

29. PI-WAN: A Physics-Informed Wind-Adaptive Network for Quadrotor Dynamics Prediction in Unknown Environments

30. Many LLMs Are More Utilitarian Than One

31. LD-RPS: Zero-Shot Unified Image Restoration via Latent Diffusion Recurrent Posterior Sampling

32. Echoes of AI: Investigating the Downstream Effects of AI Assistants on Software Maintainability

33. LitBench: A Benchmark and Dataset for Reliable Evaluation of Creative Writing

34. LearnAFE: Circuit-Algorithm Co-design Framework for Learnable Audio Analog Front-End

35. TopoStreamer: Temporal Lane Segment Topology Reasoning in Autonomous Driving

36. Audio-3DVG: Unified Audio - Point Cloud Fusion for 3D Visual Grounding

37. SAFER: Probing Safety in Reward Models with Sparse Autoencoder

38. MTCNet: Motion and Topology Consistency Guided Learning for Mitral Valve Segmentationin 4D Ultrasound

39. Generative Exaggeration in LLM Social Agents: Consistency, Bias, and Toxicity

40. Cognitive Load-Aware Inference: A Neuro-Symbolic Framework for Optimizing the Token Economy of Large Language Models

41. Horus: A Protocol for Trustless Delegation Under Uncertainty

42. Physics-Informed Neural ODEs for Temporal Dynamics Modeling in Cardiac T1 Mapping

43. Residual Reward Models for Preference-based Reinforcement Learning

44. Mixture of Reasonings: Teach Large Language Models to Reason with Adaptive Strategies

45. High-resolution spatial memory requires grid-cell-like neural codes

46. Quantum Circuit Structure Optimization for Quantum Reinforcement Learning

47. AI-Generated Video Detection via Perceptual Straightening

48. TUM-MiKaNi at SemEval-2025 Task 3: Towards Multilingual and Knowledge-Aware Non-factual Hallucination Identification

49. BadViM: Backdoor Attack against Vision Mamba

50. Inverse Design in Nanophotonics via Representation Learning

51. Not All Attention Heads Are What You Need: Refining CLIP’s Image Representation with Attention Ablation

52. Rethinking Group Recommender Systems in the Era of Generative AI: From One-Shot Recommendations to Agentic Group Decision Support

53. Box-QAymo: Box-Referring VQA Dataset for Autonomous Driving

54. Customer Service Representative’s Perception of the AI Assistant in an Organization’s Call Center

56. Visual Anagrams Reveal Hidden Differences in Holistic Shape Processing Across Vision Models

57. Twill: Scheduling Compound AI Systems on Heterogeneous Mobile Edge Platforms

58. PNAct: Crafting Backdoor Attacks in Safe Reinforcement Learning

59. Physics-Aware Style Transfer for Adaptive Holographic Reconstruction

60. Diversity Conscious Refined Random Forest

61. Novel Complex-Valued Hopfield Neural Networks with Phase and Magnitude Quantization

62. Process-aware and high-fidelity microstructure generation using stable diffusion

63. ATSTrack: Enhancing Visual-Language Tracking by Aligning Temporal and Spatial Scales

64. Best Agent Identification for General Game Playing

65. Iterative Distillation for Reward-Guided Fine-Tuning of Diffusion Models in Biomolecular Design

66. Novel Pigeon-inspired 3D Obstacle Detection and Avoidance Maneuver for Multi-UAV Systems

67. A Recipe for Causal Graph Regression: Confounding Effects Revisited

68. RoboEval: Where Robotic Manipulation Meets Structured and Scalable Evaluation

69. Geological Everything Model 3D: A Promptable Foundation Model for Unified and Zero-Shot Subsurface Understanding

70. Serving LLMs in HPC Clusters: A Comparative Study of Qualcomm Cloud AI 100 Ultra and High-Performance GPUs

71. Augmenting Molecular Graphs with Geometries via Machine Learning Interatomic Potentials

72. iPanda: An Intelligent Protocol Testing and Debugging Agent for Conformance Testing

73. Data-Driven Exploration for a Class of Continuous-Time Linear–Quadratic Reinforcement Learning Problems

74. CGEarthEye:A High-Resolution Remote Sensing Vision Foundation Model Based on the Jilin-1 Satellite Constellation

75. An AST-guided LLM Approach for SVRF Code Synthesis

76. VTS-Guided AI Interaction Workflow for Business Insights

77. Training for X-Ray Vision: Amodal Segmentation, Amodal Content Completion, and View-Invariant Object Representation from Multi-Camera Video