전체 AI 논문 - 2026-05-04

1. Position: agentic AI orchestration should be Bayes-consistent


2. To Call or Not to Call: A Framework to Assess and Optimize LLM Tool Calling


3. Learn where to Click from Yourself: On-Policy Self-Distillation for GUI Grounding


4. Instance-Aware Parameter Configuration in Bilevel Late Acceptance Hill Climbing for the Electric Capacitated Vehicle Routing Problem


5. On the Role of Artificial Intelligence in Human-Machine Symbiosis


6. Thinking in Text and Images: Interleaved Vision–Language Reasoning Traces for Long-Horizon Robot Manipulation


7. AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning


8. Physically Native World Models: A Hamiltonian Perspective on Generative World Modeling


9. AgentFloor: How Far Up the tool use Ladder Can Small Open-Weight Models Go?


10. Token Arena: A Continuous Benchmark Unifying Energy and Cognition in AI Inference


11. Agentic AI for Trip Planning Optimization Application


12. Causal Foundations of Collective Agency


13. ARMOR 2025: A Military-Aligned Benchmark for Evaluating Large Language Model Safety Beyond Civilian Contexts


14. TUR-DPO: Topology- and Uncertainty-Aware Direct Preference Optimization


15. Are Tools All We Need? Unveiling the Tool-Use Tax in LLM Agents


16. Minimal, Local, Causal Explanations for Jailbreak Success in Large Language Models


17. AgentReputation: A Decentralized Agentic AI Reputation Framework


18. TADI: Tool-Augmented Drilling Intelligence via Agentic LLM Orchestration over Heterogeneous Wellsite Data


19. Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs


20. Can Coding Agents Reproduce Findings in Computational Materials Science?


21. When RAG Chatbots Expose Their Backend: An Anonymized Case Study of Privacy and Security Risks in Patient-Facing Medical AI


22. Unsupervised Denoising of Real Clinical Low Dose Liver CT with Perceptual Attention Networks


23. Make Your LVLM KV Cache More Lightweight


24. GeoContra: From Fluent GIS Code to Verifiable Spatial Analysis with Geography-Grounded Repair


25. Directed Social Regard: Surfacing Targeted Advocacy, Opposition, Aid, Harms, and Victimization in Online Media


26. Meritocratic Fairness in Budgeted Combinatorial Multi-armed Bandits via Shapley Values


27. EASE: Federated Multimodal Unlearning via Entanglement-Aware Anchor Closure


28. Empowering Heterogeneous Graph Foundation Models via Decoupled Relation Alignment


29. Towards Improving Speaker Distance Estimation through Generative Impulse Response Augmentation


30. Augmented Lagrangian Multiplier Network for State-wise Safety in Reinforcement Learning


31. InpaintSLat: Inpainting Structured 3D Latents via Initial Noise Optimization


32. Reinforcement Learning with Markov Risk Measures and Multipattern Risk Approximation


33. AdaMeZO: Adam-style Zeroth-Order Optimizer for LLM Fine-tuning Without Maintaining the Moments


34. Learning Multimodal Energy-Based Model with Multimodal Variational Auto-Encoder via MCMC Revision


35. Born-Qualified: An Autonomous Framework for Deploying Advanced Energy and Electronic Materials


36. BlenderRAG: High-Fidelity 3D Object Generation via Retrieval-Augmented Code Synthesis


37. Possibilistic Predictive Uncertainty for Deep Learning


38. Fairness of Classifiers in the Presence of Constraints between Features


39. Jailbreaking Vision-Language Models Through the Visual Modality


40. AI Washing Inflates Expected Performance but Not Interaction Outcomes: An AI Placebo Study Using Fitts’ Law


41. Structure Liberates: How Constrained Sensemaking Produces More Novel Research Output


42. Linking Behaviour and Perception to Evaluate Meaningful Human Control over Partially Automated Driving


43. A11y-Compressor: A Framework for Enhancing the Efficiency of GUI Agent Observations through Visual Context Reconstruction and Redundancy Reduction


44. Beyond Continuity: Simulation-free Reconstruction of Discrete Branching Dynamics from Single-cell Snapshots


45. Hierarchical Abstract Tree for Cross-Document Retrieval-Augmented Generation


46. SAGA: Workflow-Atomic Scheduling for AI Agent Inference on GPU Clusters


47. Silicon Showdown: Performance, Efficiency, and Ecosystem Barriers in Consumer-Grade LLM Inference


48. Space Network of Experts: Architecture and Expert Placement


49. LLM-Oriented Information Retrieval: A Denoising-First Perspective


50. “What Are You Really Trying to Do?”: Co-Creating Life Goals from Everyday Computer Use


51. Scalable Context-Aware Graph Attention for Unsupervised Anomaly Detection in Large-Scale Mobile Networks


52. PAMod: Modeling Cyclical Shifts via Phase-Amplitude Modulation for Non-stationary Time Series Forecasting


53. Adaptation of AI-accelerated CFD Simulations to the IPU platform


54. Impact of Task Phrasing on Presumptions in Large Language Models


55. Escaping Mode Collapse in LLM Generation via Geometric Regulation


56. Improving LLM Code Generation via Requirement-Aware Curriculum Reinforcement Learning


57. Skills as Verifiable Artifacts: A Trust Schema and a Biconditional Correctness Criterion for Human-in-the-Loop Agent Runtimes


58. BWLA: Breaking the Barrier of W1AX Post-Training Quantization for LLMs


59. RadLite: Multi-Task LoRA Fine-Tuning of Small Language Models for CPU-Deployable Radiology AI


60. Trees to Flows and Back: Unifying Decision Trees and Diffusion Models


61. Agent Capsules: Quality-Gated Granularity Control for Multi-Agent LLM Pipelines


62. Scalable Learning in Structured Recurrent Spiking Neural Networks without Backpropagation


63. Social Bias in LLM-Generated Code: Benchmark and Mitigation


64. GaMMA: Towards Joint Global-Temporal Music Understanding in Large Multimodal Models


65. AlphaInventory: Evolving White-Box Inventory Policies via Large Language Models with Deployment Guarantees


66. Pedagogical Promise and Peril of AI: A Text Mining Analysis of ChatGPT Research Discussions in Programming Education


67. MemRouter: Memory-as-Embedding Routing for Long-Term Conversational Agents


68. VQ-SAD: Vector Quantized Structure Aware Diffusion For Molecule Generation


69. Hypergraph and Latent ODE Learning for Multimodal Root Cause Localization in Microservices


70. Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning


71. AI Adoption Among Teachers: Insights on Concerns, Support, Confidence, and Attitudes


72. Budget-Aware Routing for Long Clinical Text


73. DynamicPO: Dynamic Preference Optimization for Recommendation


74. Unbox Responsible GeoAI: Navigating Climate Extreme and Disaster Mapping


75. Semia: Auditing Agent Skills via Constraint-Guided Representation Synthesis


76. Beyond Structure: Revolutionising Materials Discovery via AI-Driven Synthesis Protocol-Property Relationships


77. Beyond Visual Fidelity: Benchmarking Super-Resolution Models for Large-Scale Remote Sensing Imagery via Downstream Task Integration


78. Caracal: Causal Architecture via Spectral Mixing


79. When Do Diffusion Models learn to Generate Multiple Objects?


80. REALM: An RGB and Event Aligned Latent Manifold for Cross-Modal Perception


81. Are You the A-hole? A Fair, Multi-Perspective Ethical Reasoning Framework


82. Jailbroken Frontier Models Retain Their Capabilities


83. Retrieval-Augmented Reasoning for Chartered Accountancy


84. Remote SAMsing: From Segment Anything to Segment Everything


85. Rethinking Network Topologies for Cost-Effective Mixture-of-Experts LLM Serving


86. MAEPose: Self-Supervised Spatiotemporal Learning for Human Pose Estimation on mmWave Video


87. Attention Is Where You Attack



89. RSAT: Structured Attribution Makes Small Language Models Faithful Table Reasoners


90. The $\textit{Silicon Society}$ Cookbook: Design Space of LLM-based Social Simulations


91. Fair Dataset Distillation via Cross-Group Barycenter Alignment


92. Smart Profit-Aware Crop Advisory System: Kisan AI


93. Cultural Benchmarking of LLMs in Standard and Dialectal Arabic Dialogues



95. How Frontier LLMs Adapt to Neurodivergence Context: A Measurement Framework for Surface vs. Structural Change in System-Prompted Responses


96. AIDA-ReID: Adaptive Intermediate Domain Adaptation for Generalizable and Source-Free Person Re-Identification


97. DeGenTWeb: A First Look at LLM-dominant Websites


98. NorBERTo: A ModernBERT Model Trained for Portuguese with 331 Billion Tokens Corpus


99. Hyperspherical Forward-Forward with Prototypical Representations


100. CRC-Screen: Certified DNA-Synthesis Hazard Screening Under Taxonomic Shift


101. XekRung Technical Report


102. Compliance-Aware Agentic Payments on Stablecoin Rails


103. Human-in-the-Loop Meta Bayesian Optimization for Fusion Energy and Scientific Applications


104. A Survey of Reasoning-Intensive Retrieval: Progress and Challenges


105. Dynamic-TD3: A Novel Algorithm for UAV Path Planning with Dynamic Obstacle Trajectory Prediction


106. Smart Ensemble Learning Framework for Predicting Groundwater Heavy Metal Pollution


107. Ambient Persuasion in a Deployed AI Agent: Unauthorized Escalation Following Routine Non-Adversarial Content Exposure


108. SiriusHelper: An LLM Agent-Based Operations Assistant for Big Data Platforms


109. Sure About That Line? Approaching Confidence-Based, Real-Time Line Assignment in Reading Gaze Data


110. Putting HUMANS first: Efficient LAM Evaluation with Human Preference Alignment


111. AirFM-DDA: Air-Interface Foundation Model in the Delay-Doppler-Angle Domain for AI-Native 6G


112. TimeRFT: Stimulating Generalizable Time Series Forecasting for TSFMs via Reinforcement Finetuning


113. Exploring LLM biases to manipulate AI search overview


114. FedACT: Concurrent Federated Intelligence across Heterogeneous Data Sources


115. Mean-Field Path-Integral Diffusion: From Samples to Interacting Agents


116. Cloud Is Closer Than It Appears: Revisiting the Tradeoffs of Distributed Real-Time Inference


117. Models Recall What They Violate: Constraint Adherence in Multi-Turn LLM Ideation