전체 AI 논문 - 2026-04-06

1. Coupled Control, Structured Memory, and Verifiable Action in Agentic AI (SCRAT – Stochastic Control with Retrieval and Auditable Trajectories): A Comparative Perspective from Squirrel Locomotion and Scatter-Hoarding


2. Chart-RL: Policy Optimization Reinforcement Learning for Enhanced Visual Reasoning in Chart Question Answering with Vision Language Models


3. Automatic Textbook Formalization


4. Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence?


5. InfoSeeker: A Scalable Hierarchical Parallel Agent Framework for Web Information Seeking


6. FoE: Forest of Errors Makes the First Solution the Best in Large Reasoning Models


7. AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents


8. Analysis of Optimality of Large Language Models on Planning Problems


9. Multi-Turn Reinforcement Learning for Tool-Calling Agents with Iterative Reward Calibration


10. EMS: Multi-Agent Voting via Efficient Majority-then-Stopping


11. ESL-Bench: An Event-Driven Synthetic Longitudinal Benchmark for Health Agents


12. CharTool: Tool-Integrated Visual Reasoning for Chart Understanding


13. Improving Role Consistency in Multi-Agent Collaboration via Quantitative Role Clarity


14. Aligning Progress and Feasibility: A Neuro-Symbolic Dual Memory Framework for Long-Horizon LLM Agents


15. DeltaLogic: Minimal Premise Edits Reveal Belief-Revision Failures in Logical Reasoning Models


16. GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning


17. Let’s Have a Conversation: Designing and Evaluating LLM Agents for Interactive Optimization


18. OntoKG: Ontology-Oriented Knowledge Graph Construction with Intrinsic-Relational Routing


19. AutoVerifier: An Agentic Automated Verification Framework Using Large Language Models


20. Do Audio-Visual Large Language Models Really See and Hear?


21. Mitigating LLM biases toward spurious social contexts using direct preference optimization


22. Competency Questions as Executable Plans: a Controlled RAG Architecture for Cultural Heritage Storytelling


23. Interpretable Deep Reinforcement Learning for Element-level Bridge Life-cycle Optimization


24. A Comprehensive Framework for Long-Term Resiliency Investment Planning under Extreme Weather Uncertainty for Electric Utilities


25. I must delete the evidence: AI Agents Explicitly Cover up Fraud and Violent Crime


26. AIVV: Neuro-Symbolic LLM Agent-Integrated Verification and Validation for Trustworthy Autonomous Systems


27. Understanding the Nature of Generative AI as Threshold Logic in High-Dimensional Space


28. Compositional Neuro-Symbolic Reasoning


29. Xpertbench: Expert Level Tasks with Rubrics-Based Evaluation


30. Holos: A Web-Scale LLM-Based Multi-Agent System for the Agentic Web


31. Enhancing Robustness of Federated Learning via Server Learning


32. PR3DICTR: A modular AI framework for medical 3D image-based detection and outcome prediction


33. Reliability Gated Multi-Teacher Distillation for Low Resource Abstractive Summarization


34. Gradient Boosting within a Single Attention Layer


35. Reflective Context Learning: Studying the Optimization Primitives of Context Space


36. Understanding the Role of Hallucination in Reinforcement Post-Training of Multimodal Reasoning Models


37. Beyond the Parameters: A Technical Survey of Contextual Enrichment in Large Language Models: From In-Context Prompting to Causal Retrieval-Augmented Generation


38. Valence-Arousal Subspace in LLMs: Circular Emotion Geometry and Multi-Behavioral Control


39. InCoder-32B-Thinking: Industrial Code World Model for Thinking


40. AI-Assisted Unit Test Writing and Test-Driven Code Refactoring: A Case Study


41. A Systematic Security Evaluation of OpenClaw and Its Variants


42. Domain-Adapted Retrieval for In-Context Annotation of Pedagogical Dialogue Acts


43. An Independent Safety Evaluation of Kimi K2.5


44. Can VLMs Truly Forget? Benchmarking Training-Free Visual Concept Unlearning


45. AlertStar: Path-Aware Alert Prediction on Hyper-Relational Knowledge Graphs


46. Co-Evolution of Policy and Internal Reward for Language Agents


47. A Data-Centric Vision Transformer Baseline for SAR Sea Ice Classification


48. Supply-Chain Poisoning Attacks Against LLM Coding Agent Skill Ecosystems


49. Credential Leakage in LLM Agent Skills: A Large-Scale Empirical Study


50. Verbalizing LLMs’ assumptions to explain and control sycophancy


51. Querying Structured Data Through Natural Language Using Language Models


52. MECO: A Multimodal Dataset for Emotion and Cognitive Understanding in Older Adults


53. JoyAI-LLM Flash: Advancing Mid-Scale LLMs with Token Efficiency


54. Analyzing Healthcare Interoperability Vulnerabilities: Formal Modeling and Graph-Theoretic Approach


55. ARM: Advantage Reward Modeling for Long-Horizon Manipulation


56. Beyond Isolated Tasks: A Framework for Evaluating Coding Agents on Sequential Software Evolution


57. Comparing the Impact of Pedagogy-Informed Custom and General-Purpose GAI Chatbots on Students’ Science Problem-Solving Processes and Performance Using Heterogeneous Interaction Network Analysis


58. User-Aware Conditional Generative Total Correlation Learning for Multi-Modal Recommendation


59. R2-Write: Reflection and Revision for Open-Ended Writing with Deep Reasoning


60. FedSQ: Optimized Weight Averaging via Fixed Gating


61. Self-Optimizing Multi-Agent Systems for Deep Research


62. Mitigating Reward Hacking in RLHF via Advantage Sign Robustness


63. Prompt Compression in the Wild: Measuring Latency, Rate Adherence, and Quality for Faster LLM Inference


64. LogicPoison: Logical Attacks on Graph Retrieval-Augmented Generation


65. How Annotation Trains Annotators: Competence Development in Social Influence Recognition


66. Learning from Synthetic Data via Provenance-Based Input Gradient Guidance


67. Council Mode: Mitigating Hallucination and Bias in LLMs via Multi-Agent Consensus


68. Split and Conquer Partial Deepfake Speech


69. Corporations Constitute Intelligence


70. RayMamba: Ray-Aligned Serialization for Long-Range 3D Object Detection


71. Toward an Artificial General Teacher: Procedural Geometry Data Generation and Visual Grounding with Vision-Language Models


72. Rethinking Forward Processes for Score-Based Data Assimilation in High Dimensions


73. One Model to Translate Them All? A Journey to Mount Doom for Multilingual Model Merging


74. LLM+Graph@VLDB’2025 Workshop Summary


75. A Paradigm Shift: Fully End-to-End Training for Temporal Sentence Grounding in Videos


76. High-resolution probabilistic estimation of three-dimensional regional ocean dynamics from sparse surface observations


77. Towards Secure Agent Skills: Architecture, Threat Taxonomy, and Security Analysis


78. NavCrafter: Exploring 3D Scenes from a Single Image


79. QAPruner: Quantization-Aware Vision Token Pruning for Multimodal Large Language Models


80. ChatSVA: Bridging SVA Generation for Hardware Verification via Task-Specific LLMs


81. PaveBench: A Versatile Benchmark for Pavement Distress Perception and Interactive Vision-Language Analysis


82. Rubrics to Tokens: Bridging Response-level Rubrics and Token-level Rewards in Instruction Following Tasks


83. LumaFlux: Lifting 8-Bit Worlds to HDR Reality with Physically-Guided Diffusion Transformers


84. Disrupting Cognitive Passivity: Rethinking AI-Assisted Data Literacy through Cognitive Alignment


85. SentinelAgent: Intent-Verified Delegation Chains for Securing Federal Multi-Agent AI Systems


86. Random Is Hard to Beat: Active Selection in online DPO with Modern LLMs


87. Cross Event Detection and Topic Evolution Mining in cross events for Man Made Disasters in Social Media Streams


88. IndustryCode: A Benchmark for Industry Code Generation


89. MOMO: Mars Orbital Model Foundation Model for Mars Orbital Applications


90. V2X-QA: A Comprehensive Reasoning Dataset and Benchmark for Multimodal Large Language Models in Autonomous Driving Across Ego, Infrastructure, and Cooperative Views


91. Evaluating the Formal Reasoning Capabilities of Large Language Models through Chomsky Hierarchy


92. Trivial Vocabulary Bans Improve LLM Reasoning More Than Deep Linguistic Constraints


93. DocShield: Towards AI Document Safety via Evidence-Grounded Agentic Reasoning


94. Efficient3D: A Unified Framework for Adaptive and Debiased Token Reduction in 3D MLLMs


95. Beyond Semantic Manipulation: Token-Space Attacks on Reward Models


96. Finding Belief Geometries with Sparse Autoencoders


97. Eligibility-Aware Evidence Synthesis: An Agentic Framework for Clinical Trial Meta-Analysis


98. Do Agent Societies Develop Intellectual Elites? The Hidden Power Laws of Collective Cognition in LLM Multi-Agent Systems


99. Too Polite to Disagree: Understanding Sycophancy Propagation in Multi-Agent Systems


100. Low-Rank Compression of Pretrained Models via Randomized Subspace Iteration


101. Generalization Limits of Reinforcement Learning Alignment


102. Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training


103. GBQA: A Game Benchmark for Evaluating LLMs as Quality Assurance Engineers


104. Speaking of Language: Reflections on Metalanguage Research in NLP


105. Cross-Vehicle 3D Geometric Consistency for Self-Supervised Surround Depth Estimation on Articulated Vehicles


106. Analytic Drift Resister for Non-Exemplar Continual Graph Learning


107. Toys that listen, talk, and play: Understanding Children’s Sensemaking and Interactions with AI Toys


108. Smart Transfer: Leveraging Vision Foundation Model for Rapid Building Damage Mapping with Post-Earthquake VHR Imagery


109. Poison Once, Exploit Forever: Environment-Injected Memory Poisoning Attacks on Web Agents


110. LitPivot: Developing Well-Situated Research Ideas Through Dynamic Contextualization and Critique within the Literature Landscape


111. Making Written Theorems Explorable by Grounding Them in Formal Representations


112. Moondream Segmentation: From Words to Masks


113. High Volatility and Action Bias Distinguish LLMs from Humans in Group Coordination


114. Understanding the Effects of Safety Unalignment on Large Language Models


115. Generative AI Use in Entrepreneurship: An Integrative Review and an Empowerment-Entrapment Framework


116. Pragmatics Meets Culture: Culturally-adapted Artwork Description Generation and Evaluation


117. From Theory to Practice: Code Generation Using LLMs for CAPEC and CWE Frameworks


118. Feature Attribution Stability Suite: How Stable Are Post-Hoc Attributions?


119. Jump Start or False Start? A Theoretical and Empirical Evaluation of LLM-initialized Bandits


120. Opal: Private Memory for Personal AI


121. Sparse Bayesian Learning Algorithms Revisited: From Learning Majorizers to Structured Algorithmic Learning using Neural Networks


122. Social Meaning in Large Language Models: Structure, Magnitude, and Pragmatic Prompting


123. An Explainable Vision-Language Model Framework with Adaptive PID-Tversky Loss for Lumbar Spinal Stenosis Diagnosis


124. Token-Efficient Multimodal Reasoning via Image Prompt Packaging


125. Automated Malware Family Classification using Weighted Hierarchical Ensembles of Large Language Models


126. A Multimodal Vision Transformer-based Modeling Framework for Prediction of Fluid Flows in Energy Systems


127. Generating Satellite Imagery Data for Wildfire Detection through Mask-Conditioned Generative AI


128. Hierarchical, Interpretable, Label-Free Concept Bottleneck Model


129. VERTIGO: Visual Preference Optimization for Cinematic Camera Trajectory Generation


130. On the Geometric Structure of Layer Updates in Deep Language Models


131. When simulations look right but causal effects go wrong: Large language models as behavioral simulators


132. Skeleton-based Coherence Modeling in Narratives


133. Do We Need Frontier Models to Verify Mathematical Proofs?


134. Managing Diabetic Retinopathy with Deep Learning: A Data Centric Overview


135. PlayGen-MoG: Framework for Diverse Multi-Agent Play Generation via Mixture-of-Gaussians Trajectory Prediction


136. From Elevation Maps To Contour Lines: SVM and Decision Trees to Detect Violin Width Reduction


137. Self-Directed Task Identification


138. Generative models on phase space


139. LumiVideo: An Intelligent Agentic System for Video Color Grading


140. A Synthesis Method of Safe Rust Code Based on Pushdown Colored Petri Nets


141. Improving MPI Error Detection and Repair with Large Language Models and Bug References


142. Variational Encoder–Multi-Decoder (VE-MD) for Privacy-by-functional-design (Group) Emotion Recognition


143. Environment-Aware Channel Prediction for Vehicular Communications: A Multimodal Visual Feature Fusion Framework


144. Reliability-Aware Geometric Fusion for Robust Audio-Visual Navigation


145. Spatial-Aware Conditioned Fusion for Audio-Visual Navigation


146. Audio Spatially-Guided Fusion for Audio-Visual Navigation


147. Ambig-IaC: Multi-level Disambiguation for Interactive Cloud Infrastructure-as-Code Synthesis


148. Internalized Reasoning for Long-Context Visual Document Understanding


149. A Survey on AI for 6G: Challenges and Opportunities


150. Beyond Message Passing: Toward Semantically Aligned Agent Communication


151. CIPHER: Conformer-based Inference of Phonemes from High-density EEG


152. TRACE: Traceroute-based Internet Route change Analysis with Ensemble Learning


153. Using LLM-as-a-Judge/Jury to Advance Scalable, Clinically-Validated Safety Evaluations of Model Responses to Users Demonstrating Psychosis


154. Dynamic Mask Enhanced Intelligent Multi-UAV Deployment for Urban Vehicular Networks


155. Prism: Policy Reuse via Interpretable Strategy Mapping in Reinforcement Learning


156. An Initial Exploration of Contrastive Prompt Tuning to Generate Energy-Efficient Code


157. Differentiable Symbolic Planning: A Neural Architecture for Constraint Reasoning with Learned Feasibility


158. OPRIDE: Offline Preference-based Reinforcement Learning via In-Dataset Exploration


159. DrugPlayGround: Benchmarking Large Language Models and Embeddings for Drug Discovery


160. UI-Oceanus: Scaling GUI Agents with Synthetic Environmental Dynamics


161. Haiku to Opus in Just 10 bits: LLMs Unlock Massive Compression Gains


162. LLM Reasoning with Process Rewards for Outcome-Guided Steps


163. Empirical Sufficiency Lower Bounds for Language Modeling with Locally-Bootstrapped Semantic Structures


164. Reanalyzing L2 Preposition Learning with Bayesian Mixed Effects and a Pretrained Language Model


165. Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language Modeling