전체 AI 논문 - 2026-05-25

1. SkillOpt: Executive Strategy for Self-Evolving Agent Skills


2. From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills


3. SPACENUM: Revisiting Spatial Numerical Understanding in VLMs


4. Beyond Binary Edits Robust Multimodal Knowledge Editing with Adversarial Subspace Alignment


5. Agentic Proving for Program Verification


6. MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection


7. One Policy, Infinite NPCs: Persona-Traceable Shared RL Policies for Scalable Game Agents


8. Solving the Aircraft Disassembly Scheduling Problem


9. Co-ReAct: Rubrics as Step-Level Collaborators for ReAct Agents


10. CP or DP? Why Not Both: A Case Study in the Partial Shop Scheduling Problem


11. EDGE-OPD: Internalizing Privileged Context with Evidence Guided On-Policy Distillation


12. When Planning Fails Despite Correct Execution: On Epistemic Calibration for LLM-Based Multi-Agent Systems


13. Human-in-the-Loop Multi-Agent Ventilator Decision Support with Contextual Bandit Preference Learning


14. DART: Semantic Recoverability for Structured Tool Agents


15. Ontological Knowledge Blocks: Executable Compliance and Profile-Based Validation for Trustworthy AI Systems


16. Parallel Context Compaction for Long-Horizon LLM Agent Serving


17. Design and Report Benchmarks for Knowledge Work


18. GENSTRAT: Toward a Science of Strategic Reasoning in Large Language Models


19. Foundation Protocol: A Coordination Layer for Agentic Society


20. AutoResearch AI: Towards AI-Powered Research Automation for Scientific Discovery


21. Redrawing the AI Map: A Theory of Accountability Boundaries in Agentic Ecosystems


22. Inductive Deductive Synthesis: Enabling AI to Generate Formally Verified Systems


23. PathCal: State-Aware Reflection-Marker Calibration for Efficient Reasoning


24. The Deterministic Horizon: Impossibility Results as Design Specifications for Trustworthy AI Systems


25. EVE-Agent: Evidence-Verifiable Self-Evolving Agents


26. Mediative Fuzzy Logic: From Type-1 Foundations to Type-2, Type-3 and Quantum Extensions


27. ImProver 2: Iteratively Self-Improving LMs for Neurosymbolic Proof Optimization


28. Energy per Successful Goal: Goal-Level Energy Accounting for Agentic AI Systems


29. SciAtlas: A Large-Scale Knowledge Graph for Automated Scientific Research


30. RMA: an Agentic System for Research-Level Mathematical Problems


31. NeuroNL2LTL: A Neurosymbolic Framework for Natural Language Translation of Linear Temporal Logic


32. BOHM: Zero-Cost Hierarchical Attribution for Compound AI Systems


33. LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws


34. ETCHR: Editing To Clarify and Harness Reasoning


35. Good Token Hunting: A Hitchhiker’s Guide to Token Selection for Visual Geometry Transformers


36. CHRONOS: Temporally-Aware Multi-Agent Coordination for Evolving Data Marketplaces


37. PGT: Procedurally Generated Tasks for improving visual grounding in MLLMs


38. Human Decision-Making with Persuasive and Narrative LLM Explanations


39. Leveraging Foundation Models for Causal Generative Modeling


40. It’s the humans, not the data: Geopolitical bias in LLMs originates in post-training, amplified by the language of the prompt


41. Not Too Generative, Not Too Discriminative: The Human Alignment Sweet Spot


42. PhotoFlow: Agentic 3D Virtual Photography Missions


43. Any2Any: Efficient Cross-Embodiment Transfer for Humanoid Whole-Body Tracking


44. Weierstrass Positional Encoding for Vision Transformers


45. OnePred: Next-Query Prediction via Recursive Intent Memory in Multi-Turn Conversations


46. CVSearch: Empowering Multimodal LLMs with Cognitive Visual Search for High-Resolution Image Perception


47. Learning Through Noise: Why Subliminal Learning Works and When It Fails


48. DualMem: Bypassing the Objectness Bottleneck for Calibrated Unknown-Stream Filtering in Open-World Object Detection


49. Adversarial Vulnerability Under Temporal Concept Drift: A Longitudinal Study of Android Malware Detection


50. EM-Vid: Training-Free Entity-Centric Memory for Efficient and Consistent Multi-Shot Video Generation


51. DiLaDiff: Distilled Latent-Augmented Diffusion for Language Modeling


52. Preisach Attention: A Hysteretic Model of Sequential Memory


53. Cost-Effective Model Evaluation with Meta-Learning


54. HARNESS-LM: A Three-Phase Training Recipe for Harnessing SLMs in Sponsored Search Retrieval


55. Understanding Goal Generalisation in Sequential Reinforcement Learning


56. ARMS: Automatic Reward Shaping for Sparse-Reward Multi-Agent Reinforcement Learning


57. PathNavigate: A Training-Free Pathology Agent with Surprise-Guided Scan and Shared Slide Memory for Whole-Slide Image VQA


58. Goal-Conditioned Agents that Learn Everything All at Once


59. RA-DCA: A Randomized Active-Set DCA for Directional Stationarity in Max-Structured DC Programs


60. Precise: SDE-Consistent Stochastic Sampling for RL Post-Training of Flow-Matching Models


61. DrawVideo: Generating Long Video from Storyboard Keyframe Sketches


62. VACE: Learning Geometrically Structured Representations for Time Series Anomaly Detection


63. CoSPlay: Cooperative Self-Play at Test-Time with Self-Generated Code and Unit Test


64. Multimodal Distribution Matching for Vision-Language Dataset Distillation


65. PhenoYieldNet: Learning Crop-Aware Phenological Responses for Multi-Crop Yield Prediction


66. Automated Random Embedding for Practical Bayesian Optimization with Unknown Effective Dimension


67. CBANet: A Compact Attention-Based CNN-BiLSTM Network for Aggressive Driving Event Detection


68. Learning Individual Dynamics from Sparse Cross-Sectional Snapshots


69. AI Assurance: A Comprehensive Testing Strategy for Enterprise AI Systems


70. One-Forcing: Towards Stable One-Step Autoregressive Video Generation


71. AI Security Research Should Better Incentivize Defense Research


72. SSDAU: Structured Semantic Data Augmentation for Joint Entity and Relation Extraction


73. Socially fluent AI decouples conversational signals from source identity in online interaction


74. Reflex: Reinforcement Learning with Reflection Symmetry Exploitation in State-Based Continuous Control


75. Online Hand Gesture Recognition Using 3D Convolutional Neural Networks


76. Parametric Prior Mapping Framework for Non-stationary Probabilistic Time Series Forecasting


77. Every Component is a Lookup: Token Attribution and Composition from a Single Decomposition


78. Metacognition as Reward: Reinforcing LLM Reasoning via Knowledge and Regulation Signals


79. Curriculum reinforcement learning with measurable task representation learning


80. Score-Based One-step MeanFlow Policy Optimization


81. XWind: A Cross-site Router for Large Language Model Inference Serving at Renewable Energy Farms


82. CHASD: Language Increment-Calibrated Contrastive Decoding against Hallucination in LVLMs


83. Sparse Compositional Flow Matching by geometric assembly from motion primitives


84. Convergence Without Understanding: When Language Models Agree on Representations but Disagree on Reasoning


85. Reinforcement Learning for Microcanonical Graph Ensemble with Assortativity Constraints


86. When Good Equations Get Bad Scores: Improving Symbolic Regression Through Better Parameter Optimization


87. EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation


88. ChainFlow-VLA: Causal Flow Planning with Vision-Language Models


89. Coloring the Noise: Adversarial Sobolev Alignment for Faithful Image Super Resolution


90. 6G Communication Networks Enabling Embodied Agents: Architecture and Prototype


91. Multi-Gate Residuals


92. Enhancing Deep Neural Network Reliability with Refinement and Calibration


93. SimInsert: Seamless Video Object Insertion via Regional Sparse Attention Fusion


94. Are Frontier LLMs Ready for Cybersecurity? Evidence for Vertical Foundation Models from Dual-Mode Vulnerability Benchmarks


95. PaP-NF: Probabilistic Long-Term Time Series Forecasting via Prefix-as-Prompt Reprogramming and Normalizing Flows


96. FastKernels: Benchmarking GPU Kernel Generation in Production


97. Lipschitz Optimization for Formal Verification of Homographies


98. Adaptive Mass-Segmented KV Compression for Long-Context Reasoning


99. Scalable Heterogeneous Graph Foundation Models for Data-Driven Optimal Power Flow in Smart Grids


100. Understanding and Improving Noisy Embedding Techniques in Instruction Finetuning


101. Positional Failures in Long-Context LLMs: A Blind Spot in Reasoning Benchmarks


102. PoisonForge: Task-Level Targeted Poisoning Benchmark for Instruction-Tuned LLMs


103. Autonomous Frontier-Based Exploration with VLM Guidance


104. Generative AI and the Reorganization of Labor Demand


105. As X, Do Y: How Persona and Task Combine in Instruction-Tuned LLMs


106. Infra-Bayesian Reinforcement Learning Agents Outperform Classical RL For Worst-Case Robustness


107. CALAD: Channel-Aware contrastive Learning for multivariate time series Anomaly Detection


108. Classical State Preparation for Variational Quantum Algorithms via Reinforcement Learning


109. Defining AI Fatigue in Academic Contexts: Dimensions, Indicators, and a Stage-Based Model Using Grounded Theory


110. Exploiting Longitudinal Context in Clinician-Verified Interactive Lesion Tracking


111. CoReVAD: A Contextual Reasoning Framework for Training-Free Video Anomaly Detection


112. Philosophical Dispositions as Behavioral Constraints for AI-Assisted Code Review: An Empirical Study


113. A Fine-Tuned BERT Classifier for Personal-Letter Titles in Late-Ming and Early-Qing Collected Works


114. Do Synthetic Brain MRIs Reliably Improve Tumour Classification? A StyleGAN2-ADA Class-Plane Augmentation Study on BRISC 2025


115. Security of LLM-generated Code: A Comparative Analysis


116. Dreaming Smoothly and Sample Efficiently with Gradient Penalized Latent Dynamics


117. KAPLAN: Kolmogorov-Arnold Prognostic Learnable Activation Networks for Survival Analysis


118. Dithering Defense: Adversarial Robustness of Vision Foundation Models via Multi-Level Floyd-Steinberg Dithering


119. Anytime Training with Schedule-Free Spectral Optimization


120. A measurement substrate for agentic Kubernetes operations: Methodology and a case study in retrieval-compounding falsification


121. DRL-Driven Edge-Aware Utility Optimization for Multi-Slice 6G Networks


122. Decomposing and Measuring Evaluation Awareness


123. Model Collapse as Cultural Evolution


124. DreamerNLplus: Interpretable Modeling of Mental Health Dynamics from Social Media Timelines using Hybrid Rule-Based and RAG Methods


125. The TIME Machine: On The Power of Motion for Efficient Perception


126. Do Language Models Know What Not to Say? Causal Evidence for Statistical Preemption in LLMs


127. Sparse Autoencoders Map Brain-LLM Alignment onto Cortical Semantic Topography


128. Uncovering the Latent Potential of Deep Intermediate Representations


129. Brain-LLM Alignment Tracks Training Data, Not Typology


130. MadEvolve: Evolutionary Optimization of Trading Systems with Large Language Models


131. Whose Good, Whose Place? The Moral Geography of Agentic AI for Social Good


132. A Proactive Multi-Agent Dialogue Framework for Assessing Social Language Disorder Traits in Autism


133. Robots That Know What to Ask: Recovering Misaligned Rewards through Targeted Explanations


134. Test-Time Training Undermines Safety Guardrails


135. Memorization Dynamics of Fill-in-the-Middle Pretraining


136. LLM Code Smells: A Taxonomy and Detection Approach


137. Worse than Random: The Importance of a Baseline for Unsupervised Feature Selection


138. A mathematical theory of balancing relational generalization and memorization


139. Graph Alignment Topology as an Inductive Bias for Grounding Detection


140. Human-Centered Learning Mechanics: A Dynamical Framework for Entropy-Regulated Representation Learning


141. Suicide Risk Assessment from AI-powered Video Surveillance: An Interpretable Framework for Prevention in Metro Stations


142. Seeing without Looking: Do Vision-Language Benchmarks Really Test Vision?


143. Transcoders Trace Visual Grounding and Hallucinations in Vision-Language Models


144. Agentic-VLA: Efficient Online Adaptation for Vision-Language-Action Models


145. Tensor Cache: Eviction-conditioned Associative Memory for Transformers


146. How Far Will They Go? Red-Teaming Online Influence with Large Language Models


147. When Do LLMs Reason? A Dynamical Systems View via Entropy Phase Transitions


148. MedExpMem: Adapting Experience Memory for Differential Diagnosis


149. Approximate Machine Unlearning through Manifold Representation Forgetting Guided by Self Mode Connectivity


150. The Readout Shortcut: Positional Number Copying Dominates Arithmetic CoT Readout in Small Language Models


151. Staging by the Book: Automatic Sleep Stage Classification Using Scoring Rules


152. PilotWiMAE: Pilot-Native Representation Learning for Wireless Channels


153. PrefBench: Evaluating Zero-Shot LLM Agents in Hidden-Preference Personalized Pricing Negotiations


154. Expressive Power of Deep Homomorphism Networks over Relational Databases


155. ObjectCache: Layerwise Object-Storage Retrieval for KV Cache Reuse


156. The Misattribution Gap: When Memory Poisoning Looks Like Model Failure in Agentic AI Systems


157. Strategic Coercion Within Alliances: The Greenland Sovereignty Game as an AI Stress Test


158. The Cognitive Kardashev Scale: Quantifying the Material Envelope of Civilisational Computation


159. RAG4Outcome: A Retrieval-Augmented Multimodal Framework for Prognostic Prediction in Chronic Osteomyelitis


160. LFRAG: Layout-oriented Fine-grained Retrieval-Augmented Generation on Multimodal Document Understanding


161. Computable Fairness: Boltzmann-Softmax Control for AI Resource Allocation


162. Evaluating Large Language Models in a Complex Hidden Role Game


163. KPI2KVI: A Multi Agent Workflow for Calculating Key Value Indicators from Service Descriptions


164. An AI-Driven Framework for Energy-Efficient Environmental Monitoring in Smart Cities Using Edge Intelligence