전체 AI 논문 - 2026-05-14

1. Quantifying Sensitivity for Tree Ensembles: A symbolic and compositional approach


2. History Anchors: How Prior Behavior Steers LLM Decisions Toward Unsafe Actions


3. Harnessing Agentic Evolution


4. Senses Wide Shut: A Representation-Action Gap in Omnimodal LLMs


5. ScioMind: Cognitively Grounded Multi-Agent Social Simulation with Anchoring-Based Belief Dynamics and Dynamic Profiles


6. Adaptive mine planning under geological uncertainty: A POMDP framework for sequential decision-making


7. How to Interpret Agent Behavior


8. Unweighted ranking for value-based decision making with uncertainty


9. Position: Assistive Agents Need Accessibility Alignment


10. Learning Local Constraints for Reinforcement-Learned Content Generators


11. RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation


12. Scaling Retrieval-Augmented Reasoning with Parallel Search and Explicit Merging


13. AI-Generated Slides: Are They Good? Can Students Tell?


14. MMSkills: Towards Multimodal Skills for General Visual Agents


15. Assessing the Creativity of Large Language Models: Testing, Limits, and New Frontiers


16. Cognifold: Always-On Proactive Memory via Cognitive Folding


17. TRIAGE: Evaluating Prospective Metacognitive Control in LLMs under Resource Constraints


18. RS-Claw: Progressive Active Tool Exploration via Hierarchical Skill Trees for Remote Sensing Agents


19. Multi-Agent Systems in Emergency Departments: Validation Study on a ED Digital Twin


20. Ego2World: Compiling Egocentric Cooking Videos into Executable Worlds for Belief-State Planning


21. Diversity of Extensions in Abstract Argumentation


22. VERA-MH: Validation of Ethical and Responsible AI in Mental Health


23. IdeaForge: A Knowledge Graph-Grounded Multi-Agent Framework for Cross-Methodology Innovation Analysis and Patent Claim Generation


24. Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling


25. Discrete Diffusion for Complex and Congested Multi-Agent Path Finding with Sparse Social Attention


26. What properties of reasoning supervision are associated with improved downstream model quality?


27. Differentiable Learning of Lifted Action Schemas for Classical Planning


28. D-VLA: A High-Concurrency Distributed Asynchronous Reinforcement Learning Framework for Vision-Language-Action Models


29. Respecting Self-Uncertainty in On-Policy Self-Distillation for Efficient LLM Reasoning


30. It’s not the Language Model, it’s the Tool: Deterministic Mediation for Scientific Workflows


31. Improving Code Translation with Syntax-Guided and Semantic-aware Preference Optimization


32. An Agentic AI Framework with Large Language Models and Chain-of-Thought for UAV-Assisted Logistics Scheduling with Mobile Edge Computing


33. Hierarchical Attacks for Multi-Modal Multi-Agent Reasoning


34. Formal Conjectures: An Open and Evolving Benchmark for Verified Discovery in Mathematics


35. Strikingness-Aware Evaluation for Temporal Knowledge Graph Reasoning


36. A Constraint Programming Approach for $n$-Day Lookahead Playoff Clinching


37. GRACE: Gradient-aligned Reasoning Data Curation for Efficient Post-training


38. An Agentic LLM-Based Framework for Population-Scale Mental Health Screening


39. MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning


40. Retrieval-Augmented Tutoring for Algorithm Tracing and Problem-Solving in AI Education


41. Useful Memories Become Faulty When Continuously Updated by LLMs


42. Retrieval is Cheap, Show Me the Code: Executable Multi-Hop Reasoning for Retrieval-Augmented Generation


43. Position: Agentic AI System Is a Foreseeable Pathway to AGI


44. Sustaining AI safety: Control-theoretic external impossibility, intrinsic necessity, and structural requirements


45. When Attention Closes: How LLMs Lose the Thread in Multi-Turn Interaction


46. Beyond Cooperative Simulators: Generating Realistic User Personas for Robust Evaluation of LLM Agents


47. Moltbook Moderation: Uncovering Hidden Intent Through Multi-Turn Dialogue


48. Multimodal Hidden Markov Models for Persistent Emotional State Tracking


49. PROMETHEUS: Automating Deep Causal Research Integrating Text, Data and Models


50. State-Centric Decision Process


51. BEHAVE: A Hybrid AI Framework for Real-Time Modeling of Collective Human Dynamics


52. CHAL: Council of Hierarchical Agentic Language


53. DisaBench: A Participatory Evaluation Framework for Disability Harms in Language Models


54. On the Size Complexity and Decidability of First-Order Progression


55. Learning Transferable Latent User Preferences for Human-Aligned Decision Making


56. Revealing Interpretable Failure Modes of VLMs


57. Do Androids Dream of Breaking the Game? Systematically Auditing AI Agent Benchmarks with BenchJack


58. Macro-Action Based Multi-Agent Instruction Following through Value Cancellation


59. Think Twice, Act Once: Verifier-Guided Action Selection For Embodied Agents


60. WARDEN: Endangered Indigenous Language Transcription and Translation with 6 Hours of Training Data


61. EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents


62. Topology-Preserving Neural Operator Learning via Hodge Decomposition


63. Negation Neglect: When models fail to learn negations in training


64. Neurosymbolic Auditing of Natural-Language Software Requirements


65. Improving Reproducibility in Evaluation through Multi-Level Annotator Modeling


66. Di-BiLPS: Denoising induced Bidirectional Latent-PDE-Solver under Sparse Observations


67. ENSEMBITS: an alphabet of protein conformational ensembles


68. Amplification to Synthesis: A Comparative Analysis of Cognitive Operations Before and After Generative AI


69. LMPath: Language-Mediated Priors and Path Generation for Aerial Exploration


70. MinT: Managed Infrastructure for Training and Serving Millions of LLMs


71. (How) Do Large Language Models Understand High-Level Message Sequence Charts?


72. Where Does Reasoning Break? Step-Level Hallucination Detection via Hidden-State Transport Geometry


73. High-Rate Quantized Matrix Multiplication II


74. Weakly-Supervised Spatiotemporal Anomaly Detection


75. KVServe: Service-Aware KV Cache Compression for Communication-Efficient Disaggregated LLM Serving


76. Robust and Explainable Bicuspid Aortic Valve Diagnosis Using Stacked Ensembles on Echocardiography


77. Coordinating Multiple Conditions for Trajectory-Controlled Human Motion Generation


78. AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation


79. Humanwashing – It Should Leave You Feeling Dirty


80. Children’s English Reading Story Generation via Supervised Fine-Tuning of Compact LLMs with Controllable Difficulty and Safety


81. Identifying AI Web Scrapers Using Canary Tokens


82. RTLC – Research, Teach-to-Learn, Critique: A three-stage prompting paradigm inspired by the Feynman Learning Technique that lifts LLM-as-judge accuracy on JudgeBench with no fine-tuning


83. The WidthWall: A Strict Expressivity Hierarchy for Hypergraph Neural Networks


84. A Hierarchical Language Model with Predictable Scaling Laws and Provable Benefits of Reasoning


85. Cross Modality Image Translation In Medical Imaging Using Generative Frameworks


86. Weakly Supervised Segmentation as Semantic-Based Regularization


87. Beyond Perplexity: A Geometric and Spectral Study of Low-Rank Pre-Training


88. NAACA: Training-Free NeuroAuditory Attentive Cognitive Architecture with Oscillatory Working Memory for Salience-Driven Attention Gating


89. Causality-Aware End-to-End Autonomous Driving via Ego-Centric Joint Scene Modeling


90. OpenAaaS: An Open Agent-as-a-Service Framework for Distributed Materials-Informatics Research


91. HetScene: Heterogeneity-Aware Diffusion for Dense Indoor Scene Generation


92. Beyond Anthropomorphism: Exploring the Roles of Perceived Non-humanity and Structural Similarity in Deep Self-Disclosure Toward Generative AI


93. Dynamical Predictive Modelling of Cardiovascular Disease Progression Post-Myocardial Infarction via ECG-Trained Artificial Intelligence Model


94. Generating synthetic computed tomography for radiotherapy: SynthRAD2025 challenge report


95. Self-Supervised On-Policy Reinforcement Learning via Contrastive Proximal Policy Optimisation


96. AttenA+: Rectifying Action Inequality in Robotic Foundation Models


97. Decoupled and Divergence-Conditioned Prompt for Multi-domain Dynamic Graph Foundation Models


98. Locale-Conditioned Few-Shot Prompting Mitigates Demonstration Regurgitation in On-Device PII Substitution with Small Language Models


99. Temper and Tilt Lead to SLOP: Reward Hacking Mitigation with Inference-Time Alignment


100. HLS-Seek: QoR-Aware Code Generation for High-Level Synthesis via Proxy Comparative Reward Reinforcement Learning


101. Towards Unified Surgical Scene Understanding:Bridging Reasoning and Grounding via MLLMs


102. ArcVQ-VAE: A Spherical Vector Quantization Framework with ArcCosine Additive Margin


103. Many-Shot CoT-ICL: Making In-Context Learning Truly Learn


104. Discovery of Hidden Miscalibration Regimes


105. CUBic: Coordinated Unified Bimanual Perception and Control Framework


106. Q-Flow: Stable and Expressive Reinforcement Learning with Flow-Based Policy


107. Towards a holistic understanding of Selection Bias for Causal Effect Identification


108. Continual Learning with Multilingual Foundation Model


109. LLMs as annotators of credibility assessment in Danish asylum decisions: evaluating classification performance and errors beyond aggregated metrics


110. GRIP-VLM: Group-Relative Importance Pruning for Efficient Vision-Language Models


111. Query-Conditioned Test-Time Self-Training for Large Language Models


112. A Horn extension of DL-Lite with NL data complexity


113. Constitutional Governance in Metric Spaces


114. AI Harness Engineering: A Runtime Substrate for Foundation-Model Software Agents


115. Probing Persona-Dependent Preferences in Language Models


116. Inducing Overthink: Hierarchical Genetic Algorithm-based DoS Attack on Black-Box Large Language Reasoning Models


117. Stylized Text-to-Motion Generation via Hypernetwork-Driven Low-Rank Adaptation


118. Tracing Persona Vectors Through LLM Pretraining


119. What Limits Vision-and-Language Navigation ?


120. CANTANTE: Optimizing Agentic Systems via Contrastive Credit Attribution


121. IndicMedDialog: A Parallel Multi-Turn Medical Dialogue Dataset for Accessible Healthcare in Indic Languages


122. Delightful Exploration


123. The Readability Spectrum: Patterns, Issues, and Prompt Effects in LLM-Generated Code


124. Utility-Oriented Visual Evidence Selection for Multimodal Retrieval-Augmented Generation


125. “It became a self-fulfilling prophecy”: How Lived Experiences are Entangled with AI Predictions in Menstrual Cycle Tracking Apps


126. X-Restormer++: 1st Place Solution for the UG2+ CVPR 2026 All-Weather Restoration Challenge


127. Compact Latent Manifold Translation: A Parameter-Efficient Foundation Model for Cross-Modal and Cross-Frequency Physiological Signal Synthesis


128. Teacher-Guided Policy Optimization for LLM Distillation


129. ReTool-Video: Recursive Tool-Using Video Agents with Meta-Augmented Tool Grounding


130. STAR: Semantic-Temporal Adaptive Representation Learning for Few-Shot Action Recognition


131. McCast: Memory-Guided Latent Drift Correction for Long-Horizon Precipitation Nowcasting


132. ECG-NAT: A Self-supervised Neighborhood Attention Transformer for Multi-lead Electrocardiogram Classification


133. N-vium: Mixture-of-Exits Transformer for Accelerated Exact Generation


134. Stable Attention Response for Reliable Precipitation Nowcasting


135. CLIP Tricks You: Training-free Token Pruning for Efficient Pixel Grounding in Large VIsion-Language Models


136. When Does Hierarchy Help? Benchmarking Agent Coordination in Event-Driven Industrial Scheduling


137. PanoWorld: Towards Spatial Supersensing in 360$^\circ$ Panorama World


138. EvObj: Learning Evolving Object-centric Representations for 3D Instance Segmentation without Scene Supervision


139. AcquisitionSynthesis: Targeted Data Generation using Acquisition Functions


140. LeanSearch v2: Global Premise Retrieval for Lean 4 Theorem Proving


141. MLGIB: Multi-Label Graph Information Bottleneck for Expressive and Robust Message Passing


142. Towards Long-horizon Embodied Agents with Tool-Aligned Vision-Language-Action Models


143. SECOND-Grasp: Semantic Contact-guided Dexterous Grasping


144. Context Matters: Auditing Gender Bias in T2I Generation through Risk-Tiered Use-Case Profiles


145. A Multi-Agent Orchestration Framework for Venture Capital Due Diligence


146. Margin-calibrated Classifier Guidance for Property-driven Synthesis Planning


147. Watermarking Should Be Treated as a Monitoring Primitive


148. Vividh-ASR: A Complexity-Tiered Benchmark and Optimization Dynamics for Robust Indic Speech Recognition


149. Does language matter for spoken word classification? A multilingual generative meta-learning approach


150. Spectral Flattening Is All Muon Needs: How Orthogonalization Controls Learning Rate and Convergence


151. Counterfactual Reasoning for Causal Responsibility Attribution in Probabilistic Multi-Agent Systems


152. Scaling few-shot spoken word classification with generative meta-continual learning


153. Neural QAOA$^{2}$: Differentiable Joint Graph Partitioning and Parameter Initialization for Quantum Combinatorial Optimization


154. When Absolute State Fails: Evaluating Proprioceptive Encodings for Robust Manipulation


155. Bridging Domain Gaps with Target-Aligned Generation for Offline Reinforcement Learning


156. Context Training with Active Information Seeking


157. Revealing the Gap in Human and VLM Scene Perception through Counterfactual Semantic Saliency


158. No Attack Required: Semantic Fuzzing for Specification Violations in Agent Skills


159. CoGE: Sim-to-Real Online Geometric Estimation for Monocular Colonoscopy


160. FeatCal: Feature Calibration for Post-Merging Models


161. Understanding and Accelerating the Training of Masked Diffusion Language Models


162. Rethinking Efficient Graph Coarsening via a Non-Selfishness Principle


163. Amortized Guidance for Image Inpainting with Pretrained Diffusion Models


164. Not Just RLHF: Why Alignment Alone Won’t Fix Multi-Agent Sycophancy


165. Protocol-Driven Development: Governing Generated Software Through Invariants and Evidence


166. CoRe-Gen: Robust Spectrum-to-Structure Generation under Imperfect Fingerprint Conditions


167. Revisiting Reinforcement Learning with Verifiable Rewards from a Contrastive Perspective


168. Controlling Logical Collapse in LLMs via Algebraic Ontology Projection over F2


169. AdaFocus: Adaptive Relevance-Diversity Sampling with Zero-Cache Look-back for Efficient Long Video Understanding


170. Seg-Agent: Test-Time Multimodal Reasoning for Training-Free Language-Guided Segmentation


171. When Should an AI Workflow Release? Always-Valid Inference for Black-Box Generate-Verify Systems


172. The Expressivity Boundary of Probabilistic Circuits: A Comparison with Large Language Models


173. CRePE: Curved Ray Expectation Positional Encoding for Unified-Camera-Controlled Video Generation


174. AuraMask: An Extensible Pipeline for Developing Aesthetic Anti-Facial Recognition Image Filters


175. Anatomy-Slot: Unsupervised Anatomical Factorization for Homologous Bilateral Reasoning in Retinal Diagnosis


176. AgentLens: Revealing The Lucky Pass Problem in SWE-Agent Evaluation


177. Embodied Multi-Agent Coordination by Aligning World Models Through Dialogue


178. Data Difficulty and the Generalization–Extrapolation Tradeoff in LLM Fine-Tuning


179. RISED: A Pre-Deployment Safety Evaluation Framework for Clinical AI Decision-Support Systems


180. EcoGEO: Trajectory-Aware Evidence Ecosystems for Web-Enabled LLM Search Agents


181. Quantifying LLM Safety Degradation Under Repeated Attacks Using Survival Analysis


182. Language-Based Agent Control


183. ChipMATE: Multi-Agent Training via Reinforcement Learning for Enhanced RTL Generation


184. PRISM: Perinuclear Ring-based Image Segmentation Method for Acute Lymphoblastic Leukemia Classification


185. Persona-Model Collapse in Emergent Misalignment


186. AssemblyBench: Physics-Aware Assembly of Complex Industrial Objects


187. Bayesian Model Merging


188. GraphIP-Bench: How Hard Is It to Steal a Graph Neural Network, and Can We Stop It?


189. FRAME: Forensic Routing and Adaptive Multi-path Evidence Fusion for Image Manipulation Detection


190. Orthrus: Memory-Efficient Parallel Token Generation via Dual-View Diffusion


191. Mechanism Plausibility in Generative Agent-Based Modeling


192. Training Large Language Models to Predict Clinical Events


193. REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations


194. Correcting Influence: Unboxing LLM Outputs with Orthogonal Latent Spaces


195. Discrete MeanFlow: One-Step Generation via Conditional Transition Kernels


196. Emergent and Subliminal Misalignment Through the Lens of Data-Mediated Transfer


197. Adaptive Smooth Tchebycheff Attention for Multi-Objective Policy Optimization


198. WriteSAE: Sparse Autoencoders for Recurrent State


199. Multi-Quantile Regression for Extreme Precipitation Downscaling


200. Uncovering Symmetry Transfer in Large Language Models via Layer-Peeled Optimization


201. Simulating Students or Sycophantic Problem Solving? On Misconception Faithfulness of LLM Simulators


202. CoT-Guard: Small Models for Strong Monitoring


203. What Do You Think I Think? Accounting for Human Beliefs Using Second-Order Theory of Mind


204. From Generalist to Specialist Representation


205. Large Language Models for Agentic NetOps and AIOps: Architectures, Evaluation, and Safety


206. Grid-Orch: An LLM-Powered Orchestrator for Distribution Grid Simulation and Analytics


207. Inline Critic Steers Image Editing


208. The End Justifies the Mean: A Linear Ranking Rule for Proportional Sequential Decisions


209. Controllable Quantum Memory Capacity in Quantum Reservoir Networks with Tunable partial-SWAPs


210. FePySR: A Neural Feature Extraction Framework for Efficient and Scalable Symbolic Regression


211. MMCL-Bench: Multimodal Context Learning from Visual Rules, Procedures, and Evidence


212. Do Fair Models Reason Fairly? Counterfactual Explanation Consistency for Procedural Fairness in Credit Decisions


213. Modeling Heterophily in Multiplex Graphs: An Adaptive Approach for Node Classification


214. Agentic Interpretation: Lattice-Structured Evidence for LLM-Based Program Analysis


215. A Unified Perspective for Learning Graph Representations Across Multi-Level Abstractions


216. Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty?


217. Parallel-in-Time Training of Recurrent Neural Networks for Dynamical Systems Reconstruction


218. ODRPO: Ordinal Decompositions of Discrete Rewards for Robust Policy Optimization


219. Plan Before You Trade: Inference-Time Optimization for RL Trading Agents


220. Multi-Rollout On-Policy Distillation via Peer Successes and Failures


221. Learning to Decide with AI Assistance under Human-Alignment


222. Training LLMs with Reinforcement Learning for Intent-Aware Personalized Question Answering


223. The critical slowing down in diffusion models


224. 3D Primitives are a Spatial Language for VLMs


225. Towards Robust Federated Multimodal Graph Learning under Modality Heterogeneity


226. Are Compact Rationales Free? Measuring Tile Selection Headroom in Frozen WSI-MIL


227. DistractMIA: Black-Box Membership Inference on Vision-Language Models via Semantic Distraction


228. Improving Diffusion Posterior Samplers with Lagged Temporal Corrections for Image Restoration


229. VideoSEAL: Mitigating Evidence Misalignment in Agentic Long Video Understanding by Decoupling Answer Authority


230. Active Sensing with Meta-Reinforcement Learning for Emitter Localization from RF Observations


231. Pyramid Self-contrastive Learning Framework for Test-time Ultrasound Image Denoising


232. Uncovering Latent Pathological Signatures in Pulmonary CT via Cross-Window Knowledge Distillation


233. ChannelKAN: Multi-Scale Dual-Domain Channel Prediction via Hybrid CNN-KAN Architecture


234. SSDA: Bridging Spectral and Structural Gaps via Dual Adaptation for Vision-Based Time Series Forecasting


235. CROP: Expert-Aligned Image Cropping via Compositional Reasoning and Optimizing Preference


236. Why the Unfinished Keeps Returning: Canxianization and the Dynamics of Conscious Priority


237. PG-LRF: Physiology-Guided Latent Rectified Flow for Electro-Hemodynamic PPG-to-ECG Generation


238. Information as Maximum-Caliber Deviation: A bridge between Integrated Information Theory and the Free Energy Principle


239. AgenticAITA: A Proof-Of-Concept About Deliberative Multi-Agent Reasoning for Autonomous Trading Systems


240. In-Situ Behavioral Evaluation for LLM Fairness, Not Standardized-Test Scores


241. MorphOPC: Advancing Mask Optimization with Multi-scale Hierarchical Morphological Learning


242. PERCEIVE: A Benchmark for Personalized Emotion and Communication Behavior Understanding on Social Media


243. Stress-Testing the Reasoning Competence of LLMs With Proofs Under Minimal Formalism


244. Exploring how EFL students talk to and through AI to develop texts


245. Differences in Text Generated by Diffusion and Autoregressive Language Models


246. ToolWeave: Structured Synthesis of Complex Multi-Turn Tool-Calling Dialogues


247. BoostTaxo: Zero-Shot Taxonomy Induction via Boosting-Style Agentic Reasoning and Constraint-Aware Calibration


248. Correct Answers from Sound Reasoning: Verifiable Process Supervision for Language Models


249. TimelineReasoner: Advancing Timeline Summarization with Large Reasoning Models


250. Bridging the Missing-Modality Gap: Improving Text-Only Calibration of Vision Language Models


251. Domain Adaptation of Large Language Models for Polymer-Composite Additive Manufacturing Using Retrieval-Augmented Generation and Fine-Tuning


252. SP-GCRL: Influence Maximization on Incomplete Social Graphs


253. Beyond Individual Mimicry: Constructing Human-Like Social network with Graph-Augmented LLM Agents


254. Representing Higher-Order Networks: A Survey of Graph-Based Frameworks


255. Can LLM Agents Simulate Dynamic Networks? A Case Study on Email Networks with Phishing Synthesis


256. Scale-Gest: Scalable Model-Space Synthesis and Runtime Selection for On-Device Gesture Detection



258. Prime Successor Irreducibility: Turing Machine Complexity, Kolmogorov Complexity, and Weakness-Based Formulations


259. TokaMind for Power Grid: Cross-Domain Transfer from Fusion Plasma