전체 AI 논문 - 2026-05-18


2. FORGE: Self-Evolving Agent Memory With No Weight Updates via Population Broadcast


3. Fully Open Meditron: An Auditable Pipeline for Clinical LLMs


4. Confirming Correct, Missing the Rest: LLM Tutoring Agents Struggle Where Feedback Matters Most


5. Context, Reasoning, and Hierarchy: A Cost-Performance Study of Compound LLM Agent Design in an Adversarial POMDP


6. Formal Methods Meet LLMs: Auditing, Monitoring, and Intervention for Compliance of Advanced AI Systems


7. An Algebraic Exposition of the Theory of Dyadic Morality


8. Look Before You Leap: Autonomous Exploration for LLM Agents


9. Property-Guided LLM Program Synthesis for Planning


10. ShopGym: An Integrated Framework for Realistic Simulation and Scalable Benchmarking of E-Commerce Web Agents


11. Sign-Separated Finite-Time Error Analysis of Q-Learning


12. Reasoners or Translators? Contamination-aware Evaluation and Neuro-Symbolic Robustness in Tax Law


13. ScreenSearch: Uncertainty-Aware OS Exploration


14. Petri Net Induced Heuristic Search for Resource Constrained Scheduling


15. Learning Bilevel Policies over Symbolic World Models for Long-Horizon Planning


16. Deterministic Event-Graph Substrates as World Models for Counterfactual Reasoning


17. PAGER: Bridging the Semantic-Execution Gap in Point-Precise Geometric GUI Control


18. Imperfect World Models are Exploitable


19. Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design


20. SaaS-Bench: Can Computer-Use Agents Leverage Real-World SaaS to Solve Professional Workflows?


21. ALSO: Adversarial Online Strategy Optimization for Social Agents


22. Can We Trust AI-Inferred User States. A Psychometric Framework for Validating the Reliability of Users States Classification by LLMs in Operational Environments


23. Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR


24. PRISM: Prompt Reliability via Iterative Simulation and Monitoring for Enterprise Conversational AI


25. ColPackAgent: Agent-Skill-Guided Hard-Particle Monte Carlo Workflows for Colloidal Packing


26. TopoEvo: A Topology-Aware Self-Evolving Multi-Agent Framework for Root Cause Analysis in Microservices


27. See Before You Code: Learning Visual Priors for Spatially Aware Educational Animation Generation


28. STAR: A Stage-attributed Triage and Repair framework for RCA Agents in Microservices


29. Position: Artificial Intelligence Needs Meta Intelligence – the Case for Metacognitive AI


30. DRS-GUI: Dynamic Region Search for Training-Free GUI Grounding


31. RTL-BenchMT: Dynamic Maintenance of RTL Generation Benchmark Through Agent-Assisted Analysis and Revision


32. CAPS: Cascaded Adaptive Pairwise Selection for Efficient Parallel Reasoning


33. X-SYNTH: Beyond Retrieval – Enterprise Context Synthesis from Observed Human Attention


34. From LLM-Generated Conjectures to Lean Formalizations: Automated Polynomial Inequality Proving via Sum-of-Squares Certificates


35. Beyond Partner Diversity: An Influence-Based Team Steering Framework for Zero-Shot Human-Machine Teaming


36. Ensemble Monitoring for AI Control: Diverse Signals Outweigh More Compute


37. Belief Engine: Configurable and Inspectable Stance Dynamics in Multi-Agent LLM Deliberation


38. Zero-Shot Goal Recognition with Large Language Models


39. Context Pruning for Coding Agents via Multi-Rubric Latent Reasoning


40. SMCEvolve: Principled Scientific Discovery via Sequential Monte Carlo Evolution


41. Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution


42. Verifiable Agentic Infrastructure: Proof-Derived Authorization for Sovereign AI Systems


43. NIMO Controller: a self-driving laboratory orchestrator based on the Model Context Protocol


44. ICRL: Learning to Internalize Self-Critique with Reinforcement Learning


45. NOVA: Fundamental Limits of Knowledge Discovery Through AI


46. CAX-Agent: A Lightweight Agent Harness for Reliable APDL Automation


47. Fair outputs, Biased Internals: Causal Potency and Asymmetry of Latent Bias in LLMs for High-Stakes Decisions


48. SkillSmith: Compiling Agent Skills into Boundary-Guided Runtime Interfaces


49. Does Theory of Mind Improvement Really Benefit Human-AI Interactions? Empirical Findings from Interactive Evaluations


50. SDOF: Taming the Alignment Tax in Multi-Agent Orchestration with State-Constrained Dispatch


51. DeepSlide: From Artifacts to Presentation Delivery


52. IVGT: Implicit Visual Geometry Transformer for Neural Scene Representation


53. Designing Datacenter Power Delivery Hierarchies for the AI Era


54. A Generative AI Framework for Intelligent Utility Billing CO 2 Analytics and Sustainable Resource Optimisation


55. AI-Mediated Communication Can Steer Collective Opinion


56. Offline Semantic Guidance for Efficient Vision-Language-Action Policy Distillation


57. Layer Equivalence Is Not a Property of Layers Alone: How You Test Redundancy Changes What You Find


58. A Unified Generative-AI Framework for Smart Energy Infrastructure: Intelligent Gas Distribution, Utility Billing, Carbon Analytics, and Quantum-Inspired Optimisation


59. Evaluating Design Video Generation: Metrics for Compositional Fidelity


60. Argus: Evidence Assembly for Scalable Deep Research Agents


61. paper.json: A Coordination Convention for LLM-Agent-Actionable Papers


62. Second-Order Multi-Level Variance Correction for Modality Competition in Multimodal Models


63. Surrogate Neural Architecture Codesign Package (SNAC-Pack)


64. Navigating Potholes with Geometry-Aware Sharpness Minimization


65. Entropy Across the Bridge: Conditional-Marginal Discretization for Flow and Schrödinger Samplers


66. GenShield: Unified Detection and Artifact Correction for AI-Generated Images


67. DebiasRAG: A Tuning-Free Path to Fair Generation in Large Language Models through Retrieval-Augmented Generation


68. Attention Dispersion in Dynamic Graph Transformers: Diagnosis and a Transferable Fix


69. Federated Imputation under Heterogeneous Feature Spaces


70. GeoGS-CE: Learning Delay–Beam Channel Priors with 3D Gaussians for High-Mobility Scenarios


71. Centralized vs Decentralized Federated Learning: A trade-off performance analysis


72. Multi-level Self-supervised Pretraining on Compositional Hierarchical Graph for Molecular Property Prediction


73. Towards Trustworthy and Explainable AI for Perception Models: From Concept to Prototype Vehicle Deployment


74. Towards Foundation Models for Relational Databases with Language Models and Graph Neural Networks


75. VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation


76. AgriMind: An Ensemble Deep Learning Framework for Multi-Class Plant Disease Classification


77. Robust Prior-Guided Segmentation for Editable 3D Gaussian Splatting


78. Misspecified Explore-then-Exploit Leads to Supra-Competitive Prices


79. Ada-Diffuser: Latent-Aware Adaptive Diffusion for Decision-Making


80. Looped SSMs: Depth-Recurrence and Input Reshaping for Time Series Classification


81. XSearch: Explainable Code Search via Concept-to-Code Alignment


82. RecMem: Recurrence-based Memory Consolidation for Efficient and Effective Long-Running LLM Agents


83. Learning Sim-Grounded Policies for Bimanual Rope Manipulation from Human Teleoperation Data


84. Who Owns This Agent? Tracing AI Agents Back to Their Owners


85. From Flat Language Labels to Typological Priors: Structured Language Conditioning for Multilingual Speech-to-Speech Translation


86. Can Vision Language Models Be Adaptive in Mathematics Education? A Learner Model-based Rubric Study


87. CitePrism: Human-in-the-Loop AI for Citation Auditing and Editorial Integrity


88. Constrained latent state modeling: A unifying perspective on representation learning under competing constraints


89. Beyond Content: A Comprehensive Speech Toxicity Dataset and Detection Framework Incorporating Paralinguistic Cues


90. Ontology for Policing: Conceptual Knowledge Learning for Semantic Understanding and Reasoning in Law Enforcement Reports


91. Reference-Free Reinforcement Learning Fine-Tuning for MT: A Seq2Seq Perspective


92. When and Why Adversarial Training Improves PINNs: A Neural Tangent Kernel Perspective


93. Decomposed Vision-Language Alignment for Fine-Grained Open-Vocabulary Segmentation


94. LoCO: Low-rank Compositional Rotation Fine-tuning


95. SLIP & ETHICS: Graduated Intervention for AI Emotional Companions


96. Towards Generalization of Block Attention via Automatic Segmentation and Block Distillation


97. RaPD: Resolution-Agnostic Pixel Diffusion via Semantics-Enriched Implicit Representations


98. Generative Long-term User Interest Modeling for Click-Through Rate Prediction


99. Uncertainty-Aware Wildfire Smoke Density Classification from Satellite Imagery via CBAM-Augmented EfficientNet with Evidential Deep Learning


100. CHoE: Cross-Domain Heterogeneous Graph Prompt Learning via Structure-Conditioned Experts


101. Symplectic Neural Operators for Learning Infinite Dimensional Hamiltonian Systems


102. FSCM: Frequency-Enhanced Spatial-Spectral Coupled Mamba for Infrared Hyperspectral Image Colorization


103. Shapley Neuron Values for Continual Learning: Which Neurons Matter Most?


104. Access Timing as Scaffolding: A Reinforcement Learning Approach to GenAI in Education


105. RoadmapBench: Evaluating Long-Horizon Agentic Software Development Across Version Upgrades


106. GAP: Geometric Anchor Pre-training for Data-Efficient Visuomotor Learning of Manipulation Tasks


107. Modeling Music as a Time-Frequency Image: A 2D Tokenizer for Music Generation


108. Toward Natural and Companionable Virtual Agents via Cross-Temporal Emotional Modeling


109. Grokking as Structural Inference: Transformers Need Bayesian Lottery Tickets


110. A Topology-Aware Spatiotemporal Handover Framework for Continuous Multi-UAV Tracking


111. Lamarckian Inheritance in Dynamic Environments: How Key Variables Affect Evolutionary Dynamics


112. GRASP: Learning to Ground Social Reasoning in Multi-Person Non-Verbal Interactions


113. CompactQE: Interpretable Translation Quality Estimation via Small Open-Weight LLMs


114. BiomedAP: A Vision-Informed Dual-Anchor Framework with Gated Cross-Modal Fusion for Robust Medical Vision-Language Adaptation


115. UAM: A Dual-Stream Perspective on Forgetting in VLA Training


116. Structure Abstraction and Generalization in a Hippocampal-Entorhinal Inspired World Model


117. DecomPose: Disentangling Cross-Category Optimization Contention for Category-Level 6D Object Pose Estimation


118. DiLA: Disentangled Latent Action World Models


119. Bidirectional Fusion Guided by Cardiac Patterns for Semi-Supervised ECG Segmentation


120. Position: Early-Stage Quality Assurance in Annotation Pipelines Is More Cost-Effective Than Late-Stage Validation


121. Learning Dynamic Pick-and-Place for a Legged Manipulator


122. Feedback World Model Enables Precise Guidance of Diffusion Policy


123. H-Mem: A Novel Memory Mechanism for Evolving and Retrieving Agent Memory via a Hybrid Structure


124. $α$-TCAV: A Unified Framework for Testing with Concept Activation Vectors


125. ASRU: Activation Steering Meets Reinforcement Unlearning for Multimodal Large Language Models


126. Interaction-Aware Influence Functions for Group Attribution


127. VLMs Trace Without Tracking: Diagnosing Failures in Visual Path Following


128. VAGS: Velocity Adaptive Guidance Scale for Image Editing and Generation


129. TFZ-Tree: An Ultra-Lightweight Waveform Classification Framework for Resource-Constrained Devices


130. Bridging Silicon and the Hippocampus: Algebro-Deterministic Memory “VaCoAl” as a Substrate for Vector-HaSH and TEM


131. Sharp Spectral Thresholds for Logit Fixed Points


132. Latent Video Prediction Learns Better World Models


133. A Few GPUs, A Whole Lotta Scale: Faithful LLM Training Emulation with PrismLLM


134. Offline Reinforcement Learning with Universal Horizon Models


135. Pretraining Objective Matters in Extreme Low-Data FGVC: A Backbone-Controlled Study


136. Embracing Biased Transition Matrices for Complementary-Label Learning with Many Classes


137. Detecting Privilege Escalation in Polyglot Microservices via Agentic Program Analysis


138. AstraFlow: Dataflow-Oriented Reinforcement Learning for Agentic LLMs


139. Operator-Controlled 6G: From Connectivity Infrastructure to Guaranteed Digital Services


140. CTF4Nuclear: Common Task Framework for Nuclear Fission and Fusion Models


141. Domain-Independent Game Abstraction using Word Embedding Techniques


142. SkiP: When to Skip and When to Refine for Efficient Robot Manipulation


143. Tuning-free Instruction-based Video Editing Via Structural Noise Initialization and Guidance


144. DeltaPrompts: Escaping the Zero-Delta Trap in Multimodal Distillation


145. Process Rewards with Learned Reliability


146. Neural Point-Forms


147. On the Fragility of Data Attribution When Learning Is Distributed


148. DiffVAS: Diffusion-Guided Visual Active Search in Partially Observable Environments


149. RoPE Distinguishes Neither Positions Nor Tokens in Long Contexts, Provably


150. PrismQuant: Rate-Distortion-Optimal Vector Quantization for Gaussian-Mixture Sources


151. Learning with Conflicts of Interest


152. Ghosted Layers: Unconstrained Activation Alignment for Recovering Layer-Pruned LLMs


153. Hybrid LLM-based Intelligent Framework for Robot Task Scheduling


154. Residual Reinforcement Learning for Robot Teleoperation under Stochastic Delays


155. Retrieval-Augmented Large Language Models for Schema-Constrained Clinical Information Extraction


156. GRLO: Towards Generalizable Reinforcement Learning in Open-Ended Environments from Zero


157. DrugSAGE:Self-evolving Agent Experience for Efficient State-of-the-Art Drug Discovery


158. Differentially Private Motif-Preserving Multi-modal Hashing


159. RIDE: Retinex-Informed Decoupling for Exposing Concealed Objects


160. Runtime-Structured Task Decomposition for Agentic Coding Systems


161. MR2-ByteTrack: CNN and Transformer-based Video Object Detection for AI-augmented Embedded Vision Sensor Nodes


162. $f$-Trajectory Balance: A Loss Family for Tuning GFlowNets, Generative Models, and LLMs with Off- and On-Policy Data


163. Margin-Adaptive Confidence Ranking for Reliable LLM Judgement


164. From Feedback Loops to Policy Updates: Reinforcement Fine-Tuning for LLM-Based Alpha Factor Discovery


165. Diagonal Adaptive Non-local Observables on Quantum Neural Networks


166. Amortized Energy-Based Bayesian Inference


167. Breakeven complexity: A new perspective on neural partial differential equation solvers


168. Representation Without Reward: A JEPA Audit for LLM Fine-Tuning


169. PanoWorld: Geometry-Consistent Panoramic Video World Modeling


170. Is One Score Enough? Rethinking the Evaluation of Sequentially Evolving LLM Memory


171. ChangeFlow – Latent Rectified Flow for Change Detection in Remote Sensing


172. PACER: Acyclic Causal Discovery from Large-Scale Interventional Data


173. LEAP: Trajectory-Level Evaluation of LLMs in Iterative Scientific Design


174. Hidden in Memory: Sleeper Memory Poisoning in LLM Agents


175. HoloMotion-1 Technical Report


176. From I/O to Code with Discovery Agent


177. Fortress: A Case Study in Stabilizing Search Recommendations via Temporal Data Augmentation and Feature Pruning


178. PhysBrain 1.0 Technical Report


179. GESD: Beyond Outcome-Oriented Fairness


180. GQA-μP: The maximal parameterization update for grouped query attention


181. Universal Approximation of Nonlinear Operators and Their Derivatives


182. Autonomous Intelligent Agents for Natural-Language-Driven Web Execution with Integrated Security Assurance


183. PDRNN: Modular Data-driven Pedestrian Dead Reckoning on Loosely Coupled Radio- and Inertial-Signalstreams


184. GQLA: Group-Query Latent Attention for Hardware-Adaptive Large Language Model Decoding


185. Reading the Cell, Designing the Cure: Perturbation-Conditioned Molecular Diffusion for Function-Oriented Drug Design


186. Hydra: Efficient, Correct Code Generation via Checkpoint-and-Rollback Support


187. A3D: Agentic AI flow for autonomous Accelerator Design


188. Learning Selective Merge Policies for Deadline-Constrained Coded Caching via Deep Reinforcement Learning


189. PBT-Bench: Benchmarking AI Agents on Property-Based Testing


190. Is Agentic AI Ready for Real-World Hardware Engineering? A Deep Dive with Phoenix-bench


191. Do Biological Structural Guarantees Earn Their Complexity?


192. GenAI-Driven Approach to RISC-V Supply Chain Exploration


193. Effective Harness Engineering for Algorithm Discovery with Coding Agents


194. Always Learning, Always Mixing: Efficient and Simple Data Mixing All The Time


195. An LLM-RAG Approach for Healthy Eating Index-Informed Personalized Food Recommendations


196. Fault tolerance estimation in digital circuits with visualised generative networks


197. Quantization Undoes Alignment: Bias Emergence in Compressed LLMs Across Models and Precision Levels


198. AgentStop: Terminating Local AI Agents Early to Save Energy in Consumer Devices


199. Agent4POI: Agentic Context-Conditioned Affordance Reasoning for Multimodal Point-of-Interest Recommendation


200. Ensuring Logic in the Fog: Sound POMDP Synthesis with LTL Objectives