전체 AI 논문 - 2026-05-22

1. MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent Systems


2. Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention


3. LCGuard: Latent Communication Guard for Safe KV Sharing in Multi-Agent Systems


4. Deep Reinforcement Learning for Flexible Job Shop Scheduling with Random Job Arrivals



6. Towards a General Intelligence and Interface for Wearable Health Data


7. HarnessAPI: A Skill-First Framework for Unified Streaming APIs and MCP Tools


8. Beyond Acoustic Emotion Recognition: Multimodal Pathos Analysis in Political Speech Using LLM-Based and Acoustic Emotion Models


9. Can AI Make Conflicts Worse? An Alignment Failure in LLM Deployment Across Conflict Contexts


10. Parametric Modular Answer Set Programs Made Declarative


11. AMEL: Accumulated Message Effects on LLM Judgments


12. Beyond the Org Chart: AI and the Transformation of Invisible Work


13. Forecasting Scientific Progress with Artificial Intelligence


14. Is Capability a Liability? More Capable Language Models Make Worse Forecasts When It Matters Most


15. WorkstreamBench: Evaluating LLM Agents on End-to-End Spreadsheet Tasks in Finance


16. Claw AI Lab: An Autonomous Multi-Agent Research Team


17. AtelierEval: Agentic Evaluation of Humans & LLMs as Text-to-Image Prompters


18. Spreadsheet-RL: Advancing Large Language Model Agents on Realistic Spreadsheet Tasks via Reinforcement Learning


19. Think Thrice Before You Speak: Dual knowledge-enhanced Theory-of-Mind Reasoning for Persuasive Agents


20. TerminalWorld: Benchmarking Agents on Real-World Terminal Tasks


21. A Subjective Logic-based method for runtime confidence updates in safety arguments


22. Meta-Learning for Rapid Adaptation in Reference Tracking of Uncertain Nonlinear Systems


23. Search-E1: Self-Distillation Drives Self-Evolution in Search-Augmented Reasoning


24. Towards Direct Evaluation of Harness Optimizers via Priority Ranking


25. LACO: Adaptive Latent Communication for Collaborative Driving


26. Compiling Agentic Workflows into LLM Weights: Near-Frontier Quality at Two Orders of Magnitude Less Cost


27. KAPPS: A knowledge-based CPPS Architecture for the Circular Factory


28. S2ED: From Story to Executable Descriptions for Consistency-Aware Story Illustration


29. Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings


30. Scaling Observation-aware Planning in Uncertain Domains


31. Meta-Soft: Leveraging Composable Meta-Tokens for Context-Preserving KV Cache Compression


32. Evaluation of Pipelines for Data Integration into Knowledge Graphs


33. Cross-domain benchmarks reveal when coordinated AI agents improve scientific inference from partial evidence


34. SciCore-Mol: Augmenting Large Language Models with Pluggable Molecular Cognition Modules


35. Unlocking Proactivity in Task-Oriented Dialogue


36. Evaluating Large Language Models as Live Strategic Agents: Provider Performance, Hybrid Decomposition, and Operational Gaps in Timed Risk Play


37. SGR-Bench: Benchmarking Search Agents on State-Gated Retrieval


38. Towards a compositional semantics for quantitative confidence assessment in assurance arguments


39. CLORE: Content-Level Optimization for Reasoning Efficiency


40. Skill Weaving: Efficient LLM Improvement via Modular Skillpacks


41. LLM-Metrics: Measuring Research Impact Through Large Language Model Memory


42. Measuring Cross-Modal Synergy: A Benchmark for VLM Explainability


43. Adapting the Interface, Not the Model: Runtime Harness Adaptation for Deterministic LLM Agents


44. ST-SimDiff: Balancing Spatiotemporal Similarity and Difference for Efficient Video Understanding with MLLMs


45. IdleSpec: Exploiting Idle Time via Speculative Planning for LLM Agents


46. Ratchet: A Minimal Hygiene Recipe for Self-Evolving LLM Agents


47. Efficient Agentic Reasoning Through Self-Regulated Simulative Planning


48. Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality?


49. ArborKV: Structure-Aware KV Cache Management for Scaling Tree-based LLM Reasoning


50. ExComm: Exploration-Stage Communication for Error-Resilient Agentic Test-Time Scaling


51. MPDocBench-Parse: Benchmarking Practical Multi-page Document Parsing


52. Knowledge Graph Re-engineering Along the Ontological Continuum (extended version)


53. A Camera-Cooperative ISAC Framework for Multimodal Non-Cooperative UAVs Sensing


54. Enhancing Visual Token Representations for Video Large Language Models via Training-Free Spatial-Temporal Pooling and Gridding


55. Active Evidence-Seeking and Diagnostic Reasoning in Large Language Models for Clinical Decision Support


56. The Log is the Agent: Event-Sourced Reactive Graphs for Auditable, Forkable Agentic Systems


57. ECPO: Evidence-Coupled Policy Optimization for Evidence-Certified Candidate Ranking


58. Echo: Learning from Experience Data via User-Driven Refinement


59. Format-Constraint Coupling in Knowledge Graph Construction from Statistical Tables


60. AI-Enabled Serious Games: Integrating Intelligence and Adaptivity in Training Systems


61. Planning in the LLM Era: Building for Reliability and Efficiency


62. FLUID: From Ephemeral IDs to Multimodal Semantic Codes for Industrial-Scale Livestreaming Recommendation


63. Toward AI VIS Co-Scientists: A General and End-to-End Agent Harness for Solving Complex Data Visualization Tasks


64. Implicit Safety Alignment from Crowd Preferences


65. Trace2Skill: Verifier-Guided Skill Evolution for Long-Context EDA Agents


66. What Counts as AI Sycophancy? A Taxonomy and Expert Survey of a Fragmented Construct


67. A Causal Argumentation Method for Explainability of Machine Learning Models


68. Who Uses AI? Platforms, Workforce, and AI Exposure


69. SMDD-Bench: Can LLMs Solve Real-World Small Molecule Drug Design Tasks?


70. AttuneBench: A Conversation-Based Benchmark for LLM Emotional Intelligence


71. Latent-space Attacks for Refusal Evasion in Language Models


72. The Impact of AI Usage and Informativeness on Skill Development in Logical Reasoning


73. Investigating Concept Alignment Using Implausible Category Members


74. AOP-Wiki EMOD 3.0: Data Model Expansions and Content Evaluation Framework for Using Agentic AI to Improve Integration between AOPs and New Approach Methodologies (NAMs)


75. MindLoom: Composing Thought Modes for Frontier-Level Reasoning Data Synthesis


76. The Shape of Testimony: A Scalable Framework for Oral History Archive Comparison


77. TO-Agents: A Multi-Agent AI Pipeline for Preference-Guided Topology Optimization


78. Benchmarking and Improving Monitors for Out-Of-Distribution Alignment Failure in LLMs



80. The Matching Principle: A Geometric Theory of Loss Functions for Nuisance-Robust Representation Learning


81. Finite-Particle Convergence Rates for Conservative and Non-Conservative Drifting Models


82. DeltaBox: Scaling Stateful AI Agents with Millisecond-Level Sandbox Checkpoint/Rollback


83. SDPM: Survival Diffusion Probabilistic Model for Continuous-Time Survival Analysis


84. MambaGaze: Bidirectional Mamba with Explicit Missing Data Modeling for Cognitive Load Assessment from Eye-Gaze Tracking Data


85. CogAdapt: Transferring Clinical ECG Foundation Models to Wearable Cognitive Load Assessment via Lead Adaptation


86. Reducing Political Manipulation with Consistency Training


87. Understanding Data Temporality Impact on Large Language Models Pre-training


88. Cyber-Physical Anomaly Detection in IoT-Enabled Smart Grids Using Machine Learning and Metaheuristic Feature Optimization


89. Superhuman Safe and Agile Racing through Multi-Agent Reinforcement Learning


90. Proxy-Based Approximation of Shapley and Banzhaf Interactions


91. The Distillation Game: Adaptive Attacks & Efficient Defenses


92. Post-Training is About States, Not Tokens: A State Distribution View of SFT, RL, and On-Policy Distillation


93. The Value of Covariance Matching in Gaussian DDPMs and the Lanczos Sampler


94. Live Music Diffusion Models: Efficient Fine-Tuning and Post-Training of Interactive Diffusion Music Generators


95. AnyMo: Geometry-Aware Setup-Agnostic Modeling of Human Motion in the Wild


96. Abstraction for Offline Goal-Conditioned Reinforcement Learning


97. Scout-Assisted Planning for Heterogeneous Robot Teams under Partially Known Environments


98. Swift Sampling: Selecting Temporal Surprises via Taylor Series


99. Moral Semantics Survive Machine Translation: Cross-Lingual Evidence from Moral Foundations Corpora


100. More Context, Larger Models, or Moral Knowledge? A Systematic Study of Schwartz Value Detection in Political Texts


101. Contractual Skills: A GovernSpec Design Framework for Enterprise AI Agents


102. Healthcare LLM Benchmarks Are Only as Good as Their Explicit Assumptions


103. Agentic CLEAR: Automating Multi-Level Evaluation of LLM Agents


104. Innovations in Cardless Artificial Intelligence Banking: A Comprehensive Framework for Cyber Secure and Fraud Mitigation using Machine Learning Algorithms


105. MoSA: Motion-constrained Stress Adaptation for Mitigating Real-to-Sim Gap in Continuum Dynamics via Learning Residual Anisotropy


106. SceneAligner: 3D-Grounded Floorplan Localization in the Wild


107. Beyond Temperature: Hyperfitting as a Late-Stage Geometric Expansion


108. VGenST-Bench: A Benchmark for Spatio-Temporal Reasoning via Active Video Synthesis


109. Measuring Security Without Fooling Ourselves: Why Benchmarking Agents Is Hard


110. Case-Aware Medical Image Classification with Multimodal Knowledge Graphs and Reliability-Guided Refinement


111. Dynamic Hypergraph Representation Learning for Multivariate Time Series without Prior Knowledge


112. Stabilising Explainability Fragility in Cybersecurity AI: The Impact and Mitigation of Multicollinearity in Public Benchmark Datasets



114. The Neural Compiler: Program-to-Network Translation for Hybrid Scientific Machine Learning


115. Understanding Multimodal Failure in Action-Chunking Behavioral Cloning


116. Implicit Regularization of Mini-Batch Training in Graph Neural Networks


117. BioFormer: Rethinking Cross-Subject Generalization via Spectral Structural Alignment in Biomedical Time-Series


118. From Correlation to Cause: A Five-Stage Methodology for Feature Analysis in Transformer Language Models


119. Steins;Gate Drive: Semantic Safety Arbitration over Structured Futures for Latency-Decoupled LLM Planning


120. Making the Discrete Continuous: Synthetic RAW Augmentations for Fine-Grained Evaluation of Person Detection Performance in Low Light


121. Don’t Forget the Critic: Value-Based Data Rehearsal for Multi-Cyclic Continual Reinforcement Learning


122. Pre-VLA: Preemptive Runtime Verification for Reliable Vision-Language-Action and World-Model Rollouts


123. A Constant-Time Implementation Methodology for Activation Functions on Microcontrollers


124. Characterizing the Fault Response of the Intel Neural Compute Stick 2 Under Single-Pulse Electromagnetic Fault Injection


125. FastTab: A Fast Table Recognizer with a Tiny Recursive Module and 1D Transformers


126. Diffusion-guided Generalizable Enhancer for Urban Scene Reconstruction


127. Towards Clinically Interpretable Ophthalmic VQA via Spatially-Grounded Lesion Evidence


128. DeferMem: Query-Time Evidence Distillation via Reinforcement Learning for Long-Term Memory QA


129. Cross-Subject EEG Emotion Recognition Based on Temporal Asynchronous Alignment Contrastive Learning


130. VeriScale: Adversarial Test-Suite Scaling for Verifiable Code Generation


131. TimeGuard: Channel-wise Pool Training for Backdoor Defense in Time Series Forecasting


132. Incentive-Aligned Vehicle-to-Vehicle Energy Trading via Nash-Integrated Multi-Agent Reinforcement Learning


133. VEELA: A Clinically-Constrained Benchmark for Liver Vessel Segmentation in Computed Tomography Angiography


134. TransitLM: A Large-Scale Dataset and Benchmark for Map-Free Transit Route Generation


135. Bernini: Latent Semantic Planning for Video Diffusion


136. Sibyl-AutoResearch: Autonomous Research Needs Self-Evolving Trial-and-Error Harnesses, Not Paper Generators


137. 4D-GSW: Kinematic-Aware Spatio-Temporal Consistent Watermarking for 4D Gaussian Splatting


138. SepsisAI Orchestrator: A Containerized and Scalable Platform for Deploying AI Models and Real-Time Monitoring in Early Sepsis Detection


139. Benchmarking Autonomous Agents against Temporal, Spatial, and Semantic Evasions


140. ACCoRD: Actor-Critic Conflict Resolution with Deep learning for O-RAN xApps


141. One LR Doesn’t Fit All: Heavy-Tail Guided Layerwise Learning Rates for LLMs


142. EmoTrack: Robust Depression Tracking from Counseling Transcripts across Session Regimes


143. MuKV: Multi-Grained KV Cache Compression for Long Streaming Video Question-Answering


144. Impact of Atmospheric Turbulence and Pointing Error on Earth Observation


145. Detecting Atypical Clients in Federated Learning via Representation-Level Divergence


146. Tailoring Teaching to Aptitude: Direction-Adaptive Self-Distillation for LLM Reasoning


147. What are the Right Symmetries for Formal Theorem Proving?


148. Explainable AI for Data-Driven Design of High-Dimensional Predictive Studies



150. Temporal Coding as a Substrate for Sensorimotor Object Inference: A Spiking Reinterpretation of Thousand Brains Architecture


151. OSS: Open Suturing Skills Vision-Based Assessment Challenge 2024-2025


152. Action with Visual Primitives


153. SWE-Mutation: Can LLMs Generate Reliable Test Suites in Software Engineering?


154. One-Way Policy Optimization for Self-Evolving LLMs


155. Short-Term-to-Long-Term Memory Transfer for Knowledge Graphs under Partial Observability


156. Atom-level Protein Representation Learning Improves Protein Structure Prediction


157. Adversarial Trust Poisoning in Vehicular Collaborative Perception


158. TextTeacher: What Can Language Teach About Images?


159. Not Yet: Humans Outperform LLMs in a Colonel Blotto Tournament


160. LVDrive: Latent Visual Representation Enhanced Vision-Language-Action Autonomous Driving Model


161. JMed48k: A Multi-Profession Japanese Medical Licensing Benchmark for Vision-Language Model Evaluation


162. From Reasoning Chains to Verifiable Subproblems: Curriculum Reinforcement Learning Enables Credit Assignment for LLM Reasoning


163. Echo4DIR: 4D Implicit Heart Reconstruction from 2D Echocardiography Videos


164. Safeguarding Text-to-Image Generative Models Against Unauthorized Knowledge Distillation


165. Prototype-Guided Classification Sub-Task Decoupling Framework: Enhancing Generalization and Interpretability for Multivariate Time Series


166. LABO: LLM-Accelerated Bayesian Optimization through Broad Exploration and Selective Experimentation


167. Secure and Parallel Determinant Computation for Large-Scale Matrices in Edge Environments


168. GA-VLN: Geometry-Aware BEV Representation for Efficient Vision-Language Navigation


169. AgroVG: A Large-Scale Multi-Source Benchmark for Agricultural Visual Grounding


170. FRED: A Multi-Modal Autonomous Driving Dataset for Flooded Road Environments


171. From TF-IDF to Transformers: A Comparative and Ensemble Approach to Sentiment Classification


172. Blind Spots in the Guard: How Domain-Camouflaged Injection Attacks Evade Detection in Multi-Agent LLM Systems


173. Virtual 3D H&E Staining from Phase-contrast Back-illumination Interference Tomography


174. From Patches to Trajectories: Privileged Process Supervision for Software-Engineering Agents


175. Ex-GraphRAG: Interpretable Evidence Routing for Graph-Augmented LLMs


176. Learning Spatiotemporal Sensitivity in Video LLMs via Counterfactual Reinforcement Learning


177. Interpreting and Enhancing Emotional Circuits in Large Vision-Language Models via Cross-Modal Information Flow


178. Video as Natural Augmentation: Towards Unified AI-Generated Image and Video Detection


179. LLM Retrieval for Stable and Predictable Ad Recommendations


180. ChronoMedicalWorld: A Medical World Model for Learning Patient Trajectories from Longitudinal Care Data


181. MLLMs Know When Before Speaking: Revealing and Recovering Temporal Grounding via Attention Cues


182. Thermodynamic Irreversibility of Training Algorithms


183. CausalGuard: Conformal Inference under Graph Uncertainty


184. SDGBiasBench: Benchmarking and Mitigating Vision–Language Models’ Biases in Sustainable Development Goals


185. MAVEN: A Multi-stage Agentic Annotation Pipeline for Video Reasoning Tasks


186. Engineering Hybrid Physics-Informed Neural Networks for Next-Generation Electricity Systems: A State-of-the-Art Review


187. Two-Stage Multimodal Framework for Emotion Mimicry Intensity Prediction


188. EvoScene-VLA: Evolving Scene Beliefs Inside the Action Decoder for Chunked Robot Control


189. Learning Emergent Modular Representations in Multi-modality Medical Vision Foundation Models


190. The Illusion of Reasoning: Exposing Evasive Data Contamination in LLMs via Zero-CoT Truncation


191. CrossVLA: Cross-Paradigm Post-Training and Inference Optimization for Vision-Language-Action Models


192. OPPO: Bayesian Value Recursion for Token-Level Credit Assignment in LLM Reasoning


193. ACC: Compiling Agent Trajectories for Long-Context Training


194. Comparing LLM and Fine-Tuned Model Performance on NVDRS Circumstance Extraction with Varying Prompt Complexity


195. An Open Multi-Center Whole-Body FDG PET/CT Foundation Model for Tumor Segmentation


196. Does Slightly Mean Somewhat? Measuring Vague Intensity Words in LLM Numeric Actions


197. Residual Skill Optimization for Text-to-SQL Ensembles


198. Patch Hierarchical Attention Transformer for Efficient Particle Jet Tagging


199. Understanding Perspectives of Patients, Caregivers and Clinicians towards Emerging Collaborative-decision Making Technologies


200. PEARL: Unbiased Percentile Estimation via Contrastive Learning for Industrial-Scale Livestream Recommendation


201. Support-aware offline policy selection for advertising marketplaces


202. Probabilistic Attribution For Large Language Models


203. TBP-mHC: full expressivity for manifold-constrained hyper connections through transportation polytopes


204. Learning Altruistic Collaboration in Heterogeneous Multi-Team Systems


205. PocketAgents: A Manifest-Driven Library of Autonomous Defense Agents


206. MRecover: A Conditional Generative Model for Recovering Motion-Corrupted MR images Using AI Generated Contrast


207. Planning, Scheduling, and Behavior in EV Charging Systems: A Critical Survey and Trilemma Framework


208. Hierarchical Variational Policies for Reward-Guided Diffusion


209. Value-Gradient Hypothesis of RL for LLMs


210. Amplifying, Not Learning: Fine-Tuned AI Text Detectors Amplify a Pretrained Direction


211. Look-Closer-Then-Diagnose: Confidence-Aware Ultrasound VQA via Active Zooming


212. Addressing the Synergy Gap: The Six Elements of the Design Space


213. Faster Completion, Less Learning: Generative AI Reduced Study Time on Math Problems and the Knowledge They Build


214. Flat-Pack Bench: Evaluating Spatio-Temporal Understanding in Large Vision-Language Models through Furniture Assembly


215. CR4T: Rewrite-Based Guardrails for Adolescent LLM Safety


216. When Are Teacher Tokens Reliable? Position-Weighted On-Policy Self-Distillation for Reasoning


217. Scalable On-Policy Reinforcement Learning via Adaptive Batch Scaling


218. Local Covariate Selection for Average Causal Effect Estimation without Pretreatment and Causal Sufficiency Assumptions


219. RefusalBench: Why Refusal Rate Misranks Frontier LLMs on Biological Research Prompts


220. Frequency-Domain Regularized Adversarial Alignment for Transferable Attacks against Closed-Source MLLMs


221. Detecting Synthetic Political Narratives in Cross-Platform Social Media Discourse


222. A Reproducible Log-Driven AutoML Framework for Interpretable Pipeline Optimization in Healthcare Risk Prediction


223. Tackle CSM in JPEG Steganalysis with Data Adaptation


224. Protein Thoughts: Interpretable Reasoning with Tree of Thoughts and Embedding-Space Flow Matching for Protein-Protein Interaction Discovery


225. Harnesses for Inference-Time Alignment over Execution Trajectories


226. Predicting Performance of Symbolic and Prompt Programs with Examples


227. Visibility nowcasting in South Korea: a machine learning approach to class imbalance and distribution shift


228. Multivariate Financial Forecasting using the Chronos Time Series Foundation Models


229. Graph neural network explanations reveal a topological signature of disease-associated hubs in biological networks


230. Autonomous LLM Agents & CTFs: A Second Look


231. HealthCraft: A Reinforcement Learning Safety Environment for Emergency Medicine


232. Don’t Collapse Your Features: Why CenterLoss Hurts OOD Detection and Multi-Scale Mahalanobis Wins


233. The Attribution Impossibility: No Feature Ranking Is Faithful, Stable, and Complete Under Collinearity


234. Teaching Language Models to Forecast Research Success Through Comparative Idea Evaluation


235. High-speed Networking for Giga-Scale AI Factories


236. Memory-Induced Supra-Competitive Outcomes Between Deep Reinforcement Learning Agents in Optimal Trade Execution