전체 AI 논문 - 2026-04-13

1. Strategic Algorithmic Monoculture:Experimental Evidence from Coordination Games


2. Process Reward Agents for Steering Knowledge-Intensive Reasoning


3. E3-TIR: Enhanced Experience Exploitation for Tool-Integrated Reasoning


4. Do We Really Need to Approach the Entire Pareto Front in Many-Objective Bayesian Optimisation?


5. HiL-Bench (Human-in-Loop Benchmark): Do Agents Know When to Ask for Help?


6. Mind the Gap Between Spatial Reasoning and Acting! Step-by-Step Evaluation of Agents With Spatial-Gym


7. Constraint-Aware Corrective Memory for Language-Based Drug Discovery Agents


8. SAGE: A Service Agent Graph-guided Evaluation Benchmark


9. DRBENCHER: Can Your Agent Identify the Entity, Retrieve Its Properties and Do the Math?


10. Camera Artist: A Multi-Agent Framework for Cinematic Language Storytelling Video Generation


11. Overhang Tower: Resource-Rational Adaptation in Sequential Physical Planning


12. Advantage-Guided Diffusion for Model-Based Reinforcement Learning


13. Hypergraph Neural Networks Accelerate MUS Enumeration


14. SEA-Eval: A Benchmark for Evaluating Self-Evolving Agents Beyond Episodic Assessment


15. PilotBench: A Benchmark for General Aviation Agents with Safety Constraints


16. Enhancing LLM Problem Solving via Tutor-Student Multi-Agent Interaction


17. StaRPO: Stability-Augmented Reinforcement Policy Optimization


18. SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks


19. Hidden in Plain Sight: Visual-to-Symbolic Analytical Solution Inference from Field Visualizations


20. Artifacts as Memory Beyond the Agent Boundary


21. Model Space Reasoning as Search in Feedback Space for Planning Domain Generation


22. Parameterized Complexity Of Representing Models Of MSO Formulas


23. RAMP: Hybrid DRL for Online Learning of Numeric Action Models


24. Sustained Impact of Agentic Personalisation in Marketing: A Longitudinal Case Study


25. From Business Events to Auditable Decisions: Ontology-Governed Graph Simulation for Enterprise AI


26. OpenKedge: Governing Agentic Mutation with Execution-Bound Safety and Evidence Chains


27. Large Language Models Generate Harmful Content Using a Distinct, Unified Mechanism


28. Case-Grounded Evidence Verification: A Framework for Constructing Evidence-Sensitive Supervision


29. Seeing is Believing: Robust Vision-Guided Cross-Modal Prompt Learning under Label Noise


30. VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images


31. VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoning


32. Envisioning the Future, One Step at a Time


33. Semantic Rate-Distortion for Bounded Multi-Agent Communication: Capacity-Derived Semantic Spaces and the Communication Cost of Alignment


34. VISOR: Agentic Visual Retrieval-Augmented Generation via Iterative Search and Over-horizon Reasoning


35. BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation


36. RecaLLM: Addressing the Lost-in-Thought Phenomenon with Explicit In-Context Retrieval


37. XFED: Non-Collusive Model Poisoning Attack Against Byzantine-Robust Federated Classifiers


38. SafeMind: A Risk-Aware Differentiable Control Framework for Adaptive and Safe Quadruped Locomotion


39. SafeAdapt: Provably Safe Policy Updates in Deep Reinforcement Learning


40. ECHO: Efficient Chest X-ray Report Generation with One-step Block Diffusion


41. Many-Tier Instruction Hierarchy in LLM Agents


42. TME-PSR: Time-aware, Multi-interest, and Explanation Personalization for Sequential Recommendation


43. Physics-guided surrogate learning enables zero-shot control of turbulent wings


44. On the Representational Limits of Quantum-Inspired 1024-D Document Embeddings: An Experimental Evaluation Framework


45. Rays as Pixels: Learning A Joint Distribution of Videos and Camera Trajectories


46. Three Modalities, Two Design Probes, One Prototype, and No Vision: Experience-Based Co-Design of a Multi-modal 3D Data Visualization Tool


47. PhysInOne: Visual Physics Learning and Reasoning in One Suite


48. Yes, But Not Always. Generative AI Needs Nuanced Opt-in


49. The AI Codebase Maturity Model: From Assisted Coding to Self-Sustaining Systems


50. BadSkill: Backdoor Attacks on Agent Skills via Model-in-Skill Poisoning


51. LLM-Rosetta: A Hub-and-Spoke Intermediate Representation for Cross-Provider LLM API Translation


52. Visually-Guided Policy Optimization for Multimodal Reasoning


53. SatQNet: Satellite-assisted Quantum Network Entanglement Routing Using Directed Line Graph Neural Networks


54. SkillMOO: Multi-Objective Optimization of Agent Skills for Software Engineering


55. Mosaic: Multimodal Jailbreak against Closed-Source VLMs via Multi-View Ensemble Optimization


56. DDSP-QbE++: Improving Speech Quality for Speech Anonymisation for Atypical Speech


57. Statistical Properties of the King Wen Sequence: An Anti-Habituation Structure That Does Not Improve Neural Network Training


58. Neural Distribution Prior for LiDAR Out-of-Distribution Detection


59. The Fast Lane Hypothesis: Von Economo Neurons Implement a Biological Speed-Accuracy Tradeoff


60. GRM: Utility-Aware Jailbreak Attacks on Audio LLMs via Gradient-Ratio Masking


61. On the Role of DAG topology in Energy-Aware Cloud Scheduling : A GNN-Based Deep Reinforcement Learning Approach


62. Artificial intelligence can persuade people to take political actions


63. Vision Transformers for Preoperative CT-Based Prediction of Histopathologic Chemotherapy Response Score in High-Grade Serous Ovarian Carcinoma


64. Do LLMs Follow Their Own Rules? A Reflexive Audit of Self-Stated Safety Policies


65. Generalization and Scaling Laws for Mixture-of-Experts Transformers


66. Persona-E$^2$: A Human-Grounded Dataset for Personality-Shaped Emotional Responses to Textual Events


67. Structuring versus Problematizing: How LLM-based Agents Scaffold Learning in Diagnostic Reasoning


68. CORA: Conformal Risk-Controlled Agents for Safeguarded Mobile GUI Automation


69. EquiformerV3: Scaling Efficient, Expressive, and General SE(3)-Equivariant Graph Attention Transformers


70. Interactive ASR: Towards Human-Like Interaction and Semantic Coherence Evaluation for Agentic Speech Recognition


71. PS-TTS: Phonetic Synchronization in Text-to-Speech for Achieving Natural Automated Dubbing


72. TensorHub: Scalable and Elastic Weight Transfer for LLM RL Training


73. Scheming in the wild: detecting real-world AI scheming incidents with open-source intelligence


74. CLIP-Inspector: Model-Level Backdoor Detection for Prompt-Tuned CLIP via OOD Trigger Inversion


75. DeepGuard: Secure Code Generation via Multi-Layer Semantic Aggregation


76. Beyond Isolated Clients: Integrating Graph-Based Embeddings into Event Sequence Models



78. Frequency-Enhanced Diffusion Models: Curriculum-Guided Semantic Alignment for Zero-Shot Skeleton Action Recognition


79. Learning Vision-Language-Action World Models for Autonomous Driving


80. PDE-regularized Dynamics-informed Diffusion with Uncertainty-aware Filtering for Long-Horizon Dynamics


81. Watt Counts: Energy-Aware Benchmark for Sustainable LLM Inference on Heterogeneous GPU Architectures


82. U-Cast: A Surprisingly Simple and Efficient Frontier Probabilistic AI Weather Forecaster


83. CONDESION-BENCH: Conditional Decision-Making of Large Language Models in Compositional Action Space


84. Skill-Conditioned Visual Geolocation for Vision-Language


85. Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Injection


86. Noise-Aware In-Context Learning for Hallucination Mitigation in ALLMs


87. Regime-Conditional Retrieval: Theory and a Transferable Router for Two-Hop QA


88. Identification and Anonymization of Named Entities in Unstructured Information Sources for Use in Social Engineering Detection


89. Towards Linguistically-informed Representations for English as a Second or Foreign Language: Review, Construction and Application


90. ASTRA: Adaptive Semantic Tree Reasoning Architecture for Complex Table Question Answering


91. PinpointQA: A Dataset and Benchmark for Small Object-Centric Spatial Understanding in Indoor Videos


92. PerMix-RLVR: Preserving Persona Expressivity under Verifiable-Reward Alignment


93. Neighbourhood Transformer: Switchable Attention for Monophily-Aware Graph Learning


94. Litmus (Re)Agent: A Benchmark and Agentic System for Predictive Evaluation of Multilingual Models


95. Aligned Agents, Biased Swarm: Measuring Bias Amplification in Multi-Agent Systems


96. WOMBET: World Model-based Experience Transfer for Robust and Sample-efficient Reinforcement Learning


97. MuTSE: A Human-in-the-Loop Multi-use Text Simplification Evaluator


98. Beyond Relevance: Utility-Centric Retrieval in the LLM Era


99. Large-Scale Universal Defect Generation: Foundation Models and Datasets


100. Ge$^\text{2}$mS-T: Multi-Dimensional Grouping for Ultra-High Energy Efficiency in Spiking Transformer


101. Adaptive Dual Residual U-Net with Attention Gate and Multiscale Spatial Attention Mechanisms (ADRUwAMS)


102. A Closer Look at the Application of Causal Inference in Graph Representation Learning


103. HM-Bench: A Comprehensive Benchmark for Multimodal Large Language Models in Hyperspectral Remote Sensing


104. HTNav: A Hybrid Navigation Framework with Tiered Structure for Urban Aerial Vision-and-Language Navigation


105. Revisiting the Capacity Gap in Chain-of-Thought Distillation from a Practical Perspective


106. A Mathematical Framework for Temporal Modeling and Counterfactual Policy Simulation of Student Dropout


107. Temporal Dropout Risk in Learning Analytics: A Harmonized Survival Benchmark Across Dynamic and Early-Window Representations


108. MedFormer-UR: Uncertainty-Routed Transformer for Medical Image Classification


109. AudioGuard: Toward Comprehensive Audio Safety Protection Across Diverse Threat Models


110. AI-Induced Human Responsibility (AIHR) in AI-Human teams


111. Scalable High-Recall Constraint-Satisfaction-Based Information Retrieval for Clinical Trials Matching


112. Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs


113. HiFloat4 Format for Language Model Pre-training on Ascend NPUs


114. SenBen: Sensitive Scene Graphs for Explainable Content Moderation


115. Building Better Environments for Autonomous Cyber Defence


116. Scrapyard AI


117. Lessons Without Borders? Evaluating Cultural Alignment of LLMs Using Multilingual Story Moral Generation


118. eBandit: Kernel-Driven Reinforcement Learning for Adaptive Video Streaming


119. PSIRNet: Deep Learning-based Free-breathing Rapid Acquisition Late Enhancement Imaging


120. InstrAct: Towards Action-Centric Understanding in Instructional Videos


121. Cards Against LLMs: Benchmarking Humor Alignment in Large Language Models


122. LLMs Underperform Graph-Based Parsers on Supervised Relation Extraction for Complex Graphs


123. Decomposing the Delta: What Do Models Actually Learn from Preference Pairs?


124. AI Driven Soccer Analysis Using Computer Vision


125. Demystifying the Silence of Correctness Bugs in PyTorch Compiler


126. LMGenDrive: Bridging Multimodal Understanding and Generative World Modeling for End-to-End Driving


127. Accelerating Transformer-Based Monocular SLAM via Geometric Utility Scoring


128. Deep Learning-Based Tracking and Lineage Reconstruction of Ligament Breakup


129. Every Response Counts: Quantifying Uncertainty of LLM-based Multi-Agent Systems through Tensor Decomposition


130. 3D-VCD: Hallucination Mitigation in 3D-LLM Embodied Agents through Visual Contrastive Decoding


131. On Semiotic-Grounded Interpretive Evaluation of Generative Art


132. VOLTA: The Surprising Ineffectiveness of Auxiliary Losses for Calibrated Deep Learning


133. LEGO: Latent-space Exploration for Geometry-aware Optimization of Humanoid Kinematic Design


134. Retrieval Augmented Classification for Confidential Documents


135. Evidential Transformation Network: Turning Pretrained Models into Evidential Models for Post-hoc Uncertainty Estimation


136. Practical Bayesian Inference for Speech SNNs: Uncertainty and Loss-Landscape Smoothing


137. StructRL: Recovering Dynamic Programming Structure from Learning Dynamics in Distributional Reinforcement Learning


138. SkillForge: Forging Domain-Specific, Self-Evolving Agent Skills in Cloud Technical Support


139. From Selection to Scheduling: Federated Geometry-Aware Correction Makes Exemplar Replay Work Better under Continual Dynamic Heterogeneity


140. MARINER: A 3E-Driven Benchmark for Fine-Grained Perception and Complex Reasoning in Open-Water Environments


141. Detection of Hate and Threat in Digital Forensics: A Case-Driven Multimodal Approach


142. Semantic Intent Fragmentation: A Single-Shot Compositional Attack on Multi-Agent AI Pipelines


143. Joint Interference Detection and Identification via Adversarial Multi-task Learning


144. Extrapolating Volition with Recursive Information Markets


145. TiAb Review Plugin: A Browser-Based Tool for AI-Assisted Title and Abstract Screening


146. STIndex: A Context-Aware Multi-Dimensional Spatiotemporal Information Extraction System


147. Adaptive Rigor in AI System Evaluation using Temperature-Controlled Verdict Aggregation via Generalized Power Mean


148. Mapping generative AI use in the human brain: divergent neural, academic, and mental health profiles of functional versus socio emotional AI use


149. From Dispersion to Attraction: Spectral Dynamics of Hallucination Across Whisper Model Scales


150. AlphaLab: Autonomous Multi-Agent Research Across Optimization Domains with Frontier LLMs


151. Act or Escalate? Evaluating Escalation Behavior in Automation with Language Models


152. FluidFlow: a flow-matching generative model for fluid dynamics surrogates on unstructured meshes


153. QCFuse: Query-Centric Cache Fusion for Efficient RAG Inference


154. CSAttention: Centroid-Scoring Attention for Accelerating LLM Inference


155. Multivariate Time Series Anomaly Detection via Dual-Branch Reconstruction and Autoregressive Flow-based Residual Density Estimation


156. On the Spectral Geometry of Cross-Modal Representations: A Functional Map Diagnostic for Multimodal Alignment


157. Structured Exploration and Exploitation of Label Functions for Automated Data Annotation


158. Distributionally Robust Token Optimization in RLHF


159. GAN-Enhanced Deep Reinforcement Learning for Semantic-Aware Resource Allocation in 6G Network Slicing


160. MolPaQ: Modular Quantum-Classical Patch Learning for Interpretable Molecular Generation


161. Distilling Genomic Models for Efficient mRNA Representation Learning via Embedding Matching


162. Silhouette Loss: Differentiable Global Structure Learning for Deep Representations


163. Robust Reasoning Benchmark


164. QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation


165. Can We Still Hear the Accent? Investigating the Resilience of Native Language Signals in the LLM Era


166. Dynamic sparsity in tree-structured feed-forward layers at scale


167. Temperature-Dependent Performance of Prompting Strategies in Extended Reasoning Large Language Models


168. Neural networks for Text-to-Speech evaluation


169. Uncertainty Estimation for the Open-Set Text Classification systems


170. Medical Reasoning with Large Language Models: A Survey and MR-Bench


171. WAND: Windowed Attention and Knowledge Distillation for Efficient Autoregressive Text-to-Speech Models


172. Re-Mask and Redirect: Exploiting Denoising Irreversibility in Diffusion Language Models


173. EMA Is Not All You Need: Mapping the Boundary Between Structure and Content in Recurrent Context


174. Drift and selection in LLM text ecosystems


175. GNN-as-Judge: Unleashing the Power of LLMs for Graph Learning with GNN Feedback


176. Automated Standardization of Legacy Biomedical Metadata Using an Ontology-Constrained LLM Agent


177. Unbiased Rectification for Sequential Recommender Systems Under Fake Orders


178. VerifAI: A Verifiable Open-Source Search Engine for Biomedical Question Answering


179. Towards Real-world Human Behavior Simulation: Benchmarking Large Language Models on Long-horizon, Cross-scenario, Heterogeneous Behavior Traces


180. On Divergence Measures for Training GFlowNets