전체 AI 논문 - 2026-04-10

1. Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest


2. SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions


3. From Safety Risk to Design Principle: Peer-Preservation in Multi-Agent LLM Systems and Its Implications for Orchestrated Democratic Discourse Analysis


4. KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation


5. Learning Who Disagrees: Demographic Importance Weighting for Modeling Annotator Distributions with DiADEM


6. On-board Telemetry Monitoring in Autonomous Satellites: Challenges and Opportunities


7. Verify Before You Commit: Towards Faithful Reasoning in LLM Agents via Self-Auditing


8. Awakening the Sleeping Agent: Lean-Specific Agentic Data Reactivates General Tool Use in Goedel Prover


9. SkillClaw: Let Skills Evolve Collectively with Agentic Evolver


10. Don’t Overthink It: Inter-Rollout Action Agreement as a Free Adaptive-Compute Signal for LLM Agents


11. ASPECT:Analogical Semantic Policy Execution via Language Conditioned Transfer


12. Human-AI Collaboration Reconfigures Group Regulation from Socially Shared to Hybrid Co-Regulation


13. ProMedical: Hierarchical Fine-Grained Criteria Modeling for Medical LLM Alignment via Explicit Injection


14. U-CECE: A Universal Multi-Resolution Framework for Conceptual Counterfactual Explanations


15. ACF: A Collaborative Framework for Agent Covert Communication under Cognitive Asymmetry


16. Neural-Symbolic Knowledge Tracing: Injecting Educational Knowledge into Deep Learning for Responsible Learner Modelling


17. From Phenomenological Fitting to Endogenous Deduction: A Paradigm Leap via Meta-Principle Physics Architecture


18. HiRO-Nav: Hybrid ReasOning Enables Efficient Embodied Navigation


19. Grounding Clinical AI Competency in Human Cognition Through the Clinical World Model and Skill-Mix Framework


20. Aligning Agents via Planning: A Benchmark for Trajectory-Level Reward Modeling


21. Activation Steering for Aligned Open-ended Generation without Sacrificing Coherence



23. Revise: A Framework for Revising OCRed text in Practical Information Systems with Data Contamination Strategy


24. ImplicitMemBench: Measuring Unconscious Behavioral Adaptation in Large Language Models


25. IoT-Brain: Grounding LLMs for Semantic-Spatial Sensor Scheduling


26. “Why This Avoidance Maneuver?” Contrastive Explanations in Human-Supervised Maritime Autonomous Navigation


27. Wiring the ‘Why’: A Unified Taxonomy and Survey of Abductive Reasoning in LLMs


28. Evaluating Counterfactual Explanation Methods on Incomplete Inputs


29. PASK: Toward Intent-Aware Proactive Agents with Long-Term Memory


30. How Far Are Large Multimodal Models from Human-Level Spatial Action? A Benchmark for Goal-Oriented Embodied Navigation in Urban Airspace


31. Are we still able to recognize pearls? Machine-driven peer review and the risk to creativity: An explainable RAG-XAI detection framework with markers extraction


32. WorldMAP: Bootstrapping Vision-Language Navigation Trajectory Prediction with Generative World Models


33. MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems


34. EigentSearch-Q+: Enhancing Deep Research Agents with Structured Reasoning Tools


35. SAT: Balancing Reasoning Accuracy and Efficiency with Stepwise Adaptive Thinking


36. Capture-Quiet Decomposition: A Verification Theorem for Chess Endgame Tablebases


37. Visual Perceptual to Conceptual First-Order Rule Learning Networks


38. DialBGM: A Benchmark for Background Music Recommendation from Everyday Multi-Turn Dialogues


39. An Agentic Evaluation Architecture for Historical Bias Detection in Educational Textbooks


40. Hidden Biases in Conditioning Autoregressive Models


41. SPARD: Self-Paced Curriculum for RL Alignment via Integrating Reward Dynamics and Data Utility


42. Silencing the Guardrails: Inference-Time Jailbreaking via Dynamic Contextual Representation Ablation


43. Automatic Generation of Executable BPMN Models from Medical Guidelines


44. Agentivism: a learning theory for the age of artificial intelligence


45. Lightweight LLM Agent Memory with Small Language Models


46. SEARL: Joint Optimization of Policy and Tool Graph Memory for Self-Evolving Agents


47. Automotive Engineering-Centric Agentic AI Workflow Framework


48. The Accountability Horizon: An Impossibility Theorem for Governing Human-Agent Collectives


49. ACIArena: Toward Unified Evaluation for Agent Cascading Injection


50. Mitigating Distribution Sharpening in Math RLVR via Distribution-Aligned Hint Synthesis and Backward Hint Annealing


51. The Cartesian Cut in Agentic AI


52. CivBench: Progress-Based Evaluation for LLMs’ Strategic Decision-Making in Civilization V


53. Emotion Concepts and their Function in a Large Language Model


54. Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution


55. Towards Knowledgeable Deep Research: Framework and Benchmark


56. IatroBench: Pre-Registered Evidence of Iatrogenic Harm from AI Safety Measures


57. Multi-Agent Orchestration for High-Throughput Materials Screening on a Leadership-Class System


58. From Debate to Decision: Conformal Social Choice for Safe Multi-Agent Deliberation


59. Bridging Natural Language and Interactive What-If Interfaces via LLM-Generated Declarative Specification


60. How Independent are Large Language Models? A Statistical Framework for Auditing Behavioral Entanglement and Reweighting Verifier Ensembles


61. PRIME: Training Free Proactive Reasoning via Iterative Memory Evolution for User-Centric Agent


62. Reasoning Graphs: Deterministic Agent Accuracy through Evidence-Centric Chain-of-Thought Feedback


63. Too long; didn’t solve


64. From Papers to Property Tables: A Priority-Based LLM Workflow for Materials Data Extraction


65. Dual-Loop Control in DCVerse: Advancing Reliable Deployment of AI in Data Centers via Digital Twins



67. Trust the AI, Doubt Yourself: The Effect of Urgency on Self-Confidence in Human-AI Interaction


68. Rhizome OS-1: Rhizome’s Semi-Autonomous Operating System for Small Molecule Drug Discovery


69. ReflectRM: Boosting Generative Reward Models via Self-Reflection within a Unified Judgment Framework


70. CLEAR: Context Augmentation from Contrastive Learning of Experience via Agentic Reflection


71. ConsistRM: Improving Generative Reward Models via Consistency-Aware Self-Training


72. M-ArtAgent: Evidence-Based Multimodal Agent for Implicit Art Influence Discovery


73. Munkres’ General Topology Autoformalized in Isabelle/HOL


74. An Analysis of Artificial Intelligence Adoption in NIH-Funded Research


75. Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models


76. SIM1: Physics-Aligned Simulator as Zero-Shot Data Scaler in Deformable Worlds


77. Seeing but Not Thinking: Routing Distraction in Multimodal Mixture-of-Experts


78. AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation


79. OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks


80. RewardFlow: Generate Images by Optimizing What You Reward


81. PSI: Shared State as the Missing Layer for Coherent AI-Generated Instruments in Personal AI Agents


82. What Drives Representation Steering? A Mechanistic Case Study on Steering Refusal


83. ClawBench: Can AI Agents Complete Everyday Online Tasks?


84. Differentially Private Language Generation and Identification in the Limit


85. Quantifying Explanation Consistency: The C-Score Metric for CAM-Based Explainability in Medical Image Classification


86. PIArena: A Platform for Prompt Injection Evaluation


87. Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization


88. TTVS: Boosting Self-Exploring Reinforcement Learning via Test-time Variational Synthesis


89. OVS-DINO: Open-Vocabulary Segmentation via Structure-Aligned SAM-DINO with Language Guidance


90. A Machine Learning Framework for Turbofan Health Estimation via Inverse Problem Formulation


91. CrashSight: A Phase-Aware, Infrastructure-Centric Video Benchmark for Traffic Crash Scene Understanding and Reasoning


92. HST-HGN: Heterogeneous Spatial-Temporal Hypergraph Networks with Bidirectional State Space Models for Global Fatigue Assessment


93. Small-scale photonic Kolmogorov-Arnold networks using standard telecom nonlinear modules


94. KV Cache Offloading for Context-Intensive Tasks


95. Synthetic Data for any Differentiable Target


96. Exploring Temporal Representation in Neural Processes for Multimodal Action Prediction


97. Selective Attention System (SAS): Device-Addressed Speech Detection for Real-Time On-Device Voice AI


98. Zero-shot Multivariate Time Series Forecasting Using Tabular Prior Fitted Networks


99. ADAPTive Input Training for Many-to-One Pre-Training on Time-Series Classification


100. Phantasia: Context-Adaptive Backdoors in Vision Language Models


101. TASU2: Controllable CTC Simulation for Alignment and Low-Resource Adaptation of Speech LLMs


102. A GAN and LLM-Driven Data Augmentation Framework for Dynamic Linguistic Pattern Modeling in Chinese Sarcasm Detection


103. Scaling-Aware Data Selection for End-to-End Autonomous Driving Systems


104. Scalable Neural Decoders for Practical Fault-Tolerant Quantum Computation


105. PokeGym: A Visually-Driven Long-Horizon Benchmark for Vision-Language Models


106. InstAP: Instance-Aware Vision-Language Pre-Train for Spatial-Temporal Understanding


107. Dead Weights, Live Signals: Feedforward Graphs of Frozen Language Models


108. Lost in the Hype: Revealing and Dissecting the Performance Degradation of Medical Multimodal Large Language Models in Image Classification


109. Multi-Modal Learning meets Genetic Programming: Analyzing Alignment in Latent Space Optimization


110. HistDiT: A Structure-Aware Latent Conditional Diffusion Model for High-Fidelity Virtual Staining in Histopathology


111. Securing Retrieval-Augmented Generation: A Taxonomy of Attacks, Defenses, and Future Directions


112. DMax: Aggressive Parallel Decoding for dLLMs


113. SeLaR: Selective Latent Reasoning in Large Language Models


114. Can Vision Language Models Judge Action Quality? An Empirical Evaluation


115. CIAO - Code In Architecture Out - Automated Software Architecture Documentation with Large Language Models


116. Distributed Multi-Layer Editing for Rule-Level Knowledge in Large Language Models


117. QARIMA: A Quantum Approach To Classical Time Series Analysis


118. DBMF: A Dual-Branch Multimodal Framework for Out-of-Distribution Detection


119. Behavior-Aware Item Modeling via Dynamic Procedural Solution Representations for Knowledge Tracing


120. HyperMem: Hypergraph Memory for Long-Term Conversations


121. EditCaption: Human-Aligned Instruction Synthesis for Image Editing via Supervised Fine-Tuning and Direct Preference Optimization


122. MedVR: Annotation-Free Medical Visual Reasoning via Agentic Reinforcement Learning


123. AT-ADD: All-Type Audio Deepfake Detection Challenge Evaluation Plan


124. OceanMAE: A Foundation Model for Ocean Remote Sensing


125. ViVa: A Video-Generative Value Model for Robot Reinforcement Learning


126. Face-D(^2)CL: Multi-Domain Synergistic Representation with Dual Continual Learning for Facial DeepFake Detection


127. Multimodal Reasoning with LLM for Encrypted Traffic Interpretation: A Benchmark


128. Alloc-MoE: Budget-Aware Expert Activation Allocation for Efficient Mixture-of-Experts Inference


129. LegoDiffusion: Micro-Serving Text-to-Image Diffusion Workflows


130. Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator


131. Small Vision-Language Models are Smart Compressors for Long Video Understanding


132. TADP-RME: A Trust-Adaptive Differential Privacy Framework for Enhancing Reliability of Data-Driven Systems


133. OV-Stitcher: A Global Context-Aware Framework for Training-Free Open-Vocabulary Semantic Segmentation


134. AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models


135. From Gaze to Guidance: Interpreting and Adapting to Users’ Cognitive Needs with Multimodal Gaze-Aware AI Assistants


136. Governed Capability Evolution for Embodied Agents: Safe Upgrade, Compatibility Checking, and Runtime Rollback for Embodied Capability Modules


137. 3DrawAgent: Teaching LLM to Draw in 3D with Early Contrastive Experience


138. LINE: LLM-based Iterative Neuron Explanations for Vision Models


139. PrivFedTalk: Privacy-Aware Federated Diffusion with Identity-Stable Adapters for Personalized Talking-Head Generation


140. From Universal to Individualized Actionability: Revisiting Personalization in Algorithmic Recourse


141. SearchAD: Large-Scale Rare Image Retrieval Dataset for Autonomous Driving


142. The ecosystem of machine learning competitions: Platforms, participants, and their impact on AI development


143. Show Me the Infographic I Imagine: Intent-Aware Infographic Retrieval for Authoring Support


144. LogAct: Enabling Agentic Reliability via Shared Logs


145. A Decomposition Perspective to Long-context Reasoning for LLMs


146. AtomEval: Atomic Evaluation of Adversarial Claims in Fact Verification


147. DSCA: Dynamic Subspace Concept Alignment for Lifelong VLM Editing


148. Rethinking Data Mixing from the Perspective of Large Language Models


149. TOOLCAD: Exploring Tool-Using Large Language Models in Text-to-CAD Generation with Reinforcement Learning


150. Pruning Extensions and Efficiency Trade-Offs for Sustainable Time Series Classification


151. Investigation of Automated Design of Quantum Circuits for Imaginary Time Evolution Methods Using Deep Reinforcement Learning


152. Incremental Residual Reinforcement Learning Toward Real-World Learning for Social Navigation


153. On-Policy Distillation of Language Models for Autonomous Vehicle Motion Planning


154. Large Language Model Post-Training: A Unified View of Off-Policy and On-Policy Learning


155. Same Outcomes, Different Journeys: A Trace-Level Framework for Comparing Human and GUI-Agent Behavior in Production Search Systems


156. Sinkhorn doubly stochastic attention rank decay analysis


157. Mitigating Entangled Steering in Large Vision-Language Models for Hallucination Reduction


158. Dynamic Attentional Context Scoping: Agent-Triggered Focus Sessions for Isolated Per-Agent Steering in Multi-Agent LLM Orchestration


159. AnomalyAgent: Agentic Industrial Anomaly Synthesis via Tool-Augmented Reinforcement Learning


160. TSUBASA: Improving Long-Horizon Personalization via Evolving Memory and Self-Learning with Context Distillation


161. Data Selection for Multi-turn Dialogue Instruction Tuning


162. Reinforcement-Guided Synthetic Data Generation for Privacy-Sensitive Identity Recognition


163. FlowGuard: Towards Lightweight In-Generation Safety Detection for Diffusion Models via Linear Latent Decoding


164. PyVRP$^+$: LLM-Driven Metacognitive Heuristic Evolution for Hybrid Genetic Search in Vehicle Routing Problems


165. Task-Adaptive Retrieval over Agentic Multi-Modal Web Histories via Learned Graph Memory


166. Networking-Aware Energy Efficiency in Agentic AI Inference: A Survey


167. QaRL: Rollout-Aligned Quantization-Aware RL for Fast and Stable Training under Training–Inference Mismatch


168. ReRec: Reasoning-Augmented LLM-based Recommendation Assistant via Reinforcement Fine-tuning


169. Filling the Gaps: Selective Knowledge Augmentation for LLM Recommenders


170. LPM 1.0: Video-based Character Performance Model


171. Loop, Think, & Generalize: Implicit Reasoning in Recurrent-Depth Transformers


172. More Capable, Less Cooperative? When LLMs Fail At Zero-Cost Collaboration


173. PolicyLong: Towards On-Policy Context Extension


174. The Weaponization of Computer Vision: Tracing Military-Surveillance Ties through Conference Sponsorship


175. Latent Anomaly Knowledge Excavation: Unveiling Sparse Sensitive Neurons in Vision-Language Models


176. TEMPER: Testing Emotional Perturbation in Quantitative Reasoning


177. Learning Without Losing Identity: Capability Evolution for Embodied Agents


178. Toward Generalizable Graph Learning for 3D Engineering AI: Explainable Workflows for CAE Mode Shape Classification and CFD Field Prediction


179. Sensitivity-Positional Co-Localization in GQA Transformers


180. Beyond Surface Artifacts: Capturing Shared Latent Forgery Knowledge Across Modalities


181. DailyArt: Discovering Articulation from Single Static Images via Latent Dynamics


182. MIMIC-Py: An Extensible Tool for Personality-Driven Automated Game Testing with Large Language Models


183. Beyond Pedestrians: Caption-Guided CLIP Framework for High-Difficulty Video-based Person Re-Identification


184. TrajGuard: Streaming Hidden-state Trajectory Detection for Decoding-time Jailbreak Defense



186. AITH: A Post-Quantum Continuous Delegation Protocol for Human-AI Trust Establishment


187. Joint Task Offloading, Inference Optimization and UAV Trajectory Planning for Generative AI Empowered Intelligent Transportation Digital Twin


188. Reinforcement Learning with LLM-Guided Action Spaces for Synthesizable Lead Optimization


189. An Imperfect Verifier is Good Enough: Learning with Noisy Rewards


190. Optimal Decay Spectra for Linear Recurrences


191. Cognitive-Causal Multi-Task Learning with Psychological State Conditioning for Assistive Driving Perception


192. Safe Large-Scale Robust Nonlinear MPC in Milliseconds via Reachability-Constrained System Level Synthesis on the GPU


193. Exponential quantum advantage in processing massive classical data


194. Sheaf-Laplacian Obstruction and Projection Hardness for Cross-Modal Compatibility on a Modality-Independent Site


195. DIVERSED: Relaxed Speculative Decoding via Dynamic Ensemble Verification


196. Towards Real-Time Human-AI Musical Co-Performance: Accompaniment Generation with Latent Diffusion Models and MAX/MSP


197. Google, AI Literacy, and the Learning Sciences: Multiple Modes of Research, Industry, and Practice Partnerships


198. From Ground Truth to Measurement: A Statistical Framework for Human Labeling


199. DCD: Domain-Oriented Design for Controlled Retrieval-Augmented Generation


200. Don’t Measure Once: Measuring Visibility in AI Search (GEO)


201. Learning is Forgetting: LLM Training As Lossy Compression


202. Reasoning-Based Refinement of Unsupervised Text Clusters with LLMs


203. Generative Experiences for Digital Mental Health Interventions: Evidence from a Randomized Study


204. TR-EduVSum: A Turkish-Focused Dataset and Consensus Framework for Educational Video Summarization


205. MCP-DPT: A Defense-Placement Taxonomy and Coverage Analysis for Model Context Protocol Security


206. EMSDialog: Synthetic Multi-person Emergency Medical Service Dialogue Generation from Electronic Patient Care Reports via Multi-LLM Agents


207. RL-ASL: A Dynamic Listening Optimization for TSCH Networks Using Reinforcement Learning


208. The Shrinking Lifespan of LLMs in Science


209. SYN-DIGITS: A Synthetic Control Framework for Calibrated Digital Twin Simulation


210. Beyond Human-Readable: Rethinking Software Engineering Conventions for the Agentic Development Era


211. Triage: Routing Software Engineering Tasks to Cost-Effective LLM Tiers via Code Quality Signals


212. Cluster Attention for Graph Machine Learning


213. Enabling Intrinsic Reasoning over Dense Geospatial Embeddings with DFR-Gemma


214. Private Seeds, Public LLMs: Realistic and Privacy-Preserving Synthetic Data Generation


215. Active Reward Machine Inference From Raw State Trajectories


216. When Switching Algorithms Helps: A Theoretical Study of Online Algorithm Selection


217. CMP: Robust Whole-Body Tracking for Loco-Manipulation via Competence Manifold Projection


218. GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents


219. Regret-Aware Policy Optimization: Environment-Level Memory for Replay Suppression under Delayed Harm


220. GIRL: Generative Imagination Reinforcement Learning via Information-Theoretic Hallucination Control


221. SubSearch: Intermediate Rewards for Unsupervised Guided Reasoning in Complex Retrieval


222. FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios


223. Reinforcement Learning with Reward Machines for Sleep Control in Mobile Networks


224. Conservation Law Breaking at the Edge of Stability: A Spectral Theory of Non-Convex Neural Network Optimization


225. Breaking the Illusion of Identity in LLM Tooling


226. Data Warmup: Complexity-Aware Curricula for Efficient Diffusion Training


227. A Physical Agentic Loop for Language-Guided Grasping with Execution-State Monitoring


228. DSPR: Dual-Stream Physics-Residual Networks for Trustworthy Industrial Time Series Forecasting


229. Self-Calibrating LLM-Based Analog Circuit Sizing with Interpretable Design Equations


230. Playing DOOM with 1.3M Parameters: Specialized Small Models vs Large Language Models for Real-Time Game Control


231. Decisions and Deployment: The Five-Year SAHELI Project (2020-2025) on Restless Multi-Armed Bandits for Improving Maternal and Child Health


232. Latent Structure of Affective Representations in Large Language Models


233. The Role of Emotional Stimuli and Intensity in Shaping Large Language Model Behavior


234. Position Paper: From Edge AI to Adaptive Edge AI


235. Hybrid CNN-Transformer Architecture for Arabic Speech Emotion Recognition


236. Prediction Arena: Benchmarking AI Models on Real-World Prediction Markets


237. Contextual Earnings-22: A Speech Recognition Benchmark with Custom Vocabulary in the Wild