전체 AI 논문 - 2026-03-19

1. AgentFactory: A Self-Evolving Framework Through Executable Subagent Accumulation and Reuse


2. RPMS: Enhancing LLM-Based Embodied Planning through Rule-Augmented Memory Synergy


3. Governed Memory: A Production Architecture for Multi-Agent Workflows


4. Facts as First Class Objects: Knowledge Objects for Persistent LLM Memory



6. MALLES: A Multi-agent LLMs-based Economic Sandbox with Consumer Preference Alignment


7. Sensi: Learn One Thing at a Time – Curriculum-Based Test-Time Learning for LLM Game Agents


8. VeriGrey: Greybox Agent Validation


9. Per-Domain Generalizing Policies: On Learning Efficient and Robust Q-Value Functions (Extended Version with Technical Appendix)


10. Informative Semi-Factuals for XAI: The Elaborated Explanations that People Prefer


11. When Only the Final Text Survives: Implicit Execution Tracing for Multi-Agent Attribution


12. Proactive Knowledge Inquiry in Doctor-Patient Dialogue: Stateful Extraction, Belief Updating, and Path-Aware Action Planning


13. From Digital Twins to World Models:Opportunities, Challenges, and Applications for Mobile Edge General Intelligence


14. Towards Safer Large Reasoning Models by Promoting Safety Decision-Making before Chain-of-Thought Generation


15. A Progressive Visual-Logic-Aligned Framework for Ride-Hailing Adjudication


16. ShuttleEnv: An Interactive Data-Driven RL Environment for Badminton Strategy Modeling


17. Physics-informed offline reinforcement learning eliminates catastrophic fuel waste in maritime routing


18. InfoDensity: Rewarding Information-Dense Traces for Efficient Reasoning


19. Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations


20. Graph-Native Cognitive Memory for AI Agents: Formal Belief Revision Semantics for Versioned Memory Architectures


21. Draft-and-Prune: Improving the Reliability of Auto-formalization for Logical Reasoning


22. AI Scientist via Synthetic Task Scaling


23. How Clued up are LLMs? Evaluating Multi-Step Deductive Reasoning in a Text-Based Game Environment


24. Cascade-Aware Multi-Agent Routing: Spatio-Temporal Sidecars and Geometry-Switching


25. Transformers are Bayesian Networks


26. Generative AI-assisted Participatory Modeling in Socio-Environmental Planning under Deep Uncertainty


27. Unified Spatio-Temporal Token Scoring for Efficient Video VLMs


28. Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models


29. Toward Scalable Automated Repository-Level Datasets for Software Vulnerability Detection


30. TDAD: Test-Driven Agentic Development - Reducing Code Regressions in AI Coding Agents via Graph-Based Impact Analysis


31. Specification-Aware Distribution Shaping for Robotics Foundation Models


32. VideoAtlas: Navigating Long-Form Video in Logarithmic Compute


33. CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention


34. IndicSafe: A Benchmark for Evaluating Multilingual LLM Safety in South Asia


35. Differential Privacy in Generative AI Agents: Analysis and Optimal Tradeoffs


36. scicode-lint: Detecting Methodology Bugs in Scientific Python Code with LLM-Generated Patterns


37. RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference


38. AI-Assisted Goal Setting Improves Goal Progress Through Social Accountability


39. Differential Attention-Augmented BiomedCLIP with Asymmetric Focal Optimization for Imbalanced Multi-Label Video Capsule Endoscopy Classification


40. Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval


41. Procedural Generation of Algorithm Discovery Tasks in Machine Learning


42. How do LLMs Compute Verbal Confidence


43. Generative Control as Optimization: Time Unconditional Flow Matching for Adaptive and Robust Robotic Control


44. Text-to-Stage: Spatial Layouts from Long-form Narratives


45. CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents


46. FailureMem: A Failure-Aware Multimodal Framework for Autonomous Software Repair


47. ChopGrad: Pixel-Wise Losses for Latent Video Diffusion via Truncated Backpropagation


48. Dropout Robustness and Cognitive Profiling of Transformer Models via Stochastic Inference


49. Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated Gradients


50. EVA: Aligning Video World Models with Executable Robot Actions via Inverse Dynamics Rewards


51. RangeAD: Fast On-Model Anomaly Detection


52. A Dual Certificate Approach to Sparsity in Infinite-Width Shallow Neural Networks


53. CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution


54. Attention Sinks Induce Gradient Sinks


55. Harm or Humor: A Multimodal, Multilingual Benchmark for Overt and Covert Harmful Humor


56. SARE: Sample-wise Adaptive Reasoning for Training-free Fine-grained Visual Recognition


57. Machine Learning for Network Attacks Classification and Statistical Evaluation of Machine Learning for Network Attacks Classification and Adversarial Learning Methodologies for Synthetic Data Generation


58. Eye image segmentation using visual and concept prompts with Segment Anything Model 3 (SAM3)


59. Can Blindfolded LLMs Still Trade? An Anonymization-First Framework for Portfolio Optimization


60. Objective Mispricing Detection for Shortlisting Undervalued Football Players via Market Dynamics and News Signals


61. WeatherReasonSeg: A Benchmark for Weather-Aware Reasoning Segmentation in Visual Language Models


62. Adaptive Guidance for Retrieval-Augmented Masked Diffusion Models


63. Inhibitory normalization of error signals improves learning in neural circuits


64. Post-Training Local LLM Agents for Linux Privilege Escalation with Verifiable Rewards


65. FINER: MLLMs Hallucinate under Fine-grained Negative Queries


66. Interpretable Cross-Domain Few-Shot Learning with Rectified Target-Domain Local Alignment


67. Anchoring and Rescaling Attention for Semantically Coherent Inbetweening


68. Automated Grammar-based Algebraic Multigrid Design With Evolutionary Algorithms


69. Benchmarking Reinforcement Learning via Stochastic Converse Optimality: Generating Systems with Known Optimal Policies


70. rSDNet: Unified Robust Neural Learning against Label Noise and Adversarial Attacks


71. A Contextual Help Browser Extension to Assist Digital Illiterate Internet Users


72. Edit-As-Act: Goal-Regressive Planning for Open-Vocabulary 3D Indoor Scene Editing


73. Identifying Latent Actions and Dynamics from Offline Data via Demonstrator Diversity


74. Unsupervised Symbolic Anomaly Detection


75. FoMo X: Modular Explainability Signals for Outlier Detection Foundation Models


76. FrescoDiffusion: 4K Image-to-Video with Prior-Regularized Tiled Diffusion


77. CLeAN: Continual Learning Adaptive Normalization in Dynamic Environments


78. Learning Coordinate-based Convolutional Kernels for Continuous SE(3) Equivariant and Efficient Point Cloud Analysis


79. Rel-Zero: Harnessing Patch-Pair Invariance for Robust Zero-Watermarking Against AI Editing


80. AdapTS: Lightweight Teacher-Student Approach for Multi-Class and Continual Visual Anomaly Detection


81. AirDDE: Multifactor Neural Delay Differential Equations for Air Quality Forecasting


82. KineVLA: Towards Kinematics-Aware Vision-Language-Action Models with Bi-Level Action Decomposition


83. Detecting the Machine: A Comprehensive Benchmark of AI-Generated Text Detectors Across Architectures, Domains, and Adversarial Conditions


84. QuantFL: Sustainable Federated Learning for Edge IoT via Pre-Trained Model Quantisation


85. Auto-Unrolled Proximal Gradient Descent: An AutoML Approach to Interpretable Waveform Optimization


86. UniSAFE: A Comprehensive Benchmark for Safety Evaluation of Unified Multimodal Models


87. Revisiting Cross-Attention Mechanisms: Leveraging Beneficial Noise for Domain-Adaptive Learning


88. VirPro: Visual-referred Probabilistic Prompt Learning for Weakly-Supervised Monocular 3D Detection


89. VLM2Rec: Resolving Modality Collapse in Vision-Language Model Embedders for Multimodal Sequential Recommendation


90. AdaZoom-GUI: Adaptive Zoom-based GUI Grounding with Instruction Refinement


91. Baguan-TS: A Sequence-Native In-Context Learning Model for Time Series Forecasting with Covariates


92. TimeAPN: Adaptive Amplitude-Phase Non-Stationarity Normalization for Time Series Forecasting


93. The Phasor Transformer: Resolving Attention Bottlenecks on the Unit Circle


94. Caging the Agents: A Zero Trust Security Architecture for Autonomous AI in Healthcare


95. Joint Degradation-Aware Arbitrary-Scale Super-Resolution for Variable-Rate Extreme Image Compression


96. CRE-T1 Preview Technical Report: Beyond Contrastive Learning for Reasoning-Intensive Retrieval


97. SCALE:Scalable Conditional Atlas-Level Endpoint transport for virtual cell perturbation prediction


98. Efficient Exploration at Scale



100. Public Profile Matters: A Scalable Integrated Approach to Recommend Citations in the Wild


101. WebPII: Benchmarking Visual PII Detection for Computer-Use Agents


102. Learning Permutation Distributions via Reflected Diffusion on Ranks


103. Recurrent Reasoning with Vision-Language Models for Estimating Long-Horizon Embodied Task Progress


104. ReLMXEL: Adaptive RL-Based Memory Controller with Explainable Energy and Latency Optimization


105. Symphony: A Cognitively-Inspired Multi-Agent System for Long-Video Understanding


106. From Words to Worlds: Benchmarking Cross-Cultural Cultural Understanding in Machine Translation


107. GUIDE: GenAI Units In Digital Design Education


108. Directing the Narrative: A Finetuning Method for Controlling Coherence and Style in Story Generation


109. DANCE: Dynamic 3D CNN Pruning: Joint Frame, Channel, and Feature Adaptation for Energy Efficiency on the Edge


110. Pathology-Aware Multi-View Contrastive Learning for Patient-Independent ECG Reconstruction


111. Deployment and Evaluation of an EHR-integrated, Large Language Model-Powered Tool to Triage Surgical Patients


112. KANtize: Exploring Low-bit Quantization of Kolmogorov-Arnold Networks for Efficient Inference


113. From Drop-off to Recovery: A Mechanistic Analysis of Segmentation in MLLMs


114. TharuChat: Bootstrapping Large Language Models for a Low-Resource Language via Synthetic Data and Human Validation


115. SA-CycleGAN-2.5D: Self-Attention CycleGAN with Tri-Planar Context for Multi-Site MRI Harmonization


116. Alignment Makes Language Models Normative, Not Descriptive


117. Anonymous-by-Construction: An LLM-Driven Framework for Privacy-Preserving Text


118. Adaptive Contracts for Cost-Effective AI Delegation


119. A scalable neural bundle map for multiphysics prediction in lithium-ion battery across varying configurations


120. OPERA: Online Data Pruning for Efficient Retrieval Model Adaptation


121. Catching rationalization in the act: detecting motivated reasoning before and after CoT via activation probing


122. Towards Unsupervised Adversarial Document Detection in Retrieval Augmented Generation Systems


123. Detecting Data Poisoning in Code Generation LLMs via Black-Box, Vulnerability-Oriented Scanning


124. Generalist Multimodal LLMs Gain Biometric Expertise via Human Salience


125. PAuth - Precise Task-Scoped Authorization For Agents


126. Intent Formalization: A Grand Challenge for Reliable Coding in the Age of AI Agents


127. REAL: Regression-Aware Reinforcement Learning for LLM-as-a-Judge


128. Security Assessment and Mitigation Strategies for Large Language Models: A Comprehensive Defensive Framework


129. Hidden Clones: Exposing and Fixing Family Bias in Vision-Language Model Ensembles


130. When the Specification Emerges: Benchmarking Faithfulness Loss in Long-Horizon Coding Agents


131. Knowledge Localization in Mixture-of-Experts LLMs Using Cross-Lingual Inconsistency


132. CircuitBuilder: From Polynomials to Circuits via Reinforcement Learning


133. Large Reasoning Models Struggle to Transfer Parametric Knowledge Across Scripts


134. Evaluating Ill-Defined Tasks in Large Language Models


135. Early Quantization Shrinks Codebook: A Simple Fix for Diversity-Preserving Tokenization


136. Do Understanding and Generation Fight? A Diagnostic Study of DPO for Unified Multimodal Models


137. Dependence Fidelity and Downstream Inference Stability in Generative Models


138. Shared Representation Learning for Reference-Guided Targeted Sound Detection


139. HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning


140. LLM NL2SQL Robustness: Surface Noise vs. Linguistic Variation in Traditional and Agentic Settings


141. Empirical Recipes for Efficient and Compact Vision-Language Models


142. Implementation of tangent linear and adjoint models for neural networks based on a compiler library tool


143. The State of Generative AI in Software Development: Insights from Literature and a Developer Survey


144. Are a Thousand Words Better Than a Single Picture? Beyond Images – A Framework for Multi-Modal Knowledge Graph Dataset Enrichment


145. Continual Multimodal Egocentric Activity Recognition via Modality-Aware Novel Detection


146. DeepStage: Learning Autonomous Defense Policies Against Multi-Stage APT Campaigns


147. MSRAMIE: Multimodal Structured Reasoning Agent for Multi-instruction Image Editing


148. CineSRD: Leveraging Visual, Acoustic, and Linguistic Cues for Open-World Visual Media Speaker Diarization


149. Adversarial attacks against Modern Vision-Language Models


150. Machine intelligence supports the full chain of 2D dendrite synthesis


151. PhysQuantAgent: An Inference Pipeline of Mass Estimation for Vision-Language Models


152. Embodied Foundation Models at the Edge: A Survey of Deployment Constraints and Mitigation Strategies


153. EmergeNav: Structured Embodied Inference for Zero-Shot Vision-and-Language Navigation in Continuous Environments


154. Automatic Termination Strategy of Inelastic Neutron-scattering Measurement Using Bayesian Optimization for Bin-width Selection


155. Joint Optimization of Storage and Loading for High-Performance 3D Point Cloud Data Processing


156. Omni IIE Bench: Benchmarking the Practical Capabilities of Image Editing Models


157. KGS-GCN: Enhancing Sparse Skeleton Sensing via Kinematics-Driven Gaussian Splatting and Probabilistic Topology for Action Recognition


158. UNICORN: Ultrasound Nakagami Imaging via Score Matching and Adaptation for Assessing Hepatic Steatosis


159. On the Degrees of Freedom of Gridded Control Points in Learning-Based Medical Image Registration


160. Cryptographic Runtime Governance for Autonomous AI Systems: The Aegis Architecture for Verifiable Policy Enforcement


161. TDMM-LM: Bridging Facial Understanding and Animation via Language Models


162. GenLie: A Global-Enhanced Lie Detection Network under Sparsity and Semantic Interference


163. AgriChat: A Multimodal Large Language Model for Agriculture Image Understanding


164. Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs


165. Script-to-Slide Grounding: Grounding Script Sentences to Slide Objects for Automatic Instructional Video Generation


166. Facial beauty prediction fusing transfer learning and broad learning system


167. MHPO: Modulated Hazard-aware Policy Optimization for Stable Reinforcement Learning


168. Music Source Restoration with Ensemble Separation and Targeted Reconstruction


169. SimulU: Training-free Policy for Long-form Simultaneous Speech-to-Speech Translation


170. Privacy and Safety Experiences and Concerns of U.S. Women Using Generative AI for Seeking Sexual and Reproductive Health Information


171. Quantizer-Aware Hierarchical Neural Codec Modeling for Speech Deepfake Detection


172. What on Earth is AlphaEarth? Hierarchical structure and functional interpretability for global land cover


173. TerraLingua: Emergence and Analysis of Open-endedness in LLM Ecologies


174. Quantum-Assisted Optimal Rebalancing with Uncorrelated Asset Selection for Algorithmic Trading Walk-Forward QUBO Scheduling via QAOA


175. From Language to Action in Arabic: Reliable Structured Tool Calling via Data-Centric Fine-Tuning


176. Social physics in the age of artificial intelligence


177. A Novel end-to-end Digital Health System Using Deep Learning-based ECG Analysis


178. Rubric-Guided Fine-tuning of SpeechLLMs for Multi-Aspect, Multi-Rater L2 Reading-Speech Assessment


179. Multi-Agent Reinforcement Learning for Dynamic Pricing: Balancing Profitability,Stability and Fairness


180. PowerModelsGAT-AI: Physics-Informed Graph Attention for Multi-System Power Flow with Continual Learning


181. A foundation model for electrodermal activity data


182. Multi-Modal Multi-Agent Reinforcement Learning for Radiology Report Generation: Radiologist-Like Workflow with Clinically Verifiable Rewards


183. Attention Guidance through Video Script: A Case Study of Object Focusing on 360° VR Video Tours


184. Disclosure By Design: Identity Transparency as a Behavioural Property of Conversational AI Models


185. Unsupervised learning for inverse problems in computed tomography