전체 AI 논문 - 2026-03-18

1. SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models


2. Internalizing Agency from Reflective Experience


3. Learning to Present: Inverse Specification Rewards for Agentic Slide Generation


4. Prompt Programming for Cultural Bias and Alignment of Large Language Models


5. Surg$Σ$: A Spectrum of Large-Scale Multimodal Data and Foundation Models for Surgical Intelligence


6. Is Conformal Factuality for RAG-based LLMs Robust? Novel Metrics and Systematic Insights


7. Beyond Accuracy: Evaluating Forecasting Models by Multi-Echelon Inventory Cost


8. Anticipatory Planning for Multimodal AI Agents


9. Nonstandard Errors in AI Agents


10. MedCL-Bench: Benchmarking stability-efficiency trade-offs and scaling in biomedical continual learning


11. Differential Harm Propensity in Personalized LLM Agents: The Curious Case of Mental Health Disclosure


12. IQuest-Coder-V1 Technical Report


13. CritiSense: Critical Digital Literacy and Resilience Against Misinformation


14. Machines acquire scientific taste from institutional traces


15. What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline


16. Domain-Independent Dynamic Programming with Constraint Propagation


17. When AI Navigates the Fog of War


18. Runtime Governance for AI Agents: Policies on Paths


19. V-DyKnow: A Dynamic Benchmark for Time-Sensitive Knowledge in Vision Language Models


20. BenchPreS: A Benchmark for Context-Aware Personalized Preference Selectivity of Persistent-Memory LLMs


21. Designing for Disagreement: Front-End Guardrails for Assistance Allocation in LLM-Enabled Robots


22. Exploring different approaches to customize language models for domain-specific text-to-code generation


23. ExpressMind: A Multimodal Pretrained Large Language Model for Expressway Operation


24. Breaking the Chain: A Causal Analysis of LLM Faithfulness to Intermediate Structures


25. Follow the Clues, Frame the Truth: Hybrid-evidential Deductive Reasoning in Open-Vocabulary Multimodal Emotion Recognition


26. RetailBench: Evaluating Long-Horizon Autonomous Decision-Making and Strategy Stability of LLM Agents in Realistic Retail Environments


27. TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas


28. Visual Distraction Undermines Moral Reasoning in Vision-Language Models


29. From Natural Language to Executable Option Strategies via Large Language Models


30. Via Negativa for AI Alignment: Why Negative Constraints Are Structurally Superior to Positive Preferences


31. FactorEngine: A Program-level Knowledge-Infused Factor Mining Framework for Quantitative Investment


32. Learning to Predict, Discover, and Reason in High-Dimensional Discrete Event Sequences


33. NeSy-Route: A Neuro-Symbolic Benchmark for Constrained Route Planning in Remote Sensing


34. Adaptive Theory of Mind for LLM-based Multi-Agent Coordination


35. MOSAIC: Composable Safety Alignment with Modular Control Tokens


36. Proactive Rejection and Grounded Execution: A Dual-Stage Intent Analysis Paradigm for Safe and Efficient AIoT Smart Homes


37. Are Large Language Models Truly Smarter Than Humans?


38. SQL-ASTRA: Alleviating Sparse Feedback in Agentic SQL via Column-Set Matching and Trajectory Aggregation


39. NeuronSpark: A Spiking Neural Network Language Model with Selective State Space Dynamics


40. VIGIL: Towards Edge-Extended Agentic AI for Enterprise IT Support


41. ARISE: Agent Reasoning with Intrinsic Skill Evolution in Hierarchical Reinforcement Learning


42. A Context Alignment Pre-processor for Enhancing the Coherence of Human-LLM Dialog


43. POaaS: Minimal-Edit Prompt Optimization as a Service to Lift Accuracy and Cut Hallucinations on On-Device sLLMs


44. Enhancing Linguistic Generalization of VLA: Fine-Tuning OpenVLA via Synthetic Instruction Augmentation


45. Interpretable Context Methodology: Folder Structure as Agentic Architecture


46. IRAM-Omega-Q: A Computational Architecture for Uncertainty Regulation in Artificial Agents


47. Selective Memory for Artificial Intelligence: Write-Time Gating with Hierarchical Archiving


48. From Workflow Automation to Capability Closure: A Formal Framework for Safe and Revenue-Aware Customer Service AI


49. An Agentic Evaluation Framework for AI-Generated Scientific Code in PETSc


50. Safety is Non-Compositional: A Formal Framework for Capability-Based AI Systems


51. MAC: Multi-Agent Constitution Learning


52. Optimizing Hospital Capacity During Pandemics: A Dual-Component Framework for Strategic Patient Relocation


53. Protein Design with Agent Rosetta: A Case Study for Specialized Scientific Agents


54. Argumentative Human-AI Decision-Making: Toward AI Agents That Reason With Us, Not For Us


55. Semi-Autonomous Formalization of the Vlasov-Maxwell-Landau Equilibrium


56. Prompt Engineering for Scale Development in Generative Psychometrics


57. AsgardBench - Evaluating Visually Grounded Interactive Planning Under Minimal Feedback


58. Resilience Meets Autonomy: Governing Embodied AI in Critical Infrastructure


59. Regularized Latent Dynamics Prediction is a Strong Baseline For Behavioral Foundation Models


60. Algorithmic Trading Strategy Development and Optimisation


61. Persona-Conditioned Risk Behavior in Large Language Models: A Simulated Gambling Study with GPT-4.1


62. Prose2Policy (P2P): A Practical LLM Pipeline for Translating Natural-Language Access Policies into Executable Rego


63. CUBE: A Standard for Unifying Agent Benchmarks


64. Context-Length Robustness in Question Answering Models: A Comparative Empirical Study


65. Knowledge Graph Extraction from Biomedical Literature for Alkaptonuria Rare Disease


66. Survey of Various Fuzzy and Uncertain Decision-Making Methods


67. Theoretical Foundations of Latent Posterior Factors: Formal Guarantees for Multi-Evidence Reasoning


68. I Know What I Don’t Know: Latent Posterior Factor Models for Multi-Evidence Probabilistic Reasoning


69. Quantum-Secure-By-Construction (QSC): A Paradigm Shift For Post-Quantum Agentic Intelligence


70. A Dynamic Survey of Fuzzy, Intuitionistic Fuzzy, Neutrosophic, Plithogenic, and Extensional Sets


71. Compiled Memory: Not More Information, but More Precise Instructions for Language Agents


72. QV May Be Enough: Toward the Essence of Attention in LLMs


73. DynaTrust: Defending Multi-Agent Systems Against Sleeper Agents via Dynamic Trust Graphs


74. Did You Check the Right Pocket? Cost-Sensitive Store Routing for Memory-Augmented Agents


75. GSI Agent: Domain Knowledge Enhancement for Large Language Models in Green Stormwater Infrastructure


76. CraniMem: Cranial Inspired Gated and Bounded Memory for Agentic Systems


77. Form Follows Function: Recursive Stem Model


78. The Comprehension-Gated Agent Economy: A Robustness-First Architecture for AI Economic Agency


79. AIDABench: AI Data Analytics Benchmark


80. NextMem: Towards Latent Factual Memory for LLM-based Agents


81. Neural-Symbolic Logic Query Answering in Non-Euclidean Space


82. Demystifing Video Reasoning


83. MessyKitchens: Contact-rich object-level 3D scene reconstruction


84. ManiTwin: Scaling Data-Generation-Ready Digital Object Dataset to 100K


85. SparkVSR: Interactive Video Super-Resolution via Sparse Keyframe Propagation


86. SOMA: Unifying Parametric Human Body Models


87. Unifying Optimization and Dynamics to Parallelize Sequential Computation: A Guide to Parallel Newton Methods for Breaking Sequential Bottlenecks


88. Real-Time Decoding of Movement Onset and Offset for Brain-Controlled Rehabilitation Exoskeleton


89. ODIN-Based CPU-GPU Architecture with Replay-Driven Simulation and Emulation


90. CABTO: Context-Aware Behavior Tree Grounding for Robot Manipulation


91. DexGrasp-Zero: A Morphology-Aligned Policy for Zero-Shot Cross-Embodiment Dexterous Grasping


92. V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising


93. InCoder-32B: Code Foundation Model for Industrial Scenarios


94. IOSVLM: A 3D Vision-Language Model for Unified Dental Diagnosis from Intraoral Scans


95. TurnWise: The Gap between Single- and Multi-turn Language Model Capabilities


96. Finding Common Ground in a Sea of Alternatives


97. SpecMoE: Spectral Mixture-of-Experts Foundation Model for Cross-Species EEG Decoding


98. Retrieving Counterfactuals Improves Visual In-Context Learning


99. Federated Learning with Multi-Partner OneFlorida+ Consortium Data for Predicting Major Postoperative Complications


100. Cost Trade-offs in Matrix Inversion Updates for Streaming Outlier Detection


101. When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making


102. Fast-WAM: Do World Action Models Need Test-time Future Imagination?


103. Kestrel: Grounding Self-Refinement for LVLM Hallucination Mitigation


104. When Openclaw Agents Learn from Each Other: Insights from Emergent AI Agent Communities for Human-AI Partnership in Education



106. Omanic: Towards Step-wise Evaluation of Multi-hop Reasoning in Large Language Models


107. MLLM-based Textual Explanations for Face Comparison


108. Data-driven generalized perimeter control: Zürich case study


109. FSMC-Pose: Frequency and Spatial Fusion with Multiscale Self-calibration for Cattle Mounting Pose Estimation


110. BATQuant: Outlier-resilient MXFP4 Quantization via Learnable Block-wise Optimization


111. REFORGE: Multi-modal Attacks Reveal Vulnerable Concept Unlearning in Image Generation Models


112. Malicious Or Not: Adding Repository Context to Agent Skill Classification


113. Manifold-Matching Autoencoders


114. Characterizing Delusional Spirals through Human-LLM Chat Logs


115. Deep Learning-Driven Black-Box Doherty Power Amplifier with Pixelated Output Combiner and Extended Efficiency Range


116. EmoLLM: Appraisal-Grounded Cognitive-Emotional Co-Reasoning in Large Language Models


117. CompDiff: Hierarchical Compositional Diffusion for Fair and Zero-Shot Intersectional Medical Image Generation


118. DanceHA: A Multi-Agent Framework for Document-Level Aspect-Based Sentiment Analysis


119. FEAT: A Linear-Complexity Foundation Model for Extremely Large Structured Data


120. Bridging the High-Frequency Data Gap: A Millisecond-Resolution Network Dataset for Advancing Time Series Foundation Models


121. Unlearning for One-Step Generative Models via Unbalanced Optimal Transport


122. DST-Net: A Dual-Stream Transformer with Illumination-Independent Feature Guidance and Multi-Scale Spatial Convolution for Low-Light Image Enhancement


123. Multi-Agent Reinforcement Learning Counteracts Delayed CSI in Multi-Satellite Systems


124. CD-FKD: Cross-Domain Feature Knowledge Distillation for Robust Single-Domain Generalization in Object Detection


125. EngGPT2: Sovereign, Efficient and Open Intelligence


126. LenghuSky-8: An 8-Year All-Sky Cloud Dataset with Star-Aware Masks and Alt-Az Calibration for Segmentation and Nowcasting


127. An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU


128. SF-Mamba: Rethinking State Space Model for Vision


129. IndexRAG: Bridging Facts for Cross-Document Reasoning at Index Time


130. Trained Persistent Memory for Frozen Encoder–Decoder LLMs: Six Architectural Methods


131. PlotTwist: A Creative Plot Generation Framework with Small Language Models


132. Who Benchmarks the Benchmarks? A Case Study of LLM Evaluation in Icelandic


133. Fanar 2.0: Arabic Generative AI Stack


134. Robust Physics-Guided Diffusion for Full-Waveform Inversion


135. Age Predictors Through the Lens of Generalization, Bias Mitigation, and Interpretability: Reflections on Causal Implications


136. FederatedFactory: Generative One-Shot Learning for Extremely Non-IID Distributed Scenarios


137. DynamicGate MLP Conditional Computation via Learned Structural Dropout and Input Dependent Gating for Functional Plasticity


138. $D^3$-RSMDE: 40$\times$ Faster and High-Fidelity Remote Sensing Monocular Depth Estimation


139. Toward Experimentation-as-a-Service in 5G/6G: The Plaza6G Prototype for AI-Assisted Trials


140. Automated identification of Ichneumonoidea wasps via YOLO-based deep learning: Integrating HiresCam for Explainable AI


141. Explainable machine learning workflows for radio astronomical data processing


142. Detecting Sentiment Steering Attacks on RAG-enabled Large Language Models


143. An Interpretable Machine Learning Framework for Non-Small Cell Lung Cancer Drug Response Analysis


144. A Human-Centred Architecture for Large Language Models-Cognitive Assistants in Manufacturing within Quality Management Systems


145. Attention-guided Evidence Grounding for Spoken Question Answering


146. VisBrowse-Bench: Benchmarking Visual-Native Search for Multimodal Browsing Agents


147. Surrogate-Assisted Genetic Programming with Rank-Based Phenotypic Characterisation for Dynamic Multi-Mode Project Scheduling


148. AW-MoE: All-Weather Mixture of Experts for Robust Multi-Modal 3D Object Detection


149. Human/AI Collective Intelligence for Deliberative Democracy: A Human-Centred Design Approach


150. Grounding the Score: Explicit Visual Premise Verification for Reliable Vision-Language Process Reward Models


151. Visual Prompt Discovery via Semantic Exploration


152. RASLF: Representation-Aware State Space Model for Light Field Super-Resolution


153. Generative AI for Quantum Circuits and Quantum Code: A Technical Review and Taxonomy


154. CoMAI: A Collaborative Multi-Agent Framework for Robust and Equitable Interview Evaluation


155. A Scoping Review of AI-Driven Digital Interventions in Mental Health Care: Mapping Applications Across Screening, Support, Monitoring, Prevention, and Clinical Education


156. Robust Generative Audio Quality Assessment: Disentangling Quality from Spurious Correlations


157. Sample-Efficient Adaptation of Drug-Response Models to Patient Tumors under Strong Biological Domain Shift


158. 360° Image Perception with MLLMs: A Comprehensive Benchmark and a Training-Free Method


159. MemX: A Local-First Long-Term Memory System for AI Assistants


160. Open-Source Reproduction and Explainability Analysis of Corrective Retrieval Augmented Generation


161. Homogeneous and Heterogeneous Consistency progressive Re-ranking for Visible-Infrared Person Re-identification


162. DyJR: Preserving Diversity in Reinforcement Learning with Verifiable Rewards via Dynamic Jensen-Shannon Replay


163. GATS: Gaussian Aware Temporal Scaling Transformer for Invariant 4D Spatio-Temporal Point Cloud Representation


164. HIPO: Instruction Hierarchy via Constrained Reinforcement Learning


165. Structure-Aware Multimodal LLM Framework for Trustworthy Near-Field Beam Prediction


166. When Generative Augmentation Hurts: A Benchmark Study of GAN and Diffusion Models for Bias Correction in AI Classification Systems


167. SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding


168. Functorial Neural Architectures from Higher Inductive Types


169. PathGLS: Evaluating Pathology Vision-Language Models without Ground Truth through Multi-Dimensional Consistency


170. ASDA: Automated Skill Distillation and Adaptation for Financial Reasoning


171. RepoReviewer: A Local-First Multi-Agent Architecture for Repository-Level Code Review


172. Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization


173. Efficient LLM Serving for Agentic Workflows: A Data Systems Perspective


174. LICA: Layered Image Composition Annotations for Graphic Design Research


175. Diffusion Models for Joint Audio-Video Generation


176. Parallel In-context Learning for Large Vision Language Models


177. CounterRefine: Answer-Conditioned Counterevidence Retrieval for Inference-Time Knowledge Repair in Factual Question Answering


178. RecBundle: A Next-Generation Geometric Paradigm for Explainable Recommender Systems


179. Towards the Vision-Sound-Language-Action Paradigm: The HEAR Framework for Sound-Centric Manipulation


180. Interact3D: Compositional 3D Generation of Interactive Objects


181. SEAHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Southeast Asia


182. Resource Consumption Threats in Large Language Models


183. Large Reward Models: Generalizable Online Robot Reward Generation with Vision-Language Models


184. Collaborative Temporal Feature Generation via Critic-Free Reinforcement Learning for Cross-User Sensor-Based Activity Recognition


185. Residual Stream Duality in Modern Transformer Architectures


186. Understanding Moral Reasoning Trajectories in Large Language Models: Toward Probing-Based Explainability


187. FlatLands: Generative Floormap Completion From a Single Egocentric View


188. Evaluating Agentic Optimization on Large Codebases


189. RadAnnotate: Large Language Models for Efficient and Reliable Radiology Report Annotation


190. The Midas Touch in Gaze vs. Hand Pointing: Modality-Specific Failure Modes and Implications for XR Interfaces


191. Something from Nothing: Data Augmentation for Robust Severity Level Estimation of Dysarthric Speech


192. Aligning Paralinguistic Understanding and Generation in Speech LLMs via Multi-Task Reinforcement Learning


193. Standardizing Medical Images at Scale for AI


194. 100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models


195. MoLoRA: Composable Specialization via Per-Token Adapter Routing


196. ExpertGen: Scalable Sim-to-Real Expert Policy Learning from Imperfect Behavior Priors


197. MobileLLM-Flash: Latency-Guided On-Device LLM Design for Industry Scale


198. A Family of LLMs Liberated from Static Vocabularies


199. Data-Local Autonomous LLM-Guided Neural Architecture Search for Multiclass Multimodal Time-Series Classification


200. Evaluating Causal Discovery Algorithms for Path-Specific Fairness and Utility in Healthcare


201. VIBEPASS: Can Vibe Coders Really Pass the Vibe Check?


202. Auto Researching, not hyperparameter tuning: Convergence Analysis of 10,000 Experiments


203. The Agentic Researcher: A Practical Guide to AI-Assisted Research in Mathematics and Machine Learning


204. Federated Learning for Privacy-Preserving Medical AI


205. The Internet of Physical AI Agents: Interoperability, Longevity, and the Cost of Getting It Wrong


206. COGNAC at SemEval-2026 Task 5: LLM Ensembles for Human-Level Word Sense Plausibility Rating in Challenging Narratives


207. PhasorFlow: A Python Library for Unit Circle Based Computing


208. Electrodermal Activity as a Unimodal Signal for Aerobic Exercise Detection in Wearable Sensors


209. Counteractive RL: Rethinking Core Principles for Efficient and Scalable Deep Reinforcement Learning


210. Interpretative Interfaces: Designing for AI-Mediated Reading Practices and the Knowledge Commons


211. FlashSampling: Fast and Memory-Efficient Exact Sampling


212. Informationally Compressive Anonymization: Non-Degrading Sensitive Input Protection for Privacy-Preserving Supervised Machine Learning


213. When Stability Fails: Hidden Failure Modes Of LLMS in Data-Constrained Scientific Decision-Making


214. Hypothesis Class Determines Explanation: Why Accurate Models Disagree on Feature Attribution


215. Don’t Trust Stubborn Neighbors: A Security Framework for Agentic Networks


216. OMNIFLOW: A Physics-Grounded Multimodal Agent for Generalized Scientific Reasoning


217. Parallelised Differentiable Straightest Geodesics for 3D Meshes


218. Morphemes Without Borders: Evaluating Root-Pattern Morphology in Arabic Tokenizers and LLMs


219. CorrectionPlanner: Self-Correction Planner with Reinforcement Learning in Autonomous Driving


220. Simulation Distillation: Pretraining World Models in Simulation for Rapid Real-World Adaptation


221. You’ve Got a Golden Ticket: Improving Generative Robot Policies With A Single Noise Vector


222. ClawWorm: Self-Propagating Attacks Across LLM Agent Ecosystems


223. MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification


224. Meta-TTRL: A Metacognitive Framework for Self-Improving Test-Time Reinforcement Learning in Unified Multimodal Models


225. A Framework and Prototype for a Navigable Map of Datasets in Engineering Design and Systems Engineering


226. How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition


227. Embedding-Aware Feature Discovery: Bridging Latent Representations and Interpretable Features in Event Sequences


228. LLM-Driven Discovery of High-Entropy Catalysts via Retrieval-Augmented Generation


229. Mastering the Minority: An Uncertainty-guided Multi-Expert Framework for Challenging-tailed Sequence Learning


230. SEMAG: Self-Evolutionary Multi-Agent Code Generation


231. This Is Taking Too Long - Investigating Time as a Proxy for Energy Consumption of LLMs


232. Tackling Over-smoothing on Hypergraphs: A Ricci Flow-guided Neural Diffusion Approach


233. BadLLM-TG: A Backdoor Defender powered by LLM Trigger Generator


234. Loosely-Structured Software: Engineering Context, Structure, and Evolution Entropy in Runtime-Rewired Multi-Agent Systems


235. Transition Flow Matching


236. Evidential Domain Adaptation for Remaining Useful Life Prediction with Incomplete Degradation


237. DASH: Dynamic Audio-Driven Semantic Chunking for Efficient Omnimodal Token Compression


238. State-Dependent Safety Failures in Multi-Turn Language Model Interaction


239. IdentityGuard: Context-Aware Restriction and Provenance for Personalized Synthesis


240. Spectral Edge Dynamics of Training Trajectories: Signal–Noise Geometry Across Scales


241. Automated Self-Testing as a Quality Gate: Evidence-Driven Release Management for LLM Applications


242. DRCY: Agentic Hardware Design Reviews


243. Quantum Amplitude Estimation for Catastrophe Insurance Tail-Risk Pricing: Empirical Convergence and NISQ Noise Analysis


244. OrthoAI v2: From Single-Agent Segmentation to Dual-Agent Treatment Planning for Clear Aligners


245. Attribution-Guided Model Rectification of Unreliable Neural Network Behaviors


246. Beyond Reward Suppression: Reshaping Steganographic Communication Protocols in MARL via Dynamic Representational Circuit Breaking


247. Discovering the Hidden Role of Gini Index In Prompt-based Classification


248. Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context


249. A federated learning framework with knowledge graph and temporal transformer for early sepsis prediction in multi-center ICUs


250. Steering Frozen LLMs: Adaptive Social Alignment via Online Prompt Routing


251. Alternating Reinforcement Learning with Contextual Rubric Rewards


252. XLinear: Frequency-Enhanced MLP with CrossFilter for Robust Long-Range Forecasting


253. Exploring the Use of VLMs for Navigation Assistance for People with Blindness and Low Vision


254. Finder: A Multimodal AI-Powered Search Framework for Pharmaceutical Data Retrieval


255. SAC-NeRF: Adaptive Ray Sampling for Neural Radiance Fields via Soft Actor-Critic Reinforcement Learning


256. One Operator to Rule Them All? On Boundary-Indexed Operator Families in Neural PDE Solvers