전체 AI 논문 - 2026-03-25

1. Mecha-nudges for Machines


2. Bilevel Autoresearch: Meta-Autoresearching Itself


3. Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies


4. RelayS2S: A Dual-Path Speculative Generation for Real-Time Dialogue


5. LLM Olympiad: Why Model Evaluation Needs a Sealed Exam


6. Online library learning in human visual puzzle solving


7. MemCollab: Cross-Agent Memory Collaboration via Contrastive Trajectory Distillation


8. PERMA: Benchmarking Personalized Memory Agents via Event-Driven Preference and Realistic Task Environments


9. SAiW: Source-Attributable Invisible Watermarking for Proactive Deepfake Defense


10. Describe-Then-Act: Proactive Agent Steering via Distilled Language-Action World Models


11. Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment


12. MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models


13. Minibal: Balanced Game-Playing Without Opponent Modeling


14. Can Large Language Models Reason and Optimize Under Constraints?


15. On the use of Aggregation Operators to improve Human Identification using Dental Records


16. JFTA-Bench: Evaluate LLM’s Ability of Tracking and Analyzing Malfunctions Using Fault Trees



18. PersonalQ: Select, Quantize, and Serve Personalized Diffusion Models for Efficient Inference


19. Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning


20. Ran Score: a LLM-based Evaluation Score for Radiology Report Generation


21. ProGRank: Probe-Gradient Reranking to Defend Dense-Retriever RAG from Corpus Poisoning


22. Separating Diagnosis from Control: Auditable Policy Adaptation in Agent-Based Simulations with LLM-Based Diagnostics


23. Continuous Optimization for Satisfiability Modulo Theories on Linear Real Arithmetic


24. Dynamical Systems Theory Behind a Hierarchical Reasoning Model


25. Chain-of-Authorization: Internalizing Authorization into Large Language Models via Reasoning Trajectories


26. CoMaTrack: Competitive Multi-Agent Game-Theoretic Tracking with Vision-Language-Action Models


27. PhySe-RPO: Physics and Semantics Guided Relative Policy Optimization for Diffusion-Based Surgical Smoke Removal


28. Improving Safety Alignment via Balanced Direct Preference Optimization


29. Empirical Comparison of Agent Communication Protocols for Task Orchestration


30. Learning What Matters Now: Dynamic Preference Inference under Contextual Shifts


31. Reliable Classroom AI via Neuro-Symbolic Multimodal Reasoning


32. ABSTRAL: Automatic Design of Multi-Agent Systems Through Iterative Refinement and Topology Optimization


33. AgriPestDatabase-v1.0: A Structured Insect Dataset for Training Agricultural Large Language Model


34. Can LLM Agents Generate Real-World Evidence? Evaluating Observational Studies in Medical Databases


35. CLiGNet: Clinical Label-Interaction Graph Network for Medical Specialty Classification from Clinical Transcriptions


36. Beyond Binary Correctness: Scaling Evaluation of Long-Horizon Agents on Subjective Enterprise Tasks


37. HyFI: Hyperbolic Feature Interpolation for Brain-Vision Alignment


38. MuQ-Eval: An Open-Source Per-Sample Quality Metric for AI Music Generation Evaluation


39. Benchmarking Multi-Agent LLM Architectures for Financial Document Processing: A Comparative Study of Orchestration Patterns, Cost-Accuracy Tradeoffs and Production Scaling Strategies


40. Graph-Aware Late Chunking for Retrieval-Augmented Generation in Biomedical Literature


41. Bridging the Know-Act Gap via Task-Level Autoregressive Reasoning


42. Understanding LLM Performance Degradation in Multi-Instance Processing: The Roles of Instance Count and Context Length


43. AI Mental Models: Learned Intuition and Deliberation in a Bounded Neural Architecture


44. Maximum Entropy Relaxation of Multi-Way Cardinality Constraints for Synthetic Population Generation


45. Computational Arbitrage in AI Model Markets


46. From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents


47. STEM Agent: A Self-Adapting, Tool-Enabled, Extensible Architecture for Multi-Protocol AI Agent Systems


48. Session Risk Memory (SRM): Temporal Authorization for Deterministic Pre-Execution Safety Gates


49. Intelligence Inertia: Physical Principles and Applications


50. Dynamic Fusion-Aware Graph Convolutional Neural Network for Multimodal Emotion Recognition in Conversations


51. The Efficiency Attenuation Phenomenon: A Computational Challenge to the Language of Thought Hypothesis


52. Memory Bear AI Memory Science Engine for Multimodal Affective Intelligence: A Technical Report


53. MedObvious: Exposing the Medical Moravec’s Paradox in VLMs via Clinical Triage


54. VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions


55. Failure of contextual invariance in gender inference with large language models


56. ReqFusion: A Multi-Provider Framework for Automated PEGS Analysis Across Software Domains


57. VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs


58. InverFill: One-Step Inversion for Enhanced Few-Step Diffusion Inpainting


59. Code Review Agent Benchmark


60. 3DCity-LLM: Empowering Multi-modality Large Language Models for 3D City-scale Perception and Understanding


61. Evaluating LLM-Based Test Generation Under Software Evolution


62. Targeted Adversarial Traffic Generation : Black-box Approach to Evade Intrusion Detection Systems in IoT Networks


63. Biased Error Attribution in Multi-Agent Human-AI Systems Under Delayed Feedback


64. SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling


65. Planning over MAPF Agent Dependencies via Multi-Dependency PIBT


66. Graph Energy Matching: Transport-Aligned Energy-Based Modeling for Graph Generation


67. Natural Language Interfaces for Spatial and Temporal Databases: A Comprehensive Overview of Methods, Taxonomy, and Future Directions


68. Contrastive Metric Learning for Point Cloud Segmentation in Highly Granular Detectors


69. Edge Radar Material Classification Under Geometry Shifts


70. Leveraging LLMs and Social Media to Understand User Perception of Smartphone-Based Earthquake Early Warnings


71. WISTERIA: Weak Implicit Signal-based Temporal Relation Extraction with Attention


72. Unilateral Relationship Revision Power in Human-AI Companion Interaction


73. Curriculum-Driven 3D CT Report Generation via Language-Free Visual Grafting and Zone-Constrained Compression


74. Designing Agentic AI-Based Screening for Portfolio Investment


75. A Comparative Study of Machine Learning Models for Hourly Forecasting of Air Temperature and Relative Humidity


76. Emergence of Fragility in LLM-based Social Networks: the Case of Moltbook


77. A Multimodal Framework for Human-Multi-Agent Interaction


78. Not All Tokens Are Created Equal: Query-Efficient Jailbreak Fuzzing for LLMs


79. SafeSeek: Universal Attribution of Safety Circuits in Language Models


80. AI Lifecycle-Aware Feasibility Framework for Split-RIC Orchestration in NTN O-RAN


81. A Learning Method with Gap-Aware Generation for Heterogeneous DAG Scheduling


82. Neural ODE and SDE Models for Adaptation and Planning in Model-Based Reinforcement Learning


83. General Machine Learning: Theory for Learning Under Variable Regimes


84. ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment


85. Reasoning over Semantic IDs Enhances Generative Recommendation


86. Robust Safety Monitoring of Language Models via Activation Watermarking


87. Why AI-Generated Text Detection Fails: Evidence from Explainable AI Beyond Benchmark Accuracy


88. Can an LLM Detect Instances of Microservice Infrastructure Patterns?


89. AuthorMix: Modular Authorship Style Transfer via Layer-wise Adapter Mixing


90. Mind Your HEARTBEAT! Claw Background Execution Inherently Enables Silent Memory Pollution


91. Machine Learning Models for the Early Detection of Burnout in Software Engineering: a Systematic Literature Review


92. DBAutoDoc: Automated Discovery and Documentation of Undocumented Database Schemas via Statistical Analysis and Iterative LLM Refinement


93. MSR-HuBERT: Self-supervised Pre-training for Adaptation to Multiple Sampling Rates


94. Parametric Knowledge and Retrieval Behavior in RAG Fine-Tuning for Electronic Design Automation


95. Assessing the Robustness of Climate Foundation Models under No-Analog Distribution Shifts


96. HUydra: Full-Range Lung CT Synthesis via Multiple HU Interval Generative Modelling


97. YOLOv10 with Kolmogorov-Arnold networks and vision-language foundation models for interpretable object detection and trustworthy multimodal AI in computer vision perception


98. Looking Beyond the Window: Global-Local Aligned CLIP for Training-free Open-Vocabulary Semantic Segmentation


99. Concept-based explanations of Segmentation and Detection models in Natural Disaster Management


100. A Sobering Look at Tabular Data Generation via Probabilistic Circuits


101. AgentRAE: Remote Action Execution through Notification-based Visual Backdoors against Screenshots-based Mobile GUI Agents


102. Can Graph Foundation Models Generalize Over Architecture?


103. DariMis: Harm-Aware Modeling for Dari Misinformation Detection on YouTube


104. Set-Valued Prediction for Large Language Models with Feasibility-Aware Coverage Guarantees


105. The EU AI Act and the Rights-based Approach to Technological Governance


106. EVA: Efficient Reinforcement Learning for End-to-End Video Agent


107. From the AI Act to a European AI Agency: Completing the Union’s Regulatory Architecture


108. ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling


109. Off-Policy Evaluation and Learning for Survival Outcomes under Censoring


110. Confidence Calibration under Ambiguous Ground Truth


111. Grounding Sim-to-Real Generalization in Dexterous Manipulation: An Empirical Study with Vision-Language-Action Models


112. Agent-Sentry: Bounding LLM Agents via Execution Provenance


113. The Coordinate System Problem in Persistent Structural Memory for Neural Architectures


114. Avoiding Over-smoothing in Social Media Rumor Detection with Pre-trained Propagation Tree Transformer


115. Agent Audit: A Security Analysis System for LLM Agent Applications


116. UniQueR: Unified Query-based Feedforward 3D Reconstruction


117. UAV-DETR: DETR for Anti-Drone Target Detection


118. URA-Net: Uncertainty-Integrated Anomaly Perception and Restoration Attention Network for Unsupervised Anomaly Detection


119. TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment


120. When AI Shows Its Work, Is It Actually Working? Step-Level Evaluation Reveals Frontier Language Models Frequently Bypass Their Own Reasoning


121. Focus, Don’t Prune: Identifying Instruction-Relevant Regions for Information-Rich Image Understanding


122. PhotoAgent: A Robotic Photographer with Spatial and Aesthetic Understanding


123. Quantum Random Forest for the Regression Problem


124. Exposure-Normalized Bed and Chair Fall Rates via Continuous AI Monitoring


125. KARMA: Knowledge-Action Regularized Multimodal Alignment for Personalized Search at Taobao


126. From Arithmetic to Logic: The Resilience of Logic and Lookup-Based Neural Networks Under Parameter Bit-Flips


127. From Overload to Convergence: Supporting Multi-Issue Human-AI Negotiation with Bayesian Visualization



129. KALAVAI: Predicting When Independent Specialist Fusion Works – A Quantitative Model for Post-Hoc Cooperative LLM Training


130. PopResume: Causal Fairness Evaluation of LLM/VLM Resume Screeners with Population-Representative Dataset


131. WiFi2Cap: Semantic Action Captioning from Wi-Fi CSI via Limb-Level Semantic Alignment


132. Vision-based Deep Learning Analysis of Unordered Biomedical Tabular Datasets via Optimal Spatial Cartography


133. Generalizing Dynamics Modeling More Easily from Representation Perspective


134. AwesomeLit: Towards Hypothesis Generation with Agent-Supported Literature Research


135. Learning to Trust: How Humans Mentally Recalibrate AI Confidence Signals


136. LGSE: Lexically Grounded Subword Embedding Initialization for Low-Resource Language Adaptation


137. Toward Faithful Segmentation Attribution via Benchmarking and Dual-Evidence Fusion


138. To Agree or To Be Right? The Grounding-Sycophancy Tradeoff in Medical Vision-Language Models


139. Causal Discovery in Action: Learning Chain-Reaction Mechanisms from Interventions


140. Do Consumers Accept AIs as Moral Compliance Agents?


141. Language Models Can Explain Visual Features via Steering


142. flexvec: SQL Vector Retrieval with Programmatic Embedding Modulation


143. Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?


144. STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving


145. Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos


146. GraphRAG for Engineering Diagrams: ChatP&ID Enables LLM Interaction with P&IDs


147. LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface


148. High Resolution Flood Extent Detection Using Deep Learning with Random Forest Derived Training Labels


149. Do Large Language Models Reduce Research Novelty? Evidence from Information Systems Journals


150. Tiny Inference-Time Scaling with Latent Verifiers


151. Cognitive Training for Language Models: Towards General Capabilities via Cross-Entropy Games


152. Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures


153. Stability-Preserving Online Adaptation of Neural Closed-loop Maps


154. Color When It Counts: Grayscale-Guided Online Triggering for Always-On Streaming Video Sensing


155. LLM-guided headline rewriting for clickability enhancement without clickbait


156. Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs


157. CaP-X: A Framework for Benchmarking and Improving Coding Agents for Robot Manipulation


158. Latent Style-based Quantum Wasserstein GAN for Drug Design


159. Learning When to Act: Interval-Aware Reinforcement Learning with Predictive Temporal Structure


160. Symbolic Graph Networks for Robust PDE Discovery from Noisy Sparse Data


161. Instruction-Tuned, but Not More Verifiable Instruction-Following: A Cross-Task Diagnosis for LoRA Adapters


162. Abnormalities and Disease Detection in Gastro-Intestinal Tract Images


163. AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Agents with Cloud Computing Access


164. Three Creates All: You Only Sample 3 Steps


165. Rethinking Multimodal Fusion for Time Series: Auxiliary Modalities Need Constrained Fusion


166. FAAR: Format-Aware Adaptive Rounding for NVFP4


167. SynLeaF: A Dual-Stage Multimodal Fusion Framework for Synthetic Lethality Prediction Across Pan- and Single-Cancer Contexts


168. When Visuals Aren’t the Problem: Evaluating Vision-Language Models on Misleading Data Visualizations


169. Reasoner-Executor-Synthesizer: Scalable Agentic Architecture with Static O(1) Context Window


170. Modeling Quantum Federated Autoencoder for Anomaly Detection in IoT Networks


171. Q-AGNN: Quantum-Enhanced Attentive Graph Neural Network for Intrusion Detection


172. MCLR: Improving Conditional Modeling in Visual Generative Models via Inter-Class Likelihood-Ratio Maximization and Establishing the Equivalence between Classifier-Free Guidance and Alignment Objectives


173. Early Discoveries of Algorithmist I: Promise of Provable Algorithm Synthesis at Scale


174. Unveiling the Mechanism of Continuous Representation Full-Waveform Inversion: A Wave Based Neural Tangent Kernel Framework


175. Bridging neuroscience and AI: adaptive, culturally sensitive technologies transforming aphasia rehabilitation


176. WIST: Web-Grounded Iterative Self-Play Tree for Domain-Targeted Reasoning Improvement


177. First-Mover Bias in Gradient Boosting Explanations: Mechanism, Detection, and Resolution



179. Graphs RAG at Scale: Beyond Retrieval-Augmented Generation With Labeled Property Graphs and Resource Description Framework for Complex and Unknown Search Spaces


180. Causal Direct Preference Optimization for Distributionally Robust Generative Recommendation


181. Graph Signal Processing Meets Mamba2: Adaptive Filter Bank via Delta Modulation


182. Large Language Models for Missing Data Imputation: Understanding Behavior, Hallucination Effects, and Control Mechanisms


183. Conformal Risk Control for Safety-Critical Wildfire Evacuation Mapping: A Comparative Study of Tabular, Spatial, and Graph-Based Models


184. Trained Persistent Memory for Frozen Decoder-Only LLMs


185. Beyond the Mean: Distribution-Aware Loss Functions for Bimodal Regression


186. AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI


187. A Direct Classification Approach for Reliable Wind Ramp Event Forecasting under Severe Class Imbalance


188. Hybrid Associative Memories


189. DAQ: Delta-Aware Quantization for Post-Training LLM Weight Compression


190. A Multi-Task Targeted Learning Framework for Lithium-Ion Battery State-of-Health and Remaining Useful Life


191. AEGIS: An Operational Infrastructure for Post-Market Governance of Adaptive Medical AI Under US and EU Regulations


192. From Instructions to Assistance: a Dataset Aligning Instruction Manuals with Assembly Videos for Evaluating Multimodal LLMs


193. Sparsely-Supervised Data Assimilation via Physics-Informed Schrödinger Bridge


194. Geometric Mixture-of-Experts with Curvature-Guided Adaptive Routing for Graph Representation Learning


195. ST-GDance++: A Scalable Spatial-Temporal Diffusion for Long-Duration Group Choreography


196. Emergency Preemption Without Online Exploration: A Decision Transformer Approach


197. Enhancing AI-Based Tropical Cyclone Track and Intensity Forecasting via Systematic Bias Correction


198. A Multi-Modal CNN-LSTM Framework with Multi-Head Attention and Focal Loss for Real-Time Elderly Fall Detection


199. UniFluids: Unified Neural Operator Learning with Conditional Flow-matching



201. Sample Transform Cost-Based Training-Free Hallucination Detector for Large Language Models


202. Latent Semantic Manifolds in Large Language Models


203. Scaling Attention via Feature Sparsity


204. Between the Layers Lies the Truth: Uncertainty Estimation in LLMs Using Intra-Layer Local Information Scores


205. Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emotion Categorization in LLMs


206. Efficient Embedding-based Synthetic Data Generation for Complex Reasoning Tasks


207. TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs


208. Beyond Hard Constraints: Budget-Conditioned Reachability For Safe Offline Reinforcement Learning


209. MERIT: Memory-Enhanced Retrieval for Interpretable Knowledge Tracing


210. Evaluating Prompting Strategies for Chart Question Answering with Large Language Models


211. Founder effects shape the evolutionary dynamics of multimodality in open LLM families


212. Automated Microservice Pattern Instance Detection Using Infrastructure-as-Code Artifacts and Large Language Models