전체 AI 논문 - 2026-03-27

1. Training the Knowledge Base through Evidence Distillation and Write-Back Enrichment


2. Back to Basics: Revisiting ASR in the Age of Voice Agents


3. R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning


4. Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardware Optimization?


5. Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment Performance?


6. Voxtral TTS


7. EcoThink: A Green Adaptive Inference Framework for Sustainable and Accessible Agents


8. Retraining as Approximate Bayesian Inference


9. Cross-Model Disagreement as a Label-Free Correctness Signal


10. Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation


11. Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models


12. Does Structured Intent Representation Generalize? A Cross-Language, Cross-Model Empirical Study of 5W3H Prompting


13. 4OPS: Structural Difficulty Modeling in Integer Arithmetic Puzzles


14. Agentic Trust Coordination for Federated Learning through Adaptive Thresholding and Autonomous Decision Making in Sustainable and Resilient Industrial Networks


15. Macroscopic Characteristics of Mixed Traffic Flow with Deep Reinforcement Learning Based Automated and Human-Driven Vehicles


16. Evaluating Language Models for Harmful Manipulation


17. DAGverse: Building Document-Grounded Semantic DAGs from Scientific Papers


18. SliderQuant: Accurate Post-Training Quantization for LLMs


19. A Gait Foundation Model Predicts Multi-System Health Phenotypes from 3D Skeletal Motion


20. Distribution and Clusters Approximations as Abstract Domains in Probabilistic Abstract Interpretation to Neural Network Analysis


21. Probabilistic Abstract Interpretation on Neural Networks via Grids Approximation


22. The Competence Shadow: Theory and Bounds of AI Assistance in Safety Engineering


23. Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills


24. UniAI-GraphRAG: Synergizing Ontology-Guided Extraction, Multi-Dimensional Clustering, and Dual-Channel Fusion for Robust Multi-Hop Reasoning


25. RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following


26. When Sensing Varies with Contexts: Context-as-Transform for Tactile Few-Shot Class-Incremental Learning


27. ElephantBroker: A Knowledge-Grounded Cognitive Runtime for Trustworthy AI Agents


28. Sparse Visual Thought Circuits in Vision-Language Models


29. MP-MoE: Matrix Profile-Guided Mixture of Experts for Precipitation Forecasting


30. Mechanistically Interpreting Compression in Vision-Language Models


31. From Stateless to Situated: Building a Psychological World for LLM-Based Emotional Support


32. System-Anchored Knee Estimation for Low-Cost Context Window Selection in PDE Forecasting


33. A Public Theory of Distillation Resistance via Constraint-Coupled Reasoning Architectures


34. Rethinking Failure Attribution in Multi-Agent Systems: A Multi-Perspective Benchmark and Evaluation


35. The Anatomy of Uncertainty in LLMs


36. Design Once, Deploy at Scale: Template-Driven ML Development for Large Model Ecosystems


37. Can MLLMs Read Students’ Minds? Unpacking Multimodal Error Analysis in Handwritten Math


38. Shopping with a Platform AI Assistant: Who Adopts, When in the Journey, and What For


39. FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol


40. Decoding Market Emotions in Cryptocurrency Tweets via Predictive Statement Classification with Machine Learning and Transformers


41. LogitScope: A Framework for Analyzing LLM Uncertainty Through Information Metrics


42. On the Foundations of Trustworthy Artificial Intelligence


43. How Far Are Vision-Language Models from Constructing the Real World? A Benchmark for Physical Generative Reasoning


44. SentinelAI: A Multi-Agent Framework for Structuring and Linking NG9-1-1 Emergency Incident Data


45. Resisting Humanization: Ethical Front-End Design Choices in AI for Sensitive Contexts


46. ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing


47. Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regulation Agentic AI Loop for Engineering Design


48. Formal Semantics for Agentic Tool Protocols: A Process Calculus Approach


49. Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour


50. AutoSAM: an Agentic Framework for Automating Input File Generation for the SAM Code with Multi-Modal Retrieval-Augmented Generation


51. When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic Drift in LLMs


52. ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence


53. Vega: Learning to Drive with Natural Language Instructions


54. Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving


55. PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference


56. PixelSmile: Toward Fine-Grained Facial Expression Editing


57. Natural-Language Agent Harnesses


58. Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models


59. Neural Network Conversion of Machine Learning Pipelines


60. The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase


61. A Unified Memory Perspective for Probabilistic Trustworthy AI


62. Just Zoom In: Cross-View Geo-Localization via Autoregressive Zooming


63. Measuring What Matters – or What’s Convenient?: Robustness of LLM-Based Scoring Systems to Construct-Irrelevant Factors


64. A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots


65. Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers


66. Visual or Textual: Effects of Explanation Format and Personal Characteristics on the Perception of Explanations in an Educational Recommender System


67. Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verification


68. DeepFAN, a transformer-based deep learning model for human-artificial intelligence collaborative assessment of incidental pulmonary nodules in CT scans: a multi-reader, multi-case trial


69. TAAC: A gate into Trustable Audio Affective Computing


70. Are LLMs Overkill for Databases?: A Study on the Finiteness of SQL


71. Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes


72. CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild


73. NERO-Net: A Neuroevolutionary Approach for the Design of Adversarially Robust CNNs


74. Challenges in Hyperspectral Imaging for Autonomous Driving: The HSI-Drive Case


75. Lightweight GenAI for Network Traffic Synthesis: Fidelity, Augmentation, and Classification


76. Interpretable PM2.5 Forecasting for Urban Air Quality: A Comparative Study of Operational Time-Series Models


77. Maximum Entropy Behavior Exploration for Sim2Real Zero-Shot Reinforcement Learning


78. Temporally Decoupled Diffusion Planning for Autonomous Driving


79. From Manipulation to Mistrust: Explaining Diverse Micro-Video Misinformation for Robust Debunking in the Wild


80. Decidable By Construction: Design-Time Verification for Trustworthy AI


81. System Design for Maintaining Internal State Consistency in Long-Horizon Robotic Tabletop Games


82. Shape and Substance: Dual-Layer Side-Channel Attacks on Local Vision-Language Models


83. A Causal Framework for Evaluating ICU Discharge Strategies


84. GlowQ: Group-Shared LOw-Rank Approximation for Quantized LLMs


85. Integrating Deep RL and Bayesian Inference for ObjectNav in Mobile Robotics


86. Image Rotation Angle Estimation: Comparing Circular-Aware Methods


87. Adaptive Chunking: Optimizing Chunking-Method Selection for RAG


88. How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models


89. AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer’s Disease Diagnosis with Multi-cohort Assessment, Fairness Analysis, and Reader Study


90. Revealing the influence of participant failures on model quality in cross-silo Federated Learning


91. CSI-tuples-based 3D Channel Fingerprints Construction Assisted by MultiModal Learning


92. CRAFT: Grounded Multi-Agent Coordination Under Partial Information


93. MolQuest: A Benchmark for Agentic Evaluation of Abductive Reasoning in Chemical Structure Elucidation


94. Does Explanation Correctness Matter? Linking Computational XAI Evaluation to Human Understanding


95. Activation Matters: Test-time Activated Negative Labels for OOD Detection with Vision-Language Models


96. FEAST: Fully Connected Expressive Attention for Spatial Transcriptomics


97. FluxEDA: A Unified Execution Infrastructure for Stateful Agentic EDA


98. WebTestBench: Evaluating Computer-Use Agents towards End-to-End Automated Web Testing


99. A Wireless World Model for AI-Native 6G Networks


100. Free-Lunch Long Video Generation via Layer-Adaptive O.O.D Correction


101. A Decade-Scale Benchmark Evaluating LLMs’ Clinical Practice Guidelines Detection and Adherence in Multi-turn Conversations


102. Probing the Lack of Stable Internal Beliefs in LLMs


103. Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model


104. Knowledge-Guided Adversarial Training for Infrared Object Detection via Thermal Radiation Modeling


105. PIDP-Attack: Combining Prompt Injection with Database Poisoning Attacks on Retrieval-Augmented Generation Systems


106. Vision Hopfield Memory Networks


107. Photon: Speedup Volume Understanding with Efficient Multimodal Large Language Models


108. Goodness-of-pronunciation without phoneme time alignment


109. Factors Influencing the Quality of AI-Generated Code: A Synthesis of Empirical Evidence


110. FD$^2$: A Dedicated Framework for Fine-Grained Dataset Distillation


111. SAVe: Self-Supervised Audio-visual Deepfake Detection Exploiting Visual Artifacts and Audio-visual Misalignment


112. Reinforcement learning for quantum processes with memory


113. MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation


114. Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory


115. MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness


116. Layer-Specific Lipschitz Modulation for Fault-Tolerant Multimodal Representation Learning


117. From Logic Monopoly to Social Contract: Separation of Power and the Institutional Foundations for Autonomous Agent Economies


118. Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Optimization


119. Pixelis: Reasoning in Pixels, from Seeing to Acting


120. Learning domain-invariant features through channel-level sparsification for Out-Of Distribution Generalization


121. An Explainable Ensemble Learning Framework for Crop Classification with Optimized Feature Pyramids and Deep Networks


122. TopoPilot: Reliable Conversational Workflow Automation for Topological Data Analysis and Visualization


123. The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Creates Exploitable Vulnerabilities


124. Closing the Confidence-Faithfulness Gap in Large Language Models


125. Imperative Interference: Social Register Shapes Instruction Topology in Large Language Models


126. Few TensoRF: Enhance the Few-shot on Tensorial Radiance Fields


127. Improving Fine-Grained Rice Leaf Disease Detection via Angular-Compactness Dual Loss Learning


128. Efficient Detection of Bad Benchmark Items with Novel Scalability Coefficients


129. Learning Rollout from Sampling:An R1-Style Tokenized Traffic Simulation Model


130. Rethinking Health Agents: From Siloed AI to Collaborative Decision Mediators


131. Subject-Specific Low-Field MRI Synthesis via a Neural Operator


132. Self-Corrected Image Generation with Explainable Latent Rewards


133. Toward domain-specific machine translation and quality estimation systems


134. Evaluating adaptive and generative AI-based feedback and recommendations in a knowledge-graph-integrated programming learning system


135. TIGFlow-GRPO: Trajectory Forecasting via Interaction-Aware Flow Matching and Reward-Driven Optimization


136. CVA: Context-aware Video-text Alignment for Video Temporal Grounding


137. Shaping the Future of Mathematics in the Age of AI


138. Integrated Multi-Drone Task Allocation, Sequencing, and Optimal Trajectory Generation in Obstacle-Rich 3D Environments


139. Sovereign AI at the Front Door of Care: A Physically Unidirectional Architecture for Secure Clinical Intelligence


140. LogSigma at SemEval-2026 Task 3: Uncertainty-Weighted Multitask Learning for Dimensional Aspect-Based Sentiment Analysis


141. Surrogates, Spikes, and Sparsity: Performance Analysis and Characterization of SNN Hyperparameters on Hardware


142. More Than “Means to an End”: Supporting Reasoning with Transparently Designed AI Data Science Processes


143. AI Security in the Foundation Model Era: A Comprehensive Survey from a Unified Perspective


144. Gaze patterns predict preference and confidence in pairwise AI image evaluation


145. NeuroVLM-Bench: Evaluation of Vision-Enabled Large Language Models for Clinical Reasoning in Neurological Disorders


146. Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models


147. A Practical Guide Towards Interpreting Time-Series Deep Clinical Predictive Models: A Reproducibility Study


148. Learning From Developers: Towards Reliable Patch Validation at Scale for Linux


149. Generative Adversarial Perturbations with Cross-paradigm Transferability on Localized Crowd Counting


150. FODMP: Fast One-Step Diffusion of Movement Primitives Generation for Time-Dependent Robot Actions


151. GoldiCLIP: The Goldilocks Approach for Balancing Explicit Supervision for Language-Image Pretraining


152. Dissecting Model Failures in Abdominal Aortic Aneurysm Segmentation through Explainability-Driven Analysis


153. AIP: Agent Identity Protocol for Verifiable Delegation Across MCP and A2A


154. From Untestable to Testable: Metamorphic Testing in the Age of LLMs


155. Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset


156. SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks


157. Pseudo Label NCF for Sparse OHC Recommendation: Dual Representation Learning and the Separability Accuracy Trade off


158. Grokking as a Falsifiable Finite-Size Transition


159. Decentralized Task Scheduling in Distributed Systems: A Deep Reinforcement Learning Approach


160. Is Geometry Enough? An Evaluation of Landmark-Based Gaze Estimation


161. Scalable Object Relation Encoding for Better 3D Spatial Reasoning in Large Language Models


162. Reconstructing Spiking Neural Networks Using a Single Neuron with Autapses


163. When Consistency Becomes Bias: Interviewer Effects in Semi-Structured Clinical Interviews


164. Experiential Reflective Learning for Self-Improving LLM Agents


165. DyMRL: Dynamic Multispace Representation Learning for Multimodal Event Forecasting in Knowledge Graph


166. Dual-Graph Multi-Agent Reinforcement Learning for Handover Optimization


167. TRAJEVAL: Decomposing Code Agent Trajectories for Fine-Grained Diagnosis


168. Sketch2Simulation: Automating Flowsheet Generation via Multi Agent Large Language Models


169. Causal AI For AMS Circuit Design: Interpretable Parameter Effects Analysis


170. Fusion Learning from Dynamic Functional Connectivity: Combining the Amplitude and Phase of fMRI Signals to Identify Brain Disorders


171. MuViS: Multimodal Virtual Sensing Benchmark


172. FED-HARGPT: A Hybrid Centralized-Federated Approach of a Transformer-based Architecture for Human Context Recognition


173. A Learnable SIM Paradigm: Fundamentals, Training Techniques, and Applications


174. X-OPD: Cross-Modal On-Policy Distillation for Capability Alignment in Speech LLMs


175. Model2Kernel: Model-Aware Symbolic Execution For Safe CUDA Kernels


176. Malicious LLM-Based Conversational AI Makes Users Reveal Personal Information


177. History of generative Artificial Intelligence (AI) chatbots: past, present, and future development