전체 AI 논문 - 2026-03-12

1. A Hybrid Knowledge-Grounded Framework for Safety and Traceability in Prescription Verification


2. Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization


3. Emulating Clinician Cognition via Self-Evolving Deep Clinical Research


4. FAME: Formal Abstract Minimal Explanation for Neural Networks


5. Trajectory-Informed Memory Generation for Self-Improving Agent Systems


6. Does LLM Alignment Really Need Diversity? An Empirical Study of Adapting RLVR Methods for Moral Reasoning


7. CUAAudit: Meta-Evaluation of Vision-Language Models as Auditors of Autonomous Computer-Use Agents


8. Adaptive RAN Slicing Control via Reward-Free Self-Finetuning Agents


9. IH-Challenge: A Training Dataset to Improve Instruction Hierarchy on Frontier LLMs


10. Resource-constrained Amazons chess decision framework integrating large language models and graph attention


11. Verbalizing LLM’s Higher-order Uncertainty via Imprecise Probabilities


12. Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability


13. HEAL: Hindsight Entropy-Assisted Learning for Reasoning Distillation


14. Hybrid Self-evolving Structured Memory for GUI Agents


15. Agentic Control Center for Data Product Optimization


16. COMIC: Agentic Sketch Comedy Generation


17. LiTo: Surface Light Field Tokenization


18. Neural Field Thermal Tomography: A Differentiable Physics Framework for Non-Destructive Evaluation


19. V2M-Zero: Zero-Pair Time-Aligned Video-to-Music Generation


20. Instruction set for the representation of graphs


21. Does AI See like Art Historians? Interpreting How Vision Language Models Recognize Artistic Style


22. RCTs & Human Uplift Studies: Methodological Challenges and Practical Solutions for Frontier AI Evaluation


23. Artificial Intelligence as a Catalyst for Innovation in Software Engineering


24. GroundCount: Grounding Vision-Language Models with Object Detection for Mitigating Counting Hallucinations


25. Contact Coverage-Guided Exploration for General-Purpose Dexterous Manipulation


26. Safe RLHF Beyond Expectation: Stochastic Dominance for Universal Spectral Risk Control


27. Historical Consensus: Preventing Posterior Collapse via Iterative Selection of Gaussian Mixture Priors


28. When Fine-Tuning Fails and when it Generalises: Role of Data Diversity and Mixed Training in LLM-based TTS


29. LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation


30. Dynamics-Predictive Sampling for Active RL Finetuning of Large Reasoning Models


31. Continuous Diffusion Transformers for Designing Synthetic Regulatory Elements


32. An Extreme Multi-label Text Classification (XMTC) Library Dataset: What if we took “Use of Practical AI in Digital Libraries” seriously?


33. GRACE: A Unified 2D Multi-Robot Path Planning Simulator & Benchmark for Grid, Roadmap, And Continuous Environments


34. $V_{0.5}$: Generalist Value Model as a Prior for Sparse RL Rollouts


35. Semantic Landmark Particle Filter for Robot Localisation in Vineyards


36. Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis


37. Human Presence Detection via Wi-Fi Range-Filtered Doppler Spectrum on Commodity Laptops


38. On the Reliability of Cue Conflict and Beyond


39. BALD-SAM: Disagreement-based Active Prompting in Interactive Segmentation


40. Speaker Verification with Speech-Aware LLMs: Evaluation and Augmentation


41. Protein Counterfactuals via Diffusion-Guided Latent Optimization


42. Risk-Adjusted Harm Scoring for Automated Red Teaming for LLMs in Financial Services


43. Towards Intelligent Spectrum Management: Spectrum Demand Estimation Using Graph Neural Networks


44. AI-Enhanced Spatial Cellular Traffic Demand Prediction with Contextual Clustering and Error Correction for 5G/6G Planning


45. Taking Shortcuts for Categorical VQA Using Super Neurons


46. Deep Randomized Distributed Function Computation (DeepRDFC): Neural Distributed Channel Simulation


47. CUPID: A Plug-in Framework for Joint Aleatoric and Epistemic Uncertainty Estimation with a Single Model


48. Towards Robust Speech Deepfake Detection via Human-Inspired Reasoning


49. UAV traffic scene understanding: A cross-spectral guided approach and a unified benchmark


50. Probabilistic Verification of Voice Anti-Spoofing Models


51. AlphaFlowTSE: One-Step Generative Target Speaker Extraction via Conditional AlphaFlow


52. Structured Linked Data as a Memory Layer for Agent-Orchestrated Retrieval


53. EvoSchema: Towards Text-to-SQL Robustness Against Schema Evolution


54. RandMark: On Random Watermarking of Visual Foundation Models


55. Repurposing Backdoors for Good: Ephemeral Intrinsic Proofs for Verifiable Aggregation in Cross-silo Federated Learning


56. Contract And Conquer: How to Provably Compute Adversarial Examples for a Black-Box Model?


57. A Platform-Agnostic Multimodal Digital Human Modelling Framework: Neurophysiological Sensing in Game-Based Interaction


58. Are Video Reasoning Models Ready to Go Outside?


59. Interleaving Scheduling and Motion Planning with Incremental Learning of Symbolic Space-Time Motion Abstractions


60. Detecting and Eliminating Neural Network Backdoors Through Active Paths with Application to Intrusion Detection


61. Reinforcement Learning with Conditional Expectation Reward


62. Recover to Predict: Progressive Retrospective Learning for Variable-Length Trajectory Prediction


63. Gradient Flow Drifting: Generative Modeling via Wasserstein Gradient Flows of KDE-Approximated Divergences


64. Towards Cognitive Defect Analysis in Active Infrared Thermography with Vision-Text Cues


65. SCORE: Replacing Layer Stacking with Contractive Recurrent Depth


66. Prompting with the human-touch: evaluating model-sensitivity of foundation models for musculoskeletal CT segmentation


67. UAV-MARL: Multi-Agent Reinforcement Learning for Time-Critical and Dynamic Medical Supply Delivery


68. Naïve Exposure of Generative AI Capabilities Undermines Deepfake Detection


69. JEDI: Jointly Embedded Inference of Neural Dynamics


70. Learning to Negotiate: Multi-Agent Deliberation for Collective Value Alignment in LLMs


71. Aligning Large Language Models with Searcher Preferences


72. Modeling Stage-wise Evolution of User Interests for News Recommendation


73. G-STAR: End-to-End Global Speaker-Tracking Attributed Recognition


74. UniPINN: A Unified PINN Framework for Multi-task Learning of Diverse Navier-Stokes Equations


75. FAR-Dex: Few-shot Data Augmentation and Adaptive Residual Policy Refinement for Dexterous Manipulation


76. The Curse and Blessing of Mean Bias in FP4-Quantized LLM Training


77. Domain-Adaptive Health Indicator Learning with Degradation-Stage Synchronized Sampling and Cross-Domain Autoencoder


78. Enhancing Network Intrusion Detection Systems: A Multi-Layer Ensemble Approach to Mitigate Adversarial Attacks


79. Effective Dataset Distillation for Spatio-Temporal Forecasting with Bi-dimensional Compression


80. Designing Service Systems from Textual Evidence


81. On the Learning Dynamics of Two-layer Linear Networks with Label Noise SGD


82. Safe Probabilistic Planning for Human-Robot Interaction using Conformal Risk Control


83. Optimal Expert-Attention Allocation in Mixture-of-Experts: A Scalable Law for Dynamic Model Design


84. Causal Concept Graphs in LLM Latent Space for Stepwise Reasoning


85. Reactive Writers: How Co-Writing with AI Changes How We Engage with Ideas


86. Few-Shot Adaptation to Non-Stationary Environments via Latent Trend Embedding for Robotics


87. Beyond Interleaving: Causal Attention Reformulations for Generative Recommender Systems


88. Dynamic Knowledge Fusion for Multi-Domain Dialogue State Tracking


89. Utility Function is All You Need: LLM-based Congestion Control


90. Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck


91. Federated Active Learning Under Extreme Non-IID and Global Class Imbalance


92. Overcoming Visual Clutter in Vision Language Action Models via Concept-Gated Visual Distillation


93. Does Reasoning Make Search More Fair? Comparing Fairness in Reasoning and Non-Reasoning Rerankers


94. PC-Diffuser: Path-Consistent Capsule CBF Safety Filtering for Diffusion-Based Trajectory Planner


95. NasoVoce: A Nose-Mounted Low-Audibility Speech Interface for Always-Available Speech Interaction


96. Is this Idea Novel? An Automated Benchmark for Judgment of Research Ideas


97. Simulation-in-the-Reasoning (SiR): A Conceptual Framework for Empirically Grounded AI in Autonomous Transportation


98. Quantum entanglement provides a competitive advantage in adversarial games


99. Conversational AI-Enhanced Exploration System to Query Large-Scale Digitised Collections of Natural History Museums


100. Taming Score-Based Denoisers in ADMM: A Convergent Plug-and-Play Framework


101. Joint Imaging-ROI Representation Learning via Cross-View Contrastive Alignment for Brain Disorder Classification


102. DUCTILE: Agentic LLM Orchestration of Engineering Analysis in Product Development Practice


103. Intrinsic Numerical Robustness and Fault Tolerance in a Neuromorphic Algorithm for Scientific Computing


104. Learning from Radio using Variational Quantum RF Sensing


105. Rethinking the Harmonic Loss via Non-Euclidean Distance Layers


106. Robotic Ultrasound Makes CBCT Alive


107. A Diffusion Analysis of Policy Gradient for Stochastic Bandits


108. Multilingual AI-Driven Password Strength Estimation with Similarity-Based Detection


109. Delta-K: Boosting Multi-Instance Generation via Cross-Attention Augmentation


110. Adaptive Activation Cancellation for Hallucination Mitigation in Large Language Models


111. MCP-in-SoS: Risk assessment framework for open-source MCP servers


112. Compatibility at a Cost: Systematic Discovery and Exploitation of MCP Clause-Compliance Vulnerabilities


113. Mashup Learning: Faster Finetuning by Remixing Past Checkpoints


114. Social Knowledge for Cross-Domain User Preference Modeling


115. The Generation-Recognition Asymmetry: Six Dimensions of a Fundamental Divide in Formal Language Theory


116. AR-VLA: True Autoregressive Action Expert for Vision-Language-Action Models


117. Lost in the Middle at Birth: An Exact Theory of Transformer Position Bias


118. CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR


119. Hardware Efficient Approximate Convolution with Tunable Error Tolerance for CNNs


120. Code-Space Response Oracles: Generating Interpretable Multi-Agent Policies with Large Language Models


121. Equivariant Asynchronous Diffusion: An Adaptive Denoising Schedule for Accelerated Molecular Conformation Generation


122. Execution Is the New Attack Surface: Survivability-Aware Agentic Crypto Trading with OpenClaw-Style Local Executors


123. Multi-Stream Perturbation Attack: Breaking Safety Alignment of Thinking LLMs Through Concurrent Task Interference


124. ES-dLLM: Efficient Inference for Diffusion Large Language Models by Early-Skipping


125. KernelSkill: A Multi-Agent Framework for GPU Kernel Optimization


126. Digging Deeper: Learning Multi-Level Concept Hierarchies


127. Amnesia: Adversarial Semantic Layer Specific Activation Steering in Large Language Models


128. TASER: Task-Aware Spectral Energy Refine for Backdoor Suppression in UAV Swarms Decentralized Federated Learning


129. Marginals Before Conditionals


130. Why LLMs Fail: A Failure Analysis and Partial Success Measurement for Automated Security Patch Generation


131. Dissecting Chronos: Sparse Autoencoders Reveal Causal Feature Hierarchies in Time Series Foundation Models


132. ADVERSA: Measuring Multi-Turn Guardrail Degradation and Judge Reliability in Large Language Models


133. HTMuon: Improving Muon via Heavy-Tailed Spectral Correction


134. The Epistemic Support-Point Filter: Jaynesian Maximum Entropy Meets Popperian Falsification


135. Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead


136. Tool Receipts, Not Zero-Knowledge Proofs: Practical Hallucination Detection for AI Agents


137. SBOMs into Agentic AIBOMs: Schema Extensions, Agentic Orchestration, and Reproducibility Evaluation


138. Training Language Models via Neural Cellular Automata


139. Where Do Flow Semantics Reside? A Protocol-Native Tabular Pretraining Paradigm for Encrypted Traffic Classification


140. InFusionLayer: a CFA-based ensemble tool to generate new classifiers for learning and modeling


141. Revisiting Sharpness-Aware Minimization: A More Faithful and Effective Implementation


142. Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination Reduction


143. Gated Adaptation for Continual Learning in Human Activity Recognition


144. Safety Under Scaffolding: How Evaluation Conditions Shape Measured Safety


145. AMB-DSGDN: Adaptive Modality-Balanced Dynamic Semantic Graph Differential Network for Multimodal Emotion Recognition


146. Targeted Bit-Flip Attacks on LLM-Based Agents


147. Evaluating Progress in Graph Foundation Models: A Comprehensive Benchmark and New Insights


148. HTM-EAR: Importance-Preserving Tiered Memory with Hybrid Routing under Saturation


149. Architecture-Aware LLM Inference Optimization on AMD Instinct GPUs: A Comprehensive Benchmark and Deployment Study


150. The DMA Streaming Framework: Kernel-Level Buffer Orchestration for High-Performance AI Data Paths


151. How to Count AIs: Individuation and Liability for AI Agents


152. A Governance and Evaluation Framework for Deterministic, Rule-Based Clinical Decision Support in Empiric Antibiotic Prescribing


153. RedFuser: An Automatic Operator Fusion Framework for Cascaded Reductions on AI Accelerators


154. Defining AI Models and AI Systems: A Framework to Resolve the Boundary Problem


155. Prompts and Prayers: the Rise of GPTheology


156. DeliberationBench: A Normative Benchmark for the Influence of Large Language Models on Users’ Views


157. Assessing Cognitive Biases in LLMs for Judicial Decision Support: Virtuous Victim and Halo Effects


158. Measuring and Eliminating Refusals in Military Large Language Models


159. FERRET: Framework for Expansion Reliant Red Teaming


160. Personalized Group Relative Policy Optimization for Heterogenous Preference Alignment


161. GATech at AbjadMed: Bidirectional Encoders vs. Causal Decoders: Insights from 82-Class Arabic Medical Classification


162. SENS-ASR: Semantic Embedding injection in Neural-transducer for Streaming Automatic Speech Recognition


163. SpreadsheetArena: Decomposing Preference in LLM Generation of Spreadsheet Workbooks


164. Leveraging Wikidata for Geographically Informed Sociocultural Bias Dataset Creation: Application to Latin America


165. A Retrieval-Augmented Language Assistant for Unmanned Aircraft Safety Assessment and Regulatory Compliance


166. Automated evaluation of LLMs for effective machine translation of Mandarin Chinese to English


167. Empathy Is Not What Changed: Clinical Assessment of Psychological Safety Across GPT Model Generations


168. There Are No Silly Questions: Evaluation of Offline LLM Capabilities from a Turkish Perspective


169. Context Over Compute Human-in-the-Loop Outperforms Iterative Chain-of-Thought Prompting in Interview Answer Quality


170. Evaluating Adjective-Noun Compositionality in LLMs: Functional vs Representational Perspectives


171. CEI: A Benchmark for Evaluating Pragmatic Reasoning in Language Models


172. TAMUSA-Chat: A Domain-Adapted Large Language Model Conversational System for Research and Responsible Deployment


173. PoultryLeX-Net: Domain-Adaptive Dual-Stream Transformer Architecture for Large-Scale Poultry Stakeholder Modeling


174. A Two-Stage Architecture for NDA Analysis: LLM-based Segmentation and Transformer-based Clause Classification



176. Causally Grounded Mechanistic Interpretability for LLMs with Faithful Natural-Language Explanations


177. Evolving Demonstration Optimization for Chain-of-Thought Feature Transformation


178. Quantifying Hallucinations in Language Language Models on Medical Textbooks


179. The Dunning-Kruger Effect in Large Language Models: An Empirical Study of Confidence Calibration


180. MoE-SpAc: Efficient MoE Inference Based on Speculative Activation Utility in Heterogeneous Edge Scenarios


181. AraModernBERT: Transtokenized Initialization and Long-Context Encoder Modeling for Arabic


182. Explainable LLM Unlearning Through Reasoning


183. One Model, Many Skills: Parameter-Efficient Fine-Tuning for Multitask Code Analysis


184. Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards