전체 AI 논문 - 2026-03-06

1. The Spike, the Sparse and the Sink: Anatomy of Massive Activations and Attention Sinks


2. Towards Provably Unbiased LLM Judges via Bias-Bounded Evaluation


3. Distributed Partial Information Puzzles: Examining Common Ground Construction Under Epistemic Asymmetry


4. Dissociating Direct Access from Inference in AI Introspection


5. Judge Reliability Harness: Stress Testing the Reliability of LLM Judges



7. PACE: A Personalized Adaptive Curriculum Engine for 9-1-1 Call-taker Training


8. Ailed: A Psyche-Driven Chess Engine with Dynamic Emotional Modulation


9. Building AI Coding Agents for the Terminal: Scaffolding, Harness, Context Engineering, and Lessons Learned


10. UniSTOK: Uniform Inductive Spatio-Temporal Kriging


11. WebChain: A Large-Scale Human-Annotated Dataset of Real-World Web Interaction Traces


12. STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks


13. X-RAY: Mapping LLM Reasoning Capability via Formalized and Calibrated Probes


14. GCAgent: Enhancing Group Chat Communication through Dialogue Agents System


15. Reclaiming Lost Text Layers for Source-Free Cross-Domain Few-Shot Learning


16. AI+HW 2035: Shaping the Next Decade


17. KARL: Knowledge Agents via Reinforcement Learning


18. MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty Consensus


19. Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Reasoning


20. Jagarin: A Three-Layer Architecture for Hibernating Personal Duty Agents on Mobile


21. WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents


22. Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination


23. The Trilingual Triad Framework: Integrating Design, AI, and Domain Knowledge in No-code AI Smart City Course


24. AegisUI: Behavioral Anomaly Detection for Structured User Interface Protocols in AI Agent Systems


25. Survive at All Costs: Exploring LLM’s Risky Behaviors under Survival Pressure


26. S5-SHB Agent: Society 5.0 enabled Multi-model Agentic Blockchain Framework for Smart Home


27. Measuring the Fragility of Trust: Devising Credibility Index via Explanation Stability (CIES) for Business Decision Support Systems


28. BioLLMAgent: A Hybrid Framework with Enhanced Structural Interpretability for Simulating Human Decision-Making in Computational Psychiatry


29. Rethinking Representativeness and Diversity in Dynamic Data Selection


30. Retrieval-Augmented Generation with Covariate Time Series


31. TimeWarp: Evaluating Web Agents by Revisiting the Past


32. Knowledge-informed Bidding with Dual-process Control for Online Advertising


33. Alignment Backfire: Language-Dependent Reversal of Safety Interventions Across 16 Languages in LLM Multi-Agent Systems


34. EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection


35. Authorize-on-Demand: Dynamic Authorization with Legality-Aware Intellectual Property Protection for VLMs


36. Differentially Private Multimodal In-Context Learning


37. Bounded State in an Infinite Horizon: Proactive Hierarchical Memory for Ad-Hoc Recall over Streaming Dialogues


38. SEA-TS: Self-Evolving Agent for Autonomous Code Generation of Time Series Forecasting Algorithms


39. K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory Generation


40. Causally Robust Reward Learning from Reason-Augmented Preference Feedback


41. On Multi-Step Theorem Prediction via Non-Parametric Structural Priors


42. Design Behaviour Codes (DBCs): A Taxonomy-Driven Layered Governance Benchmark for Large Language Models


43. VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment


44. LLM-Grounded Explainability for Port Congestion Prediction via Temporal Graph Attention Networks


45. EchoGuard: An Agentic Framework with Knowledge-Graph Memory for Detecting Manipulative Communication in Longitudinal Dialogue


46. Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling


47. Breaking Contextual Inertia: Reinforcement Learning with Single-Turn Anchors for Stable Multi-Turn Interaction


48. MOOSEnger – a Domain-Specific AI Agent for the MOOSE Ecosystem


49. Evaluating the Search Agent in a Parallel World


50. HiMAP-Travel: Hierarchical Multi-Agent Planning for Long-Horizon Constrained Travel


51. Visioning Human-Agentic AI Teaming: Continuity, Tension, and Future Research


52. CONE: Embeddings for Complex Numerical Data Preserving Unit and Variable Semantics


53. Memory as Ontology: A Constitutional Memory Architecture for Persistent Digital Citizens


54. Interactive Benchmarks


55. Solving an Open Problem in Theoretical Physics using AI-Assisted Discovery


56. From Offline to Periodic Adaptation for Pose-Based Shoplifting Detection in Real-world Retail Security


57. Model Medicine: A Clinical Framework for Understanding, Diagnosing, and Treating AI Models


58. Using Vision + Language Models to Predict Item Difficulty


59. When Agents Persuade: Propaganda Generation and Mitigation in LLMs


60. Towards automated data analysis: A guided framework for LLM-based risk estimation


61. ECG-MoE: Mixture-of-Expert Electrocardiogram Foundation Model


62. Self-Attribution Bias: When AI Monitors Go Easy on Themselves


63. Adaptive Memory Admission Control for LLM Agents


64. Discovering mathematical concepts through a multi-agent system


65. Progressive Refinement Regulation for Accelerating Diffusion Language Model Decoding


66. Capability Thresholds and Manufacturing Topology: How Embodied Intelligence Triggers Phase Transitions in Economic Geography


67. SkillNet: Create, Evaluate, and Connect AI Skills


68. RoboPocket: Improve Robot Policies Instantly with Your Phone


69. POET-X: Memory-efficient LLM Training by Scaling Orthogonal Transformation


70. Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation


71. Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought


72. SurvHTE-Bench: A Benchmark for Heterogeneous Treatment Effect Estimation in Survival Analysis


73. Leveraging LLM Parametric Knowledge for Fact Checking without Retrieval


74. RealWonder: Real-Time Physical Action-Conditioned Video Generation


75. Residual RL–MPC for Robust Microrobotic Cell Pushing Under Time-Varying Flow


76. Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model


77. SAIL: Similarity-Aware Guidance and Inter-Caption Augmentation-based Learning for Weakly-Supervised Dense Video Captioning


78. Ensembling Language Models with Sequential Monte Carlo


79. RelaxFlow: Text-Driven Amodal 3D Generation


80. MobileFetalCLIP: Selective Repulsive Knowledge Distillation for Mobile Fetal Ultrasound Analysis


81. The Spatial and Temporal Resolution of Motor Intention in Multi-Target Prediction



83. GALACTIC: Global and Local Agnostic Counterfactuals for Time-series Clustering


84. PersianPunc: A Large-Scale Dataset and BERT-Based Approach for Persian Punctuation Restoration


85. Latent-Mark: An Audio Watermark Robust to Neural Resynthesis


86. Med-V1: Small Language Models for Zero-shot and Scalable Biomedical Evidence Attribution


87. WavSLM: Single-Stream Speech Language Modeling via WavLM Distillation


88. Whispering to a Blackbox: Bootstrapping Frozen OCR with Visual Prompts


89. Visual-Informed Speech Enhancement Using Attention-Based Beamforming


90. Recursive Inference Machines for Neural Reasoning


91. Boosting ASR Robustness via Test-Time Reinforcement Learning with Audio-Text Semantic Rewards


92. Not All Trust is the Same: Effects of Decision Workflow and Explanations in Human-AI Decision Making


93. The Geometric Inductive Bias of Grokking: Bypassing Phase Transitions via Architectural Topology


94. SPyCer: Semi-Supervised Physics-Guided Contextual Attention for Near-Surface Air Temperature Estimation from Satellite Imagery


95. Early Warning of Intraoperative Adverse Events via Transformer-Driven Multi-Label Learning


96. Balancing Coverage and Draft Latency in Vocabulary Trimming for Faster Speculative Decoding


97. Stable-LoRA: Stabilizing Feature Learning of Low-Rank Adaptation


98. Escaping the Hydrolysis Trap: An Agentic Workflow for Inverse Design of Durable Photocatalytic Covalent Organic Frameworks


99. Logi-PAR: Logic-Infused Patient Activity Recognition via Differentiable Rule



101. C2-Faith: Benchmarking LLM Judges for Causal and Coverage Faithfulness in Chain-of-Thought Reasoning


102. Lifelong Language-Conditioned Robotic Manipulation Learning


103. SSR-GS: Separating Specular Reflection in Gaussian Splatting for Glossy Surface Reconstruction


104. Federated Causal Discovery Across Heterogeneous Datasets under Latent Confounding


105. Recurrent Graph Neural Networks and Arithmetic Circuits


106. Particle-Guided Diffusion for Gas-Phase Reaction Kinetics


107. LBM: Hierarchical Large Auto-Bidding Model via Reasoning and Acting


108. Measuring the Redundancy of Decoder Layers in SpeechLLMs


109. FedBCD:Communication-Efficient Accelerated Block Coordinate Gradient Descent for Federated Learning


110. UniPAR: A Unified Framework for Pedestrian Attribute Recognition


111. SPIRIT: Perceptive Shared Autonomy for Robust Robotic Manipulation under Deep Learning Uncertainty


112. ARC-TGI: Human-Validated Task Generators with Reasoning Chain Templates for ARC-AGI


113. GEM-TFL: Bridging Weak and Full Supervision for Forgery Localization through EM-Guided Decomposition and Temporal Refinement


114. Axiomatic On-Manifold Shapley via Optimal Generative Flows


115. Aura: Universal Multi-dimensional Exogenous Integration for Aviation Time Series


116. Cyber Threat Intelligence for Artificial Intelligence Systems


117. A 360-degree Multi-camera System for Blue Emergency Light Detection Using Color Attention RT-DETR and the ABLDataset


118. MUTEX: Leveraging Multilingual Transformers and Conditional Random Fields for Enhanced Urdu Toxic Span Detection


119. Poisoning the Inner Prediction Logic of Graph Neural Networks for Clean-Label Backdoor Attacks


120. Debiasing Sequential Recommendation with Time-aware Inverse Propensity Scoring



122. 3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding


123. Mixture of Universal Experts: Scaling Virtual Width via Depth-Width Transformation


124. MPCEval: A Benchmark for Multi-Party Conversation Generation


125. When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger


126. Location-Aware Pretraining for Medical Difference Visual Question Answering


127. BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning


128. EVMbench: Evaluating AI Agents on Smart Contract Security


129. VPWEM: Non-Markovian Visuomotor Policy with Working and Episodic Memory


130. Deterministic Preprocessing and Interpretable Fuzzy Banding for Cost-per-Student Reporting from Extracted Records


131. AgentSCOPE: Evaluating Contextual Privacy Across Agentic Workflows


132. Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models


133. FedAFD: Multimodal Federated Learning via Adversarial Fusion and Distillation


134. DeformTrace: A Deformable State Space Model with Relay Tokens for Temporal Forgery Localization


135. Interpretable Pre-Release Baseball Pitch Type Anticipation from Broadcast 3D Kinematics


136. An Approach to Simultaneous Acquisition of Real-Time MRI Video, EEG, and Surface EMG for Articulatory, Brain, and Muscle Activity During Speech Production


137. SCoUT: Scalable Communication via Utility-Guided Temporal Grouping in Multi-Agent Reinforcement Learning


138. Multilevel Training for Kolmogorov Arnold Networks


139. On the Strengths and Weaknesses of Data for Open-set Embodied Assistance


140. Meta-D: Metadata-Aware Architectures for Brain Tumor Analysis and Missing-Modality Segmentation


141. Attention’s Gravitational Field:A Power-Law Interpretation of Positional Correlation


142. Guiding Diffusion-based Reconstruction with Contrastive Signals for Balanced Visual Representation


143. Beyond Linear LLM Invocation: An Efficient and Effective Semantic Filter Paradigm


144. Comparative Evaluation of Traditional Methods and Deep Learning for Brain Glioma Imaging. Review Paper


145. LAW & ORDER: Adaptive Spatial Weighting for Medical Diffusion and Segmentation


146. TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings


147. MADCrowner: Margin Aware Dental Crown Design with Template Deformation and Refinement


148. DSA-SRGS: Super-Resolution Gaussian Splatting for Dynamic Sparse-View DSA Reconstruction


149. Evaluating GPT-5 as a Multimodal Clinical Reasoner: A Landscape Commentary


150. Stacked from One: Multi-Scale Self-Injection for Context Window Extension


151. DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval


152. Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in the Wild


153. AI-Assisted Moot Courts: Simulating Justice-Specific Questioning in Oral Arguments


154. Probabilistic Dreaming for World Models


155. When Denoising Hinders: Revisiting Zero-Shot ASR with SAM-Audio and Whisper


156. Detection of Illicit Content on Online Marketplaces using Large Language Models


157. Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement


158. Why the Brain Consolidates: Predictive Forgetting for Optimal Generalisation


159. Optimizing Language Models for Crosslingual Knowledge Consistency


160. Decoding the Pulse of Reasoning VLMs in Multi-Image Understanding Tasks


161. Neuro-Symbolic Financial Reasoning via Deterministic Fact Ledgers and Adversarial Low-Latency Hallucination Detector


162. GIANT - Global Path Integration and Attentive Graph Networks for Multi-Agent Trajectory Planning


163. When Sensors Fail: Temporal Sequence Models for Robust PPO under Sensor Drift


164. RoboMME: Benchmarking and Understanding Memory for Robotic Generalist Policies


165. Vibe Code Bench: Evaluating AI Models on End-to-End Web Application Development


166. Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning


167. Why Do Neural Networks Forget: A Study of Collapse in Continual Learning


168. How Professional Visual Artists are Negotiating Generative AI in the Workplace


169. Invariant Causal Routing for Governing Social Norms in Online Market Economies


170. Still Fresh? Evaluating Temporal Drift in Retrieval Benchmarks


171. Projected Hessian Learning: Fast Curvature Supervision for Accurate Machine-Learning Interatomic Potentials


172. Augmenting representations with scientific papers


173. Activity Recognition from Smart Insole Sensor Data Using a Circular Dilated CNN


174. From Spark to Fire: Modeling and Mitigating Error Cascades in LLM-Based Multi-Agent Collaboration


175. Towards Explainable Deep Learning for Ship Trajectory Prediction in Inland Waterways


176. Understanding the Dynamics of Demonstration Conflict in In-Context Learning


177. MAD-SmaAt-GNet: A Multimodal Advection-Guided Neural Network for Precipitation Nowcasting


178. VSPrefill: Vertical-Slash Sparse Attention with Lightweight Indexing for Long-Context Prefilling


179. Benchmark of Benchmarks: Unpacking Influence and Code Repository Quality in LLM Safety Benchmarks


180. Learning Unified Distance Metric for Heterogeneous Attribute Data Clustering


181. Large Language Models as Bidding Agents in Repeated HetNet Auction


182. Query Disambiguation via Answer-Free Context: Doubling Performance on Humanity’s Last Exam


183. Induced Numerical Instability: Hidden Costs in Multimodal Large Language Models


184. A unified foundational framework for knowledge injection and evaluation of Large Language Models in Combustion Science


185. On Emergences of Non-Classical Statistical Characteristics in Classical Neural Networks


186. MPBMC: Multi-Property Bounded Model Checking with GNN-guided Clustering


187. An Explainable Ensemble Framework for Alzheimer’s Disease Prediction Using Structured Clinical and Cognitive Data


188. vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality Models


189. AMV-L: Lifecycle-Managed Agent Memory for Tail-Latency Control in Long-Running LLM Systems


190. A systematic approach to answering the easy problems of consciousness based on an executable cognitive system


191. CogGen: Cognitive-Load-Informed Fully Unsupervised Deep Generative Modeling for Compressively Sampled MRI Reconstruction


192. ASFL: An Adaptive Model Splitting and Resource Allocation Framework for Split Federated Learning


193. ZorBA: Zeroth-order Federated Fine-tuning of LLMs with Heterogeneous Block Activation


194. Uncertainty-Calibrated Spatiotemporal Field Diffusion with Sparse Supervision


195. What Is Missing: Interpretable Ratings for Large Language Model Outputs


196. Agent Memory Below the Prompt: Persistent Q4 KV Cache for Multi-Agent LLM Inference on Edge Devices


197. Thin Keys, Full Values: Reducing KV Cache via Low-Dimensional Attention Selection


198. Delta-Crosscoder: Robust Crosscoder Model Diffing in Narrow Fine-Tuning Regimes


199. Generating Realistic, Protocol-Compliant Maritime Radio Dialogues using Self-Instruct and Low-Rank Adaptation


200. FedEMA-Distill: Exponential Moving Average Guided Knowledge Distillation for Robust Federated Learning


201. Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?


202. Context-Dependent Affordance Computation in Vision-Language Models


203. Decorrelating the Future: Joint Frequency Domain Learning for Spatio-temporal Forecasting


204. Simulating Meaning, Nevermore! Introducing ICR: A Semiotic-Hermeneutic Metric for Evaluating Meaning in LLM Text Summaries


205. One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache


206. SalamahBench: Toward Standardized Safety Evaluation for Arabic Language Models


207. Unpacking Human Preference for LLMs: Demographically Aware Evaluation with the HUMAINE Framework


208. Semantic Containment as a Fundamental Property of Emergent Misalignment


209. CTRL-RAG: Contrastive Likelihood Reward Based Reinforcement Learning for Context-Faithful RAG Models


210. Lost in Translation: How Language Re-Aligns Vision for Cross-Species Pathology


211. FinRetrieval: A Benchmark for Financial Data Retrieval by AI Agents


212. A theoretical model of dynamical grammatical gender shifting based on set-valued set function