전체 AI 논문 - 2026-05-07

1. LongSeeker: Elastic Context Orchestration for Long-Horizon Search Agents


2. Executable World Models for ARC-AGI-3 in the Era of Coding Agents


3. Position: Embodied AI Requires a Privacy-Utility Trade-off


4. Uno-Orchestra: Parsimonious Agent Routing via Selective Delegation


5. On-line Learning in Tree MDPs by Treating Policies as Bandit Arms


6. A Foundation Model for Zero-Shot Logical Rule Induction


7. Curated AI beats frontier LLMs at pharma asset discovery


8. Strat-Reasoner: Reinforcing Strategic Reasoning of LLMs in Multi-Agent Games


9. DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents


10. AgentTrust: Runtime Safety Evaluation and Interception for AI Agent Tool Use


11. Reward-Decomposed Reinforcement Learning for Immersive Video Role-Playing


12. Budget-aware Auto Optimizer Configurator


13. AuditRepairBench: A Paired-Execution Trace Corpus for Evaluator-Channel Ranking Instability in Agent Repair


14. SensingAgents: A Multi-Agent Collaborative Framework for Robust IMU Activity Recognition


15. From Parameter Dynamics to Risk Scoring : Quantifying Sample-Level Safety Degradation in LLM Fine-tuning


16. How Does Thinking Mode Change LLM Moral Judgments? A Controlled Instant-vs-Thinking Comparison Across Five Frontier Models


17. Deployment-Relevant Alignment Cannot Be Inferred from Model-Level Evaluation Alone


18. When Context Hurts: The Crossover Effect of Knowledge Transfer on Multi-Agent Design Exploration


19. The Scaling Properties of Implicit Deductive Reasoning in Transformers


20. Agent Island: A Saturation- and Contamination-Resistant Benchmark from Multiagent Games


21. Parallel Prefix Verification for Speculative Generation


22. Temporal Reasoning Is Not the Bottleneck: A Probabilistic Inconsistency Framework for Neuro-Symbolic QA


23. Pro$^2$Assist: Continuous Step-Aware Proactive Assistance with Multimodal Egocentric Perception for Long-Horizon Procedural Tasks


24. ANDRE: An Attention-based Neuro-symbolic Differentiable Rule Extractor


25. Actionable Real-Time Modeling of Surgical Team Dynamics via Time-Expanded Interaction Graphs


26. Regularized Centered Emphatic Temporal Difference Learning


27. LCM: Lossless Context Management


28. Taming Outlier Tokens in Diffusion Transformers


29. Grokability in five inequalities


30. Almost-Orthogonality in Lp Spaces: A Case Study with Grok


31. When Life Gives You BC, Make Q-functions: Extracting Q-values from Behavior Cloning for On-Robot Reinforcement Learning


32. Design Conductor 2.0: An agent builds a TurboQuant inference accelerator in 80 hours


33. The First Token Knows: Single-Decode Confidence for Hallucination Detection


34. Geometry-Aware State Space Model: A New Paradigm for Whole-Slide Image Representation


35. PSK at SemEval-2026 Task 9: Multilingual Polarization Detection Using Ensemble Gemma Models with Synthetic Data Augmentation


36. Aes3D: Aesthetic Assessment in 3D Gaussian Splatting


37. Superposition Is Not Necessary: A Mechanistic Interpretability Analysis of Transformer Representations for Time Series Forecasting


38. What Matters in Practical Learned Image Compression


39. Joint Treatment Effect Estimation from Incomplete Healthcare Data: Temporal Causal Normalizing Flows with LLM-driven Evolutionary MNAR Imputation


40. Adaptive Policy Selection and Fine-Tuning under Interaction Budgets for Offline-to-Online Reinforcement Learning


41. On the Wasserstein Gradient Flow Interpretation of Drifting Models


42. LineRides: Line-Guided Reinforcement Learning for Bicycle Robot Stunts


43. Building informative materials datasets beyond targeted objectives


44. Text Corpora as Concept Fields: Black-Box Hallucination and Novelty Measurement


45. Continual Knowledge Updating in LLM Systems: Learning Through Multi-Timescale Memory Dynamics


46. Driver-WM: A Driver-Centric Traffic-Conditioned Latent World Model for In-Cabin Dynamics Rollout


47. Think-Aloud Reshapes Automated Cognitive Model Discovery Beyond Behavior


48. Automatically Finding and Validating Unexpected Side-Effects of Interventions on Language Models


49. Look Once, Beam Twice: Camera-Primed Real-Time Double-Directional mmWave Beam Management for Vehicular Connectivity


50. The Impossibility Triangle of Long-Context Modeling


51. SoK: Robustness in Large Language Models against Jailbreak Attacks


52. Adaptive Learning Strategies for AoA-Based Outdoor Localization: A Comprehensive Framework


53. Direct Product Flow Matching: Decoupling Radial and Angular Dynamics for Few-Shot Adaptation


54. Piper: Efficient Large-Scale MoE Training via Resource Modeling and Pipelined Hybrid Parallelism


55. Preference-Based Self-Distillation: Beyond KL Matching via Reward Regularization


56. Local Intrinsic Dimension Unveils Hallucinations in Diffusion Models


57. Misaligned by Reward: Socially Undesirable Preferences in LLMs


58. Federated Learning for Early Prediction of EV Charging Demand


59. Architectural Constraints Alignment in AI-assisted, Platform-based Service Development


60. Why Geometric Continuity Emerges in Deep Neural Networks: Residual Connections and Rotational Symmetry Breaking


61. Skill Neologisms: Towards Skill-based Continual Learning


62. Reliable Modeling of Distribution Shifts via Displacement-Reshaped Optimal Transport


63. EP-GRPO: Entropy-Progress Aligned Group Relative Policy Optimization with Implicit Process Guidance


64. DART: A Vision-Language Foundation Model for Comprehensive Rope Condition Monitoring


65. Modular Reinforcement Learning For Cooperative Swarms


66. When Does Gene Regulatory Network Inference Break? A Controlled Diagnostic Study of Causal and Correlational Methods on Single-Cell Data


67. Evolving Idea Graphs with Learnable Edits-and-Commits for Multi-Agent Scientific Ideation


68. Delta-Based Neural Architecture Search: LLM Fine-Tuning via Code Diffs


69. On the (In-)Security of the Shuffling Defense in the Transformer Secure Inference


70. Storage Is Not Memory: A Retrieval-Centered Architecture for Agent Recall


71. FairEnc: A Fair Vision-Language Model with Fair Vision and Text Encoders for Glaucoma Detection


72. A Harmonic Mean Formulation of Average Reward Reinforcement Learning in SMDPs


73. Anticipating Innovation Using Large Language Models


74. Assessing Cognitive Effort in L2 Idiomatic Processing: An Eye-Tracking Dataset


75. Quantile-Free Uncertainty Quantification in Graph Neural Networks


76. StoryAlign: Evaluating and Training Reward Models for Story Generation


77. Beyond Seeing Is Believing: On Crowdsourced Detection of Audiovisual Deepfakes


78. Cognitive Twins: Investigating Personalized Thinking Model Building and Its Performance Enhancement with Human-in-the-Loop


79. Gyan: An Explainable Neuro-Symbolic Language Model


80. Hybrid Congestion Classification Framework Using Flow-Guided Attention and Empirical Mode Decomposition


81. Knowledge-Free Correlated Agreement for Incentivizing Federated Learning


82. AICoFe: Implementation and Deployment of an AI-Based Collaborative Feedback System for Higher Education


83. AISSA: Implementation and Deployment of an AI-based Student Slides Analysis tool for Academic Presentations


84. From Beats to Breaches:How Offensive AI Infers Sensitive User Information from Playlists


85. Exact Dual Geometry of SOC-ICNN Value Functions


86. FaithfulFaces: Pose-Faithful Facial Identity Preservation for Text-to-Video Generation


87. Sparse Tokens Suffice: Jailbreaking Audio Language Models via Token-Aware Gradient Optimization


88. Average Attention Transformers and Arithmetic Circuits


89. Multi-Level Bidirectional Biomimetic Learning for EEG-Based Visual Decoding


90. CodeEvolve: LLM-Driven Evolutionary Optimization with Runtime-Enriched Target Selection for Multi-Language Code Enhancement


91. Gradients with Respect to Semantics Preserving Embeddings Tell the Uncertainty of Large Language Models


92. Library learning with e-graphs on jazz harmony


93. Guidelines for Designing AI Technologies to Support Adult Learning



95. VocalParse: Towards Unified and Scalable Singing Voice Transcription with Large Audio Language Models


96. Reference-based Category Discovery: Unsupervised Object Detection with Category Awareness


97. A Queueing-Theoretic Framework for Stability Analysis of LLM Inference with KV Cache Memory Constraints


98. HeterSEED: Semantics-Structure Decoupling for Heterogeneous Graph Learning under Heterophily


99. From Diffusion to Rectified Flow: Rethinking Text-Based Segmentation


100. Dream-MPC: Gradient-Based Model Predictive Control with Latent Imagination


101. Efficient Geometry-Controlled High-Resolution Satellite Image Synthesis


102. Stage-adaptive audio diffusion modeling


103. RLearner-LLM: Balancing Logical Grounding and Fluency in Large Language Models via Hybrid Direct Preference Optimization


104. Accountable Agents in Software Engineering: An Analysis of Terms of Service and a Research Roadmap


105. SADE: Symptom-Aware Diagnostic Escalation for LLM-Based Network Troubleshooting


106. RaguTeam at SemEval-2026 Task 8: Meno and Friends in a Judge-Orchestrated LLM Ensemble for Faithful Multi-Turn Response Generation


107. DAO-enabled decentralized physical AI: A new paradigm for human-machine collaboration


108. Predictive and Prescriptive AI toward Optimizing Wildfire Suppression


109. Ilov3Splat: Instance-Level Open-Vocabulary 3D Scene Understanding in Gaussian Splatting


110. JASTIN: Aligning LLMs for Zero-Shot Audio and Speech Evaluation via Natural Language Instructions


111. SpecPL: Disentangling Spectral Granularity for Prompt Learning


112. DiffCap-Bench: A Comprehensive, Challenging, Robust Benchmark for Image Difference Captioning


113. Example-Based Object Detection


114. Harnessing Linguistic Dissimilarity for Language Generalization on Unseen Low-Resource Varieties


115. Pen-Strategist: A Reasoning Framework for Penetration Testing Strategy Formation and Analysis


116. CAR: Query-Guided Confidence-Aware Reranking for Retrieval-Augmented Generation


117. A Hybrid Method for Low-Resource Named Entity Recognition


118. CCL-D: A High-Precision Diagnostic System for Slow and Hang Anomalies in Large-Scale Model Training


119. Stabilizing LLM Supervised Fine-Tuning via Explicit Distributional Control


120. StableI2I: Spotting Unintended Changes in Image-to-Image Transition


121. GEM: Graph-Enhanced Mixture-of-Experts with ReAct Agents for Dialogue State Tracking


122. Dissociating spatial frequency reliance from adversarial robustness advantages in neurally guided deep convolutional neural networks


123. Joint Optimization of Trajectory Control, Resource Allocation, and Task Offloading for Multi-UAV-Assisted IoV


124. Towards Robust LLM Post-Training: Automatic Failure Management for Reinforcement Fine-Tuning


125. FLUID: Continuous-Time Hyperconnected Sparse Transformer for Sink-Free Learning


126. Demystifying Manifold Constraints in LLM Pre-training


127. Evaluation Cards for XAI Metrics


128. Detecting Deepfakes via Hamiltonian Dynamics


129. Critical Windows of Complexity Control: When Transformers Decide to Reason or Memorize


130. Experiment-as-Code Labs: A Declarative Stack for AI-Driven Scientific Discovery


131. Worst-Case Discovery and Runtime Protection for RL-Based Network Controllers


132. Extending Differential Temporal Difference Methods for Episodic Problems


133. Mitigating Label Shift in Tabular In-Context Learning via Test-Time Posterior Adjustment


134. Coral: Cost-Efficient Multi-LLM Serving over Heterogeneous Cloud GPUs


135. Efficiently Aligning Language Models with Online Natural Language Feedback


136. Budgeted LoRA: Distillation as Structured Compute Allocation for Efficient Inference


137. Resilient AI Supercomputer Networking using MRC and SRv6


138. NoisyCausal: A Benchmark for Evaluating Causal Reasoning Under Structured Noise


139. Memory as a Markov Matrix: Sample Efficient Knowledge Expansion via Token-to-Dictionary Mapping


140. SWAN: Semantic Watermarking with Abstract Meaning Representation


141. LLMs Uncertainty Quantification via Adaptive Conformal Semantic Entropy


142. A Mean Curvature Approach to Boundary Detection: Geometric Insights for Unsupervised Learning


143. Layerwise LQR for Geometry-Aware Optimization of Deep Networks


144. ARMATA: Auto-Regressive Multi-Agent Task Assignment


145. Self-Prompting Small Language Models for Privacy-Sensitive Clinical Information Extraction


146. Predict-then-Diffuse: Adaptive Response Length for Compute-Budgeted Inference in Diffusion LLMs


147. Undetectable Backdoors in Model Parameters: Hiding Sparse Secrets in High Dimensions


148. Deep Wave Network for Modeling Multi-Scale Physical Dynamics


149. MedFabric and EtHER: A Data-Centric Framework for Word-Level Fabrication Generation and Detection in Medical LLMs


150. Frontier Lag: A Bibliometric Audit of Capability Misrepresentation in Academic AI Evaluation


151. A Dialogue-Based Framework for Correcting Multimodal Errors in AI-Assisted STEM Education


152. Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation


153. ProtDBench: A Unified Benchmark of Protein Binder Design and Evaluation


154. Learning reveals invisible structure in low-rank RNNs


155. Resource Utilization of Differentiable Logic Gate Networks Deployed on FPGAs


156. TSCG: Deterministic Tool-Schema Compilation for Agentic LLM Deployments


157. Meta-LegNet: A Transferable and Interpretable Framework for Surface Adsorption Prediction via Self-Defined Adsorption-Environment Learning


158. Are Multimodal LLMs Ready for Clinical Dermatology? A Real-World Evaluation in Dermatology


159. CTM-AI: A Blueprint for General AI Inspired by a Model of Consciousness


160. Evaluating Patient Safety Risks in Generative AI: Development and Validation of a FMECA Framework for Generated Clinical Content


161. FASQ: Flexible Accelerated Subspace Quantization for Calibration-Free LLM Compression


162. AsymmetryZero: A Framework for Operationalizing Human Expert Preferences as Semantic Evals


163. Time series causal discovery with variable lags



165. Efficient Handwriting-Based Alzheimer,s Disease Diagnosis Using a Low-Rank Mixture of Experts Deep Learning Framework


166. Validity-Calibrated Reasoning Distillation


167. Balanced Aggregation: Understanding and Fixing Aggregation Bias in GRPO


168. A Regulatory Governance Framework for AI-Driven Financial Fraud Detection in U.S. Banking: Integrating OCC, SR 11-7, CFPB, and FinCEN Compliance Requirements for Model Development, Validation, and Monitoring Lifecycles


169. RetentiveKV: State-Space Memory for Uncertainty-Aware Multimodal KV Cache Eviction


170. A Physics-Aware Framework for Short-Term GPU Power Forecasting of AI Data Centers


171. Confronting Label Indeterminacy in Automated Bail Decisions


172. Sparse Autoencoder Decomposition of Clinical Sequence Model Representations: Feature Complexity, Task Specialisation, and Mortality Prediction


173. FlatASCEND: Autoregressive Clinical Sequence Generation with Continuous Time Prediction and Association-Based Pharmacological Testing


174. Toward Human-AI Complementarity Across Diverse Tasks


175. LAWS: Learning from Actual Workloads Symbolically – A Self-Certifying Parametrized Cache Architecture for Neural Inference, Robotics, and Edge Deployment


176. Designing a double deep reinforcement learning selection tool for resilient demand prediction


177. Investigating Trustworthiness of Nonparametric Deep Survival Models for Alzheimer’s Disease Progression Analysis


178. EdgeRazor: A Lightweight Framework for Large Language Models via Mixed-Precision Quantization-Aware Distillation


179. Lookahead Drifting Model


180. MP-ISMoE: Mixed-Precision Interactive Side Mixture-of-Experts for Efficient Transfer Learning



182. Transformation Categorization Based on Group Decomposition Theory Using Parameter Division


183. The Reasoning Trap: An Information-Theoretic Bound on Closed-System Multi-Step LLM Reasoning


184. Modeling Subjective Urban Perception with Human Gaze


185. Interpreting Manifolds and Graph Neural Embeddings from Internet of Things Traffic Flows


186. Learning Reconstructive Embeddings in Reproducing Kernel Hilbert Spaces via the Representer Theorem


187. A large language model-type architecture for high-dimensional molecular potential energy surfaces


188. Analogy between Boltzmann machines and Feynman path integrals