전체 AI 논문 - 2026-02-11

1. Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning


2. CODE-SHARP: Continuous Open-ended Discovery and Evolution of Skills as Hierarchical Reward Programs


3. Chain of Mindset: Reasoning with Adaptive Cognitive Modes


4. Discovering High Level Patterns from Simulation Traces


5. ESTAR: Early-Stopping Token-Aware Reasoning For Efficient Inference


6. Closing Reasoning Gaps in Clinical Agents with Differential Reasoning Learning


7. Why Do AI Agents Systematically Fail at Cloud Root Cause Analysis?


8. Efficient Unsupervised Environment Design through Hierarchical Policy Representation Learning


9. Would a Large Language Model Pay Extra for a View? Inferring Willingness to Pay from Subjective Choices


10. Symbolic Pattern Temporal Numeric Planning with Intermediate Conditions and Effects


11. GHS-TDA: A Synergistic Reasoning Framework Integrating Global Hypothesis Space with Topological Data Analysis


12. ClinAlign: Scaling Healthcare Alignment from Clinician Preference


13. FLINGO – Instilling ASP Expressiveness into Linear Integer Constraints


14. Detecting radar targets swarms in range profiles with a partially complex-valued neural network


15. Autoregressive Direct Preference Optimization


16. Computing Conditional Shapley Values Using Tabular Foundation Models


17. Bridging Efficiency and Transparency: Explainable CoT Compression in Multimodal Large Reasoning Models


18. SpotAgent: Grounding Visual Geo-localization in Large Vision-Language Models through Agentic Reasoning


19. P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads


20. Image Quality in the Era of Artificial Intelligence


21. Not-in-Perspective: Towards Shielding Google’s Perspective API Against Adversarial Negation Attacks


22. Auditing Multi-Agent LLM Reasoning Trees Outperforms Majority Vote and LLM-as-Judge


23. Measuring Dataset Diversity from a Geometric Perspective


24. Human Control Is the Anchor, Not the Answer: Early Divergence of Oversight in Agentic AI Communities


25. FlyAOC: Evaluating Agentic Ontology Curation of Drosophila Scientific Knowledge Bases


26. CoMMa: Contribution-Aware Medical Multi-Agents From A Game-Theoretic Perspective


27. PABU: Progress-Aware Belief Update for Efficient LLM Agents


28. Uncertainty-Aware Multimodal Emotion Recognition through Dirichlet Parameterization


29. A Small-Scale System for Autoregressive Program Synthesis Enabling Controlled Experimentation


30. Biases in the Blind Spot: Detecting What LLMs Fail to Mention


31. Olaf-World: Orienting Latent Actions for Video World Modeling


32. Step-resolved data attribution for looped transformers


33. Causality in Video Diffusers is Separable from Denoising


34. Anagent For Enhancing Scientific Table & Figure Analysis


35. Long Chain-of-Thought Compression via Fine-Grained Group Policy Optimization


36. Optimistic World Models: Efficient Exploration in Model-Based Deep Reinforcement Learning


37. Fake-HR1: Rethinking reasoning of vision language model for synthetic image detection


38. Decoupled Reasoning with Implicit Fact Tokens (DRIFT): A Dual-Model Framework for Efficient Long-Context Inference


39. ADORA: Training Reasoning Models with Dynamic Advantage Estimation on Reinforcement Learning


40. Kunlun: Establishing Scaling Laws for Massive-Scale Recommendation Systems through Unified Architecture Design


41. RoboSubtaskNet: Temporal Sub-task Segmentation for Human-to-Robot Skill Transfer in Real-World Environments


42. A Collaborative Safety Shield for Safe and Efficient CAV Lane Changes in Congested On-Ramp Merging


43. A Unified Assessment of the Poverty of the Stimulus Argument for Neural Language Models


44. Empirical Stability Analysis of Kolmogorov-Arnold Networks in Hard-Constrained Recurrent Physics-Informed Discovery


45. Infusion: Shaping Model Behavior by Editing Training Data via Influence Functions


46. Online Monitoring Framework for Automotive Time Series Data using JEPA Embeddings


47. Coupled Inference in Diffusion Models for Semantic Decomposition


48. Supervised Metric Regularization Through Alternating Optimization for Multi-Regime Physics-Informed Neural Networks


49. Drug Release Modeling using Physics-Informed Neural Networks


50. Bladder Vessel Segmentation using a Hybrid Attention-Convolution Framework


51. Instruct2Act: From Human Instruction to Actions Sequencing and Execution via Robot Action Network for Robotic Manipulation


52. Unbalanced optimal transport for robust longitudinal lesion evolution with registration-aware and appearance-guided priors


53. Monocular Normal Estimation via Shading Sequence Estimation


54. LLMs Encode Their Failures: Predicting Success from Pre-Generation Activations


55. SARS: A Novel Face and Body Shape and Appearance Aware 3D Reconstruction System extends Morphable Models


56. Self-Regulated Reading with AI Support: An Eight-Week Study with Students


57. Routing, Cascades, and User Choice for LLMs


58. TaCo: A Benchmark for Lossless and Lossy Codecs of Heterogeneous Tactile Data


59. Code2World: A GUI World Model via Renderable Code Generation


60. Hybrid Responsible AI-Stochastic Approach for SLA Compliance in Multivendor 6G Networks


61. Text summarization via global structure awareness


62. A Controlled Study of Double DQN and Dueling DQN Under Cross-Environment Transfer


63. Decomposing Reasoning Efficiency in Large Language Models


64. Flexible Entropy Control in RLVR with Gradient-Preserving Perspective


65. Explainability in Generative Medical Diffusion Models: A Faithfulness-Based Analysis on MRI Synthesis


66. Grounding LTL Tasks in Sub-Symbolic RL Environments for Zero-Shot Generalization


67. ExO-PPO: an Extended Off-policy Proximal Policy Optimization Algorithm


68. From Lightweight CNNs to SpikeNets: Benchmarking Accuracy-Energy Tradeoffs with Pruned Spiking SqueezeNet


69. Physics-informed diffusion models in spectral space


70. Maastricht University at AMIYA: Adapting LLMs for Dialectal Arabic using Fine-tuning and MBR Decoding


71. GenSeg-R1: RL-Driven Vision-Language Grounding for Fine-Grained Referring Segmentation


72. Resilient Class-Incremental Learning: on the Interplay of Drifting, Unlabelled and Imbalanced Data Streams


73. Administrative Law’s Fourth Settlement: AI and the Capability-Accountability Trap


74. MATA: Multi-Agent Framework for Reliable and Flexible Table Question Answering


75. Stop Testing Attacks, Start Diagnosing Defenses: The Four-Checkpoint Framework Reveals Where LLM Safety Breaks


76. AnyTouch 2: General Optical Tactile Representation Learning For Dynamic Tactile Perception


77. With Argus Eyes: Assessing Retrieval Gaps via Uncertainty Scoring to Detect and Remedy Retrieval Blind Spots


78. AGMark: Attention-Guided Dynamic Watermarking for Large Vision-Language Models


79. Why the Counterintuitive Phenomenon of Likelihood Rarely Appears in Tabular Anomaly Detection with Deep Generative Models?


80. On the Optimal Reasoning Length for RL-Trained Language Models


81. Context-Aware Counterfactual Data Augmentation for Gender Bias Mitigation in Language Models


82. MieDB-100k: A Comprehensive Dataset for Medical Image Editing


83. Mitigating the Likelihood Paradox in Flow-based OOD Detection via Entropy Manipulation


84. Aligning Tree-Search Policies with Fixed Token Budgets in Test-Time Scaling of LLMs


85. Predictive Query Language: A Domain-Specific Language for Predictive Modeling on Relational Databases


86. LEMUR: A Corpus for Robust Fine-Tuning of Multilingual Law Embedding Models for Retrieval


87. ECG-IMN: Interpretable Mesomorphic Neural Networks for 12-Lead Electrocardiogram Interpretation


88. Comprehensive Comparison of RAG Methods Across Multi-Domain Conversational QA


89. Learning to Discover Iterative Spectral Algorithms


90. EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies


91. Seeing the Goal, Missing the Truth: Human Accountability for AI Bias


92. Beware of the Batch Size: Hyperparameter Bias in Evaluating LoRA


93. Listen to the Layers: Mitigating Hallucinations with Inter-Layer Disagreement


94. ArtifactLens: Hundreds of Labels Are Enough for Artifact Detection with VLMs


95. NOWJ @BioCreative IX ToxHabits: An Ensemble Deep Learning Approach for Detecting Substance Use and Contextual Information in Clinical Texts


96. AlgoVeri: An Aligned Benchmark for Verified Code Generation on Classical Algorithms


97. SWE-AGI: Benchmarking Specification-Driven Software Construction with MoonBit in the Era of Autonomous Agents


98. Conceptual Cultural Index: A Metric for Cultural Specificity via Relative Generality


99. Evaluating Social Bias in RAG Systems: When External Context Helps and Reasoning Hurts


100. Diffusion-Guided Pretraining for Brain Graph Foundation Models


101. A Behavioral Fingerprint for Large Language Models: Provenance Tracking via Refusal Vectors


102. Autonomous Action Runtime Management(AARM):A System Specification for Securing AI-Driven Actions at Runtime


103. Sci-VLA: Agentic VLA Inference Plugin for Long-Horizon Tasks in Scientific Experiments


104. Beyond Input-Output: Rethinking Creativity through Design-by-Analogy in Human-AI Collaboration


105. LARV: Data-Free Layer-wise Adaptive Rescaling Veneer for Model Merging


106. Accelerating Post-Quantum Cryptography via LLM-Driven Hardware-Software Co-Design


107. Squeezing More from the Stream : Learning Representation Online for Streaming Reinforcement Learning


108. The Critical Horizon: Inspection Design Principles for Multi-Stage Operations and Deep Reasoning


109. LLMAC: A Global and Explainable Access Control Framework with Large Language Model


110. Contractual Deepfakes: Can Large Language Models Generate Contracts?


111. BiasScope: Towards Automated Detection of Bias in LLM-as-a-Judge Evaluation


112. Surrogate-Guided Quantum Discovery in Black-Box Landscapes with Latent-Quadratic Interaction Embedding Transformers


113. Behavioral Economics of AI: LLM Biases and Corrections


114. AgentCgroup: Understanding and Controlling OS Resources of AI Agents


115. Kyrtos: A methodology for automatic deep analysis of graphic charts with curves in technical documents


116. Beyond Uniform Credit: Causal Credit Assignment for Policy Optimization


117. GAFR-Net: A Graph Attention and Fuzzy-Rule Network for Interpretable Breast Cancer Image Classification


118. SnareNet: Flexible Repair Layers for Neural Networks with Hard Constraints


119. A Deep Multi-Modal Method for Patient Wound Healing Assessment


120. Clarifying Shampoo: Adapting Spectral Descent to Stochasticity and the Parameter Trajectory


121. Don’t Shoot The Breeze: Topic Continuity Model Using Nonlinear Naive Bayes With Attention


122. Empowering Contrastive Federated Sequential Recommendation with LLMs


123. X-Mark: Saliency-Guided Robust Dataset Ownership Verification for Medical Imaging


124. Effective Reasoning Chains Reduce Intrinsic Dimensionality


125. STaR: Scalable Task-Conditioned Retrieval for Long-Horizon Multimodal Robot Memory


126. VLM-Guided Iterative Refinement for Surgical Image Segmentation with Foundation Models


127. Do Neural Networks Lose Plasticity in a Gradually Changing World?


128. MUZZLE: Adaptive Agentic Red-Teaming of Web Agents Against Indirect Prompt Injection Attacks


129. A Lightweight Multi-View Approach to Short-Term Load Forecasting


130. CausalGDP: Causality-Guided Diffusion Policies for Reinforcement Learning


131. Genocide by Algorithm in Gaza: Artificial Intelligence, Countervailing Responsibility, and the Corruption of Public Discourse


132. Gradient Residual Connections


133. AIDev: Studying AI Coding Agents on GitHub


134. $n$-Musketeers: Reinforcement Learning Shapes Collaboration Among Language Models


135. Quantifying Epistemic Uncertainty in Diffusion Models


136. What do Geometric Hallucination Detection Metrics Actually Measure?


137. A Hybrid Deterministic Framework for Named Entity Extraction in Broadcast News Video


138. SceneSmith: Agentic Generation of Simulation-Ready Indoor Scenes


139. Benchmarking the Energy Savings with Speculative Decoding Strategies


140. Distributed Hybrid Parallelism for Large Language Models: Comparative Study and System Design Guide


141. UI-Venus-1.5 Technical Report


142. DMamba: Decomposition-enhanced Mamba for Time Series Forecasting


143. Looping Back to Move Forward: Recursive Transformers for Efficient and Flexible Large Multimodal Models


144. Framework for Integrating Zero Trust in Cloud-Based Endpoint Security for Critical Infrastructure


145. Learning to Remember, Learn, and Forget in Attention-Based Models


146. DRAGON: Robust Classification for Very Large Collections of Software Repositories


147. NarraScore: Bridging Visual Narrative and Musical Dynamics via Hierarchical Affective Control


148. AntigenLM: Structure-Aware DNA Language Modeling for Influenza


149. Spectral Disentanglement and Enhancement: A Dual-domain Contrastive Framework for Representation Learning


150. Enhanced Graph Transformer with Serialized Graph Tokens


151. Predicting Open Source Software Sustainability with Deep Temporal Neural Hierarchical Architectures and Explainable AI


152. scBench: Evaluating AI Agents on Single-Cell RNA-seq Analysis


153. Persistent Entropy as a Detector of Phase Transitions


154. RuleFlow : Generating Reusable Program Optimizations with LLMs


155. SAS-Net: Scene-Appearance Separation Network for Robust Spatiotemporal Registration in Bidirectional Photoacoustic Microscopy


156. DSFlow: Dual Supervision and Step-Aware Architecture for One-Step Flow Matching Speech Synthesis


157. Soft Clustering Anchors for Self-Supervised Speech Representation Learning in Joint Embedding Prediction Architectures


158. Efficient Distance Pruning for Process Suffix Comparison in Prescriptive Process Monitoring


159. Scaling GraphLLM with Bilevel-Optimized Sparse Querying


160. E2CAR: An Efficient 2D-CNN Framework for Real-Time EEG Artifact Removal on Edge Devices


161. Recovering Whole-Brain Causal Connectivity under Indirect Observation with Applications to Human EEG and fMRI


162. Federated Learning for Surgical Vision in Appendicitis Classification: Results of the FedSurg EndoVis 2024 Challenge