전체 AI 논문 - 2026-02-05

1. Fluid Representations in Reasoning Models


2. Group-Evolving Agents: Open-Ended Self-Improvement via Experience Sharing


3. Are AI Capabilities Increasing Exponentially? A Competing Hypothesis


4. Agentic AI in Healthcare & Medicine: A Seven-Dimensional Taxonomy for Empirical Evaluation of LLM-based Agents


5. WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning


6. Vibe AIGC: A New Paradigm for Content Generation via Agentic Orchestration


7. From Competition to Collaboration: Designing Sustainable Mechanisms Between LLMs and Online Forums


8. ReThinker: Scientific Reasoning by Rethinking with Guided Reflection and Confidence Control


9. Digital Twins & ZeroConf AI: Structuring Automated Intelligent Pipelines for Industrial Applications


10. From Assumptions to Actions: Turning LLM Reasoning into Uncertainty-Aware Planning for Embodied Agents


11. Agent-Omit: Training Efficient LLM Agents for Adaptive Thought and Observation Omission via Agentic Reinforcement Learning



13. InterPReT: Interactive Policy Restructuring and Training Enable Effective Imitation Learning from Laypersons


14. Steering LLMs via Scalable Interactive Oversight


15. OMG-Agent: Toward Robust Missing Modality Generation with Decoupled Coarse-to-Fine Agentic Workflows


16. Interfaze: The Future of AI is built on Task-Specific Small Models


17. Scaling In-Context Online Learning Capability of LLMs via Cross-Episode Meta-RL


18. Axiomatic Foundations of Counterfactual Explanations


19. When AI Persuades: Adversarial Explanation Attacks on Human Trust in AI-Assisted Decision Making


20. Monitorability as a Free Gift: How RLVR Spontaneously Aligns Reasoning


21. Adaptive Test-Time Compute Allocation via Learned Heuristics over Categorical Structure


22. Active Epistemic Control for Query-Efficient Verified Planning


23. AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent


24. Enhancing Mathematical Problem Solving in LLMs through Execution-Driven Reasoning Augmentation


25. Knowledge Model Prompting Increases LLM Performance on Planning Tasks


26. Protein Autoregressive Modeling via Multiscale Structure Generation


27. Contrastive Continual Learning for Model Adaptability in Internet of Things


28. Rethinking the Trust Region in LLM Reinforcement Learning


29. Multi-layer Cross-Attention is Provably Optimal for Multi-modal In-context Learning


30. CRoSS: A Continual Robotic Simulation Suite for Scalable Reinforcement Learning with High Task Diversity and Realistic Physics Simulation


31. Subliminal Effects in Your Data: A General Mechanism via Log-Linearity


32. From Evaluation to Design: Using Potential Energy Surface Smoothness Metrics to Guide Machine Learning Interatomic Potential Architectures


33. El Agente Quntur: A research collaborator agent for quantum chemistry


34. El Agente Estructural: An Artificially Intelligent Molecular Editor


35. It’s not a Lottery, it’s a Race: Understanding How Gradient Descent Adapts the Network’s Capacity to the Task


36. Safe Urban Traffic Control via Uncertainty-Aware Conformal Prediction and World-Model Reinforcement Learning


37. Toward Reliable and Explainable Nail Disease Classification: Leveraging Adversarial Training and Grad-CAM Visualization


38. SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization


39. Beyond Rewards in Reinforcement Learning for Cyber Defence


40. Skin Tokens: A Learned Compact Representation for Unified Autoregressive Rigging


41. Team, Then Trim: An Assembly-Line LLM Framework for High-Quality Tabular Data Generation


42. Billion-Scale Graph Foundation Models


43. Active Asymmetric Multi-Agent Multimodal Learning under Uncertainty


44. When Silence Is Golden: Can LLMs Learn to Abstain in Temporal QA and Beyond?


45. Comparative Insights on Adversarial Machine Learning from Industry and Academia: A User-Study Approach


46. Exploiting contextual information to improve stance detection in informal political discourse with LLMs


47. Alignment Drift in Multimodal LLMs: A Two-Phase, Longitudinal Evaluation of Harm Across Eight Model Releases


48. From Data to Behavior: Predicting Unintended Model Behaviors Before Training


49. Supporting software engineering tasks with agentic AI: Demonstration on document retrieval and test scenario generation


50. Identifying Intervenable and Interpretable Features via Orthogonality Regularization


51. Adaptive Prompt Elicitation for Text-to-Image Generation


52. SAR-RAG: ATR Visual Question Answering by Semantic Search, Retrieval, and MLLM Generation


53. Addressing Corpus Knowledge Poisoning Attacks on RAG Using Sparse Attention


54. DRMOT: A Dataset and Framework for RGBD Referring Multi-Object Tracking


55. Audio ControlNet for Fine-Grained Audio Generation and Editing


56. Let Experts Feel Uncertainty: A Multi-Expert Label Distribution Approach to Probabilistic Time Series Forecasting


57. Overstating Attitudes, Ignoring Networks: LLM Biases in Simulating Misinformation Susceptibility


58. Delving into Muon and Beyond: Deep Analysis and Extensions


59. Rethinking the Design Space of Reinforcement Learning for Diffusion Models: On the Importance of Likelihood Estimation Beyond Loss Design


60. Towards Structured, State-Aware, and Execution-Grounded Reasoning for Software Engineering Agents


61. A Human-Centered Privacy Approach (HCP) to AI


62. RexBERT: Context Specialized Bidirectional Encoders for E-commerce


63. VILLAIN at AVerImaTeC: Verifying Image-Text Claims via Multi-Agent Collaboration


64. Trust The Typical


65. Dual Mind World Model Inspired Network Digital Twin for Access Scheduling


66. OmniRad: A Radiological Foundation Model for Multi-Task Medical Image Analysis


67. Continual Learning through Control Minimization


68. LycheeDecode: Accelerating Long-Context LLM Inference via Hybrid-Head Sparse Decoding


69. SLUM-i: Semi-supervised Learning for Urban Mapping of Informal Settlements and Data Quality Benchmarking


70. Learning the Value Systems of Agents with Preference-based and Inverse Reinforcement Learning


71. BrainVista: Modeling Naturalistic Brain Dynamics as Multimodal Next-Token Prediction


72. Discovering Mechanistic Models of Neural Activity: System Identification in an in Silico Zebrafish


73. LLM-Empowered Cooperative Content Caching in Vehicular Fog Caching-Assisted Platoon Networks


74. Is Micro Domain-Adaptive Pre-Training Effective for Real-World Operations? Multi-Step Evaluation Reveals Potential and Bottlenecks


75. Growth First, Care Second? Tracing the Landscape of LLM Value Preferences in Everyday Dilemmas


76. RASA: Routing-Aware Safety Alignment for Mixture-of-Experts Models


77. Mixture of Masters: Sparse Chess Language Models with Player Routing


78. No One-Size-Fits-All: Building Systems For Translation to Bashkir, Kazakh, Kyrgyz, Tatar and Chuvash Using Synthetic And Original Data


79. SPEAR: An Engineering Case Study of Multi-Agent Coordination for Smart Contract Auditing


80. EMA Policy Gradient: Taming Reinforcement Learning for LLMs with EMA Anchor and Top-k KL


81. Med-MMFL: A Multimodal Federated Learning Benchmark in Healthcare


82. History-Guided Iterative Visual Reasoning with Self-Correction


83. Performative Learning Theory


84. Bi-directional Bias Attribution: Debiasing Large Language Models without Modifying Prompts


85. LoRDO: Distributed Low-Rank Optimization with Infrequent Communication


86. Blockchain Federated Learning for Sustainable Retail: Reducing Waste through Collaborative Demand Forecasting


87. Enabling Real-Time Colonoscopic Polyp Segmentation on Commodity CPUs via Ultra-Lightweight Architecture


88. Beyond KL Divergence: Policy Optimization with Flexible Bregman Divergences for LLM Reasoning


89. SparVAR: Exploring Sparsity in Visual AutoRegressive Modeling for Training-Free Acceleration


90. Counterfactual Explanations for Hypergraph Neural Networks


91. VecSet-Edit: Unleashing Pre-trained LRM for Mesh Editing from Single Image


92. UnMaskFork: Test-Time Scaling for Masked Diffusion via Deterministic Action Branching


93. Explicit Uncertainty Modeling for Active CLIP Adaptation with Dual Prompt Tuning


94. Fine-tuning Pre-trained Vision-Language Models in a Human-Annotation-Free Manner


95. Efficient Equivariant High-Order Crystal Tensor Prediction via Cartesian Local-Environment Many-Body Coupling


96. DeFrame: Debiasing Large Language Models Against Framing Effects


97. Beyond Static Cropping: Layer-Adaptive Visual Localization and Decoding Enhancement


98. Revisiting Prompt Sensitivity in Large Language Models for Text Classification: The Role of Prompt Underspecification


99. ProxyWar: Dynamic Assessment of LLM Code Generation in Game Arenas


100. How Few-shot Demonstrations Affect Prompt-based Defenses Against LLM Jailbreak Attacks


101. Disentangling Causal Importance from Emergent Structure in Multi-Expert Orchestration


102. Contextual Drag: How Errors in the Context Affect LLM Reasoning


103. Multi Objective Design Optimization of Non Pneumatic Passenger Car Tires Using Finite Element Modeling, Machine Learning, and Particle swarm Optimization and Bayesian Optimization Algorithms


104. SkeletonGaussian: Editable 4D Generation through Gaussian Skeletonization


105. Thickening-to-Thinning: Reward Shaping via Human-Inspired Learning Dynamics for LLM Reasoning


106. From Dead Neurons to Deep Approximators: Deep Bernstein Networks as a Provable Alternative to Residual Layers


107. AppleVLM: End-to-end Autonomous Driving with Advanced Perception and Planning-Enhanced Vision-Language Models


108. ACIL: Active Class Incremental Learning for Image Classification


109. RAPO: Risk-Aware Preference Optimization for Generalizable Safe Reasoning


110. OAT: Ordered Action Tokenization


111. Language Models Struggle to Use Representations Learned In-Context


112. SCALE: Self-uncertainty Conditioned Adaptive Looking and Execution for Vision-Language-Action Models



114. From Helpfulness to Toxic Proactivity: Diagnosing Behavioral Misalignment in LLM Agents


115. Natural Language Instructions for Scene-Responsive Human-in-the-Loop Motion Planning in Autonomous Driving using Vision-Language-Action Models


116. HoloEv-Net: Efficient Event-based Action Recognition via Holographic Spatial Embedding and Global Spectral Gating


117. Topology-Aware Revival for Efficient Sparse Training


118. Improving 2D Diffusion Models for 3D Medical Imaging with Inter-Slice Consistent Stochasticity


119. Pruning for Generalization: A Transfer-Oriented Spatiotemporal Graph Framework


120. MA3DSG: Multi-Agent 3D Scene Graph Generation for Large-Scale Indoor Environments


121. JSynFlow: Japanese Synthesised Flowchart Visual Question Answering Dataset built with Large Language Models


122. KGLAMP: Knowledge Graph-guided Language model for Adaptive Multi-robot Planning and Replanning


123. From Lemmas to Dependencies: What Signals Drive Light Verbs Classification?


124. Scalable Explainability-as-a-Service (XaaS) for Edge AI Systems


125. Toward Effective Multimodal Graph Foundation Model: A Divide-and-Conquer Based Approach


126. Tinker Tales: Supporting Child-AI Collaboration through Co-Creative Storytelling with Educational Scaffolding


127. DMS2F-HAD: A Dual-branch Mamba-based Spatial-Spectral Fusion Network for Hyperspectral Anomaly Detection


128. A computational account of dreaming: learning and memory consolidation


129. Structure-Informed Estimation for Pilot-Limited MIMO Channels via Tensor Decomposition


130. Principles of Lipschitz continuity in neural networks


131. On the Credibility of Evaluating LLMs using Survey Questions


132. PluRel: Synthetic Data unlocks Scaling Laws for Relational Foundation Models


133. Understanding and Guiding Layer Placement in Parameter-Efficient Fine-Tuning of Large Language Models


134. PromptSplit: Revealing Prompt-Level Disagreement in Generative Models


135. Rational ANOVA Networks


136. When Chains of Thought Don’t Matter: Causal Bypass in Large Language Models


137. DeXposure-FM: A Time-series, Graph Foundation Model for Credit Exposures and Stability on Decentralized Financial Networks


138. Transformers perform adaptive partial pooling


139. Fixed Budget is No Harder Than Fixed Confidence in Best-Arm Identification up to Logarithmic Factors


140. Structural shifts in institutional participation and collaboration within the AI arXiv preprint research ecosystem


141. Semantic Rate Distortion and Posterior Design: Compute Constraints, Multimodality, and Strategic Inference


142. Linguistic Blind Spots in Clinical Decision Extraction


143. First-Principles AI finds crystallization of fractional quantum Hall liquids


144. WIND: Weather Inverse Diffusion for Zero-Shot Atmospheric Modeling


145. SpecMD: A Comprehensive Study On Speculative Expert Prefetching


146. Phaedra: Learning High-Fidelity Discrete Tokenization for the Physical Science


147. Entropy-Aware Structural Alignment for Zero-Shot Handwritten Chinese Character Recognition


148. HY3D-Bench: Generation of 3D Assets


149. GeoIB: Geometry-Aware Information Bottleneck via Statistical-Manifold Compression


150. All-Atom GPCR-Ligand Simulation via Residual Isometric Latent Flow


151. Byzantine Machine Learning: MultiKrum and an optimal notion of robustness


152. Vision Transformers for Zero-Shot Clustering of Animal Images: A Comparative Benchmarking Study


153. Audit After Segmentation: Reference-Free Mask Quality Assessment for Language-Referred Audio-Visual Segmentation


154. Sounding Highlights: Dual-Pathway Audio Encoders for Audio-Visual Video Highlight Detection


155. Explainable Computer Vision Framework for Automated Pore Detection and Criticality Assessment in Additive Manufacturing


156. PriorProbe: Recovering Individual-Level Priors for Personalizing Neural Networks in Facial Expression Recognition


157. DiGAN: Diffusion-Guided Attention Network for Early Alzheimer’s Disease Detection


158. TruKAN: Towards More Efficient Kolmogorov-Arnold Networks Using Truncated Power Functions


159. GOPO: Policy Optimization using Ranked Rewards


160. Reversible Deep Learning for 13C NMR in Chemoinformatics: On Structures and Spectra


161. Decoding Ambiguous Emotions with Test-Time Scaling in Audio-Language Models


162. Understanding the Impact of Differentially Private Training on Memorization of Long-Tailed Data


163. Benchmarking Automatic Speech Recognition for Indian Languages in Agricultural Contexts


164. PaperX: A Unified Framework for Multimodal Academic Presentation Generation with Scholar DAG


165. Perceptions of AI-CBT: Trust and Barriers in Chinese Postgrads


166. WebAccessVL: Making an Accessible Web via Violation-Conditioned VLM


167. HybridQuestion: Human-AI Collaboration for Identifying High-Impact Research Questions


168. Merged ChemProt-DrugProt for Relation Extraction from Biomedical Literature