LLM 관련 주요 논문 - 2026-02-05

1. Fluid Representations in Reasoning Models


2. Agentic AI in Healthcare & Medicine: A Seven-Dimensional Taxonomy for Empirical Evaluation of LLM-based Agents


3. WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning


4. ReThinker: Scientific Reasoning by Rethinking with Guided Reflection and Confidence Control


5. From Assumptions to Actions: Turning LLM Reasoning into Uncertainty-Aware Planning for Embodied Agents


6. Agent-Omit: Training Efficient LLM Agents for Adaptive Thought and Observation Omission via Agentic Reinforcement Learning



8. Steering LLMs via Scalable Interactive Oversight


9. Interfaze: The Future of AI is built on Task-Specific Small Models


10. Scaling In-Context Online Learning Capability of LLMs via Cross-Episode Meta-RL


11. When AI Persuades: Adversarial Explanation Attacks on Human Trust in AI-Assisted Decision Making


12. Adaptive Test-Time Compute Allocation via Learned Heuristics over Categorical Structure


13. Active Epistemic Control for Query-Efficient Verified Planning


14. AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent


15. Enhancing Mathematical Problem Solving in LLMs through Execution-Driven Reasoning Augmentation


16. Knowledge Model Prompting Increases LLM Performance on Planning Tasks


17. Rethinking the Trust Region in LLM Reinforcement Learning


18. Subliminal Effects in Your Data: A General Mechanism via Log-Linearity


19. El Agente Estructural: An Artificially Intelligent Molecular Editor


20. Team, Then Trim: An Assembly-Line LLM Framework for High-Quality Tabular Data Generation


21. When Silence Is Golden: Can LLMs Learn to Abstain in Temporal QA and Beyond?


22. Exploiting contextual information to improve stance detection in informal political discourse with LLMs


23. Alignment Drift in Multimodal LLMs: A Two-Phase, Longitudinal Evaluation of Harm Across Eight Model Releases


24. From Data to Behavior: Predicting Unintended Model Behaviors Before Training


25. Supporting software engineering tasks with agentic AI: Demonstration on document retrieval and test scenario generation


26. Identifying Intervenable and Interpretable Features via Orthogonality Regularization


27. Adaptive Prompt Elicitation for Text-to-Image Generation


28. SAR-RAG: ATR Visual Question Answering by Semantic Search, Retrieval, and MLLM Generation


29. Addressing Corpus Knowledge Poisoning Attacks on RAG Using Sparse Attention


30. Overstating Attitudes, Ignoring Networks: LLM Biases in Simulating Misinformation Susceptibility


31. Rethinking the Design Space of Reinforcement Learning for Diffusion Models: On the Importance of Likelihood Estimation Beyond Loss Design


32. VILLAIN at AVerImaTeC: Verifying Image-Text Claims via Multi-Agent Collaboration


33. Trust The Typical


34. LycheeDecode: Accelerating Long-Context LLM Inference via Hybrid-Head Sparse Decoding


35. Discovering Mechanistic Models of Neural Activity: System Identification in an in Silico Zebrafish


36. LLM-Empowered Cooperative Content Caching in Vehicular Fog Caching-Assisted Platoon Networks


37. Is Micro Domain-Adaptive Pre-Training Effective for Real-World Operations? Multi-Step Evaluation Reveals Potential and Bottlenecks


38. Growth First, Care Second? Tracing the Landscape of LLM Value Preferences in Everyday Dilemmas


39. RASA: Routing-Aware Safety Alignment for Mixture-of-Experts Models


40. Mixture of Masters: Sparse Chess Language Models with Player Routing


41. EMA Policy Gradient: Taming Reinforcement Learning for LLMs with EMA Anchor and Top-k KL


42. History-Guided Iterative Visual Reasoning with Self-Correction


43. Bi-directional Bias Attribution: Debiasing Large Language Models without Modifying Prompts


44. Beyond KL Divergence: Policy Optimization with Flexible Bregman Divergences for LLM Reasoning


45. UnMaskFork: Test-Time Scaling for Masked Diffusion via Deterministic Action Branching


46. Explicit Uncertainty Modeling for Active CLIP Adaptation with Dual Prompt Tuning


47. Fine-tuning Pre-trained Vision-Language Models in a Human-Annotation-Free Manner


48. DeFrame: Debiasing Large Language Models Against Framing Effects


49. Beyond Static Cropping: Layer-Adaptive Visual Localization and Decoding Enhancement


50. Revisiting Prompt Sensitivity in Large Language Models for Text Classification: The Role of Prompt Underspecification


51. ProxyWar: Dynamic Assessment of LLM Code Generation in Game Arenas


52. How Few-shot Demonstrations Affect Prompt-based Defenses Against LLM Jailbreak Attacks


53. Disentangling Causal Importance from Emergent Structure in Multi-Expert Orchestration


54. Contextual Drag: How Errors in the Context Affect LLM Reasoning


55. Thickening-to-Thinning: Reward Shaping via Human-Inspired Learning Dynamics for LLM Reasoning


56. AppleVLM: End-to-end Autonomous Driving with Advanced Perception and Planning-Enhanced Vision-Language Models


57. RAPO: Risk-Aware Preference Optimization for Generalizable Safe Reasoning


58. Language Models Struggle to Use Representations Learned In-Context



60. From Helpfulness to Toxic Proactivity: Diagnosing Behavioral Misalignment in LLM Agents


61. JSynFlow: Japanese Synthesised Flowchart Visual Question Answering Dataset built with Large Language Models


62. KGLAMP: Knowledge Graph-guided Language model for Adaptive Multi-robot Planning and Replanning


63. On the Credibility of Evaluating LLMs using Survey Questions


64. Understanding and Guiding Layer Placement in Parameter-Efficient Fine-Tuning of Large Language Models


65. When Chains of Thought Don’t Matter: Causal Bypass in Large Language Models


66. Transformers perform adaptive partial pooling


67. Structural shifts in institutional participation and collaboration within the AI arXiv preprint research ecosystem


68. Semantic Rate Distortion and Posterior Design: Compute Constraints, Multimodality, and Strategic Inference


69. Audit After Segmentation: Reference-Free Mask Quality Assessment for Language-Referred Audio-Visual Segmentation


70. GOPO: Policy Optimization using Ranked Rewards


71. Decoding Ambiguous Emotions with Test-Time Scaling in Audio-Language Models


72. WebAccessVL: Making an Accessible Web via Violation-Conditioned VLM


73. HybridQuestion: Human-AI Collaboration for Identifying High-Impact Research Questions