LLM 관련 주요 논문 - 2026-02-26

1. Language Models Exhibit Inconsistent Biases Towards Algorithmic Agents and Human Experts


2. Semantic Partial Grounding via LLMs


3. ProactiveMobile: A Comprehensive Benchmark for Boosting Proactive Intelligence on Mobile Devices


4. Distill and Align Decomposition for Enhanced Claim Verification


5. Prompt Architecture Determines Reasoning Quality: A Variable Isolation Study on the Car Wash Problem


6. ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning


7. Beyond Refusal: Probing the Limits of Agentic Self-Correction for Semantic Sensitive Information


8. A Hierarchical Multi-Agent System for Autonomous Discovery in Geoscientific Data Archives


9. Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets


10. Provable Last-Iterate Convergence for Multi-Objective Safe LLM Alignment via Optimistic Primal-Dual


11. When AI Writes, Whose Voice Remains? Quantifying Cultural Marker Erasure Across World English Varieties in Large Language Models


12. NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors


13. SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents


14. Understanding Artificial Theory of Mind: Perturbed Tasks and Reasoning in Large Language Models


15. Enhancing LLM-Based Test Generation by Eliminating Covered Code


16. Hidden Topics: Measuring Sensitive AI Beliefs with List Experiments


17. DynamicGTR: Leveraging Graph Topology Representation Preferences to Boost VLM Capabilities on Graph QAs


18. An Evaluation of Context Length Extrapolation in Long Code via Positional Embeddings and Efficient Attention


19. Beyond Static Artifacts: A Forensic Benchmark for Video Deepfake Reasoning in Vision Language Models


20. Generalisation of RLHF under Reward Shift and Clipped KL Regularisation


21. Two-Stage Active Distribution Network Voltage Control via LLM-RL Collaboration: A Hybrid Knowledge-Data-Driven Approach


22. SurGo-R1: Benchmarking and Modeling Contextual Reasoning for Operative Zone in Surgical Video


23. Dynamic Multimodal Activation Steering for Hallucination Mitigation in Large Vision-Language Models


24. Hierarchical LLM-Based Multi-Agent Framework with Prompt Optimization for Multi-Robot Task Planning


25. Sparsity Induction for Accurate Post-Training Pruning of Large Language Models


26. PPCR-IM: A System for Multi-layer DAG-based Public Policy Consequence Reasoning and Social Indicator Mapping


27. Virtual Biopsy for Intracranial Tumors Diagnosis on MRI


28. Structurally Aligned Subtask-Level Memory for Software Engineering Agents


29. Revisiting RAG Retrievers: An Information Theoretic Benchmark


30. One Brain, Omni Modalities: Towards Unified Non-Invasive Brain Decoding with Large Language Models


31. Training Generalizable Collaborative Agents via Strategic Risk Aversion


32. GradAlign: Gradient-Aligned Data Selection for LLM Reinforcement Learning


33. Revisiting Text Ranking in Deep Research


34. Adversarial Intent is a Latent Variable: Stateful Trust Inference for Securing Multimodal Agentic RAG


35. MINAR: Mechanistic Interpretability for Neural Algorithmic Reasoning


36. Causal Decoding for Hallucination-Resistant Multimodal Large Language Models


37. Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning


38. Small Language Models for Privacy-Preserving Clinical Information Extraction in Low-Resource Languages


39. Alignment-Weighted DPO: A principled reasoning approach to improve safety alignment


40. Equitable Evaluation via Elicitation


41. Group Orthogonalized Policy Optimization:Group Policy Optimization as Orthogonal Projection in Hilbert Space


42. A General Equilibrium Theory of Orchestrated AI Agent Systems


43. AgenticTyper: Automated Typing of Legacy Software Projects Using Agentic AI


44. ImpRIF: Stronger Implicit Reasoning Leads to Better Complex Instruction Following


45. Budget-Aware Agentic Routing via Boundary-Guided Training


46. Make Every Draft Count: Hidden State based Speculative Decoding


47. Measuring Pragmatic Influence in Large Language Model Instructions


48. Task-Aware LoRA Adapter Composition via Similarity Retrieval in Vector Databases


49. Latent Context Compilation: Distilling Long Context into Compact Portable Memory


50. Reasoning-Based Personalized Generation for Users with Sparse Data


51. EPSVec: Efficient and Private Synthetic Data Generation via Dataset Vectors


52. EQ-5D Classification Using Biomedical Entity-Enriched Pre-trained Language Models and Multiple Instance Learning


53. Inference-time Alignment via Sparse Junction Steering