LLM 관련 주요 논문 - 2026-02-09

1. AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents


2. From Features to Actions: Explainability in Traditional and Agentic AI Systems


3. LLM Active Alignment: A Nash Equilibrium Perspective


4. POP: Online Structural Pruning Enables Efficient Inference of Large Foundation Models


5. Wild Guesses and Mild Guesses in Active Concept Learning


6. Same Answer, Different Representations: Hidden instability in VLMs


7. SeeUPO: Sequence-Level Agentic-RL with Convergence Guarantees


8. AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research


9. LogicSkills: A Structured Benchmark for Formal Reasoning in Large Language Models


10. HyPER: Bridging Exploration and Exploitation for Scalable LLM Reasoning with Hypothesis Path Expansion and Reduction


11. JADE: Expert-Grounded Dynamic Evaluation for Open-Ended Professional Tasks


12. AgentCPM-Explore: Realizing Long-Horizon Deep Exploration for Edge-Scale Agents


13. Intrinsic Stability Limits of Autoregressive Reasoning: Structural Consequences for Long-Horizon Execution


14. Trifuse: Enhancing Attention-Based GUI Grounding via Multimodal Fusion


15. Do LLMs Act Like Rational Agents? Measuring Belief Coherence in Probabilistic Decision Making


16. Large Language Model Reasoning Failures


17. Jackpot: Optimal Budgeted Rejection Sampling for Extreme Actor-Policy Mismatch Reinforcement Learning


18. Learning a Generative Meta-Model of LLM Activations


19. Endogenous Resistance to Activation Steering in Language Models


20. Halluverse-M^3: A multitask multilingual benchmark for hallucination in LLMs


21. TamperBench: Systematically Stress-Testing LLM Safety Under Fine-Tuning and Tampering


22. TraceCoder: A Trace-Driven Multi-Agent Framework for Automated Debugging of LLM-Generated Code


23. The Quantum Sieve Tracer: A Hybrid Framework for Layer-Wise Activation Tracing in Large Language Models


24. The Representational Geometry of Number


25. Bridging 6G IoT and AI: LLM-Based Efficient Approach for Physical Layer’s Optimization Tasks


26. On the Identifiability of Steering Vectors in Large Language Models


27. Generating Data-Driven Reasoning Rubrics for Domain-Adaptive Reward Modeling


28. Next-generation cyberattack detection with large language models: anomaly analysis across heterogeneous logs


29. A Unified Framework for LLM Watermarks


30. GhostCite: A Large-Scale Analysis of Citation Validity in the Age of Large Language Models


31. compar:IA: The French Government’s LLM arena to collect French-language human prompts and preference data


32. Not All Layers Need Tuning: Selective Layer Restoration Recovers Diversity


33. Scaling Speech Tokenizers with Diffusion Autoencoders


34. Personality as Relational Infrastructure: User Perceptions of Personality-Trait-Infused LLM Messaging


35. AgentStepper: Interactive Debugging of Software Development Agents


36. SPARC: Separating Perception And Reasoning Circuits for Test-time Scaling of VLMs


37. Malicious Agent Skills in the Wild: A Large-Scale Security Empirical Study


38. Completing Missing Annotation: Multi-Agent Debate for Accurate and Scalable Relevant Assessment for IR Benchmarks


39. Revisiting the Shape Convention of Transformer Language Models


40. Improve Large Language Model Systems with User Logs


41. Principle-Evolvable Scientific Discovery via Uncertainty Minimization


42. CORE: Comprehensive Ontological Relation Evaluation for Large Language Models


43. TrajAD: Trajectory Anomaly Detection for Trustworthy LLM Agents


44. TrailBlazer: History-Guided Reinforcement Learning for Black-Box LLM Jailbreaking


45. A methodology for analyzing financial needs hierarchy from social discussions using LLM


46. Revisiting Salient Object Detection from an Observer-Centric Perspective


47. Training Data Selection with Gradient Orthogonality for Efficient Domain Adaptation


48. SHINE: A Scalable In-Context Hypernetwork for Mapping Context to LoRA in a Single Pass


49. Can Post-Training Transform LLMs into Causal Reasoners?


50. The Condensate Theorem: Transformers are O(n), Not $O(n^2)$


51. Can One-sided Arguments Lead to Response Change in Large Language Models?


52. GRP-Obliteration: Unaligning LLMs With a Single Unlabeled Prompt


53. Steering Safely or Off a Cliff? Rethinking Specificity and Robustness in Inference-Time Interventions


54. RuleSmith: Multi-Agent LLMs for Automated Game Balancing


55. Personagram: Bridging Personas and Product Design for Creative Ideation with Multimodal LLMs


56. Generics in science communication: Misaligned interpretations across laypeople, scientists, and large language models


57. Stop the Flip-Flop: Context-Preserving Verification for Fast Revocable Diffusion Decoding


58. Protean Compiler: An Agile Framework to Drive Fine-grain Phase Ordering


59. SVRepair: Structured Visual Reasoning for Automated Program Repair


60. Communication Enhances LLMs’ Stability in Strategic Thinking


61. Allocate Marginal Reviews to Borderline Papers Using LLM Comparative Ranking


62. Analyzing Diffusion and Autoregressive Vision Language Models in Multimodal Embedding Space


63. Recontextualizing Famous Quotes for Brand Slogan Generation


64. EUGens: Efficient, Unified, and General Dense Layers