LLM 관련 주요 논문 - 2025-12-05

1. Benchmark for Planning and Control with Large Language Model Agents: Blocksworld with Model Context Protocol


2. A Hierarchical Tree-based approach for creating Configurable and Static Deep Research Agent (Static-DRA)


3. RoCo: Role-Based LLMs Collaboration for Automatic Heuristic Design


4. DeepRule: An Integrated Framework for Automated Business Rule Generation via Deep Predictive Modeling and Hybrid Search Optimization


5. EnCompass: Enhancing Agent Programming with Search Over Program Execution Paths


6. Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia


7. When Do Symbolic Solvers Enhance Reasoning in Large Language Models?


8. SkillFactory: Self-Distillation For Learning Cognitive Behaviors


9. MarkTune: Improving the Quality-Detectability Trade-off in Open-Weight LLM Watermarking


10. Jina-VLM: Small Multilingual Vision Language Model


11. Large Language Models for Limited Noisy Data: A Gravitational Wave Identification Study


12. DIQ-H: Evaluating Hallucination Persistence in VLMs Under Temporal Visual Degradation


13. Sponsored Questions and How to Auction Them


14. BERnaT: Basque Encoders for Representing Natural Textual Diversity


15. DVPO: Distributional Value Modeling-based Policy Optimization for LLM Post-Training


16. AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition


17. In-Context Representation Hijacking


18. Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective


19. Context-Aware Hierarchical Learning: A Two-Step Paradigm towards Safer LLMs


20. AlignCheck: a Semantic Open-Domain Metric for Factual Consistency Assessment


21. The promising potential of vision language models for the generation of textual weather forecasts


22. SELF: A Robust Singular Value and Eigenvalue Approach for LLM Fingerprinting


23. KVNAND: Efficient On-Device Large Language Model Inference Using DRAM-Free In-Flash Computing


24. State Space Models for Bioacoustics: A comparative Evaluation with Transformers


25. Dynamic Content Moderation in Livestreams: Combining Supervised Classification with MLLM-Boosted Similarity Matching


26. V-ITI: Mitigating Hallucinations in Multimodal Large Language Models via Visual Inference-Time Intervention


27. AsymPuzl: An Asymmetric Puzzle for multi-agent cooperation


28. Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models


29. Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles


30. BookRAG: A Hierarchical Structure-aware Index-based Approach for Retrieval-Augmented Generation on Complex Documents


31. UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs


32. Idea-Gated Transformers: Enforcing Semantic Coherence via Differentiable Vocabulary Pruning


33. Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs


34. Thucy: An LLM-based Multi-Agent System for Claim Verification across Relational Databases


35. InvertiTune: High-Quality Data Synthesis for Cost-Effective Single-Shot Text-to-Knowledge Graph Generation


36. Plantain: Plan-Answer Interleaved Reasoning


37. Lost in Modality: Evaluating the Effectiveness of Text-Based Membership Inference Attacks on Large Multimodal Models


38. E-valuator: Reliable Agent Verifiers with Sequential Hypothesis Testing


39. ALARM: Automated MLLM-Based Anomaly Detection in Complex-EnviRonment Monitoring with Uncertainty Quantification


40. Ensemble Privacy Defense for Knowledge-Intensive LLMs against Membership Inference Attacks


41. When Harmful Content Gets Camouflaged: Unveiling Perception Failure of LVLMs with CamHarmTI


42. Beyond Code Pairs: Dialogue-Based Data Generation for LLM Code Translation


43. Alleviating Choice Supportive Bias in LLM with Reasoning Dependency Generation


44. AtomDisc: An Atom-level Tokenizer that Boosts Molecular LLMs and Reveals Structure–Property Associations


45. Echoes of AI Harms: A Human-LLM Synergistic Framework for Bias-Driven Harm Anticipation


46. A note on the impossibility of conditional PAC-efficient reasoning in large language models


47. Mitigating hallucinations and omissions in LLMs for invertible problems: An application to hardware logic design automation


48. Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models