LLM 관련 주요 논문 - 2026-01-21

1. Health Facility Location in Ethiopia: Leveraging LLMs to Integrate Expert Knowledge into Algorithmic Planning


2. Exploring LLM Features in Predictive Process Monitoring for Small-Scale Event-Logs


3. AstroReason-Bench: Evaluating Unified Agentic Planning across Heterogeneous Space Planning Problems


4. XChoice: Explainable Evaluation of AI-Human Alignment in LLM-based Constrained Choice Decision Making


5. Beyond Model Scaling: Test-Time Intervention for Efficient Deep Reasoning


6. TANDEM: Temporal-Aware Neural Detection for Multimodal Hate Speech


7. Do We Always Need Query-Level Workflows? Rethinking Agentic Workflow Generation for Multi-Agent Systems


8. ReCreate: Reasoning and Creating Domain Agents Driven by Experience


9. AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts


10. AdaMARP: An Adaptive Multi-Agent Interaction Framework for General Immersive Role-Playing


11. Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration


12. CTHA: Constrained Temporal Hierarchical Architecture for Stable Multi-Agent LLM Systems


13. ORBITFLOW: SLO-Aware Long-Context LLM Serving with Fine-Grained KV Cache Reconfiguration


14. Building AI Agents to Improve Job Referral Requests to Strangers


15. Do You Trust Me? Cognitive-Affective Signatures of Trustworthiness in Large Language Models


16. Building Production-Ready Probes For Gemini


17. MHA2MLA-VLM: Enabling DeepSeek’s Economical Multi-Head Latent Attention across Vision-Language Models


18. Hierarchical Orthogonal Residual Spread for Precise Massive Editing in Large Language Models


19. Relational Linearity is a Predictor of Hallucinations


20. Evaluating LLM Behavior in Hiring: Implicit Weights, Fairness Across Groups, and Alignment with Human Preferences


21. Institutional AI: Governing LLM Collusion in Multi-Agent Cournot Markets via Public Governance Graphs


22. Think-Clip-Sample: Slow-Fast Frame Selection for Video Understanding


23. How Much Would a Clinician Edit This Draft? Evaluating LLM Alignment for Patient Message Response Drafting


24. X-Distill: Cross-Architecture Vision Distillation for Visuomotor Learning


25. Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation


26. FactCorrector: A Graph-Inspired Approach to Long-Form Factuality Correction of Large Language Models


27. SDFLoRA: Selective Dual-Module LoRA for Federated Fine-tuning with Heterogeneous Clients


28. FAQ: Mitigating Quantization Error via Regenerating Calibration Data with Family-Aware Quantization


29. SD-RAG: A Prompt-Injection-Resilient Framework for Selective Disclosure in Retrieval-Augmented Generation


30. Deep GraphRAG: A Balanced Approach to Hierarchical Retrieval and Adaptive Integration


31. Learn Before Represent: Bridging Generative and Contrastive Learning for Domain-Specific LLM Embeddings


32. ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development


33. H-AIM: Orchestrating LLMs, PDDL, and Behavior Trees for Hierarchical Multi-Robot Planning


34. Predicting Biased Human Decision-Making with Large Language Models in Conversational Settings


35. Spectral Characterization and Mitigation of Sequential Knowledge Editing Collapse


36. Combating Spurious Correlations in Graph Interpretability via Self-Reflection


37. Finding the Translation Switch: Discovering and Exploiting the Task-Initiation Features in LLMs


38. When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs


39. Steering Language Models Before They Speak: Logit-Level Interventions


40. Beyond Max Tokens: Stealthy Resource Amplification via Tool Calling Chains in LLM Agents


41. Multi-Stage Patient Role-Playing Framework for Realistic Clinical Interactions


42. PatientVLM Meets DocVLM: Pre-Consultation Dialogue Between Vision-Language Models for Efficient Diagnosis


43. Selecting Language Models for Social Science: Start Small, Start Open, and Validate


44. Can Vision-Language Models Understand Construction Workers? An Exploratory Study


45. Towards Reliable ML Feature Engineering via Planning in Constrained-Topology of LLM Agents


46. Digital Metabolism: Decoupling Logic from Facts via Regenerative Unlearning – Towards a Pure Neural Logic Core


47. LogicLens: Leveraging Semantic Code Graph to explore Multi Repository large systems


48. Unifying Speech Recognition, Synthesis and Conversion with Autoregressive Transformers


49. DSA-Tokenizer: Disentangled Semantic-Acoustic Tokenization via Flow Matching-based Hierarchical Fusion


50. EvidFuse: Writing-Time Evidence Learning for Consistent Text-Chart Data Reporting