LLM 관련 주요 논문 - 2026-01-19

1. Health Facility Location in Ethiopia: Leveraging LLMs to Integrate Expert Knowledge into Algorithmic Planning


2. Exploring LLM Features in Predictive Process Monitoring for Small-Scale Event-Logs


3. AstroReason-Bench: Evaluating Unified Agentic Planning across Heterogeneous Space Planning Problems


4. XChoice: Explainable Evaluation of AI-Human Alignment in LLM-based Constrained Choice Decision Making


5. Beyond Model Scaling: Test-Time Intervention for Efficient Deep Reasoning


6. TANDEM: Temporal-Aware Neural Detection for Multimodal Hate Speech


7. Do We Always Need Query-Level Workflows? Rethinking Agentic Workflow Generation for Multi-Agent Systems


8. ReCreate: Reasoning and Creating Domain Agents Driven by Experience


9. AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts


10. AdaMARP: An Adaptive Multi-Agent Interaction Framework for General Immersive Role-Playing


11. Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration


12. CTHA: Constrained Temporal Hierarchical Architecture for Stable Multi-Agent LLM Systems


13. ORBITFLOW: SLO-Aware Long-Context LLM Serving with Fine-Grained KV Cache Reconfiguration


14. Building AI Agents to Improve Job Referral Requests to Strangers


15. Do You Trust Me? Cognitive-Affective Signatures of Trustworthiness in Large Language Models


16. Building Production-Ready Probes For Gemini


17. MHA2MLA-VLM: Enabling DeepSeek’s Economical Multi-Head Latent Attention across Vision-Language Models


18. Hierarchical Orthogonal Residual Spread for Precise Massive Editing in Large Language Models


19. Evaluating LLM Behavior in Hiring: Implicit Weights, Fairness Across Groups, and Alignment with Human Preferences


20. Institutional AI: Governing LLM Collusion in Multi-Agent Cournot Markets via Public Governance Graphs


21. Think-Clip-Sample: Slow-Fast Frame Selection for Video Understanding


22. How Much Would a Clinician Edit This Draft? Evaluating LLM Alignment for Patient Message Response Drafting


23. X-Distill: Cross-Architecture Vision Distillation for Visuomotor Learning


24. Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation


25. FactCorrector: A Graph-Inspired Approach to Long-Form Factuality Correction of Large Language Models


26. SDFLoRA: Selective Dual-Module LoRA for Federated Fine-tuning with Heterogeneous Clients


27. FAQ: Mitigating Quantization Error via Regenerating Calibration Data with Family-Aware Quantization


28. SD-RAG: A Prompt-Injection-Resilient Framework for Selective Disclosure in Retrieval-Augmented Generation


29. Deep GraphRAG: A Balanced Approach to Hierarchical Retrieval and Adaptive Integration


30. Learn Before Represent: Bridging Generative and Contrastive Learning for Domain-Specific LLM Embeddings


31. ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development


32. H-AIM: Orchestrating LLMs, PDDL, and Behavior Trees for Hierarchical Multi-Robot Planning


33. Predicting Biased Human Decision-Making with Large Language Models in Conversational Settings


34. Spectral Characterization and Mitigation of Sequential Knowledge Editing Collapse


35. Combating Spurious Correlations in Graph Interpretability via Self-Reflection


36. Finding the Translation Switch: Discovering and Exploiting the Task-Initiation Features in LLMs


37. When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs


38. Steering Language Models Before They Speak: Logit-Level Interventions


39. Beyond Max Tokens: Stealthy Resource Amplification via Tool Calling Chains in LLM Agents


40. Multi-Stage Patient Role-Playing Framework for Realistic Clinical Interactions


41. PatientVLM Meets DocVLM: Pre-Consultation Dialogue Between Vision-Language Models for Efficient Diagnosis


42. Selecting Language Models for Social Science: Start Small, Start Open, and Validate


43. Can Vision-Language Models Understand Construction Workers? An Exploratory Study


44. Towards Reliable ML Feature Engineering via Planning in Constrained-Topology of LLM Agents


45. Digital Metabolism: Decoupling Logic from Facts via Regenerative Unlearning – Towards a Pure Neural Logic Core


46. LogicLens: Leveraging Semantic Code Graph to explore Multi Repository large systems


47. Unifying Speech Recognition, Synthesis and Conversion with Autoregressive Transformers


48. DSA-Tokenizer: Disentangled Semantic-Acoustic Tokenization via Flow Matching-based Hierarchical Fusion


49. EvidFuse: Writing-Time Evidence Learning for Consistent Text-Chart Data Reporting