LLM 관련 주요 논문 - 2026-01-05

1. A Vision-and-Knowledge Enhanced Large Language Model for Generalizable Pedestrian Crossing Behavior Inference


2. DA-DPO: Cost-efficient Difficulty-aware Preference Optimization for Reducing MLLM Hallucinations


3. Bio-inspired Agentic Self-healing Framework for Resilient Distributed Computing Continuum Systems


4. Will LLM-powered Agents Bias Against Humans? Exploring the Belief-Dependent Vulnerability


5. FlashInfer-Bench: Building the Virtuous Cycle for AI-driven LLM Systems


6. An AI Monkey Gets Grapes for Sure – Sphere Neural Networks for Reliable Decision-Making


7. Explicit Abstention Knobs for Predictable Reliability in Video Question Answering


8. Constructing a Neuro-Symbolic Mathematician from First Principles


9. Ask, Clarify, Optimize: Human-LLM Agent Collaboration for Smarter Inventory Control


10. Mortar: Evolving Mechanics for Automatic Game Design


11. The Agentic Leash: Extracting Causal Feedback Fuzzy Cognitive Maps with LLMs


12. Finetuning Large Language Models for Automated Depression Screening in Nigerian Pidgin English: GENSCORE Pilot Study


13. Reasoning in Action: MCTS-Driven Knowledge Retrieval for Large Language Models


14. Geometry of Reason: Spectral Signatures of Valid Mathematical Reasoning


15. LLM Agents for Combinatorial Efficient Frontiers: Investment Portfolio Optimization


16. Exploring the Performance of Large Language Models on Subjective Span Identification Tasks


17. Detecting Performance Degradation under Data Shift in Pathology Vision-Language Model


18. QSLM: A Performance- and Memory-aware Quantization Framework with Tiered Search Strategy for Spike-driven Language Models


19. Fast-weight Product Key Memory


20. HFedMoE: Resource-aware Heterogeneous Federated Learning with Mixture-of-Experts


21. Improving Scientific Document Retrieval with Academic Concept Index


22. Cracking IoT Security: Can LLMs Outsmart Static Analysis Tools?


23. ECR: Manifold-Guided Semantic Cues for Compact Language Models


24. Trajectory Guard – A Lightweight, Sequence-Aware Model for Real-Time Anomaly Detection in Agentic AI


25. MotionPhysics: Learnable Motion Distillation for Text-Guided Simulation


26. Multi-Agent Coordinated Rename Refactoring


27. MAESTRO: Multi-Agent Evaluation Suite for Testing, Reliability, and Observability


28. Defensive M2S: Training Guardrail Models on Compressed Multi-turn Conversations


29. Language as Mathematical Structure: Examining Semantic Field Theory Against Language Games


30. Do LLMs Judge Distantly Supervised Named Entity Labels Well? Constructing the JudgeWEL Dataset


31. In Line with Context: Repository-Level Code Generation via Context Inlining


32. Robust Uncertainty Quantification for Factual Generation of Large Language Models


33. Can Large Language Models Still Explain Themselves? Investigating the Impact of Quantization on Self-Explanations


34. FaithSCAN: Model-Driven Single-Pass Hallucination Detection for Faithful Visual Question Answering


35. Beyond Perfect APIs: A Comprehensive Evaluation of LLM Agents Under Real-World API Complexity


36. Parallel Universes, Parallel Languages: A Comprehensive Study on LLM-based Multilingual Counterfactual Example Generation


37. An Empirical Evaluation of LLM-Based Approaches for Code Vulnerability Detection: RAG, SFT, and Dual-Agent Systems


38. JP-TL-Bench: Anchored Pairwise LLM Evaluation for Bidirectional Japanese-English Translation


39. FCMBench: A Comprehensive Financial Credit Multimodal Benchmark for Real-world Applications


40. Large Empirical Case Study: Go-Explore adapted for AI Red Team Testing