LLM 관련 주요 논문 - 2025-09-22

1. Structured Information for Improving Spatial Relationships in Text-to-Image Generation


2. EHR-MCP: Real-world Evaluation of Clinical Information Retrieval by Large Language Models via Model Context Protocol


3. CCrepairBench: A High-Fidelity Benchmark and Reinforcement Learning Framework for C++ Compilation Repair


4. MicroRCA-Agent: Microservice Root Cause Analysis Method Based on Large Language Model Agents


5. Diagnostics of cognitive failures in multi-agent expert systems using dynamic evaluation protocols and subsequent mutation of the processing context


6. Knowledge-Driven Hallucination in Large Language Models: An Empirical Study on Process Modeling


7. RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation


8. CultureScope: A Dimensional Lens for Probing Cultural Understanding in LLMs


9. Robust Vision-Language Models via Tensor Decomposition: A Defense Against Adversarial Attacks


10. DiffusionNFT: Online Diffusion Reinforcement with Forward Process


11. Beyond Pointwise Scores: Decomposed Criteria-Based Evaluation of LLM Responses


12. See&Trek: Training-Free Spatial Prompting for Multimodal Large Language Model


13. Compose by Focus: Scene Graph-based Atomic Skills


14. Think, Verbalize, then Speak: Bridging Complex Thoughts and Comprehensible Speech


15. BEFT: Bias-Efficient Fine-Tuning of Language Models


16. The Alignment Bottleneck



18. Foundation Models as World Models: A Foundational Study in Text-Based GridWorlds


19. Re-FRAME the Meeting Summarization SCOPE: Fact-Based Summarization and Personalization via Questions


20. Distribution-Aligned Decoding for Efficient LLM Task Adaptation


21. Best-of-L: Cross-Lingual Reward Modeling for Mathematical Reasoning


22. CIDER: A Causal Cure for Brand-Obsessed Text-to-Image Models


23. Monte Carlo Tree Diffusion with Multiple Experts for Protein Design


24. On Optimal Steering to Achieve Exact Fairness


25. Once Upon a Time: Interactive Learning for Storytelling with Small Language Models


26. KITE: Kernelized and Information Theoretic Exemplars for In-Context Learning


27. SightSound-R1: Cross-Modal Reasoning Distillation from Vision to Audio Language Models


28. Information Geometry of Variational Bayes


29. DivLogicEval: A Framework for Benchmarking Logical Reasoning Evaluation in Large Language Models


30. LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs



32. Reward Hacking Mitigation using Verifiable Composite Rewards


33. Exploring Polyglot Harmony: On Multilingual Data Allocation for Large Language Models Pretraining


34. How do Language Models Generate Slang: A Systematic Comparison between Human and Machine-Generated Slang Usages


35. The (Short-Term) Effects of Large Language Models on Unemployment and Earnings


36. SmolRGPT: Efficient Spatial Reasoning for Warehouse Environments with 600M Parameters


37. Comparing Computational Pathology Foundation Models using Representational Similarity Analysis


38. PILOT: Steering Synthetic Data Generation with Psychological & Linguistic Output Targeting


39. ORCA: Agentic Reasoning For Hallucination and Adversarial Robustness in Vision-Language Models


40. Beyond Spurious Signals: Debiasing Multimodal Large Language Models via Counterfactual Inference and Adaptive Expert Routing


41. Collective Voice: Recovered-Peer Support Mediated by An LLM-Based Chatbot for Eating Disorder Recovery


42. Evaluating the Limitations of Local LLMs in Solving Complex Programming Challenges


43. Modeling Transformers as complex networks to analyze learning dynamics


44. Emotion-Aware Speech Generation with Character-Specific Voices for Comics


45. Walk and Read Less: Improving the Efficiency of Vision-and-Language Navigation via Tuning-Free Multimodal Token Pruning


46. Causal Reasoning Elicits Controllable 3D Scene Generation


47. Synthetic bootstrapped pretraining