LLM 관련 주요 논문 - 2025-08-21

1. MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers


2. Entropy-Constrained Strategy Optimization in Urban Floods: A Multi-Agent Framework with LLM and Knowledge Graph Integration


3. LeanGeo: Formalizing Competitional Geometry problems in Lean


4. Who Sees What? Structured Thought-Action Sequences for Epistemic Reasoning in LLMs


5. Automated Optimization Modeling through Expert-Guided Large Language Model Reasoning


6. Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs


7. Long Chain-of-Thought Reasoning Across Languages


8. Evaluating Retrieval-Augmented Generation vs. Long-Context Input for Clinical Reasoning over EHRs


9. TransLLM: A Unified Multi-Task Foundation Framework for Urban Transportation via Learnable Prompting


10. PepThink-R1: LLM for Interpretable Cyclic Peptide Optimization with CoT SFT and Reinforcement Learning


11. Reliable generation of isomorphic physics problems using ChatGPT with prompt-chaining and tool use


12. Cross-Modality Controlled Molecule Generation with Diffusion Language Model


13. Evaluating Multilingual and Code-Switched Alignment in LLMs via Synthetic Natural Language Inference


14. Transplant Then Regenerate: A New Paradigm for Text Data Augmentation


15. ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine


16. ELATE: Evolutionary Language model for Automated Time-series Engineering


17. Can LLM Agents Solve Collaborative Tasks? A Study on Urgency-Aware Planning and Coordination


18. Towards LLM-generated explanations for Component-based Knowledge Graph Question Answering Systems


19. Adaptively Robust LLM Inference Optimization under Prediction Uncertainty


20. Post-hoc LLM-Supported Debugging of Distributed Processes


21. In2x at WMT25 Translation Task


22. NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model


23. Cognitive Surgery: The Awakening of Implicit Territorial Awareness in LLMs


24. DEPTH: Hallucination-Free Relation Extraction via Dependency-Aware Sentence Simplification and Two-tiered Hierarchical Refinement


25. Credence Calibration Game? Calibrating Large Language Models through Structured Play


26. ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students’ Cognitive Abilities


27. Organ-Agents: Virtual Human Physiology Simulator via LLMs