LLM 관련 주요 논문 - 2025-08-20

1. ChronoLLM: Customizing Language Models for Physics-Based Simulation Code Generation


2. The Collaboration Paradox: Why Generative AI Requires Both Strategic Intelligence and Operational Stability in Supply Chain Management


3. Structured Agentic Workflows for Financial Time-Series Modeling with LLMs and Reflective Feedback


4. Improved Generalized Planning with LLMs through Strategy Refinement and Reflection


5. Expertise-aware Multi-LLM Recruitment and Collaboration for Medical Decision-Making


6. CausalPlan: Empowering Efficient LLM Multi-Agent Collaboration Through Causality-Driven Planning


7. Neuro-Symbolic Artificial Intelligence: Towards Improving the Reasoning Abilities of Large Language Models


8. MHSNet:An MoE-based Hierarchical Semantic Representation Network for Accurate Duplicate Resume Detection with Large Language Model


9. Breaking the SFT Plateau: Multimodal Structured Reinforcement Learning for Chart-to-Code Generation


10. Toward Better EHR Reasoning in LLMs: Reinforcement Learning with Expert Attention Guidance


11. LM Agents May Fail to Act on Their Own Risk Knowledge


12. Discrete Optimization of Min-Max Violation and its Applications Across Computational Sciences


13. Unintended Misalignment from Agentic Fine-Tuning: Risks and Mitigation


14. Ask Good Questions for Large Language Models


15. Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation


16. Chunks as Arms: Multi-Armed Bandit-Guided Sampling for Long-Context LLM Preference Optimization


17. RotBench: Evaluating Multimodal Large Language Models on Identifying Image Rotation


18. Learning to Use AI for Learning: How Can We Effectively Teach and Measure Prompting Literacy for K-12 Students?


19. Prompt Orchestration Markup Language


20. InPars+: Supercharging Synthetic Data Generation for Information Retrieval Systems


21. Extracting Structured Requirements from Unstructured Building Technical Specifications for Building Information Modeling


22. The illusion of a perfect metric: Why evaluating AI’s words is harder than it looks


23. Prompt-Based One-Shot Exact Length-Controlled Generation with LLMs


24. BetaWeb: Towards a Blockchain-enabled Trustworthy Agentic Web


25. Agentic DraCor and the Art of Docstring Engineering: Evaluating MCP-empowered LLM Usage of the DraCor API


26. COMPASS: A Multi-Dimensional Benchmark for Evaluating Code Generation in Large Language Models


27. Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration


28. Mitigating Cross-Image Information Leakage in LVLMs for Multi-Image Tasks


29. Prediction is not Explanation: Revisiting the Explanatory Capacity of Mapping Embeddings


30. Generics and Default Reasoning in Large Language Models


31. Input Time Scaling


32. Who Gets the Mic? Investigating Gender Bias in the Speaker Assignment of a Speech-LLM


33. A Comparative Study of Decoding Strategies in Medical Text Generation


34. Evaluating Open-Source Vision Language Models for Facial Emotion Recognition against Traditional Deep Learning Models


35. ProMed: Shapley Information Gain Guided Reinforcement Learning for Proactive Medical LLMs


36. LLM-Enhanced Linear Autoencoders for Recommendation


37. STER-VLM: Spatio-Temporal With Enhanced Reference Vision-Language Models


38. Structured Prompting and Multi-Agent Knowledge Distillation for Traffic Video Interpretation and Risk Inference


39. Mitigating Easy Option Bias in Multiple-Choice Question Answering


40. ALIGN: Word Association Learning for Cross-Cultural Generalization in Large Language Models


41. AdaptJobRec: Enhancing Conversational Career Recommendation through an LLM-Powered Agentic System