LLM 관련 주요 논문 - 2025-09-11

1. Narrative-Guided Reinforcement Learning: A Platform for Studying Language Model Influence on Decision Making


2. No-Knowledge Alarms for Misaligned LLMs-as-Judges


3. TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making


4. Co-Investigator AI: The Rise of Agentic AI for Smarter, Trustworthy AML Compliance Narratives


5. Exploratory Retrieval-Augmented Planning For Continual Embodied Instruction Following


6. EnvX: Agentize Everything with Agentic AI


7. A Survey of Reinforcement Learning for Large Reasoning Models


8. Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation


9. Scaling Truth: The Confidence Paradox in AI Fact-Checking


10. AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning


11. X-Teaming Evolutionary M2S: Automated Discovery of Multi-turn to Single-turn Jailbreak Templates


12. Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations


13. Memorization in Large Language Models in Medicine: Prevalence, Characteristics, and Implications


14. MESH – Understanding Videos Like Human: Measuring Hallucinations in Large Video Models


15. Agents of Discovery


16. HumanAgencyBench: Scalable Evaluation of Human Agency Support in AI Assistants


17. Send to which account? Evaluation of an LLM-based Scambaiting System


18. A Structured Review of Underwater Object Detection Challenges and Solutions: From Traditional to Large Vision Language Models


19. Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics


20. An Iterative LLM Framework for SIBT utilizing RAG-based Adaptive Weight Optimization


21. Efficient Decoding Methods for Language Models on Encrypted Data


22. Low-Resource Fine-Tuning for Multi-Task Structured Information Extraction with a Billion-Parameter Instruction-Tuned Model


23. So let's replace this phrase with insult... Lessons learned from generation of toxic texts with LLMs


24. Toward Subtrait-Level Model Explainability in Automated Writing Evaluation


25. Accelerating Mixture-of-Expert Inference with Adaptive Expert Split Mechanism


26. Retrieval-Augmented VLMs for Multimodal Melanoma Diagnosis


27. Accelerating Reinforcement Learning Algorithms Convergence using Pre-trained Large Language Models as Tutors With Advice Reusing


28. Interpretable Physics Reasoning and Performance Taxonomy in Vision-Language Models


29. A Systematic Survey on Large Language Models for Evolutionary Optimization: From Modeling to Solving


30. Strategies for Improving Communication Efficiency in Distributed and Federated Learning: Compression, Local Training, and Personalization


31. Componentization: Decomposing Monolithic LLM Responses into Manipulable Semantic Units


32. XML Prompting as Grammar-Constrained Interaction: Fixed-Point Semantics, Convergence Guarantees, and Human-AI Protocols


33. From Limited Data to Rare-event Prediction: LLM-powered Feature Engineering and Multi-model Learning in Venture Capital


34. LALM-Eval: An Open-Source Toolkit for Holistic Evaluation of Large Audio Language Models



36. MVPBench: A Benchmark and Fine-Tuning Framework for Aligning Large Language Models with Diverse Human Values


37. Measuring and mitigating overreliance is necessary for building human-compatible AI


38. Bilingual Word Level Language Identification for Omotic Languages


39. ToDMA: Large Model-Driven Token-Domain Multiple Access for Semantic Communications