LLM 관련 주요 논문 - 2025-07-29

1. A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence


2. GenoMAS: A Multi-Agent Framework for Scientific Discovery via Code-Driven Gene Expression Analysis


3. MIRAGE-Bench: LLM Agent is Hallucinating and Where to Find Them


4. Core Safety Values for Provably Corrigible Agents


5. On the Limits of Hierarchically Embedded Logic in Classical Neural Networks


6. MMGraphRAG: Bridging Vision and Language with Interpretable Multimodal Knowledge Graphs


7. evalSmarT: An LLM-Based Framework for Evaluating Smart Contract Generated Comments


8. Enhancing Large Multimodal Models with Adaptive Sparsity and KV Cache Compression


9. MeLA: A Metacognitive LLM-Driven Architecture for Automatic Heuristic Design


10. Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition


11. Memorization in Fine-Tuned Large Language Models


12. Security Tensors as a Cross-Modal Bridge: Extending Text-Aligned Safety to Vision in LVLM


13. SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment


14. Your AI, Not Your View: The Bias of LLMs in Investment Analysis


15. Mind the Gap: Conformative Decoding to Improve Output Diversity of Instruction-Tuned Large Language Models


16. Dissecting Persona-Driven Reasoning in Language Models via Activation Patching


17. FRED: Financial Retrieval-Enhanced Detection and Editing of Hallucinations in Language Models


18. FHSTP@EXIST 2025 Benchmark: Sexism Detection with Transparent Speech Concept Bottleneck Models


19. Pareto-Grid-Guided Large Language Models for Fast and High-Quality Heuristics Design in Multi-Objective Combinatorial Optimization


20. MediQAl: A French Medical Question Answering Dataset for Knowledge and Reasoning Evaluation


21. HAMLET-FFD: Hierarchical Adaptive Multi-modal Learning Embeddings Transformation for Face Forgery Detection


22. Music Arena: Live Evaluation for Text-to-Music


23. Aligning Large Language Model Agents with Rational and Moral Preferences: A Supervised Fine-Tuning Approach


24. Text2VLM: Adapting Text-Only Datasets to Evaluate Alignment Training in Visual Language Models


25. MIMII-Agent: Leveraging LLMs with Function Calling for Relative Evaluation of Anomalous Sound Detection


26. Ontology-Enhanced Knowledge Graph Completion using Large Language Models


27. TransPrune: Token Transition Pruning for Efficient Large Vision-Language Model


28. Beyond Interactions: Node-Level Graph Generation for Knowledge-Free Augmentation in Recommender Systems


29. Enhancing Hallucination Detection via Future Context


30. T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation


31. Kimi K2: Open Agentic Intelligence


32. Enhancing Spatial Reasoning through Visual and Textual Thinking


33. The Xeno Sutra: Can Meaning and Value be Ascribed to an AI-Generated “Sacred” Text?


34. AQUA: A Large Language Model for Aquaculture & Fisheries


35. LLMs-guided adaptive compensator: Bringing Adaptivity to Automatic Control Systems with Large Language Models


36. Speaking in Words, Thinking in Logic: A Dual-Process Framework in QA Systems