LLM 관련 주요 논문 - 2025-09-08

1. LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation


2. Internet 3.0: Architecture for a Web-of-Agents with it’s Algorithm for Ranking Agents


3. SparkUI-Parser: Enhancing GUI Perception with Robust Grounding and Parsing


4. OSC: Cognitive Orchestration through Dynamic Knowledge Alignment in Multi-Agent LLM Collaboration


5. Cloning a Conversational Voice AI Agent from Call\,Recording Datasets for Telesales


6. Collaboration and Conflict between Humans and Language Models through the Lens of Game Theory


7. TalkToAgent: A Human-centric Explanation of Reinforcement Learning Agents with Large Language Models


8. What-If Analysis of Large Language Models: Explore the Game World Using Proactive Thinking


9. Language-Driven Hierarchical Task Structures as Explicit World Models for Multi-Agent Learning


10. Towards Personalized Explanations for Health Simulations: A Mixed-Methods Framework for Stakeholder-Centric Summarization


11. Maestro: Joint Graph & Config Optimization for Reliable AI Agents


12. The Ethical Compass of the Machine: Evaluating Large Language Models for Decision Support in Construction Project Management


13. Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining


14. SpikingBrain Technical Report: Spiking Brain-inspired Large Models


15. Scaling Performance of Large Language Model Pretraining


16. CURE: Controlled Unlearning for Robust Embeddings – Mitigating Conceptual Shortcuts in Pre-Trained Language Models


17. HoPE: Hyperbolic Rotary Positional Encoding for Stable Long-Range Dependency Modeling in Large Language Models


18. AI Agents for Web Testing: A Case Study in the Wild


19. GenAI-based test case generation and execution in SDV platform


20. LLM Enabled Multi-Agent System for 6G Networks: Framework and Method of Dual-Loop Edge-Terminal Collaboration


21. Artificial intelligence for representing and characterizing quantum systems


22. PLaMo 2 Technical Report


23. Enhancing Diversity in Large Language Models via Determinantal Point Processes


24. The LLM Has Left The Chat: Evidence of Bail Preferences in Large Language Models


25. Decoders Laugh as Loud as Encoders


26. FloodVision: Urban Flood Depth Estimation Using Foundation Vision-Language Models and Domain Knowledge Graph


27. MCANet: A Multi-Scale Class-Specific Attention Network for Multi-Label Post-Hurricane Damage Assessment using UAV Imagery


28. A Study of Large Language Models for Patient Information Extraction: Model Architecture, Fine-Tuning Strategy, and Multi-task Instruction Tuning


29. SePA: A Search-enhanced Predictive Agent for Personalized Health Coaching


30. KERAG: Knowledge-Enhanced Retrieval-Augmented Generation for Advanced Question Answering


31. ODKE+: Ontology-Guided Open-Domain Knowledge Extraction with LLMs


32. Polysemantic Dropout: Conformal OOD Detection for Specialized LLMs


33. Scaling Environments for Organoid Intelligence with LLM-Automated Design and Plasticity-Based Evaluation


34. Schema Inference for Tabular Data Repositories Using Large Language Models


35. Sample-efficient Integration of New Modalities into Large Language Models


36. Manipulating Transformer-Based Models: Controllability, Steerability, and Robust Interventions


37. Emergent Social Dynamics of LLM Agents in the El Farol Bar Problem


38. Quantized Large Language Models in Biomedical Natural Language Processing: Evaluation and Recommendation


39. Mitigation of Gender and Ethnicity Bias in AI-Generated Stories through Model Explanations


40. From Silent Signals to Natural Language: A Dual-Stage Transformer-LLM Approach


41. Behavioral Fingerprinting of Large Language Models


42. VaccineRAG: Boosting Multimodal Large Language Models’ Immunity to Harmful RAG Samples


43. Context Engineering for Trustworthiness: Rescorla Wagner Steering Under Mixed and Inappropriate Contexts


44. DeepTRACE: Auditing Deep Research AI Systems for Tracking Reliability Across Citations and Evidence


45. Where Should I Study? Biased Language Models Decide! Evaluating Fairness in LMs for Academic Recommendations


46. Learned Hallucination Detection in Black-Box LLMs using Token-level Entropy Production Rate


47. Serialized Output Prompting for Large Language Model-based Multi-Talker Speech Recognition


48. DecMetrics: Structured Claim Decomposition Scoring for Factually Consistent LLM Outputs


49. Energy Landscapes Enable Reliable Abstention in Retrieval-Augmented Large Language Models for Healthcare


50. Narrative-to-Scene Generation: An LLM-Driven Pipeline for 2D Game Environments


51. No Clustering, No Routing: How Transformers Actually Process Rare Tokens


52. Training Text-to-Molecule Models with Context-Aware Tokenization


53. ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute


54. Scaling Up, Speeding Up: A Benchmark of Speculative Decoding for Efficient LLM Test-Time Scaling


55. SpeechLLM: Unified Speech and Language Model for Enhanced Multi-Task Understanding in Low Resource Settings


56. RECAP: REwriting Conversations for Intent Understanding in Agentic Planning


57. MOSAIC: A Multilingual, Taxonomy-Agnostic, and Computationally Efficient Approach for Radiological Report Classification


58. COCORELI: Cooperative, Compositional Reconstitution \& Execution of Language Instructions


59. Multi-Modal Vision vs. Text-Based Parsing: Benchmarking LLM Strategies for Invoice Processing


60. Evaluating Large Language Models for Financial Reasoning: A CFA-Based Benchmark Study


61. Enhancing LLM Efficiency: Targeted Pruning for Prefill-Decode Disaggregation in Inference


62. Just-in-time and distributed task representations in language models


63. Emotionally-Aware Agents for Dispute Resolution


64. Can Multiple Responses from an LLM Reveal the Sources of Its Uncertainty?


65. CoCoNUTS: Concentrating on Content while Neglecting Uninformative Textual Styles for AI-Generated Peer Review Detection


66. Efficient Training-Free Online Routing for High-Volume Multi-LLM Serving