LLM 관련 주요 논문 - 2025-11-18

1. Experience-Guided Adaptation of Inference-Time Reasoning Strategies


2. CURENet: Combining Unified Representations for Efficient Chronic Disease Prediction


3. MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism


4. EcoAlign: An Economically Rational Framework for Efficient LVLM Alignment


5. AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery


6. UAVBench: An Open Benchmark Dataset for Autonomous and Agentic AI UAV Systems via LLM-Generated Flight Scenarios


7. STaR: Towards Cognitive Table Reasoning via Slow-Thinking Large Language Models


8. Multi-agent Undercover Gaming: Hallucination Removal via Counterfactual Test for Multimodal Reasoning


9. Key Decision-Makers in Multi-Agent Debates: Who Holds the Power?


10. AI Agent-Driven Framework for Automated Product Knowledge Graph Construction in E-Commerce


11. LLM enhanced graph inference for long-term disease progression modelling


12. HARNESS: Human-Agent Risk Navigation and Event Safety System for Proactive Hazard Forecasting in High-Risk DOE Environments


13. From Efficiency to Adaptivity: A Deeper Look at Adaptive Reasoning in Large Language Models


14. Human-AI collaborative autonomous synthesis with pulsed laser deposition for remote epitaxy


15. PAS : Prelim Attention Score for Detecting Object Hallucinations in Large Vision–Language Models


16. Benchmarking Visual LLMs Resilience to Unanswerable Questions on Visually Rich Documents


17. Privacy Challenges and Solutions in Retrieval-Augmented Generation-Enhanced LLMs for Healthcare Chatbots: A Review of Applications, Risks, and Future Directions


18. M-DAIGT: A Shared Task on Multi-Domain Detection of AI-Generated Text


19. LAET: A Layer-wise Adaptive Ensemble Tuning Framework for Pretrained Language Models


20. iMAD: Intelligent Multi-Agent Debate for Efficient and Accurate LLM Inference


21. AUVIC: Adversarial Unlearning of Visual Concepts for Multi-modal Large Language Models


22. KGQuest: Template-Driven QA Generation from Knowledge Graphs with LLM-Based Refinement


23. Refine and Align: Confidence Calibration through Multi-Agent Interaction in VQA


24. Utilizing LLMs for Industrial Process Automation: A Case Study on Modifying RAPID Programs


25. VIDEOP2R: Video Understanding from Perception to Reasoning


26. S2D-ALIGN: Shallow-to-Deep Auxiliary Learning for Anatomically-Grounded Radiology Report Generation


27. CrossMed: A Multimodal Cross-Task Benchmark for Compositional Generalization in Medical Imaging


28. AirCopBench: A Benchmark for Multi-drone Collaborative Embodied Perception and Reasoning


29. Data Poisoning Vulnerabilities Across Healthcare AI Architectures: A Security Threat Analysis


30. Automata-Based Steering of Large Language Models for Diverse Structured Generation


31. VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models


32. DialogGraph-LLM: Graph-Informed LLMs for End-to-End Audio Dialogue Intent Recognition


33. When Data is the Algorithm: A Systematic Study and Curation of Preference Optimization Datasets


34. DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains



36. Synthetic Voices, Real Threats: Evaluating Large Text-to-Speech Models in Generating Harmful Audio


37. Evaluating Large Language Models on Rare Disease Diagnosis: A Case Study using House M.D


38. Automated Analysis of Learning Outcomes and Exam Questions Based on Bloom’s Taxonomy


39. Expert-Guided Prompting and Retrieval-Augmented Generation for Emergency Medical Service Question Answering


40. CLIPPan: Adapting CLIP as A Supervisor for Unsupervised Pansharpening


41. A Multifaceted Analysis of Negative Bias in Large Language Models through the Lens of Parametric Knowledge


42. Short-Window Sliding Learning for Real-Time Violence Detection via LLM-based Auto-Labeling


43. HPCAgentTester: A Multi-Agent LLM Approach for Enhanced HPC Unit Test Generation


44. Leveraging Parameter Space Symmetries for Reasoning Skill Transfer in LLMs


45. The Map of Misbelief: Tracing Intrinsic and Extrinsic Hallucinations Through Attention Patterns


46. PISanitizer: Preventing Prompt Injection to Long-Context LLMs via Prompt Sanitization


47. BadThink: Triggered Overthinking Attacks on Chain-of-Thought Reasoning in Large Language Models


48. Do Not Merge My Model! Safeguarding Open-Source LLMs Against Unauthorized Model Merging


49. Evaluating from Benign to Dynamic Adversarial: A Squid Game for Large Language Models


50. Equilibrium Dynamics and Mitigation of Gender Bias in Synthetically Generated Data


51. Who Gets the Reward, Who Gets the Blame? Evaluation-Aligned Training Signals for Multi-LLM Agents


52. Pre-Attention Expert Prediction and Prefetching for Mixture-of-Experts Large Language Models


53. Learn to Select: Exploring Label Distribution Divergence for In-Context Demonstration Selection in Text Classification


54. Continual Learning of Domain Knowledge from Human Feedback in Text-to-SQL


55. Towards Fine-Grained Code-Switch Speech Translation with Semantic Space Alignment


56. Evaluating LLM Understanding via Structured Tabular Decision Simulations


57. Guarding the Meaning: Self-Supervised Training for Semantic Robustness in Guard Models


58. Evaluating Modern Large Language Models on Low-Resource and Morphologically Rich Languages:A Cross-Lingual Benchmark Across Cantonese, Japanese, and Turkish


59. Test-Time Steering for Lossless Text Compression via Weighted Product of Experts


60. Evaluating Open-Weight Large Language Models for Structured Data Extraction from Narrative Medical Reports Across Multiple Use Cases and Languages


61. Preference Orchestrator: Prompt-Aware Multi-Objective Alignment for Large Language Models


62. Hybrid Quantum Transformer for Language Generation


63. Cognitively-Inspired Episodic Memory Architectures for Accurate and Efficient Character AI


64. Data Analysis and Performance Evaluation of Simulation Deduction Based on LLMs


65. Unsupervised Cycle Detection in Agentic Applications


66. Assessing the Capabilities of LLMs in Humor:A Multi-dimensional Analysis of Oogiri Generation and Evaluation