LLM 관련 주요 논문 - 2025-09-04

1. sam-llm: interpretable lane change trajectoryprediction via parametric finetuning


2. Situating AI Agents in their World: Aspective Agentic AI for Dynamic Partially Observable Information Systems


3. Language Models Do Not Follow Occam’s Razor: A Benchmark for Inductive and Abductive Reasoning


4. app.build: A Production Framework for Scaling Agentic Prompt-to-App Generation with Environment Scaffolding


5. Plan Verification for LLM-Based Embodied Task Completion Agents


6. Do LLM Modules Generalize? A Study on Motion Generation for Autonomous Driving


7. Deep Research is the New Analytics System: Towards Building the Runtime for AI-Driven Analytics


8. Planning with Reasoning using Vision Language World Model


9. Strefer: Empowering Video LLMs with Space-Time Referring and Reasoning via Synthetic Instruction Data


10. On Entropy Control in LLM-RL Algorithms


11. Fair Resource Allocation for Fleet Intelligence


12. epiGPTope: A machine learning-based epitope generator and classifier


13. Domain Adaptation of LLMs for Process Data


14. Adaptive KV-Cache Compression without Manually Setting Budget


15. From Evaluation to Defense: Constructing Persistent Edit-Based Fingerprints for Large Language Models


16. Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers


17. Binary Quantization For LLMs Through Dynamic Grouping


18. FlashRecovery: Fast and Low-Cost Recovery from Failures for Large-Scale Training of LLMs


19. Knowledge Integration for Physics-informed Symbolic Regression Using Pre-trained Large Language Models


20. Unveiling the Response of Large Vision-Language Models to Visually Absent Tokens


21. AR-KAN: Autoregressive-Weight-Enhanced Kolmogorov-Arnold Network for Time Series Forecasting


22. KEPT: Knowledge-Enhanced Prediction of Trajectories from Consecutive Driving Frames with Vision-Language Models


23. The Basic B*** Effect: The Use of LLM-based Agents Reduces the Distinctiveness and Diversity of People’s Choices


24. Cut Costs, Not Accuracy: LLM-Powered Data Processing with Guarantees


25. Grocery to General Merchandise: A Cross-Pollination Recommender using LLMs and Real-Time Cart Context


26. A-SEA3L-QA: A Fully Automated Self-Evolving, Adversarial Workflow for Arabic Long-Context Question-Answer Generation


27. Clustering Discourses: Racial Biases in Short Stories about Women Generated by Large Language Models


28. Optimizing Geometry Problem Sets for Skill Development


29. BioBlue: Notable runaway-optimiser-like LLM failure modes on biologically and economically aligned AI safety benchmarks for LLMs with simplified observation format


30. Radio Astronomy in the Era of Vision-Language Models: Prompt Sensitivity and Adaptation


31. Synthetic Founders: AI-Generated Social Simulations for Startup Validation Research in Computational Social Science


32. OpenAIs HealthBench in Action: Evaluating an LLM-Based Medical Assistant on Realistic Clinical Queries