전체 AI 논문 - 2025-08-22

1. Language-Guided Tuning: Enhancing Numeric Optimization with Textual Feedback


2. Response and Prompt Evaluation to Prevent Parasocial Relationships with Chatbots


3. Measuring the environmental impact of delivering AI at Google Scale


4. NiceWebRL: a Python library for human subject experiments with reinforcement learning environments


5. GRAFT: GRaPH and Table Reasoning for Textual Alignment – A Benchmark for Structured Instruction Following and Visual Reasoning


6. Futurity as Infrastructure: A Techno-Philosophical Interpretation of the AI Lifecycle


7. Understanding Action Effects through Instrumental Empowerment in Multi-Agent Reinforcement Learning


8. Adapting A Vector-Symbolic Memory for Lisp ACT-R


9. Transduction is All You Need for Structured Data Workflows


10. A Dynamical Systems Framework for Reinforcement Learning Safety and Robustness Verification


11. DeepThink3D: Enhancing Large Language Models with Programmatic Reasoning in Complex 3D Situated Reasoning Tasks


12. Super-additive Cooperation in Language Model Agents


13. Think in Blocks: Adaptive Reasoning from Direct Response to Deep Reasoning


14. From Bits to Boardrooms: A Cutting-Edge Multi-Agent LLM Framework for Business Excellence


15. GraSP: A Unified Graph-Based Framework for Scalable Generation, Quality Tagging, and Management of Synthetic Data for SFT and DPO


16. Planning with Minimal Disruption


17. DiagECG: An LLM-Driven Framework for Diagnostic Reasoning via Discretized ECG Tokenization


18. RETAIL: Towards Real-world Travel Planning for Large Language Models


19. Search-Based Credit Assignment for Offline Preference-Based Reinforcement Learning


20. Coarse-to-Fine Grounded Memory for LLM Agent Planning


21. Multiple Memory Systems for Enhancing the Long-term Memory of Agent


22. Computational Intelligence based Land-use Allocation Approaches for Mixed Use Areas


23. See it. Say it. Sorted: Agentic System for Compositional Diagram Generation


24. R-ConstraintBench: Evaluating LLMs on NP-Complete Scheduling


25. LLM4Sweat: A Trustworthy Large Language Model for Hyperhidrosis Support


26. PuzzleClone: An SMT-Powered Framework for Synthesizing Verifiable Data


27. Mobile-Agent-v3: Foundamental Agents for GUI Automation


28. SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass


29. Discovering Hidden Algebraic Structures via Transformers with Rank-Aware Beam GRPO


30. LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries


31. Neural Robot Dynamics


32. Dissecting Tool-Integrated Reasoning: An Empirical Study and Analysis


33. “Does the cafe entrance look accessible? Where is the door?” Towards Geospatial AI Agents for Visual Inquiries


34. End-to-End Agentic RAG System Training for Traceable Diagnostic Reasoning


35. Numerical models outperform AI weather forecasts of record-breaking extremes


36. EcomMMMU: Strategic Utilization of Visuals for Robust Multimodal E-Commerce Models


37. Tutorial on the Probabilistic Unification of Estimation Theory, Machine Learning, and Generative AI


38. StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding


39. Foundation Models for Cross-Domain EEG Analysis Application: A Survey


40. Row-Column Hybrid Grouping for Fault-Resilient Multi-Bit Weight Representation on IMC Arrays


41. Mind and Motion Aligned: A Joint Evaluation IsaacSim Benchmark for Task Planning and Low-Level Policies in Mobile Manipulation


42. Benchmarking Computer Science Survey Generation


43. Towards a 3D Transfer-based Black-box Attack via Critical Feature Guidance


44. Label Uncertainty for Ultrasound Segmentation


45. GRASPED: Graph Anomaly Detection using Autoencoder with Spectral Encoder and Decoder (Full Version)


46. Trained Miniatures: Low cost, High Efficacy SLMs for Sales & Marketing


47. Are Virtual DES Images a Valid Alternative to the Real Ones?


48. LoUQAL: Low-fidelity informed Uncertainty Quantification for Active Learning in the chemical configuration space


49. LLM-Driven Self-Refinement for Embodied Drone Task Planning


50. LGMSNet: Thinning a medical image segmentation model via dual-level multiscale fusion


51. Subjective Behaviors and Preferences in LLM: Language of Browsing


52. RadReason: Radiology Report Evaluation Metric with Reasons and Sub-Scores


53. A Solvable Molecular Switch Model for Stable Temporal Information Processing


54. Reliable Unlearning Harmful Information in LLMs with Metamorphosis Representation Projection


55. Mitigating Hallucinations in LM-Based TTS Models via Distribution Alignment Using GFlowNets


56. Test-time Corpus Feedback: From Retrieval to RAG


57. An Empirical Study of Knowledge Distillation for Code Understanding Tasks


58. LLaSO: A Foundational Framework for Reproducible Research in Large Language and Speech Model


59. Bridging Generalization and Personalization in Wearable Human Activity Recognition via On-Device Few-Shot Learning


60. When Audio and Text Disagree: Revealing Text Bias in Large Audio-Language Models


61. Hybrid Least Squares/Gradient Descent Methods for DeepONets


62. Bladder Cancer Diagnosis with Deep Learning: A Multi-Task Framework and Online Platform


63. EvoFormer: Learning Dynamic Graph-Level Representations with Structural and Temporal Bias Correction


64. Image-Conditioned 3D Gaussian Splat Quantization


65. Unveiling Trust in Multimodal Large Language Models: Evaluation, Analysis, and Mitigation


66. Predicting Road Crossing Behaviour using Pose Detection and Sequence Modelling


67. VideoEraser: Concept Erasure in Text-to-Video Diffusion Models


68. First RAG, Second SEG: A Training-Free Paradigm for Camouflaged Object Detection


69. IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents


70. DesignCLIP: Multimodal Learning with CLIP for Design Patent Understanding


71. Way to Build Native AI-driven 6G Air Interface: Principles, Roadmap, and Outlook


72. M-$LLM^3$REC: A Motivation-Aware User-Item Interaction Framework for Enhancing Recommendation Accuracy with LLMs


73. Conflict-Aware Soft Prompting for Retrieval-Augmented Generation


74. Explainable Knowledge Distillation for Efficient Medical Image Classification


75. Robust and Efficient Quantum Reservoir Computing with Discrete Time Crystal


76. VocabTailor: Dynamic Vocabulary Selection for Downstream Tasks in Small Language Models


77. GenTune: Toward Traceable Prompts to Improve Controllability of Image Refinement in Environment Design


78. Locally Pareto-Optimal Interpretations for Black-Box Machine Learning Models


79. SparK: Query-Aware Unstructured Sparsity with Recoverable KV Cache Channel Pruning


80. Survey of Vision-Language-Action Models for Embodied Manipulation


81. SemToken: Semantic-Aware Tokenization for Efficient Long-Context Language Modeling


82. SurgWound-Bench: A Benchmark for Surgical Wound Diagnosis