전체 AI 논문 - 2025-08-21

1. Privileged Self-Access Matters for Introspection in AI


2. Data-Driven Probabilistic Evaluation of Logic Properties with PAC-Confidence on Mealy Machines


3. MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers


4. Entropy-Constrained Strategy Optimization in Urban Floods: A Multi-Agent Framework with LLM and Knowledge Graph Integration


5. LeanGeo: Formalizing Competitional Geometry problems in Lean


6. Who Sees What? Structured Thought-Action Sequences for Epistemic Reasoning in LLMs


7. The Agent Behavior: Model, Governance and Challenges in the AI Digital Age


8. Automated Optimization Modeling through Expert-Guided Large Language Model Reasoning


9. Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs


10. Graph Structure Learning with Temporal Graph Information Bottleneck for Inductive Representation Learning


11. $TIME[t] \subseteq SPACE[O(\sqrt{t})]$ via Tree Height Compression


12. Long Chain-of-Thought Reasoning Across Languages


13. From Passive Tool to Socio-cognitive Teammate: A Conceptual Framework for Agentic AI in Human-AI Collaborative Learning


14. Evaluating Retrieval-Augmented Generation vs. Long-Context Input for Clinical Reasoning over EHRs


15. TransLight: Image-Guided Customized Lighting Control with Generative Decoupling


16. DINOv3 with Test-Time Training for Medical Image Registration


17. TransLLM: A Unified Multi-Task Foundation Framework for Urban Transportation via Learnable Prompting


18. PepThink-R1: LLM for Interpretable Cyclic Peptide Optimization with CoT SFT and Reinforcement Learning


19. Reliable generation of isomorphic physics problems using ChatGPT with prompt-chaining and tool use


20. Cross-Modality Controlled Molecule Generation with Diffusion Language Model


21. Evaluating Multilingual and Code-Switched Alignment in LLMs via Synthetic Natural Language Inference


22. AFABench: A Generic Framework for Benchmarking Active Feature Acquisition


23. Emerson-Lei and Manna-Pnueli Games for LTLf+ and PPLTL+ Synthesis


24. Transplant Then Regenerate: A New Paradigm for Text Data Augmentation


25. ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine


26. Learning in Repeated Multi-Objective Stackelberg Games with Payoff Manipulation


27. Foe for Fraud: Transferable Adversarial Attacks in Credit Card Fraud Detection


28. ECHO: Frequency-aware Hierarchical Encoding for Variable-length Signal


29. ELATE: Evolutionary Language model for Automated Time-series Engineering


30. OneLoc: Geo-Aware Generative Recommender Systems for Local Life Service


31. Can LLM Agents Solve Collaborative Tasks? A Study on Urgency-Aware Planning and Coordination


32. A Study of the Scale Invariant Signal to Distortion Ratio in Speech Separation with Noisy References


33. UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling


34. An Open-Source HW-SW Co-Development Framework Enabling Efficient Multi-Accelerator Systems


35. Mamba2 Meets Silence: Robust Vocal Source Separation for Sparse Regions


36. Towards LLM-generated explanations for Component-based Knowledge Graph Question Answering Systems


37. Adaptively Robust LLM Inference Optimization under Prediction Uncertainty


38. Post-hoc LLM-Supported Debugging of Distributed Processes


39. Beyond ReLU: Chebyshev-DQN for Enhanced Deep Q-Networks


40. EffiFusion-GAN: Efficient Fusion Generative Adversarial Network for Speech Enhancement


41. MISS: Multi-Modal Tree Indexing and Searching with Lifelong Sequential Behavior for Retrieval Recommendation


42. PB-IAD: Utilizing multimodal foundation models for semantic industrial anomaly detection in dynamic manufacturing environments


43. Exact Shapley Attributions in Quadratic-time for FANOVA Gaussian Processes


44. Synaptic bundle theory for spike-driven sensor-motor system: More than eight independent synaptic bundles collapse reward-STDP learning


45. In2x at WMT25 Translation Task


46. NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model


47. Detecting Reading-Induced Confusion Using EEG and Eye Tracking


48. Cognitive Surgery: The Awakening of Implicit Territorial Awareness in LLMs


49. NoteIt: A System Converting Instructional Videos to Interactable Notes Through Multimodal Video Understanding


50. DEPTH: Hallucination-Free Relation Extraction via Dependency-Aware Sentence Simplification and Two-tiered Hierarchical Refinement


51. Credence Calibration Game? Calibrating Large Language Models through Structured Play


52. Online Incident Response Planning under Model Misspecification through Bayesian Learning and Belief Quantization


53. ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students’ Cognitive Abilities


54. Computing-In-Memory Dataflow for Minimal Buffer Traffic


55. Learning Point Cloud Representations with Pose Continuity for Depth-Based Category-Level 6D Object Pose Estimation


56. Organ-Agents: Virtual Human Physiology Simulator via LLMs


57. Inter-Class Relational Loss for Small Object Detection: A Case Study on License Plates


58. Generative AI Against Poaching: Latent Composite Flow Matching for Wildlife Conservation


59. A Comparative Evaluation of Teacher-Guided Reinforcement Learning Techniques for Autonomous Cyber Operations


60. Power Stabilization for AI Training Datacenters