전체 AI 논문 - 2025-11-21

1. What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity


2. Exploring the use of AI authors and reviewers at Agents4Science


3. Know Your Intent: An Autonomous Multi-Perspective LLM Agent Framework for DeFi User Transaction Intent Mining


4. IPR-1: Interactive Physical Reasoner


5. Terra Nova: A Comprehensive Challenge Environment for Intelligent Agents


6. Octopus: Agentic Multimodal Reasoning with Six-Capability Orchestration


7. Realist and Pluralist Conceptions of Intelligence and Their Implications on AI Research


8. Efficiency Will Not Lead to Sustainable Reasoning AI


9. SOLID: a Framework of Synergizing Optimization and LLMs for Intelligent Decision-Making


10. As If We’ve Met Before: LLMs Exhibit Certainty in Recognizing Seen Files


11. HISE-KT: Synergizing Heterogeneous Information Networks and LLMs for Explainable Knowledge Tracing with Meta-Path Optimization


12. SafeRBench: A Comprehensive Benchmark for Safety Assessment in Large Reasoning Models


13. Knowledge-Informed Automatic Feature Extraction via Collaborative Large Language Model Agents


14. ProRAC: A Neuro-symbolic Method for Reasoning about Actions with LLM-based Progression


15. Beyond GeneGPT: A Multi-Agent Architecture with Open-Source LLMs for Enhanced Genomic Question Answering


16. Learning Human-Like RL Agents Through Trajectory Optimization With Action Quantization


17. Task Specific Sharpness Aware O-RAN Resource Management using Multi Agent Reinforcement Learning


18. Uncertainty-Aware Measurement of Scenario Suite Representativeness for Autonomous Systems


19. Project Rachel: Can an AI Become a Scholarly Author?


20. Subnational Geocoding of Global Disasters Using Large Language Models


21. Ask WhAI:Probing Belief Formation in Role-Primed LLM Agents


22. Learning Interestingness in Automated Mathematical Theory Formation


23. The Illusion of Procedural Reasoning: Measuring Long-Horizon FSM Execution in LLMs


24. In-N-On: Scaling Egocentric Manipulation with in-the-wild and on-task Data


25. Think Visually, Reason Textually: Vision-Language Synergy in ARC


26. Joint Semantic-Channel Coding and Modulation for Token Communications


27. Walrus: A Cross-Domain Foundation Model for Continuum Dynamics


28. MF-GCN: A Multi-Frequency Graph Convolutional Network for Tri-Modal Depression Detection Using Eye-Tracking, Facial, and Acoustic Features


29. DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models


30. VisPlay: Self-Evolving Vision-Language Models from Images


31. GEO-Bench-2: From Performance to Capability, Rethinking Evaluation in Geospatial AI


32. Continual Reinforcement Learning for Cyber-Physical Systems: Lessons Learned and Open Challenges


33. Sufficient Explanations in Databases and their Connections to Necessary Explanations and Repairs


34. The SA-FARI Dataset: Segment Anything in Footage of Animals for Recognition and Identification


35. Optimus-Q: Utilizing Federated Learning in Adaptive Robots for Intelligent Nuclear Power Plant Operations through Quantum Cryptography


36. CompTrack: Information Bottleneck-Guided Low-Rank Dynamic Token Compression for Point Cloud Tracking


37. HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models through Curriculum Tuning


38. B+ANN: A Fast Billion-Scale Disk-based Nearest-Neighbor Index


39. Multimodal Evaluation of Russian-language Architectures


40. Theoretical Closed-loop Stability Bounds for Dynamical System Coupled with Diffusion Policies


41. Evaluating Low-Light Image Enhancement Across Multiple Intensity Levels


42. RS-CA-HSICT: A Residual and Spatial Channel Augmented CNN Transformer Framework for Monkeypox Detection


43. Insights from the ICLR Peer Review and Rebuttal Process


44. TSFM in-context learning for time-series classification of bearing-health status


45. HV-Attack: Hierarchical Visual Attack for Multimodal Retrieval Augmented Generation


46. Small Language Models for Phishing Website Detection: Cost, Performance, and Privacy Trade-Offs


47. Towards Understanding Layer Contributions in Tabular In-Context Learning Models


48. Building Robust and Scalable Multilingual ASR for Indian Languages


49. RRT*former: Environment-Aware Sampling-Based Motion Planning using Transformer


50. NAMeGEn: Creative Name Generation via A Novel Agent-based Multiple Personalized Goal Enhancement Framework


51. DEPO: Dual-Efficiency Preference Optimization for LLM Agents



53. Parameter Importance-Driven Continual Learning for Foundation Models


54. The Empowerment of Science of Science by Large Language Models: New Tools and Methods


55. IPTQ-ViT: Post-Training Quantization of Non-linear Functions for Integer-only Vision Transformers


56. Reflexive Evidence-Based Multimodal Learning for Clean Energy Transitions: Causal Insights on Cooking Fuel Access, Urbanization, and Carbon Emissions


57. STREAM-VAE: Dual-Path Routing for Slow and Fast Dynamics in Vehicle Telemetry Anomaly Detection


58. Adversarial Poetry as a Universal Single-Turn Jailbreak Mechanism in Large Language Models


59. Path Planning through Multi-Agent Reinforcement Learning in Dynamic Environments


60. Behavior Trees vs Executable Ontologies: a Comparative Analysis of Robot Control Paradigms


61. PresentCoach: Dual-Agent Presentation Coaching through Exemplars and Interactive Feedback


62. EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control


63. OEMA: Ontology-Enhanced Multi-Agent Collaboration Framework for Zero-Shot Clinical Named Entity Recognition


64. Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story


65. Physics-Based Benchmarking Metrics for Multimodal Synthetic Images


66. Taxonomy, Evaluation and Exploitation of IPI-Centric LLM Agent Defense Frameworks


67. Eq.Bot: Enhance Robotic Manipulation Learning via Group Equivariant Canonicalization


68. Masked Auto-Regressive Variational Acceleration: Fast Inference Makes Practical Reinforcement Learning


69. SWR-Viz: AI-assisted Interactive Visual Analytics Framework for Ship Weather Routing


70. FaultDiffusion: Few-Shot Fault Time Series Generation with Diffusion Model


71. Finetuning LLMs for Automatic Form Interaction on Web-Browser in Selenium Testing Framework


72. Learning Depth from Past Selves: Self-Evolution Contrast for Robust Depth Estimation


73. Can MLLMs Detect Phishing? A Comprehensive Security Benchmark Suite Focusing on Dynamic Threats and Multimodal Evaluation in Academic Environments


74. Teaching According to Students’ Aptitude: Personalized Mathematics Tutoring via Persona-, Memory-, and Forgetting-Aware LLMs


75. Multimodal Wireless Foundation Models


76. Generating Natural-Language Surgical Feedback: From Structured Representation to Domain-Grounded Evaluation


77. DCL-SE: Dynamic Curriculum Learning for Spatiotemporal Encoding of Brain Imaging


78. ItemRAG: Item-Based Retrieval-Augmented Generation for LLM-Based Recommendation


79. CASPER: Cross-modal Alignment of Spatial and single-cell Profiles for Expression Recovery


80. From Solving to Verifying: A Unified Objective for Robust Reasoning in LLMs


81. Multi-Aspect Cross-modal Quantization for Generative Recommendation


82. Neural Networks Learn Generic Multi-Index Models Near Information-Theoretic Limit


83. Semiconductor Industry Trend Prediction with Event Intervention Based on LSTM Model in Sentiment-Enhanced Time Series Data


84. Eye Care You: Voice Guidance Application Using Social Robot for Visually Impaired People


85. Effective Code Membership Inference for Code Completion Models via Adversarial Prompts


86. MAIF: Enforcing AI Trust and Provenance with an Artifact-Centric Agentic Paradigm


87. BBox DocVQA: A Large Scale Bounding Box Grounded Dataset for Enhancing Reasoning in Document Visual Question Answer


88. GPU-Initiated Networking for NCCL


89. Deep Pathomic Learning Defines Prognostic Subtypes and Molecular Drivers in Colorectal Cancer


90. Reasoning via Video: The First Evaluation of Video Models’ Reasoning Abilities through Maze-Solving Tasks


91. UniHOI: Unified Human-Object Interaction Understanding via Unified Token Space


92. Aligning Generative Music AI with Human Preferences: Methods and Challenges


93. Simulated Human Learning in a Dynamic, Partially-Observed, Time-Series Environment


94. Dynamic Expert Quantization for Scalable Mixture-of-Experts Inference


95. Mathematical Analysis of Hallucination Dynamics in Large Language Models: Uncertainty Quantification, Advanced Decoding, and Principled Mitigation


96. Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation


97. Logit-Based Losses Limit the Effectiveness of Feature Knowledge Distillation


98. SVBRD-LLM: Self-Verifying Behavioral Rule Discovery for Autonomous Vehicle Identification


99. Harmful Traits of AI Companions


100. EGSA-PT:Edge-Guided Spatial Attention with Progressive Training for Monocular Depth Estimation and Segmentation of Transparent Objects


101. Quality-Controlled Multimodal Emotion Recognition in Conversations with Identity-Based Transfer Learning and MAMBA Fusion


102. MermaidSeqBench: An Evaluation Benchmark for LLM-to-Mermaid Sequence Diagram Generation



104. Artificial intelligence approaches for energy-efficient laser cutting machines


105. Fifty Shades of Greenwashing: The Political Economy of Climate Change Advertising on Social Media


106. On-Premise SLMs vs. Commercial LLMs: Prompt Engineering and Incident Classification in SOCs and CSIRTs


107. Skin-R1: Toward Trustworthy Clinical Reasoning for Dermatological Diagnosis


108. B-Rep Distance Functions (BR-DF): How to Represent a B-Rep Model by Volumetric Distance Functions?


109. When CNNs Outperform Transformers and Mambas: Revisiting Deep Architectures for Dental Caries Segmentation


110. PolyKAN: Efficient Fused GPU Operators for Polynomial Kolmogorov-Arnold Network Variants


111. Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization


112. Implicit Bias of the JKO Scheme


113. Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech


114. Transformer Injectivity & Geometric Robustness - Analytic Margins and Bi-Lipschitz Uniformity of Sequence-Level Hidden States


115. Fully Differentiable dMRI Streamline Propagation in PyTorch


116. MergeDNA: Context-aware Genome Modeling with Dynamic Tokenization through Token Merging


117. Towards Continuous Assurance with Formal Verification and Assurance Cases


118. Scalable and Efficient Large-Scale Log Analysis with LLMs: An IT Software Support Case Study


119. Evaluating Generative AI for CS1 Code Grading: Direct vs Reverse Methods


120. Opinion Mining and Analysis Using Hybrid Deep Neural Networks


121. irace-evo: Automatic Algorithm Configuration Extended With LLM-Based Code Evolution


122. Application of Graph Based Vision Transformers Architectures for Accurate Temperature Prediction in Fiber Specklegram Sensors


123. Enabling Predictive Maintenance in District Heating Substations: A Labelled Dataset and Fault Detection Evaluation Framework based on Service Data


124. Quantifying the Role of OpenFold Components in Protein Structure Prediction


125. LiveCLKTBench: Towards Reliable Evaluation of Cross-Lingual Knowledge Transfer in Multilingual LLMs


126. Test-time Scaling of LLMs: A Survey from A Subproblem Structure Perspective


127. ExplainRec: Towards Explainable Multi-Modal Zero-Shot Recommendation with Preference Attribution and Large Language Models


128. Cluster-based Adaptive Retrieval: Dynamic Context Selection for RAG Applications


129. Causally-Informed Reinforcement Learning for Adaptive Emotion-Aware Social Media Recommendation


130. An LLM-Powered Agent for Real-Time Analysis of the Vietnamese IT Job Market


131. Optimizing Agricultural Research: A RAG-Based Approach to Mycorrhizal Fungi Information



133. Membership Inference Attack against Large Language Model-based Recommendation Systems: A New Distillation-based Paradigm


134. TacEleven: generative tactic discovery for football open play


135. ESA: Energy-Based Shot Assembly Optimization for Automatic Video Editing