전체 AI 논문 - 2025-09-05

1. ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory


2. Psychologically Enhanced AI Agents


3. Improving Robustness of AlphaZero Algorithms to Test-Time Environment Changes


4. EvoEmo: Towards Evolved Emotional Policies for LLM Agents in Multi-Turn Negotiation


5. Evaluating Quality of Gaming Narratives Co-created with AI


6. Domain size asymptotics for Markov logic networks


7. Towards an Action-Centric Ontology for Cooking Procedures Using Temporal Graphs


8. The human biological advantage over AI


9. Analysis of Bluffing by DQN and CFR in Leduc Hold’em Poker


10. Hybrid Reinforcement Learning and Search for Flight Trajectory Planning


11. Intermediate Languages Matter: Formal Languages and LLMs affect Neurosymbolic Reasoning


12. Oruga: An Avatar of Representational Systems Theory


13. CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning


14. AutoPBO: LLM-powered Optimization for Local Search PBO Solvers


15. Meta-Policy Reflexion: Reusable Reflective Memory and Rule Admissibility for Resource-Efficient LLM Agent


16. World Model Implanting for Test-time Adaptation of Embodied Agents


17. Handling Infinite Domain Parameters in Planning Through Best-First Search with Delayed Partial Expansions


18. A Foundation Model for Chest X-ray Interpretation with Grounded Reasoning via Online Reinforcement Learning


19. FaMA: LLM-Empowered Agentic Assistant for Consumer-to-Consumer Marketplace


20. Expedition & Expansion: Leveraging Semantic Representations for Goal-Directed Exploration in Continuous Cellular Automata


21. Continuous Monitoring of Large-Scale Generative AI via Deterministic Knowledge Graph Structures


22. A Multidimensional AI-powered Framework for Analyzing Tourist Perception in Historic Urban Quarters: A Case Study in Shanghai


23. An Agentic Model Context Protocol Framework for Medical Concept Standardization


24. What Would an LLM Do? Evaluating Policymaking Capabilities of Large Language Models


25. Learning to Deliberate: Meta-policy Collaboration for Agentic LLMs with Multi-agent Reinforcement Learning


26. Leveraging LLM-Based Agents for Intelligent Supply Chain Planning


27. RAGuard: A Novel Approach for in-context Safe Retrieval Augmented Generation for LLMs


28. Are LLM Agents Behaviorally Coherent? Latent Profiles for Social Simulation


29. The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs


30. PersonaTeaming: Exploring How Introducing Personas Can Improve Automated AI Red-Teaming


31. An Empirical Evaluation of Factors Affecting SHAP Explanation of Time Series Classification


32. Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning


33. Towards a Neurosymbolic Reasoning System Grounded in Schematic Representations


34. CausalARC: Abstract Reasoning with Causal World Models


35. Explainable Knowledge Graph Retrieval-Augmented Generation (KG-RAG) with KG-SMILE


36. Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents


37. Diffusion-RL Based Air Traffic Conflict Detection and Resolution Method


38. Multilinear and Linear Programs for Partially Identifiable Queries in Quasi-Markovian Structural Causal Models


39. PG-Agent: An Agent Powered by Page Graph


40. ChronoGraph: A Real-World Graph-Based Multivariate Time Series Dataset


41. Delta Activations: A Representation for Finetuned Large Language Models


42. DEXOP: A Device for Robotic Transfer of Dexterous Human Manipulation


43. Towards a Unified View of Large Language Model Post-Training


44. No Thoughts Just AI: Biased LLM Recommendations Limit Human Agency in Resume Screening


45. IPA: An Information-Preserving Input Projection Framework for Efficient Foundation Model Adaptation


46. SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer


47. Parking Availability Prediction via Fusing Multi-Source Data with A Self-Supervised Learning Enhanced Spatio-Temporal Inverted Transformer


48. PARCO: Phoneme-Augmented Robust Contextual ASR via Contrastive Entity Disambiguation


49. AUDETER: A Large-scale Dataset for Deepfake Audio Detection in Open Worlds


50. From Editor to Dense Geometry Estimator


51. Decoupled Entity Representation Learning for Pinterest Ads Ranking


52. Facts Fade Fast: Evaluating Memorization of Outdated Medical Knowledge in Large Language Models


53. HumAIne-Chatbot: Real-Time Personalized Conversational AI via Reinforcement Learning


54. Reinforcement Learning for Robust Ageing-Aware Control of Li-ion Battery Systems with Data-Driven Formal Verification


55. An Empirical Study of Vulnerabilities in Python Packages and Their Detection


56. How many patients could we save with LLM priors?


57. Learning Active Perception via Self-Evolving Preference Optimization for GUI Grounding


58. MAGneT: Coordinated Multi-Agent Generation of Synthetic Multi-Turn Mental Health Counseling Sessions


59. VisioFirm: Cross-Platform AI-assisted Annotation Tool for Computer Vision


60. Crossing the Species Divide: Transfer Learning from Speech to Animal Sounds


61. YOLO Ensemble for UAV-based Multispectral Defect Detection in Wind Turbine Components


62. Attention as an Adaptive Filter


63. TAGAL: Tabular Data Generation using Agentic LLM Methods


64. Enhancing Technical Documents Retrieval for RAG


65. Simplicity Lies in the Eye of the Beholder: A Strategic Perspective on Controllers in Reactive Synthesis


66. MEPG:Multi-Expert Planning and Generation for Compositionally-Rich Image Generation


67. EHVC: Efficient Hierarchical Reference and Quality Structure for Neural Video Coding


68. RepoDebug: Repository-Level Multi-Task and Multi-Language Debugging Evaluation of Large Language Models


69. Keypoint-based Diffusion for Robotic Motion Planning on the NICOL Robot


70. Neural Video Compression with In-Loop Contextual Filtering and Out-of-Loop Reconstruction Enhancement


71. On Robustness and Reliability of Benchmark-Based Evaluation of LLMs


72. NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings


73. Detecting Regional Spurious Correlations in Vision Transformers via Token Discarding


74. RTQA : Recursive Thinking for Complex Temporal Knowledge Graph Question Answering with Large Language Models


75. Promptception: How Sensitive Are Large Multimodal Models to Prompts?


76. NeuroBreak: Unveil Internal Jailbreak Mechanisms in Large Language Models


77. SAC-MIL: Spatial-Aware Correlated Multiple Instance Learning for Histopathology Whole Slide Image Classification


78. Expanding Foundational Language Capabilities in Open-Source LLMs through a Korean Case Study


79. Multimodal Feature Fusion Network with Text Difference Enhancement for Remote Sensing Change Detection


80. CANDY: Benchmarking LLMs’ Limitations and Assistive Potential in Chinese Misinformation Fact-Checking


81. Chest X-ray Pneumothorax Segmentation Using EfficientNet-B4 Transfer Learning in a U-Net Architecture


82. VoxRole: A Comprehensive Benchmark for Evaluating Speech-Based Role-Playing Agents


83. SPFT-SQL: Enhancing Large Language Model for Text-to-SQL Parsing by Self-Play Fine-Tuning


84. SelfAug: Mitigating Catastrophic Forgetting in Retrieval-Augmented Generation via Distribution Self-Alignment


85. MTQA:Matrix of Thought for Enhanced Reasoning in Complex Question Answering


86. Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series


87. Reactive In-Air Clothing Manipulation with Confidence-Aware Dense Correspondence and Visuotactile Affordance


88. Peptidomic-Based Prediction Model for Coronary Heart Disease Using a Multilayer Perceptron Neural Network


89. SalientFusion: Context-Aware Compositional Zero-Shot Food Recognition


90. A Comprehensive Survey on Trustworthiness in Reasoning with Large Language Models


91. MillGNN: Learning Multi-Scale Lead-Lag Dependencies for Multi-Variate Time Series Forecasting


92. Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Context Variables


93. INGRID: Intelligent Generative Robotic Design Using Large Language Models


94. From Leiden to Pleasure Island: The Constant Potts Model for Community Detection as a Hedonic Game


95. Gravity Well Echo Chamber Modeling With An LLM-Based Confirmation Bias Model


96. Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation


97. Measuring How (Not Just Whether) VLMs Build Common Ground


98. SAMVAD: A Multi-Agent System for Simulating Judicial Deliberation Dynamics in India


99. SiLVERScore: Semantically-Aware Embeddings for Sign Language Generation Evaluation


100. What Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning?


101. Natural Latents: Latent Variables Stable Across Ontologies


102. Learning an Adversarial World Model for Automated Curriculum Generation in MARL


103. ARDO: A Weak Formulation Deep Neural Network Method for Elliptic and Parabolic PDEs Based on Random Differences of Test Functions


104. STA-Net: A Decoupled Shape and Texture Attention Network for Lightweight Plant Disease Classification


105. Designing Gaze Analytics for ELA Instruction: A User-Centered Dashboard with Conversational AI Support


106. Sparse Autoencoder Neural Operators: Model Recovery in Function Spaces


107. Differentiable Entropy Regularization for Geometry and Neural Networks


108. MLSD: A Novel Few-Shot Learning Approach to Enhance Cross-Target and Cross-Domain Stance Detection


109. From Federated Learning to $\mathbb{X}$-Learning: Breaking the Barriers of Decentrality Through Random Walks


110. Hierarchical Federated Foundation Models over Wireless Networks for Multi-Modal Multi-Task Intelligence: Integration of Edge Learning with D2D/P2P-Enabled Fog Learning Architectures


111. LuxDiT: Lighting Estimation with Video Diffusion Transformer


112. Insights from Gradient Dynamics: Gradient Autoscaled Normalization


113. Efficient Virtuoso: A Latent Diffusion Transformer Model for Goal-Conditioned Trajectory Planning


114. Breaking the Mirror: Activation-Based Mitigation of Self-Preference in LLM Evaluators


115. CEHR-GPT: A Scalable Multi-Task Foundation Model for Electronic Health Records


116. treeX: Unsupervised Tree Instance Segmentation in Dense Forest Point Clouds


117. E-ARMOR: Edge case Assessment and Review of Multilingual Optical Character Recognition


118. The Optimiser Hidden in Plain Sight: Training with the Loss Landscape’s Induced Metric


119. A software security review on Uganda’s Mobile Money Services: Dr. Jim Spire’s tweets sentiment analysis


120. Improving Factuality in LLMs via Inference-Time Knowledge Graph Construction


121. AR$^2$: Adversarial Reinforcement Learning for Abstract Reasoning in Large Language Models


122. QuesGenie: Intelligent Multimodal Question Generation


123. Real-Time Detection of Hallucinated Entities in Long-Form Generation


124. Multimodal Proposal for an AI-Based Tool to Increase Cross-Assessment of Messages


125. Multilevel Analysis of Cryptocurrency News using RAG Approach with Fine-Tuned Mistral Large Language Model


126. Speech-Based Cognitive Screening: A Systematic Evaluation of LLM Adaptation Strategies


127. BiND: A Neural Discriminator-Decoder for Accurate Bimanual Trajectory Prediction in Brain-Computer Interfaces