전체 AI 논문 - 2025-10-28

1. A Knowledge-Graph Translation Layer for Mission-Aware Multi-Agent Path Planning in Spatiotemporal Dynamics


2. A Multimodal Benchmark for Framing of Oil & Gas Advertising and Potential Greenwashing Detection


3. CMOMgen: Complex Multi-Ontology Alignment via Pattern-Guided In-Context Learning


4. AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite


5. DeepAgent: A General Reasoning Agent with Scalable Toolsets


6. Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine


7. Learning Neural Control Barrier Functions from Expert Demonstrations using Inverse Constraint Learning


8. Co-Sight: Enhancing LLM-Based Agents via Conflict-Aware Meta-Verification and Trustworthy Reasoning with Structured Facts


9. EU-Agent-Bench: Measuring Illegal Behavior of LLM Agents Under EU Law


10. Multi-Task Vehicle Routing Solver via Mixture of Specialized Experts under State-Decomposable MDP


11. AutoOpt: A Dataset and a Unified Framework for Automating Optimization Problem Solving


12. Advancing Symbolic Integration in Large Language Models: Beyond Conventional Neurosymbolic AI


13. Boosting Accuracy and Efficiency of Budget Forcing in LLMs via Reinforcement Learning for Mathematical Reasoning


14. Magellan: Guided MCTS for Latent Space Exploration and Novelty Generation


15. CXRAgent: Director-Orchestrated Multi-Stage Reasoning for Chest X-Ray Interpretation


16. Towards Reliable Code-as-Policies: A Neuro-Symbolic Framework for Embodied Task Planning


17. Understanding AI Trustworthiness: A Scoping Review of AIES & FAccT Articles


18. When Models Outthink Their Safety: Mitigating Self-Jailbreak in Large Reasoning Models with Chain-of-Guardrails


19. Investigating Scale Independent UCT Exploration Factor Strategies


20. Out-of-Distribution Detection for Safety Assurance of AI and Autonomous Systems


21. OutboundEval: A Dual-Dimensional Benchmark for Expert-Level Intelligent Outbound Evaluation of Xbench’s Professional-Aligned Series


22. Shylock: Causal Discovery in Multivariate Time Series based on Hybrid Constraints


23. Memory-Free Continual Learning with Null Space Adaptation for Zero-Shot Vision-Language Models


24. String Seed of Thought: Prompting LLMs for Distribution-Faithful and Diverse Generation


25. How to Auto-optimize Prompts for Domain Tasks? Adaptive Prompting and Reasoning through Evolutionary Domain Knowledge Adaptation


26. NeuroGenPoisoning: Neuron-Guided Attacks on Retrieval-Augmented Generation of LLM via Genetic Optimization of External Knowledge


27. PanicToCalm: A Proactive Counseling Agent for Panic Attacks


28. DAO-AI: Evaluating Collective Decision-Making through Agentic AI in Decentralized Governance


29. Confounding Robust Deep Reinforcement Learning: A Causal Approach


30. MedAlign: A Synergistic Framework of Multimodal Preference Optimization and Federated Meta-Cognitive Reasoning


31. From Questions to Queries: An AI-powered Multi-Agent Framework for Spatial Text-to-SQL


32. Epistemic Deference to AI


33. Customizing Open Source LLMs for Quantitative Medication Attribute Extraction across Heterogeneous EHR Systems


34. Fuzzy numbers revisited: operations on extensional fuzzy numbers


35. Cultural Alien Sampler: Open-ended art generation balancing originality and coherence


36. Sketch2BIM: A Multi-Agent Human-AI Collaborative Pipeline to Convert Hand-Drawn Floor Plans to 3D BIM


37. On Thin Ice: Towards Explainable Conservation Monitoring via Attribution and Perturbations


38. Group Inertial Poser: Multi-Person Pose and Global Translation from Sparse Inertial Sensors and Ultra-Wideband Ranging


39. A Dynamic Knowledge Distillation Method Based on the Gompertz Curve


40. DEEDEE: Fast and Scalable Out-of-Distribution Dynamics Detection


41. Few-Shot Knowledge Distillation of LLMs With Counterfactual Explanations


42. The Universal Landscape of Human Reasoning


43. Generative Correlation Manifolds: Generating Synthetic Data with Preserved Higher-Order Correlations


44. Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation


45. From Polyester Girlfriends to Blind Mice: Creating the First Pragmatics Understanding Benchmarks for Slovene


46. Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos


47. Human and AI Trust: Trust Attitude Measurement Instrument


48. GranViT: A Fine-Grained Vision Model With Autoregressive Perception For MLLMs


49. Enhancing Social Robots through Resilient AI


50. PhysWorld: From Real Videos to World Models of Deformable Objects via Physics-Aware Demonstration Synthesis


51. REMONI: An Autonomous System Integrating Wearables and Multimodal Large Language Models for Enhanced Remote Health Monitoring


52. Does Model Size Matter? A Comparison of Small and Large Language Models for Requirements Classification


53. Vision Language Models for Dynamic Human Activity Recognition in Healthcare Settings


54. DreamerV3-XP: Optimizing exploration through uncertainty estimation


55. Large Language Models as Model Organisms for Human Associative Learning


56. REvolution: An Evolutionary Framework for RTL Generation driven by Large Language Models


57. Assessing the Real-World Utility of Explainable AI for Arousal Diagnostics: An Application-Grounded User Study


58. Compressing Quaternion Convolutional Neural Networks for Audio Classification


59. HIKMA: Human-Inspired Knowledge by Machine Agents through a Multi-Agent Framework for Semi-Autonomous Scientific Conferences


60. Patient-specific AI for generation of 3D dosimetry imaging from two 2D-planar measurements


61. Gaze-VLM:Bridging Gaze and VLMs through Attention Regularization for Egocentric Understanding


62. CT-CLIP: A Multi-modal Fusion Framework for Robust Apple Leaf Disease Recognition in Complex Environments


63. $α$-LoRA: Effective Fine-Tuning via Base Model Rescaling


64. World-POI: Global Point-of-Interest Data Enriched from Foursquare and OpenStreetMap as Tabular and Graph Data


65. CausalRec: A CausalBoost Attention Model for Sequential Recommendation


66. Weak-to-Strong Generalization under Distribution Shifts


67. TripTide: A Benchmark for Adaptive Travel Planning under Disruptions


68. Seemingly Redundant Modules Enhance Robust Odor Learning in Fruit Flies


69. A Convergence Analysis of Adaptive Optimizers under Floating-point Quantization


70. Efficient semantic uncertainty quantification in language models via diversity-steered sampling


71. WhaleVAD-BPN: Improving Baleen Whale Call Detection with Boundary Proposal Networks and Post-processing Optimisation


72. Pctx: Tokenizing Personalized Context for Generative Recommendation


73. Sparser Block-Sparse Attention via Token Permutation


74. Correlation Dimension of Auto-Regressive Large Language Models


75. Physics-Informed Neural Networks for MIMO Beam Map and Environment Reconstruction


76. Securing AI Agent Execution


77. PLAN: Proactive Low-Rank Allocation for Continual Learning


78. Reducing the Probability of Undesirable Outputs in Language Models Using Probabilistic Inference


79. Towards Straggler-Resilient Split Federated Learning: An Unbalanced Update Approach


80. Uncertainty-Aware Multi-Objective Reinforcement Learning-Guided Diffusion Models for 3D De Novo Molecular Design


81. Hierarchical AI Multi-Agent Fundamental Investing: Evidence from China’s A-Share Market


82. Quantifying CBRN Risk in Frontier Models


83. Large Language Models Meet Text-Attributed Graphs: A Survey of Integration Frameworks and Applications


84. Enhanced Evolutionary Multi-Objective Deep Reinforcement Learning for Reliable and Efficient Wireless Rechargeable Sensor Networks


85. Generalizable Hierarchical Skill Learning via Object-Centric Representation


86. The Gray Zone of Faithfulness: Taming Ambiguity in Unfaithfulness Detection


87. Urban 3D Change Detection Using LiDAR Sensor for HD Map Maintenance and Smart Mobility


88. ESCORT: Efficient Stein-variational and Sliced Consistency-Optimized Temporal Belief Representation for POMDPs


89. Self-Rewarding PPO: Aligning Large Language Models with Demonstrations Only


90. M-GLC: Motif-Driven Global-Local Context Graphs for Few-shot Molecular Property Prediction


91. CDrugRed: A Chinese Drug Recommendation Dataset for Discharge Medications in Metabolic Diseases


92. Soppia: A Structured Prompting Framework for the Proportional Assessment of Non-Pecuniary Damages in Personal Injury Cases


93. Bridging Language Gaps with Adaptive RAG: Improving Indonesian Language Question Answering


94. Deep learning-based automated damage detection in concrete structures using images from earthquake events


95. On the Sample Complexity of Differentially Private Policy Optimization


96. Reasoning’s Razor: Reasoning Improves Accuracy but Can Hurt Recall at Critical Operating Points in Safety and Hallucination Detection


97. AgentArcEval: An Architecture Evaluation Method for Foundation Model based Agents


98. JSTprove: Pioneering Verifiable AI for a Trustless Future


99. Physically consistent and uncertainty-aware learning of spatiotemporal dynamics


100. Race and Gender in LLM-Generated Personas: A Large-Scale Audit of 41 Occupations


101. Exploring Spiking Neural Networks for Binary Classification in Multivariate Time Series at the Edge


102. VESSA: Video-based objEct-centric Self-Supervised Adaptation for Visual Foundation Models


103. GPU Memory Requirement Prediction for Deep Learning Task Based on Bidirectional Gated Recurrent Unit Optimization Transformer


104. Learning Grouped Lattice Vector Quantizers for Low-Bit LLM Compression


105. Memory Constrained Dynamic Subnetwork Update for Transfer Learning


106. REx86: A Local Large Language Model for Assisting in x86 Assembly Reverse Engineering


107. 3DReasonKnee: Advancing Grounded Reasoning in Medical Vision Language Models


108. Meta-Learning for Cross-Task Generalization in Protein Mutation Property Prediction


109. Do LLMs Truly Understand When a Precedent Is Overruled?


110. Focal Modulation and Bidirectional Feature Fusion Network for Medical Image Segmentation


111. An Experimental Study of Trojan Vulnerabilities in UAV Autonomous Landing


112. Security Logs to ATT&CK Insights: Leveraging LLMs for High-Level Threat Understanding and Cognitive Trait Inference


113. Aircraft Collision Avoidance Systems: Technological Challenges and Solutions on the Path to Regulatory Acceptance


114. Code-enabled language models can outperform reasoning models on diverse tasks


115. Video-As-Prompt: Unified Semantic Control for Video Generation


116. Preventing Shortcuts in Adapter Training via Providing the Shortcuts


117. Shoot First, Ask Questions Later? Building Rational Agents that Explore and Act Like People


118. HA-RAG: Hotness-Aware RAG Acceleration via Mixed Precision and Data Placement


119. Multimodal Negative Learning


120. CC-GRMAS: A Multi-Agent Graph Neural System for Spatiotemporal Landslide Risk Assessment in High Mountain Asia


121. Crisis-Resilient Portfolio Management via Graph-based Spatio-Temporal Learning


122. Incentivizing Consistent, Effective and Scalable Reasoning Capability in Audio LLMs via Reasoning Process Rewards


123. Integrated representational signatures strengthen specificity in brains and models


124. This EEG Looks Like These EEGs: Interpretable Interictal Epileptiform Discharge Detection With ProtoEEG-kNN


125. Consciousness, natural and artificial: an evolutionary advantage for reasoning on reactive substrates


126. Image and Point-cloud Classification for Jet Analysis in High-Energy Physics: A survey