전체 AI 논문 - 2025-07-29

1. A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence


2. GenoMAS: A Multi-Agent Framework for Scientific Discovery via Code-Driven Gene Expression Analysis


3. Smart Expansion Techniques for ASP-based Interactive Configuration


4. MIRAGE-Bench: LLM Agent is Hallucinating and Where to Find Them


5. Core Safety Values for Provably Corrigible Agents


6. On the Limits of Hierarchically Embedded Logic in Classical Neural Networks



8. MMGraphRAG: Bridging Vision and Language with Interpretable Multimodal Knowledge Graphs


9. evalSmarT: An LLM-Based Framework for Evaluating Smart Contract Generated Comments


10. How Chain-of-Thought Works? Tracing Information Flow from Decoding, Projection, and Activation


11. Beyond Listenership: AI-Predicted Interventions Drive Improvements in Maternal Health Behaviours


12. Learning the Value Systems of Societies from Preferences


13. Algorithmic Fairness: A Runtime Perspective


14. A General Framework for Dynamic MAPF using Multi-Shot ASP and Tunnels


15. Adaptive Fuzzy Time Series Forecasting via Partially Asymmetric Convolution and Sub-Sliding Window Fusion


16. Complementarity-driven Representation Learning for Multi-modal Knowledge Graph Completion


17. Enhancing Large Multimodal Models with Adaptive Sparsity and KV Cache Compression


18. Unlearning of Knowledge Graph Embedding via Preference Optimization


19. MeLA: A Metacognitive LLM-Driven Architecture for Automatic Heuristic Design


20. Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition


21. STARN-GAT: A Multi-Modal Spatio-Temporal Graph Attention Network for Accident Severity Prediction


22. Enhancing QoS in Edge Computing through Federated Layering Techniques: A Pathway to Resilient AI Lifelong Learning Systems


23. Memorization in Fine-Tuned Large Language Models


24. Compositional Function Networks: A High-Performance Alternative to Deep Neural Networks with Built-in Interpretability


25. Modular Delta Merging with Orthogonal Constraints: A Scalable Framework for Continual and Reversible Model Composition


26. Security Tensors as a Cross-Modal Bridge: Extending Text-Aligned Safety to Vision in LVLM


27. Personalized Treatment Effect Estimation from Unstructured Data


28. JWB-DH-V1: Benchmark for Joint Whole-Body Talking Avatar and Speech Generation Version 1


29. SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment


30. From Entanglement to Alignment: Representation Space Decomposition for Unsupervised Time Series Domain Adaptation


31. Handoff Design in User-Centric Cell-Free Massive MIMO Networks Using DRL


32. Your AI, Not Your View: The Bias of LLMs in Investment Analysis


33. Mind the Gap: Conformative Decoding to Improve Output Diversity of Instruction-Tuned Large Language Models


34. Multivariate Conformal Prediction via Conformalized Gaussian Scoring


35. Dissecting Persona-Driven Reasoning in Language Models via Activation Patching


36. FRED: Financial Retrieval-Enhanced Detection and Editing of Hallucinations in Language Models


37. FHSTP@EXIST 2025 Benchmark: Sexism Detection with Transparent Speech Concept Bottleneck Models


38. Pareto-Grid-Guided Large Language Models for Fast and High-Quality Heuristics Design in Multi-Objective Combinatorial Optimization


39. Modeling User Behavior from Adaptive Surveys with Supplemental Context


40. MediQAl: A French Medical Question Answering Dataset for Knowledge and Reasoning Evaluation


41. HAMLET-FFD: Hierarchical Adaptive Multi-modal Learning Embeddings Transformation for Face Forgery Detection


42. SCORPION: Addressing Scanner-Induced Variability in Histopathology


43. Music Arena: Live Evaluation for Text-to-Music


44. JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment


45. Not Only Grey Matter: OmniBrain for Robust Multimodal Classification of Alzheimer’s Disease


46. Geometry of Neural Reinforcement Learning in Continuous State and Action Spaces


47. Free Energy-Inspired Cognitive Risk Integration for AV Navigation in Pedestrian-Rich Environments


48. First Hallucination Tokens Are Different from Conditional Ones


49. Why Flow Matching is Particle Swarm Optimization?


50. LanternNet: A Novel Hub-and-Spoke System to Seek and Suppress Spotted Lanternfly Populations


51. Aligning Large Language Model Agents with Rational and Moral Preferences: A Supervised Fine-Tuning Approach


52. Investigation of Accuracy and Bias in Face Recognition Trained with Synthetic Data


53. Learning to See Inside Opaque Liquid Containers using Speckle Vibrometry


54. Industry Insights from Comparing Deep Learning and GBDT Models for E-Commerce Learning-to-Rank


55. AR-LIF: Adaptive reset leaky-integrate and fire neuron for spiking neural networks


56. Regularizing Subspace Redundancy of Low-Rank Adaptation


57. Multi-Masked Querying Network for Robust Emotion Recognition from Incomplete Multi-Modal Physiological Signals


58. Prostate Cancer Classification Using Multimodal Feature Fusion and Explainable AI


59. Text2VLM: Adapting Text-Only Datasets to Evaluate Alignment Training in Visual Language Models


60. A Multimodal Architecture for Endpoint Position Prediction in Team-based Multiplayer Games


61. MIMII-Agent: Leveraging LLMs with Function Calling for Relative Evaluation of Anomalous Sound Detection


62. Hot-Swap MarkBoard: An Efficient Black-box Watermarking Approach for Large-scale Model Distribution


63. Ontology-Enhanced Knowledge Graph Completion using Large Language Models


64. TransPrune: Token Transition Pruning for Efficient Large Vision-Language Model


65. Controllable Video-to-Music Generation with Multiple Time-Varying Conditions


66. Lightweight Remote Sensing Scene Classification on Edge Devices via Knowledge Distillation and Early-exit


67. Beyond Interactions: Node-Level Graph Generation for Knowledge-Free Augmentation in Recommender Systems


68. Implicit Spatiotemporal Bandwidth Enhancement Filter by Sine-activated Deep Learning Model for Fast 3D Photoacoustic Tomography


69. DAG-AFL:Directed Acyclic Graph-based Asynchronous Federated Learning


70. Learning Phonetic Context-Dependent Viseme for Enhancing Speech-Driven 3D Facial Animation


71. MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization


72. Enhancing Hallucination Detection via Future Context


73. T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation


74. Kimi K2: Open Agentic Intelligence


75. Enhancing Spatial Reasoning through Visual and Textual Thinking


76. The Xeno Sutra: Can Meaning and Value be Ascribed to an AI-Generated “Sacred” Text?


77. AQUA: A Large Language Model for Aquaculture & Fisheries


78. LLMs-guided adaptive compensator: Bringing Adaptivity to Automatic Control Systems with Large Language Models


79. DmC: Nearest Neighbor Guidance Diffusion Model for Offline Cross-domain Reinforcement Learning


80. Speaking in Words, Thinking in Logic: A Dual-Process Framework in QA Systems


81. Shapley-Value-Based Graph Sparsification for GNN Inference