전체 AI 논문 - 2025-08-28

1. Model Context Protocols in Adaptive Transport Systems: A Survey


2. StepWiser: Stepwise Generative Judges for Wiser Reasoning


3. The Subset Sum Matching Problem


4. The Ramon Llull’s Thinking Machine for Automated Ideation


5. MATRIX: Multi-Agent simulaTion fRamework for safe Interactions and conteXtual clinical conversational evaluation


6. Playstyle and Artificial Intelligence: An Initial Blueprint Through the Lens of Video Games


7. Algorithmic Collective Action with Multiple Collectives


8. Hybrid Deep Searcher: Integrating Parallel and Sequential Search Reasoning


9. Reasoning LLMs in the Medical Domain: A Literature Survey


10. Trustworthy Agents for Electronic Health Records through Confidence Estimation


11. Can Structured Templates Facilitate LLMs in Tackling Harder Tasks? : An Exploration of Scaling Laws by Difficulty


12. A Concurrent Modular Agent: Framework for Autonomous LLM Agents


13. Investigating Advanced Reasoning of Large Language Models via Black-Box Interaction


14. MAB Optimizer for Estimating Math Question Difficulty via Inverse CV without NLP


15. Sense of Self and Time in Borderline Personality. A Comparative Robustness Study with Generative AI


16. Building Self-Evolving Agents via Experience-Driven Lifelong Learning: A Framework and Benchmark


17. AI Models Exceed Individual Human Accuracy in Predicting Everyday Social Norms


18. Enabling MoE on the Edge via Importance-Driven Expert Scheduling


19. Novel Approaches to Artificial Intelligence Development Based on the Nearest Neighbor Method


20. VISION: Robust and Interpretable Code Vulnerability Detection Leveraging Counterfactual Augmentation


21. Who Is Lagging Behind: Profiling Student Behaviors with Graph-Level Encoding in Curriculum-Based Online Learning Systems


22. FormaRL: Enhancing Autoformalization with no Labeled Data


23. Interactive Evaluation of Large Language Models for Multi-Requirement Software Engineering Tasks



25. STARec: An Efficient Agent Framework for Recommender Systems via Autonomous Deliberate Reasoning


26. CausalMACE: Causality Empowered Multi-Agents in Minecraft Cooperative Tasks


27. AniME: Adaptive Multi-Agent Planning for Long Animation Generation


28. Dynamic Collaboration of Multi-Language Models based on Minimal Complete Semantic Units


29. Answering the Unanswerable Is to Err Knowingly: Analyzing and Mitigating Abstention Failures in Large Reasoning Models


30. Stabilizing Open-Set Test-Time Adaptation via Primary-Auxiliary Filtering and Knowledge-Integrated Prediction


31. Reflection-Enhanced Meta-Optimization Integrating TextGrad-style Prompt Optimization with Memory-Driven Self-Evolution


32. CAC-CoT: Connector-Aware Compact Chain-of-Thought for Efficient Reasoning Data Synthesis Across Dual-System Cognitive Tasks


33. Bias Mitigation Agent: Optimizing Source Selection for Fair and Balanced Knowledge Retrieval


34. VistaWise: Building Cost-Effective Agent with Cross-Modal Knowledge Graph for Minecraft


35. AppAgent-Pro: A Proactive GUI Agent System for Multidomain Information Integration and User Assistance


36. MUA-RL: Multi-turn User-interacting Agent Reinforcement Learning for agentic tool use


37. Beyond Benchmark: LLMs Evaluation with an Anthropomorphic and Value-oriented Roadmap


38. RLMR: Reinforcement Learning with Mixed Rewards for Creative Writing


39. eSkinHealth: A Multimodal Dataset for Neglected Tropical Skin Diseases


40. SchemaCoder: Automatic Log Schema Extraction Coder with Residual Q-Tree Boosting


41. A Database-Driven Framework for 3D Level Generation with LLMs


42. Generic Guard AI in Stealth Game with Composite Potential Fields


43. Symmetry-Invariant Novelty Heuristics via Unsupervised Weisfeiler-Leman Features


44. Weisfeiler-Leman Features for Planning: A 1,000,000 Sample Size Hyperparameter Study


45. Language Models For Generalised PDDL Planning: Synthesising Sound and Programmatic Policies


46. The AI in the Mirror: LLM Self-Recognition in an Iterated Public Goods Game


47. PKG-DPO: Optimizing Domain-Specific AI systems with Physics Knowledge Graphs and Direct Preference Optimization


48. Information Templates: A New Paradigm for Intelligent Active Feature Acquisition


49. AI LLM Proof of Self-Consciousness and User-Specific Attractors


50. Generative Interfaces for Language Models


51. Interpolating Speaker Identities in Embedding Space for Data Expansion


52. VibeVoice Technical Report


53. LSD-3D: Large-Scale 3D Driving Scene Generation with Geometry Grounding


54. Understanding Tool-Integrated Reasoning


55. Emotions as Ambiguity-aware Ordinal Representations


56. Real-Time Model Checking for Closed-Loop Robot Reactive Planning


57. From Tabula Rasa to Emergent Abilities: Discovering Robot Skills via Real-World Unsupervised Quality-Diversity


58. Few-Shot Connectivity-Aware Text Line Segmentation in Historical Documents


59. RDDM: Practicing RAW Domain Diffusion Model for Real-world Image Restoration


60. Uncertainty-Resilient Active Intention Recognition for Robotic Assistants


61. ZeST: an LLM-based Zero-Shot Traversability Navigation for Unknown Environments


62. SecureV2X: An Efficient and Privacy-Preserving System for Vehicle-to-Everything (V2X) Applications


63. APT-LLM: Exploiting Arbitrary-Precision Tensor Core Computing for LLM Acceleration


64. HiPlan: Hierarchical Planning for LLM-Based Agents with Adaptive Global-Local Guidance


65. An LLM-powered Natural-to-Robotic Language Translation Framework with Correctness Guarantees


66. Attackers Strike Back? Not Anymore – An Ensemble of RL Defenders Awakens for APT Detection


67. Dynamic Triangulation-Based Graph Rewiring for Graph Neural Networks


68. Tackling Federated Unlearning as a Parameter Estimation Problem


69. No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes


70. Metric Matters: A Formal Evaluation of Similarity Measures in Active Learning for Cyber Threat Intelligence


71. STDiff: A State Transition Diffusion Framework for Time Series Imputation in Industrial Systems


72. RoofSeg: An edge-aware transformer-based network for end-to-end roof plane segmentation


73. GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging


74. Automatic Prompt Optimization with Prompt Distillation


75. Interpretable by AI Mother Tongue: Native Symbolic Reasoning in Neural Models


76. PAX-TS: Model-agnostic multi-granular explanations for time series forecasting via localized perturbations


77. The point is the mask: scaling coral reef segmentation with weak supervision


78. Diverse And Private Synthetic Datasets Generation for RAG evaluation: A multi-agent framework


79. HierCVAE: Hierarchical Attention-Driven Conditional Variational Autoencoders for Multi-Scale Temporal Modeling


80. HOTSPOT-YOLO: A Lightweight Deep Learning Attention-Driven Model for Detecting Thermal Anomalies in Drone-Based Solar Photovoltaic Inspections


81. Enhancing Model Privacy in Federated Learning with Random Masking and Quantization


82. SegReConcat: A Data Augmentation Method for Voice Anonymization Attack


83. Distance-informed Neural Processes


84. Interpretable Decision-Making for End-to-End Autonomous Driving


85. pyFAST: A Modular PyTorch Framework for Time Series Modeling with Multi-source and Sparse Data


86. HAEPO: History-Aggregated Exploratory Policy Optimization



88. ReflectivePrompt: Reflective evolution in autoprompting algorithms


89. ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive


90. ConfTuner: Training Large Language Models to Express Their Confidence Verbally


91. A Survey on Cloud-Edge-Terminal Collaborative Intelligence in AIoT Networks


92. EMind: A Foundation Model for Multi-task Electromagnetic Signals Understanding


93. Insights into User Interface Innovations from a Design Thinking Workshop at deRSE25


94. Long-Term Variability in Physiological-Arousal Relationships for Robust Emotion Estimation


95. Harnessing Rule-Based Reinforcement Learning for Enhanced Grammatical Error Correction


96. Text to Query Plans for Question Answering on Large Tables


97. M3HG: Multimodal, Multi-scale, and Multi-type Node Heterogeneous Graph for Emotion Cause Triplet Extraction in Conversations


98. FLAegis: A Two-Layer Defense Framework for Federated Learning Against Poisoning Attacks


99. SkyTrust: Blockchain-Enhanced UAV Security for NTNs with Dynamic Trust and Energy-Aware Consensus


100. Improving Noise Robust Audio-Visual Speech Recognition via Router-Gated Cross-Modal Feature Fusion


101. Cross-Learning Fine-Tuning Strategy for Dysarthric Speech Recognition Via CDSD database


102. Skill-Aligned Fairness in Multi-Agent Learning for Collaboration in Healthcare


103. AgriChrono: A Multi-modal Dataset Capturing Crop Growth and Lighting Variability with a Field Robot


104. FALCON: Autonomous Cyber Threat Intelligence Mining with LLMs for IDS Rule Generation


105. Tailored Teaching with Balanced Difficulty: Elevating Reasoning in Multimodal Chain-of-Thought via Prompt Curriculum


106. Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks


107. Auditing Approximate Machine Unlearning for Differentially Private Models


108. Membership Inference Attacks on LLM-based Recommender Systems


109. FFT-MoE: Efficient Federated Fine-Tuning for Foundation Models via Large-scale Sparse MoE under Heterogeneous Edge


110. The Sound of Risk: A Multimodal Physics-Informed Acoustic Model for Forecasting Market Volatility and Enhancing Market Interpretability


111. Breaking the Trade-Off Between Faithfulness and Expressiveness for Large Language Models


112. PRISM: Robust VLM Alignment with Principled Reasoning for Integrated Safety in Multimodality


113. Clustering-based Feature Representation Learning for Oracle Bone Inscriptions Detection


114. LaQual: A Novel Framework for Automated Evaluation of LLM App Quality


115. ROSE: Remove Objects with Side Effects in Videos


116. Scaling Laws for Task-Stratified Knowledge in Post-Training Quantized Large Language Models


117. What do language models model? Transformers, automata, and the format of thought


118. A Case Study on the Effectiveness of LLMs in Verification with Proof Assistants


119. DrugReasoner: Interpretable Drug Approval Prediction with a Reasoning-augmented Language Model


120. The Quasi-Creature and the Uncanny Valley of Agency: A Synthesis of Theory and Evidence on User Interaction with Inconsistent Generative AI


121. Beyond prior knowledge: The predictive role of knowledge-building in Tutor Learning


122. SAT-SKYLINES: 3D Building Generation from Satellite Imagery and Coarse Geometric Priors


123. A Deep Learning Application for Psoriasis Detection


124. Analise de Desaprendizado de Maquina em Modelos de Classificacao de Imagens Medicas


125. Data Augmentation Improves Machine Unlearning


126. Collaborative Intelligence: Topic Modelling of Large Language Model use in Live Cybersecurity Operations


127. DRTA: Dynamic Reward Scaling for Reinforcement Learning in Time Series Anomaly Detection


128. Principled Detection of Hallucinations in Large Language Models via Multiple Testing


129. Vectorized Attention with Learnable Encoding for Quantum Transformer


130. VERIRL: Boosting the LLM-based Verilog Code Generation via Reinforcement Learning


131. How Reliable are LLMs for Reasoning on the Re-ranking task?


132. SwiftF0: Fast and Accurate Monophonic Pitch Detection


133. A Systematic Approach to Predict the Impact of Cybersecurity Vulnerabilities Using LLMs


134. CLARIFY: A Specialist-Generalist Framework for Accurate and Lightweight Dermatological Visual Question Answering


135. Low-Rank Tensor Decompositions for the Theory of Neural Networks


136. Can Out-of-Distribution Evaluations Uncover Reliance on Shortcuts? A Case Study in Question Answering


137. Toward Generalized Autonomous Agents: A Neuro-Symbolic AI Framework for Integrating Social and Technical Support in Education


138. Mining the Long Tail: A Comparative Study of Data-Centric Criticality Metrics for Robust Offline Reinforcement Learning in Autonomous Motion Planning


139. Latent Self-Consistency for Reliable Majority-Set Selection in Short- and Long-Answer Reasoning


140. Backprompting: Leveraging Synthetic Production Data for Health Advice Guardrails


141. EAI-Avatar: Emotion-Aware Interactive Talking Head Generation


142. Facilitating Matches on Allocation Platforms


143. Structures Meet Semantics: Multimodal Fusion via Graph Contrastive Learning


144. LLMs Can’t Handle Peer Pressure: Crumbling under Multi-Agent Social Interactions


145. Does Calibration Affect Human Actions?


146. Evaluating Federated Learning for At-Risk Student Prediction: A Comparative Analysis of Model Complexity and Data Balancing


147. Automated Landfill Detection Using Deep Learning: A Comparative Study of Lightweight and Custom Architectures with the AerialWaste Dataset


148. ProtoEHR: Hierarchical Prototype Learning for EHR-based Healthcare Predictions


149. What Matters in Data for DPO?


150. CoPE: A Lightweight Complex Positional Encoding


151. SALMAN: Stability Analysis of Language Models Through the Maps Between Graph-based Manifolds


152. scI2CL: Effectively Integrating Single-cell Multi-omics by Intra- and Inter-omics Contrastive Learning



154. Murakkab: Resource-Efficient Agentic Workflow Orchestration in Cloud Platforms


155. Can VLMs Recall Factual Associations From Visual References?


156. Federative ischemic stroke segmentation as alternative to overcome domain-shift multi-institution challenges


157. H-PRM: A Pluggable Hotword Pre-Retrieval Module for Various Speech Recognition Systems


158. MobileDenseAttn:A Dual-Stream Architecture for Accurate and Interpretable Brain Tumor Detection


159. Towards Training-Free Underwater 3D Object Detection from Sonar Point Clouds: A Comparison of Traditional and Deep Learning Approaches


160. Consensus Is All You Need: Gossip-Based Reasoning Among Large Language Models


161. Toward Responsible ASR for African American English Speakers: A Scoping Review of Bias and Equity in Speech Technology


162. Multi-Modal Drift Forecasting of Leeway Objects via Navier-Stokes-Guided CNN and Sequence-to-Sequence Attention-Based Models


163. Technology-assisted Personalized Yoga for Better Health – Challenges and Outlook