전체 AI 논문 - 2025-09-02

1. Automated Clinical Problem Detection from SOAP Notes using a Collaborative Multi-Agent LLM Architecture


2. Tree-Guided Diffusion Planner


3. Orientability of Causal Relations in Time Series using Summary Causal Graphs and Faithful Distributions


4. Freeze and Conquer: Reusable Ansatz for Solving the Traveling Salesman Problem


5. PosterForest: Hierarchical Multi-Agent Collaboration for Scientific Poster Generation


6. Leveraging Imperfection with MEDLEY A Multi-Model Approach Harnessing Bias in Medical AI


7. A-MHA: Anytime Multi-Heuristic A


8. Integrating Large Language Models with Network Optimization for Interactive and Explainable Supply Chain Planning: A Real-World Case Study


9. Scalable Solution Methods for Dec-POMDPs with Deterministic Dynamics


10. Revisiting Landmarks: Learning from Previous Plans to Generalize over Problem Instances


11. HealthProcessAI: A Technical Framework and Proof-of-Concept for LLM-Enhanced Healthcare Process Mining


12. Counterfactual Scenarios for Automated Planning


13. Modeling Wise Decision Making: A Z-Number Fuzzy Framework Inspired by Phronesis


14. MMSearch-Plus: A Simple Yet Challenging Benchmark for Multimodal Browsing Agents


15. Learning Lifted Action Models From Traces of Incomplete Actions and States


16. A General Framework of Epistemic Forgetting and its Instantiation by Ranking Functions


17. CARJAN: Agent-Based Generation and Simulation of Traffic Scenarios with AJAN



19. AHELM: A Holistic Evaluation of Audio-Language Models


20. Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models


21. Multi-Ontology Integration with Dual-Axis Propagation for Medical Concept Representation


22. MultiFluxAI Enhancing Platform Engineering with Advanced Agent-Orchestrated Retrieval Systems


23. Addressing accuracy and hallucination of LLMs in Alzheimer’s disease research through knowledge graphs


24. Fuzzy, Symbolic, and Contextual: Enhancing LLM Instruction via Cognitive Scaffolding


25. The Demon is in Ambiguity: Revisiting Situation Recognition with Single Positive Multi-Label Learning


26. DynaMark: A Reinforcement Learning Framework for Dynamic Watermarking in Industrial Machine Tool Controllers


27. TMUAD: Enhancing Logical Capabilities in Unified Anomaly Detection Models with a Text Memory Bank


28. MoE-Health: A Mixture of Experts Framework for Robust Multimodal Healthcare Prediction


29. Going over Fine Web with a Fine-Tooth Comb: Technical Report of Indexing Fine Web for Problematic Content Search and Retrieval


30. PiCSAR: Probabilistic Confidence Selection And Ranking


31. Benchmarking GPT-5 in Radiation Oncology: Measurable Gains, but Persistent Need for Expert Oversight


32. Unsupervised Video Continual Learning via Non-Parametric Deep Embedded Clustering


33. Reasoning-Intensive Regression


34. Neural Network Acceleration on MPSoC board: Integrating SLAC’s SNL, Rogue Software and Auto-SNL


35. Developer Insights into Designing AI-Based Computer Perception Tools


36. CAD2DMD-SET: Synthetic Generation Tool of Digital Measurement Device CAD Model Datasets for fine-tuning Large Vision-Language Models


37. OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization


38. Entropy-Based Non-Invasive Reliability Monitoring of Convolutional Neural Networks


39. Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR


40. Harnessing IoT and Generative AI for Weather-Adaptive Learning in Climate Resilience Education


41. QZhou-Embedding Technical Report


42. Physics-Informed Spectral Modeling for Hyperspectral Imaging


43. Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning



45. NSPDI-SNN: An efficient lightweight SNN based on nonlinear synaptic pruning and dendritic integration


46. Limitations of Physics-Informed Neural Networks: a Study on Smart Grid Surrogation


47. EZ-Sort: Efficient Pairwise Comparison via Zero-Shot CLIP-Based Pre-Ordering and Human-in-the-Loop Sorting


48. What Data is Really Necessary? A Feasibility Study of Inference Data Minimization for Recommender Systems


49. Complete Gaussian Splats from a Single Image with Denoising Diffusion Models


50. On the Hardness of Learning GNN-based SAT Solvers: The Role of Graph Ricci Curvature


51. ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding


52. Priors Matter: Addressing Misspecification in Bayesian Deep Q-Learning


53. HSFN: Hierarchical Selection for Fake News Detection building Heterogeneous Ensemble


54. Igniting Creative Writing in Small Language Models: LLM-as-a-Judge versus Multi-Agent Refined Rewards


55. Controllable 3D Molecular Generation for Structure-Based Drug Design Through Bayesian Flow Networks and Gradient Integration


56. Diffusion-based Multi-modal Synergy Interest Network for Click-through Rate Prediction


57. MedShift: Implicit Conditional Transport for X-Ray Domain Adaptation


58. The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management


59. Med-RewardBench: Benchmarking Reward Models and Judges for Medical Multimodal Large Language Models


60. Benchmarking the State of Networks with a Low-Cost Method Based on Reservoir Computing


61. DRASP: A Dual-Resolution Attentive Statistics Pooling Framework for Automatic MOS Prediction


62. zkLoRA: Fine-Tuning Large Language Models with Verifiable Security via Zero-Knowledge Proofs


63. AllSummedUp: un framework open-source pour comparer les metriques d’evaluation de resume


64. Normality and the Turing Test


65. Iterative Inference in a Chess-Playing Neural Network


66. RoboInspector: Unveiling the Unreliability of Policy Code for LLM-enabled Robotic Manipulation


67. Challenges and Applications of Large Language Models: A Comparison of GPT and DeepSeek family of models


68. EconAgentic in DePIN Markets: A Large Language Model Approach to the Sharing Economy of Decentralized Physical Infrastructure


69. Adaptive Heavy-Tailed Stochastic Gradient Descent


70. DLGAN : Time Series Synthesis Based on Dual-Layer Generative Adversarial Networks


71. Stairway to Fairness: Connecting Group and Individual Fairness


72. Stage-Diff: Stage-wise Long-Term Time Series Generation Based on Diffusion Models


73. Locus: Agentic Predicate Synthesis for Directed Fuzzing


74. MyGO: Memory Yielding Generative Offline-consolidation for Lifelong Learning Systems


75. BLUEX Revisited: Enhancing Benchmark Coverage with Automatic Captioning


76. Efficient Code Embeddings from Code Generation Models


77. A Financial Brain Scan of the LLM


78. Deep Active Learning for Lung Disease Severity Classification from Chest X-rays: Learning with Less Data in the Presence of Class Imbalance


79. Breaking the Cold-Start Barrier: Reinforcement Learning with Double and Dueling DQNs


80. Reinforcement Learning for Optimizing Large Qubit Array based Quantum Sensor Circuits


81. Quantum Machine Learning for Optimizing Entanglement Distribution in Quantum Sensor Circuits


82. A Mixture of Experts Gating Network for Enhanced Surrogate Modeling in External Aerodynamics


83. Zero-Shot KWS for Children’s Speech using Layer-Wise Features from SSL Models


84. HCQA: Hybrid Classical-Quantum Agent for Generating Optimal Quantum Sensor Circuits


85. Full-Frequency Temporal Patching and Structured Masking for Enhanced Audio Classification


86. Decoding Memories: An Efficient Pipeline for Self-Consistency Hallucination Detection


87. Can Layer-wise SSL Features Improve Zero-Shot ASR Performance for Children’s Speech?


88. Generalizable Object Re-Identification via Visual In-Context Prompting


89. Enhancing Robustness of Autoregressive Language Models against Orthographic Attacks via Pixel-based Approach


90. Improving Aviation Safety Analysis: Automated HFACS Classification Using Reinforcement Learning with Group Relative Policy Optimization


91. Manifold Trajectories in Next-Token Prediction: From Replicator Dynamics to Softmax Equilibrium


92. BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design


93. FUTURE: Flexible Unlearning for Tree Ensemble


94. Deep Residual Echo State Networks: exploring residual orthogonal connections in untrained Recurrent Neural Networks


95. Quantifying Label-Induced Bias in Large Language Model Self- and Cross-Evaluations


96. RadGS-Reg: Registering Spine CT with Biplanar X-rays via Joint 3D Radiative Gaussians Reconstruction and 3D/3D Registration


97. WaveLLDM: Design and Development of a Lightweight Latent Diffusion Model for Speech Enhancement and Restoration


98. A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers


99. HiddenObject: Modality-Agnostic Fusion for Multimodal Hidden Object Detection


100. R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning


101. EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control


102. Automating the Deep Space Network Data Systems; A Case Study in Adaptive Anomaly Detection through Agentic AI


103. An Explainable, Attention-Enhanced, Bidirectional Long Short-Term Memory Neural Network for Joint 48-Hour Forecasting of Temperature, Irradiance, and Relative Humidity


104. Learning to Generate Unit Test via Adversarial Reinforcement Learning


105. Dynamic Low-rank Approximation of Full-Matrix Preconditioner for Training Generalized Linear Models


106. PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning


107. Spatiotemporal EEG-Based Emotion Recognition Using SAM Ratings from Serious Games with Hybrid Deep Learning


108. Beyond Prediction: Reinforcement Learning as the Defining Leap in Healthcare AI


109. Safe-Control: A Safety Patch for Mitigating Unsafe Content in Text-to-Image Generation Models


110. TrInk: Ink Generation with Transformer Network


111. Model-Driven Quantum Code Generation Using Large Language Models and Retrieval-Augmented Generation


112. CoBA: Counterbias Text Augmentation for Mitigating Various Spurious Correlations via Semantic Triples


113. Pep2Prob Benchmark: Predicting Fragment Ion Probability for MS$^2$-based Proteomics


114. QuadKAN: KAN-Enhanced Quadruped Motion Control via End-to-End Reinforcement Learning