전체 AI 논문 - 2025-11-20

1. Heterogeneous Multi-Agent Proximal Policy Optimization for Power Distribution System Restoration


2. SkillGen: Learning Domain Skills for In-Context Sequential Decision Making


3. AutoTool: Efficient Tool Selection for Large Language Model Agents


4. Rate-Distortion Guided Knowledge Graph Construction from Lecture Notes Using Gromov-Wasserstein Optimal Transport


5. A Neuro-Symbolic Framework for Reasoning under Perceptual Uncertainty: Bridging Continuous Perception and Discrete Symbolic Planning


6. Operationalizing Pluralistic Values in Large Language Model Alignment Reveals Trade-offs in Safety, Inclusivity, and Model Behavior


7. When Words Change the Model: Sensitivity of LLMs for Constraint Programming Modelling


8. DataSage: Multi-agent Collaboration for Insight Discovery with External Knowledge Retrieval, Multi-role Debating, and Multi-path Reasoning


9. PathMind: A Retrieve-Prioritize-Reason Framework for Knowledge Graph Reasoning with Large Language Models


10. Enhancing Regional Airbnb Trend Forecasting Using LLM-Based Embeddings of Accessibility and Human Mobility


11. DevPiolt: Operation Recommendation for IoT Devices at Xiaomi Home


12. Listen Like a Teacher: Mitigating Whisper Hallucinations using Adaptive Layer Attention and Knowledge Distillation


13. Do Large Language Models (LLMs) Understand Chronology?


14. HFL-FlowLLM: Large Language Models for Network Traffic Flow Classification in Heterogeneous Federated Learning


15. Beyond Accuracy: A Multi-Dimensional Framework for Evaluating Enterprise Agentic AI Systems


16. Run, Ruminate, and Regulate: A Dual-process Thinking System for Vision-and-Language Navigation


17. PRISM: Prompt-Refined In-Context System Modelling for Financial Retrieval


18. APD-Agents: A Large Language Model-Driven Multi-Agents Collaborative Framework for Automated Page Design


19. Collaborative QA using Interacting LLMs. Impact of Network Structure, Node Capability and Distributed Data


20. Making Evidence Actionable in Adaptive Learning


21. AISAC: An Integrated multi-agent System for Transparent, Retrieval-Grounded Scientific Assistance


22. Syn-STARTS: Synthesized START Triage Scenario Generation Framework for Scalable LLM Evaluation


23. ALEX:A Light Editing-knowledge Extractor


24. Artificial Intelligence Agents in Music Analysis: An Integrative Perspective Based on Two Use Cases


25. Scene Graph-Guided Generative AI Framework for Synthesizing and Evaluating Industrial Hazard Scenarios


26. CORGI: Efficient Pattern Matching With Quadratic Guarantees


27. Jailbreaking Large Vision Language Models in Intelligent Transportation Systems


28. Causal computations in Semi Markovian Structural Causal Models using divide and conquer


29. When AI Does Science: Evaluating the Autonomous AI Scientist KOSMOS in Radiation Biology


30. KANGURA: Kolmogorov-Arnold Network-Based Geometry-Aware Learning with Unified Representation Attention for 3D Modeling of Complex Structures


31. Imagine in Space: Exploring the Frontier of Spatial Intelligence and Reasoning Efficiency in Vision Language Models


32. ARC Is a Vision Problem!


33. Automated proving in planar geometry based on the complex number identity method and elimination


34. Zero-shot Synthetic Video Realism Enhancement via Structure-aware Denoising


35. \textit{FLARE}: Adaptive Multi-Dimensional Reputation for Robust Client Reliability in Federated Learning


36. Seeing Beyond the Image: ECG and Anatomical Knowledge-Guided Myocardial Scar Segmentation from Late Gadolinium-Enhanced Images


37. Near-Lossless Model Compression Enables Longer Context Inference in DNA Large Language Models


38. Attention via Synaptic Plasticity is All You Need: A Biologically Inspired Spiking Neuromorphic Transformer


39. Impact of Image Resolution on Age Estimation with DeepFace and InsightFace


40. Ground Truth Generation for Multilingual Historical NLP using LLMs


41. NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards


42. Improving segmentation of retinal arteries and veins using cardiac signal in doppler holograms


43. Adapformer: Adaptive Channel Management for Multivariate Time Series Forecasting


44. Enhancing Agentic Autonomous Scientific Discovery with Vision-Language Model Capabilities


45. Failure to Mix: Large language models struggle to answer according to desired probability distributions


46. Active Matter as a framework for living systems-inspired Robophysics


47. Expert-Guided POMDP Learning for Data-Efficient Modeling in Healthcare


48. A Method for Characterizing Disease Progression from Acute Kidney Injury to Chronic Kidney Disease


49. MRI Embeddings Complement Clinical Predictors for Cognitive Decline Modeling in Alzheimer’s Disease Cohorts


50. CCSD: Cross-Modal Compositional Self-Distillation for Robust Brain Tumor Segmentation with Missing Modalities


51. Is Your VLM for Autonomous Driving Safety-Ready? A Comprehensive Benchmark for Evaluating External and In-Cabin Risks


52. Biased Minds Meet Biased AI: How Class Imbalance Shapes Appropriate Reliance and Interacts with Human Base Rate Neglect


53. Deep Learning-Based Regional White Matter Hyperintensity Mapping as a Robust Biomarker for Alzheimer’s Disease


54. ReflexGrad: Three-Way Synergistic Architecture for Zero-Shot Generalization in LLM Agents


55. SweeperBot: Making 3D Browsing Accessible through View Analysis and Visual Question Answering


56. Examining the Metrics for Document-Level Claim Extraction in Czech and Slovak


57. Masked IRL: LLM-Guided Reward Disambiguation from Demonstrations and Language


58. Apo2Mol: 3D Molecule Generation via Dynamic Pocket-Aware Diffusion Models


59. DecNefLab: A Modular and Interpretable Simulation Framework for Decoded Neurofeedback


60. MissHDD: Hybrid Deterministic Diffusion for Hetrogeneous Incomplete Data Imputation


61. IMSE: Efficient U-Net-based Speech Enhancement using Inception Depthwise Convolution and Amplitude-Aware Linear Attention


62. Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching


63. Agentic AI Systems in Electrical Power Systems Engineering: Current State-of-the-Art and Challenges


64. nnterp: A Standardized Interface for Mechanistic Interpretability of Transformers



66. Analyzing the Impact of Participant Failures in Cross-Silo Federated Learning


67. Hybrid Modeling of Photoplethysmography for Non-invasive Monitoring of Cardiovascular Parameters


68. Agentic Video Intelligence: A Flexible Framework for Advanced Video Exploration and Understanding


69. Tell Me: An LLM-powered Mental Well-being Assistant with RAG, Synthetic Dialogue Generation, and Agentic Planning


70. Watchdogs and Oracles: Runtime Verification Meets Large Language Models for Autonomous Systems


71. Context-aware, Ante-hoc Explanations of Driving Behaviour


72. MiAD: Mirage Atom Diffusion for De Novo Crystal Generation


73. Sigil: Server-Enforced Watermarking in U-Shaped Split Federated Learning via Gradient Injection


74. Continuous Vision-Language-Action Co-Learning with Semantic-Physical Alignment for Behavioral Cloning


75. Cheating Stereo Matching in Full-scale: Physical Adversarial Attack against Binocular Depth Estimation in Autonomous Driving


76. The Tokenization Bottleneck: How Vocabulary Extension Improves Chemistry Representation Learning in Pretrained Language Models


77. Clinically-Validated Innovative Mobile Application for Assessing Blinking and Eyelid Movements


78. Going Places: Place Recognition in Artificial and Natural Systems


79. LSP-YOLO: A Lightweight Single-Stage Network for Sitting Posture Recognition on Embedded Devices


80. H-LDM: Hierarchical Latent Diffusion Models for Controllable and Interpretable PCG Synthesis from Clinical Metadata


81. SAM-Fed: SAM-Guided Federated Semi-Supervised Learning for Medical Image Segmentation


82. AraLingBench A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models


83. GEN3D: Generating Domain-Free 3D Scenes from a Single Image


84. Weight Variance Amplifier Improves Accuracy in High-Sparsity One-Shot Pruning


85. Comparing Task-Agnostic Embedding Models for Tabular Data


86. Object-Centric World Models for Causality-Aware Reinforcement Learning


87. ArbESC+: Arabic Enhanced Edit Selection System Combination for Grammatical Error Correction Resolving conflict and improving system combination in Arabic GEC


88. LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation


89. Parallelizing Tree Search with Twice Sequential Monte Carlo


90. Bridging the Gap Between Bayesian Deep Learning and Ensemble Weather Forecasts


91. Orion: A Unified Visual Agent for Multimodal Perception, Advanced Visual Reasoning and Execution


92. Multi-Scale Correlation-Aware Transformer for Maritime Vessel Re-Identification


93. DiverseClaire: Simulating Students to Improve Introductory Programming Course Materials for All CS1 Learners


94. Few-Shot Precise Event Spotting via Unified Multi-Entity Graph and Distillation


95. Towards Deploying VLA without Fine-Tuning: Plug-and-Play Inference-Time VLA Policy Steering via Embodied Evolutionary Diffusion


96. SymLoc: Symbolic Localization of Hallucination across HaluEval and TruthfulQA


97. AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs


98. Certified Signed Graph Unlearning


99. Selective Weak-to-Strong Generalization


100. AsyncVLA: Asynchronous Flow Matching for Vision-Language-Action Models


101. SMART: Shot-Aware Multimodal Video Moment Retrieval with Audio-Enhanced MLLM


102. Fair-GNE : Generalized Nash Equilibrium-Seeking Fairness in Multiagent Healthcare Automation


103. Multi-view Phase-aware Pedestrian-Vehicle Incident Reasoning Framework with Vision-Language Models


104. Real-Time Mobile Video Analytics for Pre-arrival Emergency Medical Services


105. Soft-Label Training Preserves Epistemic Uncertainty


106. Synthetic Clinical Notes for Rare ICD Codes: A Data-Centric Framework for Long-Tail Medical Coding


107. CascadedViT: Cascaded Chunk-FeedForward and Cascaded Group Attention Vision Transformer


108. FAPE-IR: Frequency-Aware Planning and Execution Framework for All-in-One Image Restoration


109. NeuroPath: Neurobiology-Inspired Path Tracking and Reflection for Semantically Coherent Retrieval


110. GCA-ResUNet:Image segmentation in medical images using grouped coordinate attention


111. Error-Driven Scene Editing for 3D Grounding in Large Language Models


112. Automated glenoid bone loss measurement and segmentation in CT scans for pre-operative planning in shoulder instability


113. Zero-Training Task-Specific Model Synthesis for Few-Shot Medical Image Classification


114. CFG-EC: Error Correction Classifier-Free Guidance


115. CafeMed: Causal Attention Fusion Enhanced Medication Recommendation


116. A Machine Learning-Based Multimodal Framework for Wearable Sensor-Based Archery Action Recognition and Stress Estimation


117. Radial Compensation: Stable and Semantically Decoupled Generative Models on Riemannian Manifolds


118. GRPO Privacy Is at Risk: A Membership Inference Attack Against Reinforcement Learning With Verifiable Rewards


119. Training-free Detection of AI-generated images via Cropping Robustness


120. Keeping Code-Aware LLMs Fresh: Full Refresh, In-Context Deltas, and Incremental Fine-Tuning


121. MRI Plane Orientation Detection using a Context-Aware 2.5D Model


122. From Narrow Unlearning to Emergent Misalignment: Causes, Consequences, and Containment in LLMs


123. Developing a Grounded View of AI


124. Knowledge-Grounded Agentic Large Language Models for Multi-Hazard Understanding from Reconnaissance Reports


125. Can Artificial Intelligence Accelerate Technological Progress? Researchers’ Perspectives on AI in Manufacturing and Materials Science


126. FlakyGuard: Automatically Fixing Flaky Tests at Industry Scale


127. How to Marginalize in Causal Structure Learning?


128. LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering


129. Node-Level Uncertainty Estimation in LLM-Generated SQL


130. Data Whitening Improves Sparse Autoencoder Learning


131. Preference-Based Learning in Audio Applications: A Systematic Analysis


132. Compute-in-Memory Implementation of State Space Models for Event Sequence Processing


133. What Works for ‘Lost-in-the-Middle’ in LLMs? A Study on GM-Extract and Mitigations


134. Can QE-informed (Re)Translation lead to Error Correction?


135. Hybrid Convolution Neural Network Integrated with Pseudo-Newton Boosting for Lumbar Spine Degeneration Detection


136. H-CNN-ViT: A Hierarchical Gated Attention Multi-Branch Model for Bladder Cancer Recurrence Prediction


137. Randomized Controlled Trials for Conditional Access Optimization Agent


138. Randomized Controlled Trials for Phishing Triage Agent


139. ScoresActivation: A New Activation Function for Model Agnostic Global Explainability by Design


140. GAEA: Experiences and Lessons Learned from a Country-Scale Environmental Digital Twin


141. Passive Dementia Screening via Facial Temporal Micro-Dynamics Analysis of In-the-Wild Talking-Head Video


142. Synergizing Multigrid Algorithms with Vision Transformer: A Novel Approach to Enhance the Seismic Foundation Model


143. MAT-MPNN: A Mobility-Aware Transformer-MPNN Model for Dynamic Spatiotemporal Prediction of HIV Diagnoses in California, Florida, and New England


144. A Trajectory-free Crash Detection Framework with Generative Approach and Segment Map Diffusion


145. FusionFM: All-in-One Multi-Modal Image Fusion with Flow Matching


146. Modeling Fairness in Recruitment AI via Information Flow


147. XAI-Driven Deep Learning for Protein Sequence Functional Group Classification


148. GeoPl@ntNet: A Platform for Exploring Essential Biodiversity Variables


149. Uncovering and Aligning Anomalous Attention Heads to Defend Against NLP Backdoor Attacks


150. Scaling Patterns in Adversarial Alignment: Evidence from Multi-LLM Jailbreak Experiments


151. Quantifying Distribution Shift in Traffic Signal Control with Histogram-Based GEH Distance


152. Temporal Object-Aware Vision Transformer for Few-Shot Video Object Detection


153. Semantic Multiplexing


154. Known Meets Unknown: Mitigating Overconfidence in Open Set Recognition


155. Can LLMs Create Legally Relevant Summaries and Analyses of Videos?


156. ExplainableGuard: Interpretable Adversarial Defense for Large Language Models Using Chain-of-Thought Reasoning


157. Dynamic Temperature Scheduler for Knowledge Distillation


158. Credal Ensemble Distillation for Uncertainty Quantification


159. PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning


160. Gene Incremental Learning for Single-Cell Transcriptomics


161. What happens when nanochat meets DiLoCo?


162. MoETTA: Test-Time Adaptation Under Mixed Distribution Shifts with MoE-LayerNorm


163. Multi-Agent VLMs Guided Self-Training with PNU Loss for Low-Resource Offensive Content Detection


164. ChemFixer: Correcting Invalid Molecules to Unlock Previously Unseen Chemical Space


165. VitalBench: A Rigorous Multi-Center Benchmark for Long-Term Vital Sign Prediction in Intraoperative Care


166. Multi-Horizon Time Series Forecasting of non-parametric CDFs with Deep Lattice Networks


167. Adaptive Redundancy Regulation for Balanced Multimodal Information Refinement


168. Robustness of LLM-enabled vehicle trajectory prediction under data security threats


169. Motor Imagery Classification Using Feature Fusion of Spatially Weighted Electroencephalography


170. SCALEX: Scalable Concept and Latent Exploration for Diffusion Models


171. DeepDefense: Layer-Wise Gradient-Feature Alignment for Building Robust Neural Networks


172. Deep reinforcement learning-based spacecraft attitude control with pointing keep-out constraint


173. nuCarla: A nuScenes-Style Bird’s-Eye View Perception Dataset for CARLA Simulation


174. Review of Passenger Flow Modelling Approaches Based on a Bibliometric Analysis


175. Subject-Independent Imagined Speech Detection via Cross-Subject Generalization and Calibration


176. DualLaguerreNet: A Decoupled Spectral Filter GNN and the Uncovering of the Flexibility-Stability Trade-off


177. Refine Thought: A Test-Time Inference Method for Embedding Model Reasoning


178. AI Kill Switch for malicious web-based LLM agent


179. Preparation Meets Opportunity: Enhancing Data Preprocessing for ML Training With Seneca


180. Signature vs. Substance: Evaluating the Balance of Adversarial Resistance and Linguistic Quality in Watermarking Large Language Models


181. From Legacy Fortran to Portable Kokkos: An Autonomous Agentic AI Workflow