전체 AI 논문 - 2025-11-27

1. Fighting AI with AI: Leveraging Foundation Models for Assuring AI-Enabled Safety-Critical Systems



3. Building a Foundation Model for Trajectory from Scratch


4. PaTAS: A Parallel System for Trust Propagation in Neural Networks Using Subjective Logic


5. Beyond Generation: Multi-Hop Reasoning for Factual Accuracy in Vision-Language Models


6. Assessing LLMs’ Performance: Insights from the Chinese Pharmacist Exam


7. FRAGMENTA: End-to-end Fragmentation-based Generative Model with Agentic Tuning for Drug Lead Optimization


8. Quantifying the Privacy Implications of High-Fidelity Synthetic Network Traffic


9. Universe of Thoughts: Enabling Creative Reasoning with Large Language Models


10. DRAFT-RL: Multi-Agent Chain-of-Draft Reasoning for Reinforcement Learning-Enhanced LLMs


11. VibraVerse: A Large-Scale Geometry-Acoustics Alignment Dataset for Physically-Consistent Multimodal Learning


12. NNGPT: Rethinking AutoML with Large Language Models


13. Active Inference in Discrete State Spaces from First Principles


14. Data Augmentation Techniques to Reverse-Engineer Neural Network Weights from Input-Output Queries


15. Improving Language Agents through BREW


16. SMoG: Schema Matching on Graph


17. Actionable and diverse counterfactual explanations incorporating domain knowledge and causal constraints


18. CostNav: A Navigation Benchmark for Cost-Aware Evaluation of Embodied Agents


19. Interactive AI NPCs Powered by LLMs: Technical Report for the CPDC Challenge 2025


20. Towards Benign Memory Forgetting for Selective Multimodal Large Language Model Unlearning


21. From data to concepts via wiring diagrams


22. VICoT-Agent: A Vision-Interleaved Chain-of-Thought Framework for Interpretable Multimodal Reasoning and Scalable Remote Sensing Analysis


23. “Are We Done Yet?”: A Vision-Based Judge for Autonomous Task Completion of Computer Use Agents


24. Reducing Latency of LLM Search Agent via Speculation-based Algorithm-System Co-Design


25. M$^3$Prune: Hierarchical Communication Graph Pruning for Efficient Multi-Modal Multi-Agent Retrieval-Augmented Generation


26. A System-Level Taxonomy of Failure Modes in Large Language Model Applications


27. Semantic-KG: Using Knowledge Graphs to Construct Benchmarks for Measuring Semantic Similarity


28. RPM-MCTS: Knowledge-Retrieval as Process Reward Model with Monte Carlo Tree Search for Code Generation


29. Simulated Self-Assessment in Large Language Models: A Psychometric Approach to AI Self-Efficacy


30. Agentic AI-Empowered Conversational Embodied Intelligence Networks in 6G


31. MicroSims: A Framework for AI-Generated, Scalable Educational Simulations with Universal Embedding and Adaptive Learning Support


32. Reinforcement Learning with $ω$-Regular Objectives and Constraints


33. A Unified Evaluation-Instructed Framework for Query-Dependent Prompt Optimization


34. KOM: A Multi-Agent Artificial Intelligence System for Precision Management of Knee Osteoarthritis (KOA)


35. NOEM$^{3}$A: A Neuro-Symbolic Ontology-Enhanced Method for Multi-Intent Understanding in Mobile Agents


36. Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs


37. Scaling Item-to-Standard Alignment with Large Language Models: Accuracy, Limits, and Solutions


38. FISCAL: Financial Synthetic Claim-document Augmented Learning for Efficient Fact-Checking


39. HeaRT: A Hierarchical Circuit Reasoning Tree-Based Agentic Framework for AMS Design Optimization


40. Fara-7B: An Efficient Agentic Model for Computer Use


41. Using Wearable Devices to Improve Chronic PainTreatment among Patients with Opioid Use Disorder


42. MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities


43. MotionV2V: Editing Motion in a Video


44. Latent Collaboration in Multi-Agent Systems


45. MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models


46. ROOT: Robust Orthogonalized Optimizer for Neural Network Training


47. DiFR: Inference Verification Despite Nondeterminism


48. Evaluating the Performance of Deep Learning Models in Whole-body Dynamic 3D Posture Prediction During Load-reaching Activities


49. Can Vibe Coding Beat Graduate CS Students? An LLM vs. Human Coding Tournament on Market-driven Strategic Planning


50. On Evaluating LLM Alignment by Evaluating LLMs as Judges


51. The Driver-Blindness Phenomenon: Why Deep Sequence Models Default to Autocorrelation in Blood Glucose Forecasting


52. BrowseSafe: Understanding and Preventing Prompt Injection Within AI Browser Agents


53. EnergyTwin: A Multi-Agent System for Simulating and Coordinating Energy Microgrids


54. Gated Uncertainty-Aware Runtime Dual Invariants for Neural Signal-Controlled Robotics


55. Time-Domain Linear Model-based Framework for Passive Acoustic Mapping of Cavitation Activity


56. Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning


57. New York Smells: A Large Multimodal Dataset for Olfaction


58. Automated Monitoring of Cultural Heritage Artifacts Using Semantic Segmentation


59. Proceedings Twentieth Conference on Theoretical Aspects of Rationality and Knowledge


60. MIMIC-MJX: Neuromechanical Emulation of Animal Behavior


61. DesignPref: Capturing Personal Preferences in Visual Design Generation


62. The Text Aphasia Battery (TAB): A Clinically-Grounded Benchmark for Aphasia-Like Deficits in Language Models


63. From One Attack Domain to Another: Contrastive Transfer Learning with Siamese Networks for APT Detection


64. MTBBench: A Multimodal Sequential Clinical Decision-Making Benchmark in Oncology


65. Ranking-Enhanced Anomaly Detection Using Active Learning-Assisted Attention Adversarial Dual AutoEncoders


66. Efficient and Fast Generative-Based Singing Voice Separation using a Latent Diffusion Model


67. Generation, Evaluation, and Explanation of Novelists’ Styles with Single-Token Prompts


68. Object-Centric Vision Token Pruning for Vision Language Models


69. Block Cascading: Training Free Acceleration of Block-Causal Video Models


70. StableTrack: Stabilizing Multi-Object Tracking on Low-Frequency Detections


71. Short-Range Oversquashing


72. LLMs for Automated Unit Test Generation and Assessment in Java: The AgoneTest Framework


73. BengaliFig: A Low-Resource Challenge for Figurative and Culturally Grounded Reasoning in Bengali


74. From Passive Perception to Active Memory: A Weakly Supervised Image Manipulation Localization Framework Driven by Coarse-Grained Annotations


75. Soft Adaptive Policy Optimization


76. 3D Motion Perception of Binocular Vision Target with PID-CNN


77. Geometry of Decision Making in Language Models



79. Prompting Lipschitz-constrained network for multiple-in-one sparse-view CT reconstruction


80. Forgetting by Pruning: Data Deletion in Join Cardinality Estimation


81. Can LLMs Make (Personalized) Access Control Decisions?


82. HVAdam: A Full-Dimension Adaptive Optimizer


83. Beyond Components: Singular Vector-Based Interpretability of Transformer Circuits


84. Interpretable Air Pollution Forecasting by Physics-Guided Spatiotemporal Decoupling


85. XiCAD: Camera Activation Detection in the Da Vinci Xi User Interface


86. Uplifting Table Tennis: A Robust, Real-World Application for 3D Trajectory and Spin Estimation


87. Leveraging weights signals - Predicting and improving generalizability in reinforcement learning


88. DUO-TOK: Dual-Track Semantic Music Tokenizer for Vocal-Accompaniment Generation


89. OmniAlpha: A Sequence-to-Sequence Framework for Unified Multi-Task RGBA Generation


90. Human-computer interactions predict mental health


91. Beluga: A CXL-Based Memory Architecture for Scalable and Efficient LLM KVCache Management


92. On the Limits of Momentum in Decentralized and Federated Optimization


93. While recognizing actions, LMMs struggle to detect core interaction events


94. SEDA: A Self-Adapted Entity-Centric Data Augmentation for Boosting Gird-based Discontinuous NER Models


95. IDAP++: Advancing Divergence-Based Pruning via Filter-Level and Layer-Level Optimization


96. “When Data is Scarce, Prompt Smarter”… Approaches to Grammatical Error Correction in Low-Resource Settings


97. LungEvaty: A Scalable, Open-Source Transformer-based Deep Learning Model for Lung Cancer Risk Prediction in LDCT Screening


98. The Devil in the Details: Emergent Misalignment, Format and Coherence in Open-Weights LLMs


99. The Making of Digital Ghosts: Designing Ethical AI Afterlives


100. R3A: Reliable RTL Repair Framework with Multi-Agent Fault Localization and Stochastic Tree-of-Thoughts Patch Generation


101. Explainable Visual Anomaly Detection via Concept Bottleneck Models


102. MFM-point: Multi-scale Flow Matching for Point Cloud Generation


103. WaymoQA: A Multi-View Visual Question Answering Dataset for Safety-Critical Reasoning in Autonomous Driving


104. Energy Costs and Neural Complexity Evolution in Changing Environments


105. Multi-Context Fusion Transformer for Pedestrian Crossing Intention Prediction in Urban Environments


106. Pedestrian Crossing Intention Prediction Using Multimodal Fusion Network


107. BERT-APC: A Reference-free Framework for Automatic Pitch Correction via Musical Context Inference


108. Zero-Shot Transfer Capabilities of the Sundial Foundation Model for Leaf Area Index Forecasting


109. On the Feasibility of Hijacking MLLMs’ Decision Chain via One Perturbation


110. Popularity Bias Alignment Estimates


111. Directional Optimization Asymmetry in Transformers: A Synthetic Stress Test


112. On-Demand Multi-Task Sparsity for Efficient Large-Model Deployment on Edge Devices


113. EmoFeedback2: Reinforcement of Continuous Emotional Image Generation via LVLM-based Reward and Textual Feedback


114. MambaEye: A Size-Agnostic Visual Encoder with Causal Sequential Processing


115. AI/ML based Joint Source and Channel Coding for HARQ-ACK Payload


116. Optimize Flip Angle Schedules In MR Fingerprinting Using Reinforcement Learning


117. LLM-EDT: Large Language Model Enhanced Cross-domain Sequential Recommendation with Dual-phase Training


118. Zero-Knowledge Proof Based Verifiable Inference of Models


119. Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning


120. Distilling Cross-Modal Knowledge via Feature Disentanglement


121. MAPS: Preserving Vision-Language Representations via Module-Wise Proximity Scheduling for Better Vision-Language-Action Generalization


122. CodeFuse-CommitEval: Towards Benchmarking LLM’s Power on Commit Message and Code Change Inconsistency Detection


123. Cross-LLM Generalization of Behavioral Backdoor Detection in AI Agent Supply Chains


124. A Systematic Analysis of Large Language Models with RAG-enabled Dynamic Prompting for Medical Error Detection and Correction


125. Cisco Time Series Model Technical Report


126. GED-Consistent Disentanglement of Aligned and Unaligned Substructures for Graph Similarity Learning


127. Rectified SpaAttn: Revisiting Attention Sparsity for Efficient Video Generation


128. Beyond Relational: Semantic-Aware Multi-Modal Analytics with LLM-Native Query Optimization


129. Mosaic Pruning: A Hierarchical Framework for Generalizable Pruning of Mixture-of-Experts Models


130. CropVLM: Learning to Zoom for Fine-Grained Vision-Language Perception


131. Language-Independent Sentiment Labelling with Distant Supervision: A Case Study for English, Sepedi and Setswana


132. Learning to Clean: Reinforcement Learning for Noisy Label Correction


133. Terminal Velocity Matching


134. Prune-Then-Plan: Step-Level Calibration for Stable Frontier Exploration in Embodied Question Answering


135. Leveraging Foundation Models for Histological Grading in Cutaneous Squamous Cell Carcinoma using PathFMTools


136. Prompt Fencing: A Cryptographic Approach to Establishing Security Boundaries in Large Language Model Prompts


137. An Adaptive, Data-Integrated Agent-Based Modeling Framework for Explainable and Contestable Policy Design


138. CrypTorch: PyTorch-based Auto-tuning Compiler for Machine Learning with Multi-party Computation


139. The Alexander-Hirschowitz theorem for neurovarieties


140. A Layered Protocol Architecture for the Internet of Agents


141. TiCT: A Synthetically Pre-Trained Foundation Model for Time Series Classification


142. TREASURE: A Transformer-Based Foundation Model for High-Volume Transaction Understanding


143. IndEgo: A Dataset of Industrial Scenarios and Collaborative Work for Egocentric Assistants


144. Accuracy and Efficiency Trade-Offs in LLM-Based Malware Detection and Explanation: A Comparative Study of Parameter Tuning vs. Full Fine-Tuning


145. Synthetic Data: AI’s New Weapon Against Android Malware



147. Robot-Powered Data Flywheels: Deploying Robots in the Wild for Continual Data Collection and Foundation Model Adaptation


148. IRSDA: An Agent-Orchestrated Framework for Enterprise Intrusion Response


149. On the Utility of Foundation Models for Fast MRI: Vision-Language-Guided Image Reconstruction


150. Many Ways to be Right: Rashomon Sets for Concept-Based Neural Networks


151. Towards Synergistic Teacher-AI Interactions with Generative Artificial Intelligence


152. HunyuanOCR Technical Report


153. Deductive Systems for Logic Programs with Counting


154. Trust-Based Social Learning for Communication (TSLEC) Protocol Evolution in Multi-Agent Reinforcement Learning


155. Merging without Forgetting: Continual Fusion of Task-Specific Models via Optimal Transport


156. SPQR: A Standardized Benchmark for Modern Safety Alignment Methods in Text-to-Image Diffusion Models


157. Think First, Assign Next (ThiFAN-VQA): A Two-stage Chain-of-Thought Framework for Post-Disaster Damage Assessment


158. Online Sparse Feature Selection in Data Streams via Differential Evolution


159. The Semiotic Channel Principle: Measuring the Capacity for Meaning in LLM Communication


160. When Should Neural Data Inform Welfare? A Critical Framework for Policy Uses of Neuroeconomics


161. Cross-Domain Generalization of Multimodal LLMs for Global Photovoltaic Assessment


162. AttackPilot: Autonomous Inference Attacks Against ML Services With LLM-Based Agents


163. Discover, Learn, and Reinforce: Scaling Vision-Language-Action Pretraining with Diverse RL-Generated Trajectories


164. Towards Efficient VLMs: Information-Theoretic Driven Compression via Adaptive Structural Pruning


165. CycleChemist: A Dual-Pronged Machine Learning Framework for Organic Photovoltaic Discovery


166. Beyond Binary Classification: A Semi-supervised Approach to Generalized AI-generated Image Detection


167. Hierarchical Dual-Strategy Unlearning for Biomedical and Healthcare Intelligence Using Imperfect and Privacy-Sensitive Medical Data


168. PeriodNet: Boosting the Potential of Attention Mechanism for Time Series Forecasting


169. Xmodel-2.5: 1.3B Data-Efficient Reasoning SLM


170. A Systematic Study of Compression Ordering for Large Language Models


171. Forecasting AI Time Horizon Under Compute Slowdowns


172. Generative Model-Aided Continual Learning for CSI Feedback in FDD mMIMO-OFDM Systems


173. Evolution without an Oracle: Driving Effective Evolution with LLM Judges


174. Building Resilient Information Ecosystems: Large LLM-Generated Dataset of Persuasion Attacks


175. Efficient Inference Using Large Language Models with Limited Human Data: Fine-Tuning then Rectification


176. Z-Space: A Multi-Agent Tool Orchestration Framework for Enterprise-Grade LLM Automation


177. Human Experts’ Evaluation of Generative AI for Contextualizing STEAM Education in the Global South


178. Exploiting the Experts: Unauthorized Compression in MoE-LLMs


179. FAST: Topology-Aware Frequency-Domain Distribution Matching for Coreset Selection


180. Tracking and Segmenting Anything in Any Modality


181. Pistachio: Towards Synthetic, Balanced, and Long-Form Video Anomaly Benchmarks


182. WavefrontDiffusion: Dynamic Decoding Schedule or Improved Reasoning


183. PrefixGPT: Prefix Adder Optimization by a Generative Pre-trained Transformer


184. Not Quite Anything: Overcoming SAMs Limitations for 3D Medical Imaging


185. Quantifying Modality Contributions via Disentangling Multimodal Representations


186. SG-OIF: A Stability-Guided Online Influence Framework for Reliable Vision Data


187. Hidden markov model to predict tourists visited place


188. Temperature in SLMs: Impact on Incident Categorization in On-Premises Environments


189. Systemic approach for modeling a generic smart grid


190. Personalized Reward Modeling for Text-to-Image Generation


191. SparOA: Sparse and Operator-aware Hybrid Scheduling for Edge DNN Inference


192. BlockCert: Certified Blockwise Extraction of Transformer Mechanisms