전체 AI 논문 - 2026-05-05

1. Standing on the Shoulders of Giants: Stabilized Knowledge Distillation for Cross–Language Code Clone Detection


2. HAAS: A Policy-Aware Framework for Adaptive Task Allocation Between Humans and Artificial Intelligence Systems


3. Compress Then Adapt? No, Do It Together via Task-aware Union of Subspaces


4. First-Order Efficiency for Probabilistic Value Estimation via A Statistical Viewpoint


5. SCPRM: A Schema-aware Cumulative Process Reward Model for Knowledge Graph Question Answering


6. AIs and Humans with Agency


7. When Audio-Language Models Fail to Leverage Multimodal Context for Dysarthric Speech Recognition


8. Fine-Grained Graph Generation through Latent Mixture Scheduling


9. U-Define: Designing User Workflows for Hard and Soft Constraints in LLM-Based Planning


10. Mitigating Misalignment Contagion by Steering with Implicit Traits


11. Triple Spectral Fusion for Sensor-based Human Activity Recognition


12. Foundation Models to Unlock Real-World Evidence from Nationwide Medical Claims


13. AI and Open-data Driven Scalable Solar Power Profiling


14. Coherent Hierarchical Multi-Label Learning to Defer for Medical Imaging


15. ORPilot: A Production-Oriented Agentic LLM-for-OR Tool for Optimization Modeling


16. An Empirical Study of Agent Skills for Healthcare: Practice, Gaps, and Governance


17. Hybrid Inspection and Task-Based Access Control in Zero-Trust Agentic AI


18. The 2026 ACII Dyadic Conversations (DaiKon) Workshop & Challenge


19. An explainable hypothesis-driven approach to Drug-Induced Liver Injury with HADES


20. AcademiClaw: When Students Set Challenges for AI Agents


21. Deciphering Shortcut Learning from an Evolutionary Game Theory Perspective


22. Trustworthy AI Suffers from Invariance Conflicts and Causality is The Solution


23. SCGNN: Semantic Consistency enhanced Graph Neural Network Guided by Granular-ball Computing


24. Counterfactual Reasoning in Automated Planning


25. Foundation-Model-Based Agents in Industrial Automation: Purposes, Capabilities, and Open Challenges


26. Universal Smoothness via Bernstein Polynomials: A Constructive Approximation Approach for Activation Functions


27. On Training Large Language Models for Long-Horizon Tasks: An Empirical Study of Horizon Length


28. Double Rectified Linear Unit-based Modular Semantics for Quantitative Bipolar Argumentation Framework


29. Strategy-Aware Optimization Modeling with Reasoning LLMs


30. Improving Model Safety by Targeted Error Correction


31. DataClaw: A Process-Oriented Agent Benchmark for Exploratory Real-World Data Analysis


32. GRAIL: A Deep-Granularity Hybrid Resonance Framework for Real-Time Agent Discovery via SLM-Enhanced Indexing


33. Efficient Temporal Datalog Materialisation for Composite Event Recognition


34. Shadow-Loom: Causal Reasoning over Graphical World Model of Narratives


35. Position: How can Graphs Help Large Language Models?


36. Measuring AI Reasoning: A Guide for Researchers


37. The Model Knows, the Decoder Finds: Future Value Guided Particle Power Sampling


38. FitText: Evolving Agent Tool Ecologies via Memetic Retrieval


39. The Compliance Trap: How Structural Constraints Degrade Frontier AI Metacognition Under Adversarial Pressure


40. HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness


41. Controllable and Verifiable Process Data Synthesis for Process Reward Models


42. A Compound AI Agent for Conversational Grant Discovery


43. ANO: A Principled Approach to Robust Policy Optimization



45. Anon: Extrapolating Optimizer Adaptivity Across the Real Spectrum


46. Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding


47. EngiAgent: Fully Connected Coordination of LLM Agents for Solving Open-ended Engineering Problems with Feasible Solutions


48. Complexity Horizons of Compressed Models in Analog Circuit Analysis


49. Towards Understanding Specification Gaming in Reasoning Models


50. A Study of Belief Revision Postulates in Multi-Agent Systems (Extended Version)


51. Zero-Shot Confidence Estimation for Small LLMs: When Supervised Baselines Aren’t Worth Training


52. PhysicianBench: Evaluating LLM Agents in Real-World EHR Environments


53. Perturbation Dose Responses in Recursive LLM Loops: Raw Switching, Stochastic Floors, and Persistent Escape under Append, Replace, and Dialog Updates


54. Bucketing the Good Apples: A Method for Diagnosing and Improving Causal Abstraction


55. CoVSpec: Efficient Device-Edge Co-Inference for Vision-Language Models via Speculative Decoding


56. Submodular Benchmark Selection


57. CBV: Clean-label Backdoor Attacks on Vision Language Models via Diffusion Models


58. MEMAUDIT: An Exact Package-Oracle Evaluation Protocol for Budgeted Long-Term LLM Memory Writing


59. T$^2$PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning


60. Intervention Complexity as a Canonical Reward and a Measure of Intelligence


61. Retrieval and Multi-Hop Reasoning in 1M-Token Context Windows: Evaluating LLMs on Classical Chinese Text


62. Planner Matters! An Efficient and Unbalanced Multi-agent Collaboration Framework for Long-horizon Planning


63. Reinforcement Learning Trained Observer Control for Bearings-Only Tracking


64. The Dynamic Gist-Based Memory Model (DGMM): A Memory-Centric Architecture for Artificial Intelligence


65. NORA: A Harness-Engineered Autonomous Research Agent for End-to-End Spatial Data Science


66. Model Spec Midtraining: Improving How Alignment Training Generalizes


67. Tenability and Weak Semantics: Modeling Non-uniform Defense – Extended Version


68. Reliable AI Needs to Externalize Implicit Knowledge: A Human-AI Collaboration Perspective


69. Personalized Digital Health Modeling with Adaptive Support Users


70. TumorXAI: Self-Supervised Deep Learning Framework for Explainable Brain MRI Tumor Classification


71. 12 Angry AI Agents: Evaluating Multi-Agent LLM Decision-Making Through Cinematic Jury Deliberation


72. Moira: Language-driven Hierarchical Reinforcement Learning for Pair Trading


73. A Language for Describing Agentic LLM Contexts


74. Disentangling Intent from Role: Adversarial Self-Play for Persona-Invariant Safety Alignment


75. CyberAId: AI-Driven Cybersecurity for Financial Service Providers


76. Sheaf-Theoretic Planning: A Categorical Foundation for Resilient Multi-Agent Autonomous Systems


77. NeuroState-Bench: A Human-Calibrated Benchmark for Commitment Integrity in LLM Agent Profiles


78. Neural Decision-Propagation for Answer Set Programming


79. DataEvolver: Let Your Data Build and Improve Itself via Goal-Driven Loop Agents


80. Runtime Evaluation of Procedural Content Generation in an Endless Runner Game Using Autonomous Agents


81. Catching the Infection Before It Spreads: Foresight-Guided Defense in Multi-Agent Systems


82. NH-CROP: Robust Pricing for Governed Language Data Assets under Cost Uncertainty


83. Are LLMs More Skeptical of Entertainment News?


84. Model Routing as a Trust Problem: Route Receipts for Adaptive AI Systems


85. Latent State Design for World Models under Sufficiency Constraints


86. CP-SynC: Multi-Agent Zero-Shot Constraint Modeling in MiniZinc with Synthesized Checkers


87. Evaluating Agentic AI in the Wild: Failure Modes, Drift Patterns, and a Production Evaluation Framework


88. Multi-Agent Reasoning Improves Compute Efficiency: Pareto-Optimal Test-Time Scaling


89. MILD: Mediator Agent System with Bidirectional Perception and Multi-Layered Alignment for Human-Vehicle Collaboration


90. SciResearcher: Scaling Deep Research Agents for Frontier Scientific Reasoning



92. Grounding Multi-Hop Reasoning in Structural Causal Models via Group Relative Policy Optimization


93. CoFlow: Coordinated Few-Step Flow for Offline Multi-Agent Decision Making


94. Rethinking Explanations: Formalizing Contrast in Description Logics


95. SCALE-LoRA: Auditing Post-Retrieval LoRA Composition with Residual Merging and View Reliability


96. Artificial Jagged Intelligence as Uneven Optimization Energy Allocation Capability Concentration, Redistribution, and Optimization Governance


97. TimeTok: Granularity-Controllable Time-Series Generation via Hierarchical Tokenization


98. AI Safety as Control of Irreversibility: A Systems Framework for Decision-Energy and Sovereignty Boundaries


99. A Cellular Doctrine of Morality: Intrinsic Active Precision and the Mind-Reality Overload Dilemma


100. Structural Ranking of the Cognitive Plausibility of Computational Models of Analogy and Metaphors with the Minimal Cognitive Grid


101. DiagramNet: An End-to-End Recognition Framework and Dataset for Non-Standard System-Level Diagrams


102. Truth or Tribe: How In-group Favoritism Prioritize Facts in Persona Agents


103. Segment-Aligned Policy Optimization for Multi-Modal Reasoning


104. Lifting Traces to Logic: Programmatic Skill Induction with Neuro-Symbolic Learning for Long-Horizon Agentic Tasks


105. Valley3: Scaling Omni Foundation Models for E-commerce


106. Uncertainty-Aware Trip Purpose Inference from GPS Trajectories via POI Semantic Zones and Pareto Calibration


107. EO-Gym: A Multimodal, Interactive Environment for Earth Observation Agents


108. Zero-Shot Signal Temporal Logic Planning with Disjunctive Branch Selection in Dynamic Semantic Maps


109. Agentic AI Systems Should Be Designed as Marginal Token Allocators


110. Faithful Mobile GUI Agents with Guided Advantage Estimator


111. GR-Ben: A General Reasoning Benchmark for Evaluating Process Reward Models


112. NEURON: A Neuro-symbolic System for Grounded Clinical Explainability


113. LLMs Should Not Yet Be Credited with Decision Explanation


114. Arithmetic in the Wild: Llama uses Base-10 Addition to Reason About Cyclic Concepts


115. Position: Safety and Fairness in Agentic AI Depend on Interaction Topology, Not on Model Scale or Alignment


116. A Low-Latency Fraud Detection Layer for Detecting Adversarial Interaction Patterns in LLM-Powered Agents


117. To Use AI as Dice of Possibilities with Timing Computation


118. Iterative Finetuning is Mostly Idempotent


119. PERSA: Reinforcement Learning for Professor-Style Personalized Feedback with LLMs



121. Towards Multi-Agent Autonomous Reasoning in Hydrodynamics


122. Virtual Speech Therapist: A Clinician-in-the-Loop AI Speech Therapy Agent for Personalized and Supervised Therapy


123. A Knowledge-Driven LLM-Based Decision-Support System for Explainable Defect Analysis and Mitigation Guidance in Laser Powder Bed Fusion


124. Algebraic Semantics of Governed Execution: Monoidal Categories, Effect Algebras, and Coterminous Boundaries


125. Effect-Transparent Governance for AI Workflow Architectures: Semantic Preservation, Expressive Minimality, and Decidability Boundaries


126. Accelerating battery research with an AI interface between FINALES and Kadi4Mat


127. ClinicBot: A Guideline-Grounded Clinical Chatbot with Prioritized Evidence RAG and Verifiable Citations


128. Understanding Emergent Misalignment via Feature Superposition Geometry


129. AI Agents for Sustainable SMEs: A Green ESG Assessment Framework


130. 2026 Roadmap on Artificial Intelligence and Machine Learning for Smart Manufacturing


131. SpecKV: Adaptive Speculative Decoding with Compression-Aware Gamma Selection


132. Enhancing RL Generalizability in Robotics through SHAP Analysis of Algorithms and Hyperparameters


133. From Sensors to Insight: Rapid, Edge-to-Core Application Development for Sensor-Driven Applications


134. (POSTER) From Sensors to Insight: Rapid, Edge-to-Core Application Development for Sensor-Driven Applications


135. A second-order method on the Stiefel manifold via Newton$\unicode{x2013}$Schulz


136. IConFace: Identity-Structure Asymmetric Conditioning for Unified Reference-Aware Face Restoration


137. Static Analysis of Recursive SHACL


138. A decoupled diffusion planner that adapts to changing cost limits by using cost-conditioned generation for safety and reward gradients for performance


139. TOC-SR: Task-Optimal Compact diffusion for Image Super Resolution


140. Virtual Scanning for NSCLC Histology: Investigating the Discriminatory Power of Synthetic PET


141. Bolek: A Multimodal Language Model for Molecular Reasoning


142. AI-Generated Smells: An Analysis of Code and Architecture in LLM and Agent-Driven Development


143. Perceptual Flow Network for Visually Grounded Reasoning


144. OphMAE: Bridging Volumetric and Planar Imaging with a Foundation Model for Adaptive Ophthalmological Diagnosis


145. mdok-style at SemEval-2026 Task 10: Finetuning LLMs for Conspiracy Detection


146. SAIL: Structure-Aware Interpretable Learning for Anatomy-Aligned Post-hoc Explanations in OCT


147. ProPACT: A Proactive AI-Driven Adaptive Collaborative Tutor for Pair Programming


148. Learning Equivariant Neural-Augmented Object Dynamics From Few Interactions


149. mdok-style at SemEval-2026 Task 9: Finetuning LLMs for Multilingual Polarization Detection


150. Caliper-in-the-Loop: Black-Box Optimization for Hyperledger Fabric Performance Tuning


151. The Design and Composition of Structural Causal Decision Processes


152. Fuzzy Fingerprinting Encoder Pre-trained Language Models for Emotion Recognition in Conversations: Human Assessment and Validity Study


153. ViewSAM: Learning View-aware Cross-modal Semantics for Weakly Supervised Cross-view Referring Multi-Object Tracking


154. Validation of an AI-based end-to-end model for prostate pathology using long-term archived routine samples


155. Dependency Parsing Across the Resource Spectrum: Evaluating Architectures on High and Low-Resource Languages


156. CoRAL: Contact-Rich Adaptive LLM-based Control for Robotic Manipulation


157. Beyond State Machines: Executing Network Procedures with Agentic Tool-Calling Sequences


158. Hyp2Former: Hierarchy-Aware Hyperbolic Embeddings for Open-Set Panoptic Segmentation


159. Recurrent Deep Reinforcement Learning for Chemotherapy Control under Partial Observability


160. Orchestrating Spatial Semantics via a Zone-Graph Paradigm for Intricate Indoor Scene Generation


161. Set-Based Training of Neural Barrier Certificates for Safety Verification of Dynamical Systems


162. A Semantic Autonomy Framework for VLM-Integrated Indoor Mobile Robots: Hybrid Deterministic Reasoning and Cross-Robot Adaptive Memory


163. Benchmarking Retrieval Strategies for Biomedical Retrieval-Augmented Generation: A Controlled Empirical Study


164. A Novel Preprocessing-Driven Approach to Remaining Useful Life (RUL) Prediction Using Temporal Convolutional Networks (TCN)


165. Pretraining on Sleep Data Improves non-Sleep Biosignal Tasks


166. Efficient Preference Poisoning Attack on Offline RLHF


167. From Experimental Limits to Physical Insight: A Retrieval-Augmented Multi-Agent Framework for Interpreting Searches Beyond the Standard Model


168. Reference-Sampled Boltzmann Projection for KL-Regularized RLVR: Target-Matched Weighted SFT, Finite One-Shot Gaps, and Policy Mirror Descent


169. When Stress Becomes Signal: Detecting Antifragility-Compatible Regimes in Multi-Agent LLM Systems


170. LLM-Assisted Repository-Level Generation with Structured Spec-Driven Engineering


171. Causal Software Engineering: A Vision and Roadmap


172. PC-MNet: Dual-Level Congruity Modeling for Multimodal Sarcasm Detection via Polarity-Modulated Attention


173. Automatic Reflection Level Classification in Hungarian Student Essays


174. FEAT: Fashion Editing and Try-On from Any Design


175. Is It Novel and Why? Fine-Grained Patent Novelty Prediction Based on Passage Retrieval


176. Entanglement is Half the Story: Post-Selection vs. Partial Traces


177. Enhancing Multimodal In-Context Learning via Inductive-Deductive Reasoning


178. Privacy Preserving Machine Learning Workflow: from Anonymization to Personalized Differential Privacy Budgets in Federated Learning


179. When Correct Isn’t Usable: Improving Structured Output Reliability in Small Language Models


180. APIOT: Autonomous Vulnerability Management Across Bare-Metal Industrial OT Networks


181. LLM-enabled Social Agents


182. When Attention Collapses: Residual Evidence Modeling for Compositional Inference


183. Rethinking Electro-Optical Vision Foundation Models for Remote Sensing Retrieval: A Controlled Comparison with Generalist VFM


184. HELIX: Hybrid Encoding with Learnable Identity and Cross-dimensional Synthesis for Time Series Imputation


185. EdgeLPR: On the Deep Neural Network trade-off between Precision and Performance in LiDAR Place Recognition


186. Reliability-Oriented Multilingual Orthopedic Diagnosis: A Domain-Adaptive Modeling and a Conceptual Validation Framework


187. On the Privacy of LLMs: An Ablation Study


188. The Conversations Beneath the Code: Triadic Data for Long-Horizon Software Engineering Agents


189. MultiSense-Pneumo: A Multimodal Learning Framework for Pneumonia Screening in Resource-Constrained Settings


190. Trees and Graphs with Non Log-concave Dominating Set Sequence via AI Tools


191. When Alignment Isn’t Enough: Response-Path Attacks on LLM Agents


192. RAFNet: Region-Aware Fusion Network for Pansharpening


193. The Causal Description Gap: Information-Theoretic Separations Across Pearl’s Hierarchy


194. Manifold-Aligned Guided Integrated Gradients for Reliable Feature Attribution


195. DocSync: Agentic Documentation Maintenance via Critic-Guided Reflexion


196. Combining Trained Models in Reinforcement Learning


197. Cross-Polarization Fusion of VV AND VH SAR Observations for Improved Flood Mapping


198. On the Optimal Sample Complexity of Offline Multi-Armed Bandits with KL Regularization


199. FLoRA: Fusion-Latent for Optical Reconstruction and Flood Area Segmentation via Cross-Modal Multi-Task Distillation Network


200. Boundary Mass and the Soft-to-Hard Limit in Mixture-of-Experts


201. Context-Aware Wireless Token Communication via Joint Token Masking and Detection


202. STABLEVAL: Disagreement-Aware and Stable Evaluation of AI Systems


203. GETA-3DGS: Automatic Joint Structured Pruning and Quantization for 3D Gaussian Splatting


204. EditPropBench: Measuring Factual Edit Propagation in Scientific Manuscripts


205. Cripping AI: Reimagining AI Through Lived Disability Experiences


206. Pair2Score: Pairwise-to-Absolute Transfer for LLM-Based Essay Scoring


207. Coopetition-Gym v1: A Formally Grounded Platform for Mixed-Motive Multi-Agent Reinforcement Learning under Strategic Coopetition


208. Principles and Guidelines for Randomized Controlled Trials in AI Evaluation


209. Optimization of CV-QKD Under Practical Constraints


210. What Single-Prompt Accuracy Misses: A Multi-Variant Reliability Audit of Language Models


211. VILAS: A VLA-Integrated Low-cost Architecture with Soft Grasping for Robotic Manipulation


212. A Multimodal Dataset for Visually Grounded Ambiguity in Machine Translation


213. Conventional Commit Classification using Large Language Models and Prompt Engineering



215. RamanBench: A Large-Scale Benchmark for Machine Learning on Raman Spectroscopy


216. Trojan Hippo: Weaponizing Agent Memory for Data Exfiltration


217. TRAP: Tail-aware Ranking Attack for World-Model Planning


218. Phone2Act: A Low-Cost, Hardware-Agnostic Teleoperation System for Scalable VLA Data Collection


219. PepSpecBench: A Unified Evaluation Benchmark for Peptide Tandem Mass Spectrometry Prediction


220. RefusalGuard: Geometry-Preserving Fine-Tuning for Safety in LLMs


221. Stochastic Sparse Attention for Memory-Bound Inference


222. Behavior-Grounded Lane Representation Learning for Multi-Task Traffic Digital Twins


223. AFFormer: Adaptive Feature Fusion Transformer for V2X Cooperative Perception under Channel Impairments


224. Chart-FR1: Visual Focus-Driven Fine-Grained Reasoning on Dense Charts


225. BadmintonGRF: A Multimodal Dataset and Benchmark for Markerless Ground Reaction Force Estimation in Badminton


226. Leveraging Data Symmetries to Select an Optimal Subset of Training Data under Label Noise


227. ShiftLIF: Efficient Multi-Level Spiking Neurons with Power-of-Two Quantization


228. Quality-Aware Exploration Budget Allocation for Cooperative Multi-Agent Reinforcement Learning


229. Spatiotemporal Hidden-State Dynamics as a Signature of Internal Reasoning in Large Language Models


230. Disentangled Anatomy-Disease Diffusion (DADD) for Controllable Ulcerative Colitis Progression Synthesis


231. Repurposing and Evaluating the (In)Feasibility of Dataset Poisoning enabled Watermarking for Contrastive Learning


232. Remote Action Generation: Remote Control with Minimal Communication


233. RMGAP: Benchmarking the Generalization of Reward Models across Diverse Preferences


234. GeoSAE: Geometric Prior-Guided Layer-Wise Sparse Autoencoder Annotation of Brain MRI Foundation Models


235. Selector-Guided Autonomous Curriculum for One-Shot Reinforcement Learning from Verifiable Rewards


236. Federated Semi-Supervised Graph Neural Networks with Prototype-Guided Pseudo-Labeling for Privacy-Preserving Gestational Diabetes Mellitus Prediction


237. TMD-Bench: A Multi-Level Evaluation Paradigm for Music-Dance Co-Generation


238. Discover Fast Power Allocation Solution for Multi-Target Tracking via AlphaEvolve Evolution


239. Khala: Scaling Acoustic Token Language Models Toward High-Fidelity Music Generation


240. Data driven approach for Outdoor Channel Prediction in 5G and Beyond


241. The Compliance Gap: Why AI Systems Promise to Follow Process Instructions but Don’t


242. Talk is Cheap, Communication is Hard: Dynamic Grounding Failures and Repair in Multi-Agent Negotiation


243. Architectural Obsolescence of Unhardened Agentic-AI Runtimes


244. GEASS: Training-Free Caption Steering for Hallucination Mitigation in Vision-Language Models


245. FEDIN: Frequency-Enhanced Deep Interest Network for Click-Through Rate Prediction


246. Motion-Aware Caching for Efficient Autoregressive Video Generation


247. SignVerse-2M: A Two-Million-Clip Pose-Native Universe of 25+ Sign Languages


248. TCDA: Thread-Constrained Discourse-Aware Modeling for Conversational Sentiment Quadruple Analysis


249. SplitZip: Ultra Fast Lossless KV Compression for Disaggregated LLM Serving


250. Probe-Geometry Alignment: Erasing the Cross-Sequence Memorization Signature Below Chance


251. BIM Information Extraction Through LLM-based Adaptive Exploration


252. GRAVITY: Architecture-Agnostic Structured Anchoring for Long-Horizon Conversational Memory


253. Class-Aware Adaptive Differential Privacy in Deep Learning for Sensor-Based Fall Detection


254. Missingness-aware Data Imputation via AI-powered Bayesian Generative Modeling


255. IMPACT-Scribe: Interactive Temporal Action Segmentation with Boundary Scribbles and Query Planning


256. IMPACT-HOI: Supervisory Control for Onset-Anchored Partial HOI Event Construction


257. TRIMMER: A New Paradigm for Video Summarization through Self-Supervised Reinforcement Learning


258. From Cortical Synchronous Rhythm to Brain Inspired Learning Mechanism: An Oscillatory Spiking Neural Network with Time-Delayed Coordination


259. AI Alignment via Incentives and Correction


260. Prosa: Rubric-Based Evaluation of LLMs on Real User Chats in Brazilian Portuguese


261. From Packets to Patterns: Interpreting Encrypted Network Traffic as Longitudinal Behavioral Signals


262. The Case for ESM3 as a General-Purpose AI Model with Systemic Risk Under the EU AI Act


263. Less Interaction But More Explanation: A Communication Perspective on Agentic AI Interfaces


264. Concepts Whisper While Syntax Shouts: Spectral Anti-Concentration and the Dual Geometry of Transformer Representations


265. Where Do Prompt Perturbations Break Generation? A Segment-Level View of Robustness in LoRA-Tuned Language Models


266. KG-First, LLM-Fallback: A Hybrid Microservice for Grounded Skill Search and Explanation


267. Model Merging: Foundations and Algorithms


268. Neuro-Symbolic Agents for Hallucination-Free Requirements Reuse


269. Automated Interpretability and Feature Discovery in Language Models with Agents


270. 6G Needs Agents: Toward Agentic AI-Native Networks for Autonomous Intelligence


271. Mesh Based Simulations with Spatial and Temporal awareness


272. Protein-Conditioned Multi-Objective Reinforcement Learning for Full-Length mRNA Design


273. FT-RAG: A Fine-grained Retrieval-Augmented Generation Framework for Complex Table Reasoning


274. CGFformer: Cluster-Guidance Frequency Transformer for Pansharpening


275. Research on Vision-Language Question Answering Models for Industrial Robots


276. LIE: LiDAR-only HD Map Construction with Intensity Enhancement via Online Knowledge Distillation


277. Practical Limits of Autonomous Test Repair: A Multi-Agent Case Study with LLM-Driven Discovery and Self-Correction


278. Decision Boundary-aware Generation for Long-tailed Learning


279. SRGAN-CKAN: Expressive Super-Resolution with Nonlinear Functional Operators under Minimal Resources


280. VisInject: Disruption != Injection – A Dual-Dimension Evaluation of Universal Adversarial Attacks on Vision-Language Models


281. Quantifying Multimodal Capabilities: Formal Generalization Guarantees in Pairwise Metric Learning


282. HepScript: A Dual-Use DSL for Human-AI Collaborative Data Analysis Workflows in High-Energy Physics


283. Medmarks: A Comprehensive Open-Source LLM Benchmark Suite for Medical Tasks


284. AMSnet-q: Unsupervised Circuit Identification and Performance Labeling for AMS Circuits


285. AI Expert Twin: Capturing Expert Cognition for Human-Centred, Practice-Based Learning


286. Investigating the Effects of Different Levels of User Control in an Interactive Educational Recommender System


287. Verbal-R3: Verbal Reranker as the Missing Bridge between Retrieval and Reasoning


288. LiveFMBench: Unveiling the Power and Limits of Agentic Workflows in Specification Generation


289. Using LLMs in Software Design: An Empirical Study of GitHub and A Practitioner Survey


290. Sparse Representation Learning for Vessels


291. Focus on the Core: Empowering Diffusion Large Language Models by Self-Contrast


292. MU-SHOT-Fi: Self-Supervised Multi-User Wi-Fi Sensing with Source-free Unsupervised Domain Adaptation


293. Model-Based Proactive Cost Generation for Learning Safe Policies Offline with Limited Violation Data


294. AgriKD: Cross-Architecture Knowledge Distillation for Efficient Leaf Disease Classification


295. VUDA: Breaking CUDA-Vulkan Isolation for Spatial Sharing of Compute and Graphics on the Same GPU


296. MAD-OPD: Breaking the Ceiling in On-Policy Distillation via Multi-Agent Debate


297. Active Reasoning Vision-Language Models via Sequential Experimental Design


298. ABox Abduction for Inconsistent Knowledge Bases under Repair Semantics


299. Creating and Evaluating Figurative Language Dataset for Sindhi


300. GraphSculptor: Sculpting Pre-training Coreset for Graph Self-supervised Learning


301. Spectral- and Energy-efficient Multi-BS Multi-RIS Pinching-antenna Systems: A GNN-based Approach


302. Are we Doomed to an AI Race? Why Self-Interest Could Drive Countries Towards a Moratorium on Superintelligence


303. Autonomous Drift Learning in Data Streams: A Unified Perspective


304. Chain of Evidence: Pixel-Level Visual Attribution for Iterative Retrieval-Augmented Generation


305. Developing a Strong Pre-Trained Base Model for Plant Leaf Disease Classification


306. A Target-Free Harmonization Method for MRI


307. Position: LLM Serving Needs Mathematical Optimization and Algorithmic Foundations, Not Just Heuristics


308. CNN-based Multi-In-Multi-Out Model for Efficient Spatiotemporal Prediction


309. The Garden of Forking Paths: Narrative Arc-Conditioned Gameplay Planning


310. Rhamba: Region-Aware Hybrid Attention-Mamba Framework for Self-Supervised Learning in Resting-State fMRI


311. MindMelody: A Closed-Loop EEG-Driven System for Personalized Music Intervention


312. Minimizing Collateral Damage in Activation Steering


313. The Productivity-Reliability Paradox: Specification-Driven Governance for AI-Augmented Software Development


314. Multi-Perspective Transformers in ARC-AGI-2 Challenge


315. Semantic Context-aware mOdality fUsion Transformer (SCOUT): A Context-Aware Multimodal Transformer for Concept-Grounded Pathology Report Generation


316. Forager: a lightweight testbed for continual learning with partial observability in RL


317. When Less is Enough: Efficient Inference via Collaborative Reasoning


318. Component-Aware Self-Speculative Decoding in Hybrid Language Models


319. Interpretable Difficulty-Aware Knowledge Tracing in Tutor-Student Dialogues


320. Governing What the EU AI Act Excludes: Accountability for Autonomous AI Agents in Smart City Critical Infrastructure


321. A Sentence Relation-Based Approach to Sanitizing Malicious Instructions


322. LEAP: Layer-wise Exit-Aware Pretraining for Efficient Transformer Inference


323. SCION: Size-aware Policy Orchestration for Nonstationary Object Caches (Long Paper Version)


324. Value Functions for Temporal Logic: Optimal Policies and Safety Filters


325. LLM Ghostbusters: Surgical Hallucination Suppression via Adaptive Unlearning


326. Separation Assurance between Heterogeneous Fleets of Small Unmanned Aerial Systems via Multi-Agent Reinforcement Learning


327. Certified Purity for Cognitive Workflow Executors: From Static Analysis to Cryptographic Attestation


328. EmoMM: Benchmarking and Steering MLLM for Multimodal Emotion Recognition under Conflict and Missingness


329. CLEAR: Revealing How Noise and Ambiguity Degrade Reliability in LLMs for Medicine


330. Model Organisms Are Leaky: Perplexity Differencing Often Reveals Finetuning Objectives



332. Physiology-Aware Masked Cross-Modal Reconstruction for Biosignal Representation Learning


333. Toward a Scientific Discovery Engine for Weather and Climate Data: A Visual Analytics Workbench for Embedding-Based Exploration


334. MedMosaic: A Challenging Large Scale Benchmark of Diverse Medical Audio


335. Adaptive 3D-RoPE: Physics-Aligned Rotary Positional Encoding for Wireless Foundation Models


336. Seeking Information with RAG-Assistants: Does Model Size Matter in Human-AI Collaborations?


337. Ablation Study of Multimodal Perception, Language Grounding, and Control for Human-Robot Interaction in an Object Detection and Grasping Task


338. “I Don’t Know” – Towards Appropriate Trust with Certainty-Aware Retrieval Augmented Generation


339. E-MIA: Exam-Style Black-Box Membership Inference Attacks against RAG Systems


340. Graph Rewiring in GNNs to Mitigate Over-Squashing and Over-Smoothing: A Survey


341. Co-Generative De Novo Functional Protein Design


342. SCARV: Structure-Constrained Aggregation for Stable Sample Ranking in Redundant NLP Datasets


343. Interpretable experiential learning based on state history and global feedback


344. From Flat Facts to Sharp Hallucinations: Detecting Stubborn Errors via Gradient Sensitivity


345. Fusing Urban Structure and Semantics: A Conditional Diffusion Model for Cross-City OD Matrix Generation


346. EventADL: Open-Box Anomaly Detection and Localization Framework for Events in Cloud-Based Service Systems


347. CGM-JEPA: Learning Consistent Continuous Glucose Monitor Representations via Predictive Self-Supervised Pretraining


348. Code World Model Preparedness Report


349. CellxPert: Inference-Time MCMC Steering of a Multi-Omics Single-Cell Foundation Model for In-Silico Perturbation


350. PhaseNet++: Phase-Aware Frequency-Domain Anomaly Detection for Industrial Control Systems via Phase Coherence Graphs


351. StyleShield: Exposing the Fragility of AIGC Detectors through Continuous Controllable Style Transfer


352. To Vibe Research or Not to Vibe Research? Generative AI in Qualitative Research


353. Rethink MAE with Linear Time-Invariant Dynamics


354. The Cost of Consensus: Isolated Self-Correction Prevails Over Unguided Homogeneous Multi-Agent Debate


355. Leveraging Imperfect Medical Data: A Manifold-Consistent Spatio-Temporal Network for Sensor-based Human Activity Recognition


356. TRIP-Evaluate: An Open Multimodal Benchmark for Evaluating Large Models in Transportation


357. Generalized Category Discovery under Domain Shifts: From Vision to Vision-Language Models


358. DIAGRAMS: A Review Framework for Reasoning-Level Attribution in Diagram QA


359. RA-CMF: Region-Adaptive Conditional MeanFlow for CT Image Reconstruction


360. When Less Is More: Simplicity Beats Complexity for Physics-Constrained InSAR Phase Unwrapping


361. Transfer Learning for Tonal Noise Prediction in VRF Units Using Thermodynamic and Vibration Signals


362. Retrieval-Guided Generation for Safer Histopathology Image Captioning


363. X2SAM: Any Segmentation in Images and Videos


364. Skeleton-Based Posture Classification to Promote Safer Walker-Assisted Gait in Older Adults


365. Selective Correlation Based Knowledge Distillation for Ground Reaction Force Estimation


366. Towards High Fidelity Face Swapping: A Comprehensive Survey and New Benchmark


367. Adversarial Flow Matching for Imperceptible Attacks on End-to-End Autonomous Driving


368. OceanPile: A Large-Scale Multimodal Ocean Corpus for Foundation Models


369. Visual Chart Representations for Cryptocurrency Regime Prediction: A Systematic Deep Learning Study


370. Latent Space Probing for Adult Content Detection in Video Generative Models


371. BRITE: A Benchmark for Reliable and Interpretable T2V Evaluation on Implausible Scenarios


372. Multi-View Hierarchical Representation Learning of Fetal Hemodynamics for Maternal Hypertension Detection at the Edge


373. NAKUL-Med: Spectral-Graph State Space Models with Dynamics Kernels for Medical Signals


374. An Algorithm for On-Sensor Agnostic Detection of Changes in Human Activity for Ultra-Low-Power Applications


375. Voice Mapping of Text-to-Speech Systems: A Metric-Based Approach for Voice Quality Assessment


376. Foundation Model Guided Dual-Branch Co-Adaptation for Source-Free EEG Decoding


377. 1BT: One-Block Transformer for EEG-Based Cognitive Workload Assessment


378. Earth System Foundation Model (ESFM): A unified framework for heterogeneous data integration and forecasting


379. H-Probes: Extracting Hierarchical Structures From Latent Representations of Language Models


380. Graph Query Generation with Constraint-guided Large Language Agents


381. The Oracle’s Fingerprint: Correlated AI Forecasting Errors and the Limits of Bias Transmission


382. Generative-AI and the transformation of workforce. A job postings-driven analysis


383. Agentopic: A Generative AI Agent Workflow for Explainable Topic Modeling


384. GhostServe: A Lightweight Checkpointing System in the Shadow for Fault-Tolerant LLM Serving


385. Separating Intelligence from Execution: A Workflow Engine for the Model Context Protocol