전체 AI 논문 - 2026-05-19

1. Actionable World Representation


2. What Does the AI Doctor Value? Auditing Pluralism in the Clinical Ethics of Language Models


3. SkillGenBench: Benchmarking Skill Generation Pipelines for LLM Agents


4. Democratizing Large-Scale Re-Optimization with LLM-Guided Model Patches


5. Learning Quantifiable Visual Explanations Without Ground-Truth


6. Efficient Lookahead Encoding and Abstracted Width for Learning General Policies in Classical Planning


7. Position: A Three-Layer Probabilistic Assume-Guarantee Architecture Is Structurally Required for Safe LLM Agent Deployment


8. GIM: Evaluating models via tasks that integrate multiple cognitive domains


9. AI for Auto-Research: Roadmap & User Guide


10. SCICONVBENCH: Benchmarking LLMs on Multi-Turn Clarification for Task Formulation in Computational Science


11. Learning Lifted Action Models from Traces with Minimal Information About Actions and States


12. Latent Action Reparameterization for Efficient Agent Inference


13. When Outcome Looks Right But Discipline Fails: Trace-Based Evaluation Under Hidden Competitor State


14. Query-Conditioned Knowledge Alignment for Reliable Cross-System Medical Reasoning


15. VISAFF: Speaker-Centered Visual Affective Feature Learning for Emotion Recognition in Conversation


16. AMR-SD: Asymmetric Meta-Reflective Self-Distillation for Token-Level Credit Assignment


17. A Practical Noise2Noise Denoising Pipeline for High-Throughput Raman Spectroscopy


18. OCCAM: Open-set Causal Concept explAnation and Ontology induction for black-box vision Models


19. When Fireflies Cluster; Enhancing Automatic Clustering via Centroid-Guided Firefly Optimization


20. QSTRBench: a New Benchmark to Evaluate the Ability of Language Models to Reason with Qualitative Spatial and Temporal Calculi


21. Causely: A Causal Intelligence Layer for Enterprise AI A Benchmark Study on SRE and Reliability Workflows


22. SD-Search: On-Policy Hindsight Self-Distillation for Search-Augmented Reasoning


23. DARE-EEG: A Foundation Model for Mining Dual-Aligned Representation of EEG


24. Beyond the Cartesian Illusion: Testing Two-Stage Multi-Modal Theory of Mind under Perceptual Bottlenecks


25. Pairwise Preference Reward and Group-Based Diversity Enhancement for Superior Open-Ended Generation


26. Scalable Environments Drive Generalizable Agents


27. Visualizing the Invisible: Generative Visual Grounding Empowers Universal EEG Understanding in MLLMs


28. TRACE: Trajectory Correction from Cross-layer Evidence for Hallucination Reduction


29. Whispers in the Noise: Surrogate-Guided Concept Awakening via a Multi-Agent Framework


30. Evidence-Grounded Frontier Mapping and Agentic Hypothesis Generation in Nanomedicine


31. Generative AI and the Productivity Divide: Human-AI Complementarities in Education


32. POST: Prior-Observation Adversarial Learning of Spatio-Temporal Associations for Multivariate Time Series Anomaly Detection


33. TaskGround: Structured Executable Task Inference for Full-Scene Household Reasoning


34. Safety Geometry Collapse in Multimodal LLMs and Adaptive Drift Correction


35. Learning to Solve Compositional Geometry Routing Problems


36. LLM-Guided Communication for Cooperative Multi-Agent Reinforcement Learning


37. DocOS: Towards Proactive Document-Guided Actions in GUI Agents


38. New Insight of Variance reduce in Zero-Order Hard-Thresholding: Mitigating Gradient Error and Expansivity Contradictions


39. TeleCom-Bench: How Far Are Large Language Models from Industrial Telecommunication Applications?


40. Shared Backbone PPO for Multi-UAV Communication Coverage with Connection Preservation


41. Unleashing LLMs in Bayesian Optimization: Preference-Guided Framework for Scientific Discovery


42. Reconciling Contradictory Views on the Effectiveness of SFT in LLMs: An Interaction Perspective


43. SVFSearch: A Multimodal Knowledge-Intensive Benchmark for Short-Video Frame Search in the Gaming Vertical Domain


44. Ethical Hyper-Velocity (EHV): A Provably Deterministic Governance-Aware JIT Compiler Architecture for Agentic Systems


45. Agentic Chunking and Bayesian De-chunking of AI Generated Fuzzy Cognitive Maps: A Model of the Thucydides Trap


46. LAST-RAG: Literature-Anchored Stochastic Trajectory Retrieval-Augmented Generation for Knowledge-Conditioned Degradation Model Selection


47. DuIVRS-2: An LLM-based Interactive Voice Response System for Large-scale POI Attribute Acquisition


48. Evaluating Cognitive Age Alignment in Interactive AI Agents


49. PAIR: Prefix-Aware Internal Reward Model for Multi-Turn Agent Optimization


50. KISS - Knowledge Infrastructure for Scientific Simulation: A Scaffolding for Agentic Earth Science


51. Remembering More, Risking More: Longitudinal Safety Risks in Memory-Equipped LLM Agents


52. Interactive Evaluation Requires a Design Science


53. Going Headless? On the Boundaries of Vertical AI Firms


54. Accelerating AI-Powered Research: The PuppyChatter Framework for Usable and Flexible Tooling


55. STRIDE: A Self-Reflective Agent Framework for Reliable Automatic Equation Discovery


56. Entropy-Gradient Inversion: Moving Toward Internal Mechanism of Large Reasoning Models



58. Agents for Experiments, Experiments for Agents: A Design Grammar for AI-Enabled Experimental Science


59. Harnessing LLM Agents with Skill Programs


60. Divergence-Suppressing Couplings for Rectified Flow


61. EXG: Self-Evolving Agents with Experience Graphs


62. EGI: A Multimodal Emotional AI Framework for Enhancing Scrum Master Real-time Self-Awareness


63. Multimodal Cultural Heritage Knowledge Graph Extension with Language and Vision Models


64. SAPO: Step-Aligned Policy Optimization for Reasoning-Based Generative Recommendation


65. Causal Intervention-Based Memory Selection for Long-Horizon LLM Agents


66. WebGameBench: Requirement-to-Application Evaluation for Coding Agents via Browser-Native Games


67. Episodic-Semantic Memory Architecture for Long-Horizon Scientific Agents


68. Prediction of Challenging Behaviors Associated with Profound Autism in a Classroom Setting Using Wearable Sensors


69. GraphMind: From Operational Traces to Self-Evolving Workflow Automation


70. AutoRubric-T2I: Robust Rule-Based Reward Model for Text-to-Image Alignment


71. NeuSymMS: A Hybrid Neuro-Symbolic Memory System for Persistent, Self-Curating LLM Agents


72. ECG-WM: A Physiology-Informed ECG World Model for Clinical Intervention Simulation


73. Generalization or Memorization? Brittleness Testing for Chess-Trained Language Models


74. Evaluating Deep Research Agents on Expert Consulting Work: A Benchmark with Verifiers, Rubrics, and Cognitive Traps


75. Memory-Guided Tree Search with Cross-Branch Knowledge Transfer for LLM Solver Synthesis


76. Self-supervised Hierarchical Visual Reasoning with World Model


77. RAG-based EEG-to-Text Translation Using Deep Learning and LLMs


78. The Capability Paradox: How Smarter Auditors Make Multi-Agent Systems Less Secure


79. Multi-Party Multi-Objective Optimization as Consensus Search: Runtime Analysis of Cross-Party Recombination


80. Computational Challenges in Token Economics: Bridging Economic Theory and AI System Design


81. Heterogeneous Information-Bottleneck Coordination Graphs for Multi-Agent Reinforcement Learning


82. QQJ: Quantifying Qualitative Judgment for Scalable and Human-Aligned Evaluation of Generative AI


83. ADR: An Agentic Detection System for Enterprise Agentic AI Security


84. CBT-Audio: Evaluating Audio Language Models for Patient-Side Distress Intensity Estimation in CBT Session Recordings


85. HyperPersona: A Multi-Level Hypergraph Framework for Text-Based Automatic Personality Prediction


86. Reasoning Before Diagnosis: Physician-Inspired Structured Thinking for ECG Classification


87. CyberCorrect: A Cybernetic Framework for Closed-Loop Self-Correction in Large Language Models


88. MetaCogAgent: A Metacognitive Multi-Agent LLM Framework with Self-Aware Task Delegation


89. A2RBench: An Automatic Paradigm for Formally Verifiable Abstract Reasoning Benchmark Generation


90. Is VLA Reasoning Faithful? Probing Safety of Chain-of-Causation


91. CAM-Bench: A Benchmark for Computational and Applied Mathematics in Lean


92. CatalyticMLLM: A Graph-Text Multimodal Large Language Model for Catalytic Materials


93. Towards Robust Argumentative Essay Understanding via TIDE: An Interactive Framework with Trial and Debate


94. ChemVA: Advancing Large Language Models on Chemical Reaction Diagrams Understanding


95. CAREBench: Evaluating LLMs’ Emotion Understanding by Assessing Cognitive Appraisal Reasoning


96. Responsible Agentic AI Requires Explicit Provenance


97. From Imitation to Interaction: Mastering Game of Schnapsen with Shallow Reinforcement Learning


98. MADP: A Multi-Agent Pipeline for Sustainable Document Processing with Human-in-the-Loop


99. Dynamics of collective creativity in AI art competitions


100. Latent Heuristic Search: Continuous Optimization for Automated Algorithm Design


101. F2IND-IT! – Multimodal Fuzzy Fake Indian News Detection using Images and Text


102. Capturing LLM Capabilities via Evidence-Calibrated Query Clustering


103. Scientific Logicality Enriched Methodology for LLM Reasoning: A Practice in Physics


104. RAGA: Reading-And-Graph-building-Agent for Autonomous Knowledge Graph Construction and Retrieval-Augmented Generation


105. AnchorDiff: Topology-Aware Masked Diffusion with Confidence-based Rewriting for Radiology Report Generation


106. Towards Human-Level Book-Writing Capability


107. PersonaArena: Dynamic Simulation for Evaluating and Enhancing Persona-Level Role-Playing in Large Language Models


108. Evidential Information Fusion on Possibilistic Structure


109. Reliability and Effectiveness of Autonomous AI Agents in Supply Chain Management


110. A Conflict-aware Evidential Framework for Reliable Sleep Stage Classification


111. Brain Vascular Age Prediction Using Cerebral Blood Flow Velocity and Machine Learning Algorithms


112. Harnessing AI for Inverse Partial Differential Equation Problems: Past, Present, and Prospects


113. How do Humans Process AI-generated Hallucination Contents: a Neuroimaging Study


114. From Static Risk to Dynamic Trajectories: Toward World-Model-Inspired Clinical Prediction


115. TOBench: A Task-Oriented Omni-Modal Benchmark for Real-World Tool-Using Agents


116. NGM: A Plug-and-Play Training-Free Memory Module for LLMs


117. Virtual Nodes Guided Dynamic Graph Neural Network for Brain Tumor Segmentation with Missing Modalities


118. Reasoning Can Be Restored by Correcting a Few Decision Tokens


119. Learning to Learn from Multimodal Experience


120. Artificial Adaptive Intelligence: The Missing Stage Between Narrow and General Intelligence


121. Sketch Then Paint: Hierarchical Reinforcement Learning for Diffusion Multi-Modal Large Language Models


122. Voices in the Loop: Mapping Participatory AI


123. Multi-Paradigm Agent Interaction in Practice:A Systematic Analysis of Generator-Evaluator, ReAct Loop,and Adversarial Evaluation in the buddyMe Framework


124. NeuroMAS: Multi-Agent Systems as Neural Networks with Joint Reinforcement Learning


125. State Contamination in Memory-Augmented LLM Agents


126. Body-Grounded Perspective Formation and Conative Attunement in Artificial Agents


127. PopuLoRA: Co-Evolving LLM Populations for Reasoning Self-Play


128. A Global-Local Graph Attention Network for Traffic Forecasting


129. Baba in Wonderland: Online Self-Supervised Dynamics Discovery for Executable World Models


130. GRID: Graph Representation of Intelligence Data for Security Text Knowledge Graph Construction


131. Recall Isn’t Enough: Bounding Commitments in Personalized Language Systems


132. Enhancing Metacognitive AI: Knowledge-Graph Population with Graph-Theoretic LLM Enrichment


133. LinAlg-Bench: A Forensic Benchmark Revealing Structural Failure Modes in LLM Mathematical Reasoning


134. Sustainable Intelligence for the Wild: Democratizing Ecological Monitoring via Knowledge-Adaptive Edge Expert Agents


135. TTE-Flash: Accelerating Reasoning-based Multimodal Representations via Think-Then-Embed Tokens


136. PRISMat: Policy-Driven, Permutation-Invariant Autoregressive Material Generation


137. Counterparty Modeling is Not Strategy: The Limits of LLM Negotiators


138. Scalable Uncertainty Reasoning in Knowledge Graphs


139. Skim: Speculative Execution for Fast and Efficient Web Agents


140. From Prompts to Protocols: An AI Agent for Laboratory Automation


141. ANNEAL: Adapting LLM Agents via Governed Symbolic Patch Learning


142. AgentWall: A Runtime Safety Layer for Local AI Agents


143. DashAttention: Differentiable and Adaptive Sparse Hierarchical Attention


144. Code as Agent Harness


145. ESI-Bench: Towards Embodied Spatial Intelligence that Closes the Perception-Action Loop


146. Vision-OPD: Learning to See Fine Details for Multimodal LLMs via On-Policy Self-Distillation


147. Predictable Confabulations: Factual Recall by LLMs Scales with Model Size and Topic Frequency


148. DexHoldem: Playing Texas Hold’em with Dexterous Embodied System


149. Semantic Generative Tuning for Unified Multimodal Models


150. Distilling Tabular Foundation Models for Structured Health Data


151. PopPy: Opportunistically Exploiting Parallelism in Python Compound AI Applications


152. Ensembling Tabular Foundation Models - A Diversity Ceiling And A Calibration Trap


153. Reversa: A Reverse Documentation Engineering Framework for Converting Legacy Software into Operational Specifications for AI Agents


154. Lance: Unified Multimodal Modeling by Multi-Task Synergy


155. COOPO: Cyclic Offline-Online Policy Optimization Algorithm


156. KairosHope: A Next-Generation Time-Series Foundation Model for Specialized Classification via Dual-Memory Architecture


157. Statistical Limits and Efficient Algorithms for Differentially Private Federated Learning


158. Pocket Foundation Models: Distilling TFMs into CPU-Ready Gradient-Boosted Trees


159. An Assessment of Human vs. Model Uncertainty in Soft-Label Learning and Calibration


160. Post-Trained MoE Can Skip Half Experts via Self-Distillation


161. Data Presentation Over Architecture: Resampling Strategies for Credit Risk Prediction with Tabular Foundation Models


162. Position: Weight Space Should Be a First-Class Generative AI Modality


163. CrossView Suite: Harnessing Cross-view Spatial Intelligence of MLLMs with Dataset, Model and Benchmark


164. Stochastic Penalty-Barrier Methods for Constrained Machine Learning


165. ManiSoft: Towards Vision-Language Manipulation for Soft Continuum Robotics


166. SAME: A Semantically-Aligned Music Autoencoder


167. CATA: Continual Machine Unlearning via Conflict-Averse Task Arithmetic


168. Not What You Asked For: Typographic Attacks in Household Robot Manipulation


169. AMARIS: A Memory-Augmented Rubric Improvement System for Rubric-Based Reinforcement Learning


170. Randomized Advantage Transformation (RAT): Computing Natural Policy Gradients via Direct Backpropagation


171. Overeager Coding Agents: Measuring Out-of-Scope Actions on Benign Tasks


172. LongMINT: Evaluating Memory under Multi-Target Interference in Long-Horizon Agent Systems


173. Estimating Item Difficulty with Large Language Models as Experts


174. Improving BM25 Code Retrieval Under Fixed Generic Tokenization: Adaptive q-Log Odds as a Drop-In BM25 Fix


175. Key-Gram: Extensible World Knowledge for Embodied Manipulation


176. StableHand: Quality-Aware Flow Matching for World-Space Dual-Hand Motion Estimation from Egocentric Video


177. STT-Arena: A More Realistic Environment for Tool-Using with Spatio-Temporal Dynamics


178. Probing for Representation Manifolds in Superposition


179. Continuous Diffusion Scales Competitively with Discrete Diffusion for Language


180. Beyond Morphology: Quantifying the Diagnostic Power of Color Features in Cancer Classification


181. DiPRL: Learning Discrete Programmatic Policies via Architecture Entropy Regularization


182. DBES: A Systematic Benchmark and Metric Suite for Evaluating Expert Specialization in Large-Scale MoEs


183. Modality vs. Morphology: A Framework for Time Series Classification for Biological Signals


184. AI4BayesCode: From Natural Language Descriptions to Validated Modular Stateful Bayesian Samplers


185. GAMMA: Global Bit Allocation for Mixed-Precision Models under Arbitrary Budgets


186. Prompt2Fingerprint: Plug-and-Play LLM Fingerprinting via Text-to-Weight Generation


187. Flowing with Confidence


188. Scheduling That Speaks: An Interpretable Programmatic Reinforcement Learning Framework


189. Modelling Customer Trajectories with Reinforcement Learning for Practical Retail Insights


190. What is Holding Back Latent Visual Reasoning?


191. Building Reliable Arithmetic Multipliers Under NBTI Aging and Process Variations


192. EvoMemBench: Benchmarking Agent Memory from a Self-Evolving Perspective


193. Geometry-Aware Uncertainty Coresets for Robust Visual In-Context Learning in Histopathology


194. Prompts Don’t Protect: Architectural Enforcement via MCP Proxy for LLM Tool Access Control


195. Qumus: Realization of An Embodied AI Quantum Material Experimentalist


196. SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution


197. Diagnosing Korean-Language LLM Political Bias via Census-Grounded Agent Simulation


198. Graph Hierarchical Recurrence for Long-Range Generalization


199. Towards Ubiquitous Mapping and Localization for Dynamic Indoor Environments


200. Probing SMEFT Operators through $t\bar{t}t\bar{t}$ Production with Hyper-Graph Neural Networks at the LHC


201. Beyond Inference-Time Search: Reinforcement Learning Synthesizes Reusable Solvers


202. The Hidden Cost of Contextual Sycophancy: an AI Literacy Intervention in Human-AI Collaboration


203. Optimising CSRNet with parameter-free attention mechanisms for crowd counting in public transport


204. Focused Forcing: Content-Aware Per-Frame KV Selection for Efficient Autoregressive Video Diffusion


205. Same Signal, Different Semantics: A Cross-Framework Behavioral Analysis of Software Engineering Agents


206. Improved Baselines with Representation Autoencoders


207. ISEP: Implicit Support Expansion for Offline Reinforcement Learning via Stochastic Policy Optimization


208. Wasserstein Equilibrium Decoding for Reliable Medical Visual Question Answering


209. Alignment Dynamics in LLM Fine-Tuning


210. PH-Dreamer: A Physics-Driven World Model via Port-Hamiltonian Generative Dynamics


211. CommitDistill: A Lightweight Knowledge-Centric Memory Layer for Software Repositories


212. From Volume to Value: Preference-Aligned Memory Construction for On-Device RAG


213. CodeBind: Decoupled Representation Learning for Multimodal Alignment with Unified Compositional Codebook


214. Machine Unlearning for Masked Diffusion Language Models


215. Privacy Preserving Reinforcement Learning with One-Sided Feedback


216. Multilingual jailbreaking of LLMs using low-resource languages


217. SomaliWeb v1: A Quality-Filtered Somali Web Corpus with a Matched Tokenizer and a Public Language-Identification Benchmark


218. Are Sparse Autoencoder Benchmarks Reliable?


219. Context Memorization for Efficient Long Context Generation


220. A Simplex Witness Certificate for Constant Collapse in Variational Autoencoders



222. SPATIOROUTE: Dynamic Prompt Routing for Zero-Shot Spatial Reasoning


223. Concise and Logically Consistent Conformal Sets for Neuro-Symbolic Concept-Based Models


224. PIPER: Content-Based Table Search via profiling and LLM-Generated Pseudoqueries


225. RGB-only Active 3D Scene Graph Generation for Indoor Mobile Robots


226. Fixed External Cameras as Common Prior Maps for Active 3D Scene Graph Generation


227. MARS: Technical Report for the CASTLE Challenge at EgoVis 2026


228. Self-Evolving Spatial Reasoning in Vision Language Models via Geometric Logic Consistency


229. Vision Inference Former: Sustaining Visual Consistency in Multimodal Large Language Models


230. An Empirical Study of Privacy Leakage Chains via Prompt Injection in Black-Box Chatbot Environments


231. Who Generated This 3D Asset? Learning Source Attribution for Generative 3D Models


232. Symmetry-Compatible Principle for Optimizer Design: Embeddings, LM Heads, SwiGLU MLPs, and MoE Routers


233. SENSE: Satellite-based ENergy Synthesis for Sustainable Environment


234. Parameterized 4-Qubit EWL Quantum Game Circuits with Dirac-Solow-Swan Hamiltonian Integration for Quadruple Helix Disruptive Innovation Recommender Systems


235. A-ProS: Towards Reliable Autonomous Programming Through Multi-Model Feedback


236. Improving Spatio-Temporal Residual Error Propagation by Mitigating Over-Squashing


237. FLAG: Foundation model representation with Latent diffusion Alignment via Graph for spatial gene expression prediction


238. Confidence-Gated Robot Autonomy: When Does Uncertainty Actually Help?


239. Exploring Trust Calibration in XAI - The Impact of Exposing Model Limitations to Lay Users


240. PROTEA: Offline Evaluation and Iterative Refinement for Multi-Agent LLM Workflows


241. Quantum Sidecar Architectures for Hybrid AI Training and Inference: Stateful Protected Registers, Stateless Reset-and-Reprepare Circuits and Quantum Weight-State Outlook


242. FedSDR: Federated Self-Distillation with Rectification


243. Interaction-Breaking Adversarial Learning Framework for Robust Multi-Agent Reinforcement Learning


244. Unveiling Memorization-Generalization Coexistence: A Case Study on Arithmetic Tasks with Label Noise


245. See What I Mean: Aligning Vision and Language Representations for Video Fine-grained Object Understanding


246. TinySAM 2: Extreme Memory Compression for Efficient Track Anything Model


247. SAS: Semantic-aware Sampling for Generative Dataset Distillation


248. Spiker-LL: An Energy-Efficient FPGA Accelerator Enabling Adaptive Local Learning in Spiking Neural Networks


249. Verify-Gated Completion as Admission Control in a Governed Multi-Agent Runtime: A Bounded Architecture Case Study


250. MARR: Module-Adaptive Residual Reconstruction for Low-Bit Post-Training Quantization



252. Stable Audio 3


253. Predictive Prefetching for Retrieval-Augmented Generation


254. LivePI: More Realistic Benchmarking of Agents Against Indirect Prompt Injectio


255. SAFE-SVD: Sensitivity-Aware Fidelity-Enforcing SVD for Physics Foundation Models


256. Babel: Jailbreaking Safety Attention via Obfuscation Distribution Optimized Sampling


257. BLAgent: Agentic RAG for File-Level Bug Localization


258. A More Word-like Image Tokenization for MLLMs


259. Training data attribution in diffusion models via mirrored unlearning and noise-consistent skew


260. BacktestBench: Benchmarking Large Language Models for Automated Quantitative Strategy Backtesting


261. Prompt Compression in Diffusion Large Language Models: Evaluating LLMLingua-2 on LLaDA


262. AdaptiveLoad: Towards Efficient Video Diffusion Transformer Training


263. Domain Transfer Becomes Identifiable via a Single Alignment


264. One Model to Translate Them All: Universal Any-to-Any Translation for Heterogeneous Collaborative Perception


265. DCFold: Efficient Protein Structure Generation with Single Forward Pass


266. Attention Sinks and Outliers in Attention Residuals


267. Multi-agent AI systems outperform human teams in creativity


268. Guard: Scalable Straggler Detection and Node Health Management for Large-Scale Training


269. HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents


270. $\boldsymbol{f}$-OPD: Stabilizing Long-Horizon On-Policy Distillation with Freshness-Aware Control


271. PAREDA: A Multi-Accent Speech Dataset of Natural Language Processing Research Discussions


272. Generating Pretraining Tokens from Organic Data for Data-Bound Scaling


273. Balancing Knowledge Distillation for Imbalance Learning with Bilevel Optimization


274. Temporal Aware Pruning for Efficient Diffusion-based Video Generation


275. Efficient Bilevel Optimization for Meta Label Correction in Noisy Label Learning


276. Content-Style Identification via Differential Independence


277. CounterCount: A Diagnostic Framework for Counting Bias in Vision Language Models


278. Why We Look Where We Look: Emergent Human-like Fixations of a Foveated Visual Language Model Maximizing Scene Understanding


279. TierCheck: Tiered Checkpointing for Fault Tolerance in Large Language Model Training


280. Virtues of Ordered Chaos: Planning with Topple Actions in Tabletop Stack Rearrangement


281. One Model, Two Roles: Emergent Specialization in a Shared Recurrent Transformer


282. Curriculum Group Policy Optimization: Adaptive Sampling for Unleashing the Potential of Text-to-Image Generation


283. Optimal Knock-Pick Planning for Tightly Packed Tabletop Blocks With Parallel Grippers


284. SocialMemBench: Are AI Memory Systems Ready for Social Group Settings?


285. Systematic Evaluation of the Quality of Synthetic Clinical Notes Rephrased by LLMs at Million-Note Scale


286. OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization


287. Bridging the Version Gap: Multi-version Training Improves ICD Code Prediction, Especially for Rare Codes


288. L-Drive: Beyond a Single Mapping-Latent Context Drives Time Series Forecasting


289. Domain Incremental Learning for Pandemic-Resilient Chest X-Ray Analysis


290. Fine-tuning Pocket-Aware Diffusion Models via Denoising Policy Optimization


291. Validate Your Authority: Benchmarking LLMs on Multi-Label Precedent Treatment Classification


292. Attention-Guided Fusion of 1D and 2D CNNs for Robust ECG-Based Biometric Recognition


293. PULSE: Agentic Investigation with Passive Sensing for Proactive Intervention in Cancer Survivorship


294. PEIRA: Learning Predictive Encoders through Inter-View Regressor Alignment


295. Training Infinitely Deep and Wide Transformers


296. LLMForge: Multi-Backend Hardware-Aware Neural Architecture Search with Infinite-Head Attention for Edge Language Models


297. SparseSAM: Structured Sparsification of Activations in Segment Anything Models


298. Multi-task learning on partially labeled datasets via invariant/equivariant semi-supervised learning


299. SynVA: A Modular Toolkit for Vessel Generation and Aneurysm Editing


300. Bayesian-Monte Carlo Schedule Updating for Construction Digital Twins: A Probabilistic Framework for Dynamic Project Forecasting


301. UniAlign: A Model-Agnostic Framework for Robust Network Traffic Classification under Distribution Shifts


302. Beyond Accuracy: Robustness, Interpretability and Expressiveness of EEG Foundation Models


303. Automated Root-Cause Subclassification and No-Code Fix Generation for Invalid Bug Reports


304. Controlling False Discovery in Arbitrarily Structured Hypothesis Spaces via Reproducing Kernels


305. Visual Sculpting: Visually-Aligned Planning Representations for Long-Horizon Robot Clay Sculpting


306. Rethinking Code Review in the Age of AI: A Vision for Agentic Code Review


307. Few-Shot Network Intrusion Detection Using Online Triplet Mining


308. CasualSynth: Generating Structurally Sound Synthetic Data


309. SaaSBench: Exploring the Boundaries of Coding Agents in Long-Horizon Enterprise SaaS Engineering


310. BESplit: Bias-Compensated Split Federated Learning with Evidential Aggregation


311. A Distributional View for Visual Mechanistic Interpretability: KL-Minimal Soft-Constraint Principle


312. Beyond Linear Superposition: Discovering Climate Features in AI Weather Models with KAN-SAE


313. An Interpretable Closed-Loop Intelligent Tutoring System for Multimodal Affective Feedback in Asynchronous Presentation Training


314. Artificial Intelligence can Recognize Whether a Job Applicant is Selling and/or Lying According to Facial Expressions and Head Movements Much More Correctly Than Human Interviewers


315. GCE-MIL: Faithful and Recoverable Evidence for Multiple Instance Learning in Whole-Slide Imaging


316. ContraFix: Agentic Vulnerability Repair via Differential Runtime Evidence and Skill Reuse


317. Spatial Blindness in Whole-Slide Multiple Instance Learning


318. MemRepair: Hierarchical Memory for Agentic Repository-Level Vulnerability Repair


319. Beyond Catalogue Counts: the Dataset Visibility Asymmetry in Low-Resource Multilingual NLP


320. DiagEval: Trajectory-Conditioned Diagnosis for Reliable Software Evaluation with GUI Agents


321. MATE: Solving Contextual Markov Decision Processes with Memory of Accumulated Transition Embeddings


322. Progressive Generalization Augmentation with Deeply Coupled RND-PPO and Domain-Prioritized Noise Injection for Robust Crop Management Reinforcement Learning


323. Learning Displacement-Robust Representations for Landslide Early Warning under Rainfall Forecast Uncertainty


324. Benchmarking Mythos-Linked Bug Rediscovery


325. IVF-TQ: Streaming-Robust Approximate Nearest Neighbor Search via a Codebook-Free Residual Layer


326. Ablating Safety: Mechanisms for Removing Alignment in Language Models for Security Applications


327. Learning Faster with Better Tokens: Parameter-Efficient Vocabulary Adaptation for Specialized Text Summarization


328. FML-bench: A Controlled Study of AI Research Agent Strategies from the Perspective of Search Dynamics


329. \textsc{MasFACT}: Continual Multi-Agent Topology Learning via Geometry-Aware Posterior Transfer


330. Transitivity Meets Cyclicity: Explicit Preference Decomposition for Dynamic Large Language Model Alignment


331. Single-Sample Black-Box Membership Inference Attack against Vision-Language Models via Cross-modal Semantic Alignment


332. LPG: Balancing Efficiency and Policy Reasoning in Latent Policy Guardrails


333. Efficient Feature-Free Initialization for Monocular Visual-Inertial Systems Using a Feed-Forward 3D Model


334. ASPI: Seeking Ambiguity Clarification Amplifies Prompt Injection Vulnerability in LLM Agents


335. TClone: Low-Latency Forking of Live GUI Environments for Computer-Use Agents


336. Learning Higher-Order Structure from Incomplete Spatiotemporal Data: Multi-Scale Hypergraph Laplacians with Neural Refinement


337. Weak-to-Strong Elicitation via Mismatched Wrong Drafts


338. Attention Hijacking: Response Manipulation Across Queries in Vision-Language Models


339. StyleText: A Large-Scale Dataset and Benchmark for Stylized Scene Text Inpainting


340. Deep Reinforcement Learning Framework for Diversified Portfolio Management Across Global Equity Markets


341. ConflictRAG: Detecting and Resolving Knowledge Conflicts in Retrieval Augmented Generation


342. LEAP: Learnable End-to-End Adaptive Pruning of Large Language Models


343. When Efficiency Backfires: Cascading LLMs Trigger Cascade Failure under Adversarial Attack


344. UNR-Explainer: Counterfactual Explanations for Unsupervised Node Representation Learning Models


345. CLAP: Contrastive Latent-space Prompt Optimization for End-to-end Autonomous Driving


346. OProver: A Unified Framework for Agentic Formal Theorem Proving


347. ContractBench: Can LLM Agents Preserve Observation Contracts?


348. Rover: Context-aware Conflict Resolution with LLM


349. How Do Electrocardiogram Models Scale?


350. State-of-the-Art Claims Require State-of-the-Art Evidence


351. Latency-Aware Deep Learning Benchmark for Real-Time Cyber-Physical Attack and Fault Classification in Inverter-Dominated Power Grids


352. Fidelity Probes for Specification–Code Alignment


353. Drift Flow Matching


354. Systematic Evaluation of Vision Transformers for Automated Cervical Cancer Classification: Optimization, Statistical Validation, and Clinical Interpretability



356. Event-Grounded Sparse Autoencoders for Vision-Language-Action Policies


357. PluRule: A Benchmark for Moderating Pluralistic Communities on Social Media


358. MusicSynth: An Automated Pipeline for Generating Violin Fingerboard Animations from Sheet Music Using Optical Music Recognition


359. Beyond Execution: Static-Analysis Rewards and Hint-Conditioned Diffusion RL for Code Generation


360. Why Do Safety Guardrails Degrade Across Languages?


361. OpenJarvis: Personal AI, On Personal Devices


362. Charon: A Unified and Fine-Grained Simulator for Large-Scale LLM Training and Inference


363. STRIDE-AI: A Threat Modeling Framework for Generative AI Security Assessment


364. When Bits Break Recourse: Counterfactual-Faithful Quantization


365. Evolutionary Extreme Learning Machine of ab-initio Energy Landscapes for Crystal Structure Prediction using Manta Ray Optimization with Levy Flight


366. Contrastive Conceptor Activation Steering (COAST): Unlocking Vision-Language-Action Models through Hidden States


367. UCSF-PDGM-VQA: Visual Question Answering dataset for brain tumor MRI interpretation


368. CAM-VFD: Cross-Attention Multimodal Video Forgery Detection


369. A Systematic Survey on Deep Learning Architectures for Point Cloud Classification and Segmentation


370. New Wide-Net-Casting Jailbreak Attacks Risk Large Models


371. The Point of No Return: Counterfactual Localization of Deceptive Commitment in Language-Model Reasoning


372. DynMuon: A Dynamic Spectral Shaping View of Muon


373. SEMA-RAG: A Self-Evolving Multi-Agent Retrieval-Augmented Generation Framework for Medical Reasoning


374. Visual Timelines of Police Encounters in Body-Worn Camera Footage: Operational Context and Activity Cataloging for Training and Analysis in OpenBWC


375. Global Automation Atlas


376. Can LLMs Think Like Consumers? Benchmarking Crowd-Level Reaction Reconstruction with ConsumerSimBench


377. How to Instruct Your Robot: Dense Language Annotations Power Robot Policy Learning


378. S-Bus: Automatic Read-Set Reconstruction for Multi-Agent LLM State Coordination


379. 1GC-7RC: One Graphic Card – Seven Research Challenges! How Good Are AI Agents at Doing Your Job?


380. Agentic AI Translate: An Agentic Translator Prototype for Translation as Communication Design


381. D$^2$Evo: Dual Difficulty-Aware Self-Evolution for Data-Efficient Reinforcement Learning


382. Privacy Policy Enforcement Guardrails for Data-Sensitive Retrieval-Augmented Generation


383. Task Abstention for Large Language Models in Code Generation


384. PARALLAX: Separating Genuine Hallucination Detection from Benchmark Construction Artifacts


385. When Dynamics Shift, Robust Task Inference Wins: Offline Imitation Learning with Behavior Foundation Models Revisited


386. Algorithmic Cultivation: How Social Media Feeds Shape User Language


387. The IsalProgram Programming Language


388. Learning-Zone Energy: Online Data Selection for Efficient RL Post-Training


389. BoLT: A Benchmark to Democratize Black-box Optimization Research for Expensive LLM Tasks


390. Adversarial Fragility and Language Vulnerability in Clinical AI: A Systematic Audit of Diagnostic Collapse Under Imperceptible Perturbations and Cross-Lingual Drift in Low-Resource Healthcare Settings


391. Response-free item difficulty modelling for multiple-choice items with fine-tuned transformers: Component-wise representation and multi-task learning


392. Skills on the Fly: Test-Time Adaptive Skill Synthesis for LLM Agents


393. Extending Pretrained 10-Second ECG Foundation Models to Longer Horizons


394. WhiteTesseract: Reframing the Interpretation of Cultural Heritage through XR and Conversational AI


395. OmniVL-Guard Pro: A Tool-Augmented Agent for Omnibus Vision-Language Forensics


396. Latent Action Control for Reasoning-Guided Unified Image Generation


397. Effort as Ceiling, Not Dial: Reasoning Budget Does Not Modulate Cognitive Cost Alignment Between Humans and Large Reasoning Models


398. Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps


399. The Alpha Illusion: Reported Alpha from LLM Trading Agents Should Not Be Treated as Deployment Evidence


400. DriveSafe: A Framework for Risk Detection and Safety Suggestions in Driving Scenarios


401. Some[Body] Must Receive That Pain for Agent Accountability


402. Metric-Guided Feature Fusion of Visual Foundation Models for Segmentation Tasks


403. Plan First, Diffuse Later: Extrinsic Graph Guidance for Long-Horizon Diffusion Planning


404. Prefix-Adaptive Block Diffusion for Efficient Document Recognition


405. PhysioSeq2Seq: A Hybrid Physiological Digital Twin and Sequence-to-Sequence LSTM for Long-Horizon Glucose Forecasting in Type 1 Diabetes


406. VGGT-CD: Training-Free Robust Registration for 3D Change Detection


407. Pedestrian-Aware LLM-Driven Behavioral Planning for Autonomous Vehicles


408. Thinking with Patterns: Breaking the Perceptual Bottleneck in Visual Planning via Pattern Induction


409. Learning Relative Representations for Fine-Grained Multimodal Alignment with Limited Data


410. Prediction-Intervention Games and Invariant Sets


411. Decoupling KL and Trajectories: A Unified Perspective for SFT, DAgger, Offline RL, and OPD in LLM Distillation


412. Echoes in Filter Bubble: Diagnosing and Curing Popularity Bias in Generative Recommenders


413. AgentKernelArena: Generalization-Aware Benchmarking of GPU Kernel Optimization Agents


414. Observation-Aligned Mask Priors for Learning Physical Dynamics from Authentic Occlusions


415. Cross-modal Affinity-aligned Multimodal Learning Analytics for Predicting Student Collaboration Satisfaction in Game-Based Learning


416. Cross-Domain Molecular Relational Learning: Leveraging Chemical Structure-Activity Analysis


417. 3DPhysVideo: Consistency-Guided Flow SDE for Video Generation via 3D Scene Reconstruction and Physical Simulation


418. TIER: Trajectory-Invariant Execution Rewards for Multi-Step Tool Composition


419. Encoding Robust Topological Signatures for Hyperdimensional Computing


420. A Holistic Method for Superquadric Fitting Using Unsupervised Clustering Analysis


421. Distinguishable Deletion: Unifying Knowledge Erasure and Refusal for Large Language Model Unlearning


422. VolTA-3D: Self-Supervised Learning for Brain MRI using 3D Volumetric Token Alignment


423. CANSURF: An ASV-View Can Dataset and Benchmark for Detection and Tracking of Surface-Level Debris


424. Exploring Lightweight Large Language Models for Court View Generation


425. Learning Unbiased Permutations via Flow Matching


426. UniER: A Unified Benchmark for Item-level and Path-level Exercise Recommendation


427. Genflow Ad Studio: A Compound AI Architecture for Brand-Aligned, Self-Correcting Video Generation


428. EmoMind: Decoding Affective Captions from Human Brain fMRI


429. Universal Dynamics of Punctuated Progress


430. MAVEN A Multi-Agent Framework for Multicultural Text-to-Video Generation


431. GeoWorld-VLM: Geometry from World Models for Vision-Language Models


432. EfficientTDMPC: Improved MPC Objectives for Sample-Efficient Continuous Control


433. CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?


434. Multi-Object Tracking Consistently Improves Wildlife Inference


435. GraViti: Graph-Level Variational Autoencoders with Relaxed Permutation Invariance


436. A Scalable Tool for Measuring Manner and Result Verbs in Developmental Language Research


437. SKG-Eval: Stateful Evaluation of Multi-Turn Dialogue via Incremental Semantic Knowledge Graphs


438. Learning How to Cube


439. \textsc{PrivScope}: Task-scoped Disclosure Control for Hybrid Agentic Systems


440. SLEIGHT-Bench: A Benchmark of Evasion Attacks Against Agent Monitors


441. To Trust or Not to Trust: Authors’ Response to AI-based Reviews


442. PromptDecipher: Supporting AI Tutor Authoring Through Editable Simulated Interactions


443. Why Modeling Human Haptic Material Perception with AI Is Difficult


444. Where Pretraining writes and Alignment reads: the asymmetry of Transformer weight space


445. GRASP: Graph Agentic Search over Propositions for Multi-hop Question Answering


446. How Few-Shot Examples Add Up: A Causal Decomposition of Function Vectors in In-Context Learning


447. Voice ‘‘Cloning’’ is Style Transfer


448. Wavelet Flow Matching for Multi-Scale Physics Emulation


449. Isotonic Survival Regression: Calibrated Survival Distributions from Deep Cox Models


450. Automatic Unsupervised Ensemble Outlier Model Selection–Extended Version


451. Symphony for Speech-to-Text: Supporting Real-Time Medical Voice Interfaces


452. RAPT: Retrieval-Augmented Post-hoc Thresholding for Multi-Label Classification


453. Inventorship in AI-Assisted Inventions: Designing an Experiment to Shape Case Law


454. Hypergraph Pattern Machine: Compositional Tokenization for Higher-Order Interactions



456. Alignment Drift in Long-Term Human-LLM Interaction: A Mechanism-Oriented Framework


457. No Plan, Yet Human: A Reactive Robotics Model Predicts Human Planning Failures on a Clinical Task


458. The Scaling Laws of Skills in LLM Agent Systems


459. Visual Agentic Memory: Enabling Online Long Video Understanding via Online Indexing, Hierarchical Memory, and Agentic Retrieval


460. MoleCode unlocks structural intelligence in large language models



462. LERA: LLM-Enhanced RAG for Ad Auction in Generative Chatbots


463. Strategic Over-Parameterization for Generalizable Low-Rank Adaptation


464. Mechanistically Interpretable Neural Encoding Reveals Fine-Grained Functional Selectivity in Human Visual Cortex


465. MHMamba: Multi-Head Mamba for 3D Brain Tumor Segmentation


466. Asking Back: Interaction-Layer Antidistillation Watermarks


467. Conservative AI for Safety-Sensitive Medical Image Restoration: Residual-Bounded CT-CTA Enhancement for Intracranial Aneurysm-Relevant Signal Recovery


468. Identifiable Token Correspondence for World Models


469. Peak-Detector: Explainable Peak Detection via Instruction-Tuned Large Language Models in Physiological Sign


470. Physics-Guided Geometric Diffusion for Macro Placement Generation


471. PESD-TSF: A Period-Aware and Explicit Structured Decomposition Framework for Long-Term Time Series Forecasting


472. Nested Spatio-Temporal Time Series Forecasting


473. Avoiding Structural Failure Modes in Tabular Fair SSL: Online Primal-Dual Allocation under Confidence Gating


474. Membership Inference Attacks on Discrete Diffusion Language Models


475. Diffusion Attention Expert Model for Predicting and Semi-automatic Localizing STAS in Lung Cancer Histopathological Images


476. Two-Valued Symmetric Circulant Matrices: Applications in Deep Learning


477. Hierarchical Two-Stage Framework for Environment-Aware Long-Horizon Vessel Trajectory Prediction


478. DeepArrhythmia: Segment-Contextualized ECG Arrhythmia Classification via Selective Evidence Acquisition


479. Semantic Smoothing via Novel View Synthesis for Robust SAR Image Classification


480. KVCapsule: Efficient Sequential KV Cache Compression for Vision-Language Models with Asymmetric Redundancy


481. Byzantine-Resilient Federated Learning via QUBO-Based Client Selection on Quantum Annealers


482. The End of Trust: How Agentic AI Breaks Security Assumptions


483. GPU-Accelerated Deep Learning for Heatwave Prediction and Urban Heat Risk Assessment


484. Edge-AI-Driven Learning-to-Rank for Decentralized Task Allocation in Circular Smart Manufacturing


485. MR-SLAM: Immersive Spatial Supervision for Multi-Robot Mapping via Mixed Reality


486. A Theory of Training Profit-Optimal LLMs


487. QuantFPFlow: Quantum Amplitude Estimation for Fokker–Planck Policy Optimisation in Continuous Reinforcement Learning


488. The Impact of AI Search on the Online Content Ecosystem: Evidence from Google and Reddit


489. EAGT: Echocardiography Augmentation for Generalisability and Transferability


490. Orthologic for SAT Solving


491. Agentic Pipeline for Self-Synchronized Multiview Joint Angle Monitoring in Uncalibrated Environments


492. Neural Visual Decoding via Cognitive guided Adaptive Blurring and Information Constrained Alignment


493. CAVE: A Structured Credit Assignment Approach for Fragmented Visual Evidence Reasoning


494. Reducing Hallucination in Vision-Language Models via Stage-wise Preference Optimization under Distribution Shift


495. Support-Safe Variational Hybrid Filtering for Contact-Mode and Sparse-Law Recovery


496. Trajectory-Aware Adaptive Inference in Object Detection Models


497. Vision Transformer-Conditioned UNet for Domain-Adaptive Semantic Segmentation


498. Overcoming the Intrinsic Performance Limitations of MEMS IMU via Diffusion-Based Generative Learning


499. Haptic Rendering of Fractional-Order Viscoelasticity: Passivity and Rendering Fidelity


500. Stabilizing Temporal Inference Dynamics for Online Surgical Phase Recognition