전체 AI 논문 - 2026-05-26

1. MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research


2. From Model Scaling to System Scaling: Scaling the Harness in Agentic AI


3. Claw-Anything: Benchmarking Always-On Personal Assistants with Broader Access to User’s Digital World


4. VeriTrace: Evolving Mental Models for Deep Research Agents


5. Retrying vs Resampling in AI Control


6. L2IR: Revealing Latent Intent in Graph Fraud Detection


7. CITYREP: A Unified Benchmark for Urban Representations Across Cities, Tasks, and Modalities


8. CausaLab: A Scalable Environment for Interactive Causal Discovery Toward AI Scientists


9. Neural Scalable Symbolic Search Framework for Complex Logical Queries with Multiple Free Variables


10. LECTOR: Joint Optimization of Scientific Reasoning Graphs and Introduction Generation


11. Explore Before You Solve: The Speed–Depth Trade-off in Epistemic Agents for ARC-AGI-3


12. $D^2$-Monitor: Dynamic Safety Monitoring for Diffusion LLMs via Hesitation-Aware Routing


13. From Accounting to Coordination: A Virtual Water-Aware Electricity-Computation-Water Nexus Framework for Data Center Dispatch


14. MuCRASP: Multimodal Chain-of-thought Reasoning aware Structured Pruning


15. Behind EvoMap: Characterizing a Self-Evolving Agent-to-Agent Collaboration Network


16. When Can We Trust Early Warnings? Leakage-Excluded Early Outcome Prediction from LMS Interaction Logs


17. Agent-Centric Social Trajectory Prediction: A Free Energy Principle Perspective


18. A Deep Dive into Axiomatic Design – Part I: Problem Formulation


19. Learning to Search and Searching to Learn for Generalization in Planning


20. FLOATBench: A Dataset and Benchmark for Floating Offshore Wind Turbine Tower Fatigue


21. AgentHijack: Benchmarking Computer Use Agent Robustness to Common Environment Corruptions


22. Insuring Every Action: An Authority Frontier Framework for Runtime Actuarial Control of Autonomous AI Agents


23. CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents


24. Back to Parsimonious Latents: Learning Task-Centric World Models from Visual Foundations


25. Detecting Unfaithful Chain-of-Thought via Circuit-Guided Internal-External Discrepancy


26. Uncertainty Reasoning with Large Language Models for Explainable Disease Diagnosis


27. Beyond Query Memorization: Large Language Model Routing with Query Decomposition and Historical Matching


28. PHGNet: Prototype-Guided Hypergraph Construction for Heterogeneous Spatiotemporal Forecasting


29. ADMFormer: An Adaptive-Decomposition Transformer with Time-Varying Masked Spatial Attention for Traffic Forecasting


30. Personalize-then-Store: Benchmarking and Learning Personalized Memory for Long-horizon Agents


31. StructBreak: Structural Cognitive Overload-Induced Safety Failures in MLLMs


32. What Gets Cited: Competitive GEO in AI Answer Engines


33. Credit Assignment with Resets in Language Model Reasoning


34. ATWL: A Formal Language for Representing, Comparing, and Reusing Visual Analytics Workflows


35. A Signal-Language Foundation Model for Broad-Spectrum Cardiovascular Assessment from Routine Electrocardiography


36. Security of OpenClaw Agents: Fundamentals, Attacks, and Countermeasures


37. CODESKILL: Learning Self-Evolving Skills for Coding Agents


38. Towards end-to-end LLM-based censoring-aware survival analysis


39. Second Guess: Detecting Uncertainty Through Abstention and Answer Stability in Small Language Models


40. Context-CoT: Enhancing Context Learning via High-Quality Reasoning Synthesis


41. AI Cartography: Mapping the Latent Landscape of AI Benchmark Ecosystems


42. Whose Alignment? Comparing LLM Process Alignment Across Diverse Organizational Decision Contexts


43. LipoAgent: Coordinating Fine-Tuned LLM Agents for Safer Lipid Design


44. FrontierOR: Benchmarking LLMs’ Capacity for Efficient Algorithm Design in Large-Scale Optimization


45. Meta-Agent: From Task Descriptions to Verified Multi-Agent Systems


46. Boosting Inference with Guided Reasoning: Stochastic Exploration for Recursive Models


47. DarkForest: Less Talk, Higher Accuracy for Multi-Agent LLMs


48. SpecAlign: A Semantic Alignment Framework for SystemVerilog Assertion Generation


49. SimuWoB: Simulating Real-World Mobile Apps for Fast and Faithful GUI Agent Benchmarking


50. Representation Without Control: Testing the Realization Effect in Language Models


51. Beyond the Frontier: Stochastic Backtracking for Efficient Test-Time Scaling


52. Trust but Verify: Prover-Verifier Deliberation for Selective LLM Prediction


53. RECTOR: Priority-Aware Rule-Based Reranking for Compliance-Aware Autonomous Driving Trajectory Selection


54. Evolutionary Enhanced Multi-Agent Reinforcement Learning for Cooperative Air Combat


55. AION: Next-Generation Tasks and Practical Harness for Time Series


56. Privacy-Preserving Local Language Models for Longitudinal Data Retrieval in Chronic Dermatologic Disease: Implementation in Pemphigus Patients


57. NeurIPS: Neuro-anatomical Inductive Priors for Sphere-based Brain Decoding


58. Mitigating Object Hallucinations in Vision-Language Models through Region-Aware Attention Recalibration


59. Towards Multi-Turn Dialog Systems for Industrial Asset Operations and Maintenance


60. Energy Shields for Fairness


61. Noise-Robust Financial Numerical Entity Attribute Tagging


62. ProActor: Timing-Aware Reinforcement Learning for Proactive Task Scheduling Agents


63. TaBIIC2: Interactive Building of Ontological Taxonomies using Weighted Self-Organizing Maps


64. Inverting the Shield: Systematically Generating Safety Tests from Policy Specifications


65. Clustering as Reasoning: A $k$-Means Interpretation of Chain-of-Thought Graph Learning


66. Solving Combinatorial Counting Problems with Weighted First-Order Model Counting


67. Geo-Expert: Towards Expert-Level Geological Reasoning via Parameter-Efficient Fine-Tuning


68. Test-Time Deep Thinking to Explore Implicit Rules


69. Agent Manufacturing: Foundation-Model Agents as First-Class Industrial Entities


70. CoRe-Code: Collaborative Reinforcement Learning for Code Generation


71. PANDO: Efficient Multimodal AI Agents via Online Skill Distillation


72. GRAIL: AI translation for scientists application workflow on satellite data


73. PRIMA: Operational Patterns for Resilient Multi-Agent Research with Verifiable Identity and Convergent Feedback


74. Uncertainty Decomposition via Cyclical SG-MCMC and Soft-label Learning for Subjective NLP


75. Proper Scoring Rules for Agentic Uncertainty Quantification



77. Hylos: Operability Contracts for Model-Native Spatial Intelligence


78. Fundamental Limitation in Explaining AI


79. MDIA: A Multi-Agent Diagnostic Intelligence Pipeline on HealthBench Professional


80. Emotional intelligence in large language models is fragmented across perception, cognition, and interaction


81. Exploration of Perceptual Speech Features for Clinical Decision-Support in Mental Health Care


82. When Mean CE Fails: Median CE Can Better Track Language Model Quality


83. Measuring Reasoning Quality in LLMs: A Multi-Dimensional Behavioral Framework


84. Beyond Inference-Only Deployment: Comparing Weight-Based Consolidation Against Cascading Compaction


85. AVBench: Human-Aligned and Automated Evaluation Benchmark for Audio-Video Generative Models


86. GlobalDentBench: A Multinational Benchmark for Evaluating LLM Clinical Reasoning in Dentistry with Expert Calibration


87. Lattice theory and algebraic models for deep convolutional learning based on mathematical morphology


88. Agent-as-Peer-Debriefer: A Multi-Agent Framework with Perspective-Based Refinement for Qualitative Analysis


89. Hera: Learning Long-Horizon Coordination for Device-Cloud Collaborative LLM Agents


90. Learning to Reason Efficiently with A* Post-Training


91. HeartBeatAI: An Interpretable and Robust Deep Learning Framework for Multi-Label ECG Arrhythmia Detection


92. Associations between echocardiographic traits and AI-ECG predictions of heart failure


93. Summoning the Oracle to Slay It: Mitigating Look-Ahead Bias in Financial Backtesting with Large Language Models


94. Jailbreak to Protect: Buffering and Reinforcing via Temporary Jailbreaking for Safe Fine-Tuning in Large Language Models


95. PALoRA: Projection-Adaptive LoRA for Preserving Reasoning in Large Language Models


96. Beyond Control-Flow: Integrating the Resource Perspective into Multi-Collaborative Process Modeling from Text


97. Emission-Aware Reinforcement Learning for Sustainable Electric Vehicle Charging and Carbon Dioxide Reduction Under Varying Renewable Penetration


98. DemoEvolve: Overcoming Sparse Feedback in Agentic Harness Evolution with Demonstrations


99. Hypothesis Generation and Inductive Inference in Children and Language Models


100. Reasoning as an Attack Surface: Adaptive Evolutionary CoT Jailbreaks for LLMs


101. Market Regime Council for Dynamic Credit Assignment in Multi-Agent LLM Decision Systems


102. TIGER: Text-Informed Generalized Enzyme-Reaction Retrieval


103. AgentFugue: Agent Scaling for Long-Horizon Tasks through Collective Reasoning


104. SPACE: Unifying Symmetric and Asymmetric Routing Problems for Generalist Neural Solver


105. SAM: State-Adaptive Memory for Long-Horizon Reasoning Agent


106. Benchmarking the Limits of In-Context Reinforcement Learning for Ad-Hoc Teamwork


107. JT-SAFE-V2: Safety-by-Design Foundation Model with World-Context Data


108. The Model Is Not the Product: A Dual-Pillar Architecture for Local-First Psychological Coaching


109. Advancing Graph Few-Shot Learning via In-Context Learning


110. ConceptM$^3$oE: Concept-Guided Multimodal Mixture of Experts for Interpretable Computational Pathology


111. Understanding and Mitigating Premature Confidence for Better LLM Reasoning


112. A governance horizon for ethical-use constraints in open-weight AI models


113. Distilling Game Code World Model Generation into Lightweight Large Language Models


114. Partner-Aware Hierarchical Skill Discovery for Robust Human-AI Collaboration


115. Adaptive Human-AI Coordination via Hierarchical Action Disentanglement


116. When Does Synthetic Patent Data Help? Volume-Fidelity Trade-offs in Low-Resource Multi-Label Classification


117. Safety-Oriented Routing Analysis of Mixtral MoE Under Benign and Harmful Prompts


118. Toward Enactive Artificial Intelligence


119. How Well Do Models Follow Their Constitutions?


120. Beyond Final Answers: Auditing Trajectory-Level Hallucinations in Multi-Agent Industrial Workflows


121. Identifying and Mitigating Systemic Measurement Bias in Production LLM Inference Benchmarks


122. When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs


123. A Sober Look at Agentic Misalignment in Automated Workflows


124. EPPC-OASIS: Ontology-Aware Adaptation and Structured Inference Refinement for Electronic Patient-Provider Communication Mining in Secure Messages


125. Inference Time Context Sparsity: Illusion or Opportunity?


126. Palette: A Modular, Controllable, and Efficient Framework for On-demand Authorized Safety Alignment Relaxation in LLMs


127. Neuro-Inspired Inverse Learning for Planning and Control


128. HyperGuide: Hyperbolic Guidance for Efficient Multi-Step Reasoning in Large Language Models


129. MAPLE: Multi-State Aggregated Policy Evaluation for AlphaZero in Imperfect-Information Games


130. SkillEvolBench: Benchmarking the Evolution from Episodic Experience to Procedural Skills


131. EvoCode-Bench: Evaluating Coding Agents in Multi-Turn Iterative Interactions


132. Breaking the Chains of Probability: Neutrosophic Logic as a New Framework for Epistemic Uncertainty in Large Language Models


133. EvoSci: A Bio-Inspired Multi-Agent Framework for the Evolution of Scientific Discovery


134. LC-ERD: Mining Latent Logic for Self-Evolving Reasoning via Consistency-Regulated Reward Decomposition


135. Reason–Imagine–Act: Closed-Loop LLM Decision Making with World Models for Autonomous Driving


136. Towards trustworthy agentic AI: a comprehensive survey of safety, robustness, privacy, and system security


137. Beyond Predefined Learning Objects: A Thinking-Learning Interaction Model for Up-to-Date Autonomous Robot Learning


138. Saturating Scaling Laws for Equational Discovery: A Phenomenology of Growth Dynamics in Three Toy Substrates with Two Real-World Replications


139. Why We Need World Models for AGI: Where LLMs Fail and How World Models May Outperform


140. LGMT: Logic-Grounded Metamorphic Testing for Evaluating the Reasoning Reliability of LLMs


141. Low-Cost Labels, Reliable Choices: Rollout-Calibrated Hyper-Heuristics for Job Shop Scheduling


142. QUIVER: A Formal Framework for Quantifying Perturbation Propagation and Bifurcation in Compound AI Systems


143. From Accuracy to Auditability: A Survey of Determinism in Financial AI Systems


144. Machine Psychometrics: A Mathematical Psychology of Artificial Intelligence


145. Methods for Formal Verification of Agent Skills: Three Layers Toward a Mechanically Checkable Capability-Containment Proof


146. Stop Comparing LLM Agents Without Disclosing the Harness


147. Accelerating Long-Tail Generation in Synchronous RLHF Training via Adaptive Tensor Parallelism



149. Spacetime Formation under Requirements: Contextual Realization and Form-Dependent Probability


150. A Dynamical Framework for Cognitive Processes Based on Transformations and Semantic Equivalence


151. MEMOR-E: In-Context and Fine-Tuned LLM Personalization for Alzheimer’s Assistive Robotics


152. Residual Drift Dominates Contradiction in Multi-Turn Constraint Reasoning


153. DRIVE: Modeling Skills at the Reasoning and Interaction Levels for Web Agents under Continual Learning


154. Authority Inversion in LLM-Mediated Ubiquitous Systems: When Models Trust Users Over Sensors


155. BoxLitE: A Faithful Knowledge Base Embedding Based on Convex Optimization


156. Fuzzy, Neutrosophic, and Uncertain Graph Theory: Properties and Applications


157. Operationalizing Reconstructive Authority: Runtime Construction, Dependency Resolution, and Execution Gating in Autonomous Agent Systems


158. Practical Quantum CIM Empowerment via All-Domestic-Core Agentic Large Model


159. When Correct Beliefs Collapse: Epistemic Resilience of LLMs under Clinical Pressure


160. BODHI: Precise OS Kernel Specification Inference


161. Quantum Frog: Emergent Cooperation and Difficulty Scaling in a Quantized-Time Cooperative Game


162. Toward Reliable Design of LLM-Enabled Agentic Workflows: Optimizing Latency-Reliability-Cost Tradeoffs


163. Context: Proactive Goal-Directed Intelligence via Composable Sandboxed Programs, Declarative Wiring, and Structured Interaction


164. How Much Thinking is Enough? Quantifying and Understanding Redundancy in LLM Reasoning


165. Confidence Calibration in Large Language Models


166. In Search of the Ingredients of Open-Endedness: Replicating Picbreeder with Large Vision-Language Models


167. Squeezing Capacity from Multimodal Large Language Models for Subject-driven Generation


168. Beyond Summaries: Structure-Aware Labeling of Code Changes with Large Language Models


169. Language Models Need Sleep


170. OrpQuant: Geometric Orthogonal Residual Projection for Multiplier-Free Power-of-Two Transformer Quantization


171. Channel-wise Vector Quantization


172. StakeBench: Evaluating Language Understanding Grounded in Market Commitment


173. Rethinking Weak Supervision in Anomaly Detection: A Comprehensive Benchmark


174. Conditional KRR: Injecting Unpenalized Features into Kernel Methods with Applications to Kernel Thresholding


175. Neuronal Stochastic Attention Circuit (NSAC) for Probabilistic Representation Learning


176. When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges


177. Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model Internals


178. DRScaffold: Boosting Dense-Scene Reasoning in Lightweight Vision Language Models


179. Everything at Every Scale: Scale-Invariant Diffusion with Continuous Super-Resolution


180. A Multimodal 3D Foundation Model for Light Sheet Fluorescence Microscopy Enables Few-Shot Segmentation, Classification, and Deblurring


181. Retrieval-Augmented Detection of Potentially Abusive Clauses in Chilean Terms of Service


182. AdvantageFlow: Advantage-Weighted Least Squares for RL in Flow Models


183. Learning in Low-Dimensional Subspaces: Orthogonal Bottlenecks for Reinforcement Learning


184. AI-Assisted Systematization for Evaluating GenAI Systems


185. SafeCtrl-RL: Inference-Time Adaptive Behaviour Control for LLM Dialogue via RL-Driven Prompt Optimisation


186. Creative Quality Alignment: Expert Tacit Knowledge Transfer via Chain-of-Thought Fine-Tuning


187. Continual Speaker Identity Unlearning with Minimal Interference


188. QUIET: A Multi-Blank Cascaded Story Cloze Benchmark for LLM Creative Generation Capability


189. Step-TP: A Grounded, Step-Level Dataset with Chain-of-Thought Reasoning for LLM-Guided Tensor Program Optimization


190. VEN-VL: A Visual Ensemble MoE Framework for Effective and Efficient Multi-Modal Understanding


191. Small Models, Strong Priors: Architectural Inductive Bias for Parameter-Efficient Neural PDE Solvers


192. EchoPilot: Training-Free Ultrasound Video Segmentation via Scale-Space Semantic Prompting and Reliability-Gated Memory


193. From Latent Space to Training Data: Explainable Specialization in Minimal MLPs


194. Quantitative Evaluation of the Severity of Posttraumatic Stress Disorder through Transfer Learning from Specific Phobia Data



196. Causal Tongue-Tie: LLMs Can Encode Causal Direction, But Their Yes/No Outputs Fail to Express


197. MuNet: A Mutualistic Network for Joint 3D Human Mesh Recovery and 3D Clothed Human Reconstruction from Single Images


198. Explaining Too Much? Understanding How Large Language Model Reasoning Traces Influence Performance and Metacognition


199. TIAR: Trajectory-Informed Advantage Reweighting for LLM Abstention Learning


200. Geometric Evolution Maps: Extracting Stable Concept Probes from Transformer Residual Streams


201. TTPrint: Evidence-Grounded TTP Extraction via Diverge-then-Converge Verification


202. Context-Instrumental Data Distillation for Kubernetes Manifest Generation: Method and Experimental Evaluation


203. When Search Becomes Memory: Turning Robot Design Trials into Transferable Skills


204. Clarify, Abstain or Answer? Strategising in Conversation with Belief-Augmented Generation


205. OASIS: Observation-Action Space Alignment via SE(3) Trajectory Prediction for Robotic Manipulation


206. Fine-Tuning Over Architectural Complexity: Broad-Coverage PII Detection on PIIBench with DeBERTa


207. Adaptive Graph Refinement and Label Propagation with LLMs for Cost-Effective Entity Resolution


208. SAMark: A Self-Anchored Text Watermarking with Paragraph-Level Paraphrase Robustness


209. On the Benefits of Free Exploration for Regret Minimization in Multi-Armed Bandits


210. NPSolver: Neural Poisson Solver with Iterative Physics Supervision


211. Efficient Benchmarking Is Just Feature Selection and Multiple Regression


212. MDGMIX: Boundary-Aware Subgraph Mixing for Multi-Domain Graph Pre-Training


213. Concept Unlearning via Cross-Attention Activation Projection for Diffusion Models


214. Benchmarking Pathology Foundation Models for Spatial Domain Understanding


215. DeGRe: Dense-supervised Generative Reranking for Recommendation


216. Multi-Agent Coordination Adaptation via Structure-Guided Orchestration


217. How Should LLMs Consume High-Quality Data? Optimal Data Scheduling via Quality-Aware Functional Scaling Laws


218. Profiling-Driven Adaptive Distributed Transformer Inference on Embedded Edge Deployment


219. Don’t Retrain, Just Reuse: Recovering Dual-Target Molecules from Single-Target Diffusion Models


220. Simulating Human Memory with Language Models


221. Referential Security as a New Paradigm for AI Evaluations


222. Meta-Engineering Harnesses for AI-Native Software Production: A Contract-Driven Adversarial Verification Architecture with Early Deployment Report


223. Posture Clip: Sit properly or I wont let you work


224. AutoSG: LLM-Driven Solver Generation Solely from Task Prompts for Expensive Optimization


225. Fine-Tuning and Serving Gemma 4 31B on Google Cloud TPU: A Technical Comparison with GPU Baselines


226. Towards the Connection between Activation Sparsity and Flat Minima


227. Toward a Benchmark for Controllable Simulation of Imperfect Students with Large Language Models


228. Acting on the Unseen: Communication-Free Collaborative Filtering for Decentralized Multi-Robot Task Allocation


229. Extreme Region Policy Distillation


230. Geometric Flow Matching for Molecular Conformation Generation via Manifold Decomposition


231. Mosaic: Compositional Multi-Concept Erasure via Vector Field Blending


232. PennySynth: RAG-Driven Data Synthesis for Automated Quantum Code Generation


233. Keep the Proof State Live: Snapshotting for Efficient Tactic Search in Lean 4


234. BC Protocol: Structured Dual-Expert Dialogue for Eliciting High-Quality Chain-of-Thought Post-Training Data


235. ‘Si’multaneous ‘S’patial-‘T’emporal Message Passing for Dynamic Graph Representation Learning


236. TopoAlign: Topology-Aware Visual Representation Alignment



238. Cross-Stage Attention Multi-Expert Network for Radiologist-Inspired Breast Ultrasound Diagnosis


239. Generative AI impacts on intra-urban inequality and skill premium in Beijing


240. A Controlled Synthetic Benchmark for Educational Aspect-Based Sentiment Analysis


241. Test-Time Self-Adaptive Conditioning for Stable Audio-Driven Talking-Head Generation


242. EXPO-FT: Sample-Efficient Reinforcement Learning Finetuning for Vision-Language-Action Models


243. IndexMem: Learned KV-Cache Eviction with Latent Memory for Long-Context LLM Inference


244. From Simulation to Enaction: Post-trained language models recognize and react to their own generations


245. AI Content Moderation in Therapy Conversations


246. A Multi-Agent LLM Framework for Rating the Quality of Surgical Feedback


247. Binding Visual Features Point by Point


248. SeqRoute: Global Budget-Aware Sequential LLM Routing via Offline Reinforcement Learning


249. A Token/KV-Cache Communication Media Selection and Resource Allocation Strategy for Multi-Agent Collaboration


250. SomaliBench Eval: Measuring English-to-Somali Refusal Gaps in Open-Weight Language Models


251. Autoregression-Free Neural Operators for Time-Dependent PDEs


252. Anatomy-Anchored Self-Supervision: Distilling Vision Foundation Models for Invariant Ultrasound Representation


253. Subspace-Guided Semantic and Topological Invariant Registration for Annotation-Free Ultrasound Plane Quality Control


254. Evo-Attacker: Memory-Augmented Reinforcement Learning for Long-Horizon Tool Attacks on LLM-MAS


255. Weakly Supervised Camouflaged Object Detection Based on the SAM Model and Mask Guidance


256. CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation


257. Adversarial Orthogonal Disentanglement for LVLM Hallucination Mitigation


258. KYA: A Framework-Agnostic Trust Layer for Autonomous Systems with Verifiable Provenance and Hierarchical Policy Composition


259. AI-Associated Lexical Shifts Across 34 Languages: Cross-Lingual Convergence and Diachronic Uptake in News Writing


260. Certified Robustness from Approximate Gaussian Mixture Structures in Pretrained Latent Spaces


261. Parameter-Efficient CT Reconstruction via Deep Graph Laplacian Regularization


262. Parallel Differentiable Reachability for Learning and Planning with Certified Neural Dynamics and Controllers


263. A general tensor-structured compression scheme for efficient large language models


264. CausalFlow: Causal Attribution and Counterfactual Repair for LLM Agent Failures


265. UWM-JEPA: Predictive World Models That Imagine in Belief Space


266. Eureka: Intelligent Feature Engineering for Enterprise AI Cloud Resource Demand Prediction


267. Neuromorphic LiDAR-based Bird’s Eye View Object Detection using Energy-efficient Spiking Neural Networks


268. READER: Reasoning-Enhanced AI-Generated Text Detection


269. Positivity in classical enumerative geometry: a case study in synchronized AI-assisted mathematics


270. Latent Q-Barrier Shielding for Safe In-Context Reinforcement Learning


271. Mimir: Large-scale Multilingual Concept Modeling


272. First, do no harm: Breaking suicidogenic echo chambers in media recommendation


273. Guess the Unified Model: How Much Can We Recover from Generated Images?


274. Quantifying Empirical Compute-Supervision Tradeoffs in RLVR


275. JudgmentBench: Comparing Rubric and Preference Evaluation for Quality Assessment


276. Constraint-Anchored Attribution: Feasibility-Certified Counterfactuals and Bonferroni-PAC Sufficient Subsets for Neural CO Policies


277. On the Epistemic Uncertainty of Overparametrized Neural Networks


278. Specification-Based Code-Text-Code Reengineering for LLM-Mediated Software Evolution


279. Continuous-Depth Field Theory for Transformer Patching and Mechanistic Interpretability


280. Multi-Objective Learning for Diffusion Models: A Statistical Theory under Semi-Supervised Learning


281. Influence-Inspired Spectral Rotations for Extreme Low-Bit LLM Quantization


282. Hide to Guide: Learning via Semantic Masking


283. Beyond Killer Robots: General AI Attitudes and Public Support for Military AI in Nine Countries


284. By Their Fruits You Will Know Them: Comparing Formalizations of Law by the Decisions They Encode


285. Knowledge Graph-Driven Expert-Level Reasoning for Neuroscience


286. Grow-Prune-Freeze Networks: Adaptive & Continual Learning Technique for Olfactory Navigation


287. Methodology for Creating a Clinically Verified Dermoscopic Image Dataset


288. AME-TS: Anchored Mixture-of-Experts for Time Series Forecasting


289. K-U-KAN: Koopman-Enhanced U-KAN for 3D Dental Reconstruction from a Single Panoramic X-ray Radiograph


290. STREAM: A Data-Centric Framework for Mining High-Value Task-Oriented Dialogues from Streaming Media


291. Abduction-Deduction Entanglement: Domain Generalization via Representation Transplants


292. LLM Agent Based Renewable Energy Forecasting Using Edge and IoT Data A Review of Solar Wind Weather and Grid Aware Decision Support


293. ASTRO: Adaptive Spatio-Temporal Reinforcement Optimization for GNN Powered Anomly Detection in Cyber Physical Systems


294. Theoretical Analysis of Sparse Optimization with Reparameterization, Weight Decay, and Adaptive Learning Rate


295. Inference-Time Alignment of Diffusion Models via Trust-Region Iterative Twisted Sequential Monte Carlo


296. Evidence-Linked Radiology Reporting: A Human-Supervised Reference Architecture for Structured Imaging Intelligence


297. Trust-Aware Joint Feature-Prediction Discrepancy for Robust Domain Adaptation


298. Courant: a State-Adaptive Perceiver-Based Neural Surrogate with Local Support and Interpretable Field Decomposition


299. Uncertainty-DTW for Sequences and Visual Tokens


300. Leveraging Gauge Freedom for Learning Non-Gradient Population Dynamics of Stochastic Systems


301. Multi-Agent Specification-based Metamorphic Testing of FMU-Based Simulations


302. Polynomial Context-Truncation Sensitivity in Autoregressive Language Models: Sequential Wyner-Ziv Bounds for KV Cache Compression


303. Security in the Fine-Tuning Lifecycle of Large Language Models: Threats, Defenses,Evaluation, and Future Directions


304. Cultivating Machine Intelligence: The OMEGA Shift from Top-Down Optimization to Autopoietic Cognitive Ecologies


305. GL-LFGNN:A Global-Local Dual-branch Causal Graph Neural Network Based on Liang-Kleeman Information Flow for EEG Emotion Recognition


306. Intent Signal Theory: A Computational Framework for Intent-State Control in Human-AI Interaction


307. Scale When Needed: Adaptive Neuron-level Mixed Precision Quantization Aware Training


308. TinyFormer: Preserving Tiny Objects in YOLO-DETRHybridReal-time Detectors


309. Language Bias in LVLMs: From In-Depth Analysis to Simple and Effective Mitigation


310. D3S2: Diffusion-Guided Dataset Distillation for Semantic Segmentation


311. Performance Comparison of Classical and Neural Sampling Algorithms for Robotic Navigation


312. Metropolis-Scale Resilient and Trustworthy Traffic Flow Inference Using Multi-Source Data


313. Interpretation, Learning, and Empathy as One Constraint: A Residual-Adequacy Architecture with Accountable Abstention


314. Scaling up Energy-Aware Multi-Agent Reinforcement Learning for Mission-Oriented Drone Networks with Individual Reward


315. Selective Test-Time Compute Scaling for Click-Through Rate Prediction via Uncertainty-Triggered Feature Path Exploration


316. Bridging the Gap: Enabling Soft Actor Critic for High Performance Legged Locomotion


317. MinerU-Popo: Universal Post-Processing Model for Structured Document Parsing


318. TGFormer: Towards Temporal Graph Transformer with Auto-Correlation Mechanism


319. OSDTW: Optimal Shared Depth and Task Weighting for Long-Tailed Recognition


320. Cross-Domain Generalization Limits of Vision Foundation Models in Facial Deepfake Detection


321. Investigating the Interplay between Contextual and Parametric Chain-of-Thought Faithfulness under Optimization


322. SEP-Attack: A Simple and Effective Paradigm for Transfer-Based Textual Adversarial Attack


323. APT-Agent: Automated Penetration Testing using Large Language Models


324. RealBench: Benchmarking Data-Driven Numerical Weather Forecasting Under Operational Conditions and Extreme Event Challenges


325. Riemannian-Manifold Steering: Geometry-Aware Generative Autoencoders for Label-Free Steering


326. Your Embedding Model is SMARTer Than You Think


327. HumanEgo: Zero-Shot Robot Learning from Minutes of Human Egocentric Videos


328. Quaternion Self-Attention with Shared Scores


329. Explainable Multi-Task Retinal Imaging Reveals Microvascular Signals for Systemic Risk Stratification in Type 2 Diabetes: A Pilot Study


330. Explainable Retinal Imaging for Prediction of Multi-Organ Dysfunction in Type 2 Diabetes


331. Factorize to Generalize: Retrieval-Guided Invariant-Dynamic Decomposition for Time Series Forecasting


332. On the Impact of Class Imbalance on the Learning Dynamics of Deep Neural Networks:An Intuitive Insight


333. When Reasoning Hurts: Source-Aware Evaluation of Frontier LLMs for Clinical SOAP Note Generation


334. Towards a Universal Causal Reasoner


335. DBPnet: Damper Characteristics-Based Bayesian Physics-Informed Neural Network for Wheel Load Estimation


336. The Concept Allocation Zone: Tracking How Concepts Form Across Transformer Depth


337. Tiny Brains, Giant Impact: Uncovering the Keystone Neurons of LLM with Just a Few Prompts


338. Adversarial Error Correction for Visual Autoregressive Generation


339. Reflect-Guard: Enhancing LLM Safeguards against Adversarial Prompts via Logical Self-Reflection


340. Multiscale Real-Time Object Detection in the NMS-Free Era: A Comparative Performance Evaluation of YOLOv8 and YOLO26


341. Cross-Domain Energy-Guided Diffusion Generation for Off-Dynamics Reinforcement Learning


342. Disentangled Double Machine Learning for Accurate Causal Effect Estimation


343. Zero-Shot Parkinson’s Disease Detection from Speech: Comparing Large Audio and Language Models


344. Divide-and-Conquer Inference for Large-Scale Visual Recognition with Multimodal Large Language Models


345. Parameter-Efficient VLMs for Gastrointestinal Endoscopy: Medical Image Generation and Clinical Visual Question Answering


346. CONF-KV: Confidence-Aware KV Cache Eviction with Mixed-Precision Storage for Long-Horizon LLM


347. Complement Submodular Information Measures for Balanced and Robust Data Selection


348. From Theory to Decision Rule: Calibrating the Noisy-Label Crossover for Vision-Language Model Weak Supervision Across Three Medical-Imaging Benchmarks


349. Leveraging pretrained RGB denoisers for hyperspectral image restoration


350. Spectral Retrieval: Multi-Scale Sinc Convolution over Token Embeddings for Localized Retrieval in LLM Multi-Agent Systems


351. Motion-Compensated Weight Compression


352. Bilevel Optimization of Synthetic Trajectories for Multi-Turn LLM Fine-Tuning


353. Who judges the judges? Governance from metrics: a runtime framework for continuous LLM compliance monitoring


354. World-State Transformations for Neuro-symbolic Interactive Storytelling


355. TS-Skill: A Benchmark for Evaluating Analytical Skills in Time-Series Question Answering


356. The Path Matters: Learning a Token-Commitment Policy for Diffusion Language Models


357. HoloFair: Unified T2I Fairness Evaluation and Fair-GRPO Debiasing


358. Beyond the Aggregation Dilemma: Prior-Retaining Decoupled Learning for Multimodal Graphs


359. Mix-MoE: Improving Multilingual Machine Translation of Large Language Models through Mixed MoEs


360. VaaWIT: Visual-Aware Adaptation of Large Language Models for Multilingual Web Image Translation


361. CyBOKClaw: Human-in-the-Loop CyBOK Mapping for Cybersecurity Curriculum


362. How Many Tools Should an LLM Agent See? A Chance-Corrected Answer


363. On the Stability and Realizability of Recurrent Polynomial Surrogate Ternary Logic Gate Networks


364. DisDop: Distillation with Domain Priors for Open-Vocabulary Aerial Object Detection


365. Demystifying the Mythos or Disrupting Bugonomics? From Zero-Day Asymmetry to Defender Remediation Throughput


366. Beyond Generative Priors: Minority Sampling with JEPA-Guided Diffusion


367. Phase-Aware Wavelet-Based-Scattering Encoder-Decoder for Dense Predictions


368. Measuring the Depth of LLM Unlearning via Activation Patching


369. Guarded Repair for Harm-Aware Post-hoc Replacement of LLM Mathematical Reasoning


370. Catching MRI outliers: unsupervised detection and localization of MRI artefacts and clinical anomalies using deep learning


371. Correcting Visual Blur Induced by Attention Distraction to Reduce Hallucinations: Algorithm and Theory


372. LAPLEX: The FFT of Learnable Laplace Kernels


373. Polymorphism Is Rotation: Operational Mechanistic Interpretability from a Two-Layer Transformer to Pythia-70m


374. PILOT: Policy-Informed Learned Optimization for Adaptive Deep Network Training


375. PEDESTRIANQA: A Benchmark for Vision-Language Models on Pedestrian Intention and Trajectory Prediction


376. Rethinking Federated Unlearning via the Lens of Memorization


377. AI-Driven Adaptive Adversaries and the Erosion of Cryptographic Trust in Public Key Systems


378. SemanticZip: A Pilot Framework for Lossy Text Compression with LLMs as Semantic Decompressors


379. Is Decentralized AI Governable? From Regulative Policy to Constitutive Protocol


380. TRAFA: Anticipating User Actions to Reduce Errors in Procedural Tasks with Predictive Feedback


381. Grammatically-Guided Sparse Attention for Efficient and Interpretable Transformers


382. Adaptive Punishment for Cooperation in Mixed-Motive Games


383. Φ-Noise: Training-Free Temporal Video Conditioning via Phase-Based Noise Manipulation


384. FoodMonitor: Benchmarking MLLMs for Explainable Compliance Analysis


385. Robust Fuzzy Multi-view Learning under View Conflict


386. Coarse-to-Fine Domain Incremental Learning with Attentive Distillation for Mining Footprint Segmentation in Multispectral Imagery


387. Balancing Fairness, Privacy, and Accuracy: A Multitask Adversarial Framework for Centralized Data-Driven Systems


388. Code2UML: Agentic LLMs with context engineering for scalable software visualization



390. Momentum Streams for Optimizer-Inspired Transformers


391. Batch Normalization Amplifies Memorization and Privacy Risks


392. Generative OOD-regularized Model-based Policy Optimization


393. VectorArk: Learning Practical Image Vectorization with Rounded Polygon Representation


394. MX-SAFE: Versatile Inference- and Training-Proof Microscaling Format with On-the-Fly Exponent and Mantissa Bit Allocation


395. Side-by-side Comparison Amplifies Dialect Bias in Language Models


396. Assessing the Operational Viability of Foundation Models for Time Series Forecasting


397. Treatment Effect Estimation with Differentiated Networked Effect on Graph Data


398. ScaleAcross Explorer: Exploring Communication Optimization for Scale-Across AI Model Training


399. ChaosBench-Logic v2: Evaluating LLM Logical Reasoning over Dynamical Systems at Scale


400. ArtSplat: Feed-Forward Articulated 3D Gaussian Splatting from Sparse Multi-State Uncalibrated Views


401. Enhancing Reliability in LLM-Based Secure Code Generation


402. An Empirical Evaluation of LLM-Generated Code Security Across Prompting Methods


403. Benchmarking Patent Embeddings: A Multi-Task Evaluation of 22 Models Across Retrieval, Classification, and Clustering


404. Concept Drift Adaptation Using Self-Supervised and Reinforcement Learning In Android Malware Detection


405. An Interactive Paradigm for Deep Research


406. CRISP – Clustering-Based Redundancy-Reduced Instance Sampling for Pathology Case Representation and Retrieval


407. Attested Tool-Server Admission: A Security Extension to the Model Context Protocol


408. Improving Labeling Consistency with Detailed Constitutional Definitions and AI-Driven Evaluation


409. GIBLy: Improving 3D Semantic Segmentation through an Architecture-Agnostic Lightweight Geometric Inductive Bias Layer


410. Unlocking Apple’s Private Cloud Compute: An Analysis of Privacy-Preserving Artificial Intelligence


411. Agent-ToM: Learning to Monitor Autonomous LLM Agents via Theory-of-Mind Reasoning


412. Towards Evaluation Engineering: An Empirical Study of ML Evaluation Harnesses in the Wild


413. Distributionally Robust Transfer Learning with Structurally Missing Covariates, with Application to Cross-National Cardiac Arrest Prediction


414. Teaching Through Analogies: A Modular Pipeline for Educational Analogy Generation


415. Filtered Posterior Mean Collections: A Unified Framework for Analytical Models of Diffusion Generalization


416. AvalancheBench: Evaluating Enterprise Data Agents Through Latent World Recovery


417. Human-AI Collaboration in Science at Scale: A Global Large-scale Randomized Field Experiment


418. Extracting Training Data from Diffusion Language Models via Infilling


419. PromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detection


420. Knowledge Graph Modulated Deep Learning for Limited-Sample Clinical Data Analysis


421. An Interpretable CF-RL-TOPSIS Fusion Model for Skills-Aware Talent Recommendation


422. Understanding Conversational Patterns in Multi-agent Programming: A Case Study on Fibonacci Game Development


423. Empirical Analysis and Detection of Hallucinations in LLM-Generated Bug Report Summaries


424. MASt3R-Nav: WayPixel Navigation in Relative 3D Maps


425. Overcoming “Physics Shock” in Earth Observation A Heteroscedastic Uncertainty Framework for PINN-based Flood Inference


426. The Time is Here for Just-in-Time Systems: Challenges and Opportunities


427. Verified SHAP: Provable Bounds for Exact Shapley Values of Neural Networks


428. TRACER: A Semantic-Aware Framework for Fine-Grained Contamination Detection in Code LLMs


429. Not All Transitions Matter: Evidence from PPO


430. When the Manual Lies: A Realistic Benchmark to Evaluate MCP Poisoning Attacks for LLM Agents


431. Generative Representation Learning on Hyper-relational Knowledge Graphs via Masked Discrete Diffusion


432. Federated Learning over Human-Body Communication for On-Body Edge Intelligence: A Survey, Taxonomy, and BODYFED-HBC Scheduling Vignette


433. Spectral Probe-Circuits: A Three-Step Recipe for Identifying Attention-Head Circuits in Pretrained Transformers


434. Signs Beat Floats: Low-Rank Double-Binary Adaptation for On-Device Fine-Tuning


435. Feature Lottery? A Bifurcation Theory of Concept Emergence


436. Cascade-KDE: Robust Time-Series Restoration under Out-of-Distribution Impulse Corruptions


437. Truthful Online Preference Aggregation for LLM Fine-Tuning in Mobile Crowdsourcing


438. More Skills, Worse Agents? Skill Shadowing Degrades Performance When Expanding Skill Libraries


439. Mixture of Complementary Agents for Robust LLM Ensemble


440. A Large-Scale Dataset and Benchmark: Do Protein-Ligand Models Learn Binding Sites or Just Binding Likelihood?


441. LLM-AutoSciLab: Closed-Loop Scientific Discovery via Active Experimentation with LLMs


442. Hidden-State Privacy Has an Empty Middle


443. Iterative Refinement Neural Operators are Learned Fixed-Point Solvers: A Principled Approach to Spectral Bias Mitigation


444. Mode-as-Sequence: Translating Multimodal Motion Prediction into Unified Sequential Mode Modeling


445. WTKO-CNN: Deep Learning Reveals Sequence Motifs Distinguishing Wild-Type and Knockout ATAC-seq Peaks


446. Machine Intelligence that Understands Visual and Linguistic Information and Interacts with Humans and Environments


447. SA-Kura: An Energy-Efficient Systolic Array Accelerator for Locally-Coupled Kuramoto Drift in Diffusion Sampling


448. ActQuant: Sub-4-bit Action-Guided Quantization for Vision-Language-Action Models


449. Remote sensing data imputation using deep learning for multispectral imagery


450. Harnessing AtomisticSkills for Agentic Atomistic Research


451. Diff-Instruct with Diffused Reward: Towards Principled One-step Generator RL


452. IVR-R1: Refining Trajectories through Iterative Visual-Grounded Reasoning in Reinforcement Learning


453. Task-Aligned Self-Supervised Learning for Medical Image Analysis: A Systematic Review and Practical Design Guidelines


454. RAW: Robust Avatar Watermarking – Benchmarking and Baseline


455. Nano World Models: A Minimalist Implementation of Future Video Prediction


456. A World Model of Radiologist Reading for Medical Image Representation Learning


457. MemForest: An Efficient Agent Memory System with Hierarchical Temporal Indexing


458. Parameter Efficient Multi-Class Intelligent Scheduling for Multimodal Online Distributed Industrial Anomaly Detection


459. Metacognition Should Be the Scientific Framework for Bounded and Effective Self-Governance in Generative AI


460. Sensing Intelligence as a Trainable Metamaterial Property


461. TriVAL: A Tri-Validation Framework for Faithful Automatic Optimization Modeling


462. Multi-market value-stacking: Battery control for combined imbalance participation and non-uniform FCR bidding


463. Multimodal Alignment and Preference Optimization for Zero-Shot Conditional RNA Generation


464. AI in the Enterprise: How People Use M365 Copilot Chat


465. EchoDistill:Alignment Noisy-to-Clean Self-Distillation for Robust Audio LLMs


466. SODE: Analyzing Social Dynamics in LLM Agents


467. AI-Driven Controlled Environment Agriculture as Resilient Infrastructure for U.S. Fresh-Produce Supply Chains


468. KT4EQG: Personalized Exercise Question Generation via Knowledge Tracing


469. Catching The Correct Answer Trap: Characterising AI Tutor Blind Spots When Analysing Student Reasoning


470. High-Risk AI Systems and the Problem of Identity in the European AI Act


471. Authority Signals in Claude AI Health Citations: A Descriptive Analysis Using the Authority Signals Framework


472. Artificial Effort


473. Agent-Facing Information Design in LLM Tool Registries


474. VineLM: Trie-Based Fine-Grained Control for Agentic Workflows


475. Raon-Speech Technical Report


476. Document Classification Pattern Recognition via Information Fusion: A Systematic Review of Multimodal and Multiview Representation Approaches


477. AI-Driven Alpha Decay: Algorithmic Homogenization, Reflexive Signal Erosion, and the Paradox of Intelligent Markets


478. Check Your LLM’s Secret Dictionary! Five Lines of Code Reveal What Your LLM Learned (Including What It Shouldn’t Have)



480. LETS Forecast: Learning Embedology for Time Series Forecasting