전체 AI 논문 - 2026-04-21

1. MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval


2. Agentic Forecasting using Sequential Bayesian Updating of Linguistic Beliefs


3. Benchmarking System Dynamics AI Assistants: Cloud Versus Local LLMs on CLD Extraction and Discussion


4. ClawEnvKit: Automatic Environment Generation for Claw-Like Agents


5. OGER: A Robust Offline-Guided Exploration Reward for Hybrid Reinforcement Learning


6. LLM Safety From Within: Detecting Harmful Content with Internal Representations


7. WorldDB: A Vector Graph-of-Worlds Memory Engine with Ontology-Aware Write-Time Reconciliation


8. A Generalized Synthetic Control Method for Baseline Estimation in Demand Response Services


9. Using large language models for embodied planning introduces systematic safety risks


10. Six Llamas: Comparative Religious Ethics Through LoRA-Adapted Language Models


11. Learning from Less: Measuring the Effectiveness of RLVR in Low Data and Compute Regimes


12. The implicated scientist: on the role of AI researchers in the development of weapons systems


13. Training and Agentic Inference Strategies for LLM-based Manim Animation Generation


14. One Pass for All: A Discrete Diffusion Model for Knowledge Graph Triple Set Prediction


15. PARM: Pipeline-Adapted Reward Model


16. Toward Zero-Egress Psychiatric AI: On-Device LLM Deployment for Privacy-Preserving Mental Health Decision Support


17. Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence


18. Enhancing Tabular Anomaly Detection via Pseudo-Label-Guided Generation


19. LeGo-Code: Can Modular Curriculum Learning Advance Complex Code Generation? Insights from Text-to-SQL


20. AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation


21. TacticGen: Grounding Adaptable and Scalable Generation of Football Tactics


22. A Control Architecture for Training-Free Memory Use


23. QuantumQA: Enhancing Scientific Reasoning via Physics-Consistent Dataset and Verification-Aware Reinforcement Learning


24. State Transfer Reveals Reuse in Controlled Routing


25. Multi-Agent Systems: From Classical Paradigms to Large Foundation Model-Enabled Futures


26. Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration


27. Stability Implies Redundancy: Delta Attention Selective Halting for Efficient Long-Context Prefilling


28. DSAINet: An Efficient Dual-Scale Attentive Interaction Network for General EEG Decoding


29. Architectural Design Decisions in AI Agent Harnesses


30. Understanding Human Actions through the Lens of Executable Models


31. The Topological Dual of a Dataset: A Logic-to-Topology Encoding for AlphaGeometry-Style Data


32. SELF-EMO: Emotional Self-Evolution from Recognition to Consistent Expression


33. AIT Academy: Cultivating the Complete Agent with a Confucian Three-Domain Curriculum


34. From Fallback to Frontline: When Can LLMs be Superior Annotators of Human Perspectives?


35. A Sugeno Integral View of Binarized Neural Network Inference


36. TPS-CalcBench: A Benchmark and Diagnostic Evaluation Framework for LLM Analytical Calculation Competence in Hypersonic Thermal Protection System Engineering


37. CADMAS-CTX: Contextual Capability Calibration for Multi-Agent Delegation


38. ContraPrompt: Contrastive Prompt Optimization via Dyadic Reasoning Trace Analysis


39. LiteResearcher: A Scalable Agentic RL Training Framework for Deep Research Agent


40. Physics-Informed Causal MDPs for Sequential Constraint Repair in Engineering Simulation Pipelines


41. SPREG: Structured Plan Repair with Entropy-Guided Test-Time Intervention for Large Language Model Reasoning


42. On the Reliability of Computer Use Agents


43. Polysemantic Experts, Monosemantic Paths: Routing as Control in MoEs


44. WebUncertainty: Dual-Level Uncertainty Driven Planning and Reasoning For Autonomous Web Agent


45. Adversarial Arena: Crowdsourcing Data Generation through Interactive Competition


46. Prompt Optimization Enables Stable Algorithmic Collusion in LLM Agents


47. When Vision-Language Models Judge Without Seeing: Exposing Informativeness Bias


48. Contrastive Attribution in the Wild: An Interpretability Analysis of LLM Failures on Realistic Benchmarks


49. Evolutionary Negative Module Pruning for Better LoRA Merging


50. Co-evolving Agent Architectures and Interpretable Reasoning for Automated Optimization


51. Stratagem: Learning Transferable Reasoning via Trajectory-Modulated Game Self-Play


52. Semantic Entanglement in Vector-Based Retrieval: A Formal Framework and Context-Conditioned Disentanglement Pipeline for Agentic RAG Systems


53. Poly-EPO: Training Exploratory Reasoning Models


54. PV-SQL: Synergizing Database Probing and Rule-based Verification for Text-to-SQL Agents


55. Toward Reusability of AI Models Using Dynamic Updates of AI Documentation


56. KnowledgeBerg: Evaluating Systematic Knowledge Coverage and Compositional Reasoning in Large Language Models


57. Characterizing Model-Native Skills


58. DIRCR: Dual-Inference Rule-Contrastive Reasoning for Solving RAVENs


59. Beyond Static Snapshots: A Grounded Evaluation Framework for Language Models at the Agentic Frontier


60. SafeAgent: A Runtime Protection Architecture for Agentic Systems



62. From Admission to Invariants: Measuring Deviation in Delegated Agent Systems


63. SkillGraph: Self-Evolving Multi-Agent Collaboration with Multimodal Graph Topology


64. Towards Shutdownable Agents: Generalizing Stochastic Choice in RL Agents and LLMs


65. Waking Up Blind: Cold-Start Optimization of Supervision-Free Agentic Trajectories for Grounded Visual Perception


66. Language models recognize dropout and Gaussian noise applied to their activations


67. EHRAG: Bridging Semantic Gaps in Lightweight GraphRAG via Hybrid Hypergraph Construction and Retrieval


68. TrafficClaw: Generalizable Urban Traffic Control via Unified Physical Environment Modeling


69. Compiling Deterministic Structure into SLM Harnesses


70. EvoMaster: A Foundational Agent Framework for Building Evolving Autonomous Scientific Agents at Scale


71. STRIDE: Strategic Iterative Decision-Making for Retrieval-Augmented Multi-Hop Question Answering


72. Phase-Scheduled Multi-Agent Systems for Token-Efficient Coordination


73. Beyond Meta-Reasoning: Metacognitive Consolidation for Self-Improving LLM Reasoning


74. LLM-Guided Strategy Synthesis for Scalable Equality Saturation


75. T-DuMpRa: Teacher-guided Dual-path Multi-prototype Retrieval Augmented framework for fine-grained medical image classification


76. Hive: A Multi-Agent Infrastructure for Algorithm- and Task-Level Scaling


77. SOCIA-EVO: Automated Simulator Construction via Dual-Anchored Bi-Level Optimization


78. Formal Foundations of Agentic Business Process Management


79. AutoSearch: Adaptive Search Depth for Efficient Agentic RAG via Reinforcement Learning


80. Knows: Agent-Native Structured Research Representations


81. SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents


82. Efficient Test-Time Scaling via Temporal Reasoning Aggregation


83. LLaTiSA: Towards Difficulty-Stratified Time Series Reasoning from Visual Perception to Semantics


84. HalluClear: Diagnosing, Evaluating and Mitigating Hallucinations in GUI Agents


85. The Continuity Layer: Why Intelligence Needs an Architecture for What It Carries Forward


86. Rectification Difficulty and Optimal Sample Allocation in LLM-Augmented Surveys


87. Safe and Policy-Compliant Multi-Agent Orchestration for Enterprise AI


88. Yanasse: Finding New Proofs from Deep Vision’s Analogies, Part 1


89. Beyond the Basics: Leveraging Large Language Model for Fine-Grained Medical Entity Recognition


90. Graph-of-Agents: A Graph-based Framework for Multi-Agent LLM Collaboration


91. Local Inconsistency Resolution: The Interplay between Attention and Control in Probabilistic Models


92. If Only My CGM Could Speak: A Privacy-Preserving Agent for Question Answering over Continuous Glucose Data


93. Complementing Self-Consistency with Cross-Model Disagreement for Uncertainty Quantification


94. Understanding and Enforcing Weight Disentanglement in Task Arithmetic


95. Harness as an Asset: Enforcing Determinism via the Convergent AI Agent Framework (CAAF)


96. Mini-BEHAVIOR-Gran: Revealing U-Shaped Effects of Instruction Granularity on Language-Guided Embodied Agents


97. Small Model as Master Orchestrator: Learning Unified Agent-Tool Orchestration with Parallel Subtask Decomposition


98. Rule-VLN: Bridging Perception and Compliance via Semantic Reasoning and Geometric Rectification


99. A phenotype-driven and evidence-governed framework for knowledge graph enrichment and hypotheses discovery in population data


100. MCPO: Mastery-Consolidated Policy Optimization for Large Reasoning Models


101. AutoPKG: An Automated Framework for Dynamic E-commerce Product-Attribute Knowledge Graph Construction


102. LLMs can persuade only psychologically susceptible humans on societal issues, via trust in AI and emotional appeals, amid logical fallacies


103. Playing Psychic: Using Thought Trees to Predict Reasoning Models Accuracy on Coding Tasks


104. Alignment Imprint: Zero-Shot AI-Generated Text Detection via Provable Preference Discrepancy


105. ClimAgent: LLM as Agents for Autonomous Open-ended Climate Science Analysis


106. The Cognitive Penalty: Ablating System 1 and System 2 Reasoning in Edge-Native SLMs for Decentralized Consensus


107. Skilldex: A Package Manager and Registry for Agent Skill Packages with Hierarchical Scope-Based Distribution


108. Beyond Text-Dominance: Understanding Modality Preference of Omni-modal Large Language Models


109. Step-GRPO: Internalizing Dynamic Early Exit for Efficient Reasoning


110. GRAIL: Autonomous Concept Grounding for Neuro-Symbolic Reinforcement Learning


111. GAMMA-Net: Adaptive Long-Horizon Traffic Spatio-Temporal Forecasting Model based on Interleaved Graph Attention and Multi-Axis Mamba


112. The CTLNet for Shanghai Composite Index Prediction


113. PersonalHomeBench: Evaluating Agents in Personalized Smart Homes


114. Introspection Adapters: Training LLMs to Report Their Learned Behaviors


115. SAVE: A Generalizable Framework for Multi-Condition Single-Cell Generation with Gene Block Attention


116. Machine individuality: Separating genuine idiosyncrasy from response bias in large language models


117. Know When to Trust the Skill: Delayed Appraisal and Epistemic Vigilance for Single-Agent LLMs


118. Don’t Start What You Can’t Finish: A Counterfactual Audit of Support-State Triage in LLM Agents


119. Why Training-Free Token Reduction Collapses: The Inherent Instability of Pairwise Scoring Signals


120. CT Open: An Open-Access, Uncontaminated, Live Platform for the Open Challenge of Clinical Trial Outcome Prediction


121. When Agents Go Quiet: Output Generation Capacity and Format-Cost Separation for LLM Document Synthesis


122. Debate as Reward: A Multi-Agent Reward System for Scientific Ideation via RL Post-Training


123. Evaluating Tool-Using Language Agents: Judge Reliability, Propagation Cascades, and Runtime Mitigation in AgentProp-Bench


124. RankGuide: Tensor-Rank-Guided Routing and Steering for Efficient Reasoning


125. The Query Channel: Information-Theoretic Limits of Masking-Based Explanations


126. Agentic Risk-Aware Set-Based Engineering Design


127. From Subsumption to Satisfiability: LLM-Assisted Active Learning for OWL Ontologies


128. Agentic Frameworks for Reasoning Tasks: An Empirical Study


129. Healthcare AI for Automation or Allocation? A Transaction Cost Economics Framework


130. Support Sufficiency as Consequence-Sensitive Compression in Belief Arbitration


131. Heterogeneous Self-Play for Realistic Highway Traffic Simulation


132. Computational Hermeneutics: Evaluating generative AI as a cultural technology


133. Semantic Consensus: Process-Aware Conflict Detection and Resolution for Enterprise Multi-Agent LLM Systems


134. Governing the Agentic Enterprise: A Governance Maturity Model for Managing AI Agent Sprawl in Business Operations


135. Sessa: Selective State Space Attention


136. Bounded Ratio Reinforcement Learning


137. When Can LLMs Learn to Reason with Weak Supervision?


138. Back into Plato’s Cave: Examining Cross-modal Representational Convergence at Scale


139. A multimodal and temporal foundation model for virtual patient representations at healthcare system scale


140. Latent Phase-Shift Rollback: Inference-Time Error Correction via Residual Stream Monitoring and KV-Cache Steering


141. Transition-Matrix Regularization for Next Dialogue Act Prediction in Counselling Conversations


142. Symbolic Synthesis for LTLf+ Obligations


143. IDOBE: Infectious Disease Outbreak forecasting Benchmark Ecosystem


144. Different Paths to Harmful Compliance: Behavioral Side Effects and Mechanistic Divergence Across LLM Jailbreaks


145. Document-as-Image Representations Fall Short for Scientific Retrieval


146. Learning the Riccati solution operator for time-varying LQR via Deep Operator Networks


147. Faster by Design: Interactive Aerodynamics via Neural Surrogates Trained on Expert-Validated CFD


148. LQM: Linguistically Motivated Multidimensional Quality Metrics for Machine Translation


149. Adversarial Humanities Benchmark: Results on Stylistic Robustness in Frontier Model Safety


150. Asset Harvester: Extracting 3D Assets from Autonomous Driving Logs for Simulation


151. An Integrated Deep-Learning Framework for Peptide-Protein Interaction Prediction and Target-Conditioned Peptide Generation with ConGA-PePPI and TC-PepGen


152. Progressive Online Video Understanding with Evidence-Aligned Timing and Transparent Decisions


153. ProtoCLIP: Prototype-Aligned Latent Refinement for Robust Zero-Shot Chest X-Ray Classification


154. Revisiting Change VQA in Remote Sensing with Structured and Native Multimodal Qwen Models


155. AlphaContext: An Evolutionary Tree-based Psychometric Context Generator for Creativity Assessment


156. Randomly Initialized Networks Can Learn from Peer-to-Peer Consensus


157. IceBreaker for Conversational Agents: Breaking the First-Message Barrier with Personalized Starters


158. Dissecting AI Trading: Behavioral Finance and Market Bubbles


159. Tight Auditing of Differential Privacy in MST and AIM


160. AdaCluster: Adaptive Query-Key Clustering for Sparse Attention in Video Generation


161. Multilingual Training and Evaluation Resources for Vision-Language Models


162. EVE: Verifiable Self-Evolution of MLLMs via Executable Visual Transformations


163. On the Importance and Evaluation of Narrativity in Natural Language AI Explanations


164. Long-Text-to-Image Generation via Compositional Prompt Decomposition


165. DocQAC: Adaptive Trie-Guided Decoding for Effective In-Document Query Auto-Completion


166. Style-Based Neural Architectures for Real-Time Weather Classification


167. Towards Disentangled Preference Optimization Dynamics Beyond Likelihood Displacement


168. Semantic-based Distributed Learning for Diverse and Discriminative Representations



170. Evaluating Multi-Hop Reasoning in RAG Systems: A Comparison of LLM-Based Retriever Evaluation Strategies


171. Aether: Network Validation Using Agentic AI and Digital Twin


172. Is SAM3 ready for pathology segmentation?


173. WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models


174. Scalable Neighborhood-Based Multi-Agent Actor-Critic


175. Committed SAE-Feature Traces for Audited-Session Substitution Detection in Hosted LLMs


176. STaD: Scaffolded Task Design for Identifying Compositional Skill Gaps in LLMs


177. Copy-as-Decode: Grammar-Constrained Parallel Prefill for LLM Editing


178. Beyond Reproduction: A Paired-Task Framework for Assessing LLM Comprehension and Creativity in Literary Translation


179. MM-JudgeBias: A Benchmark for Evaluating Compositional Biases in MLLM-as-a-Judge


180. Does “Do Differentiable Simulators Give Better Policy Gradients?’’ Give Better Policy Gradients?


181. Modular Representation Compression: Adapting LLMs for Efficient and Effective Recommendations


182. Region-Grounded Report Generation for 3D Medical Imaging: A Fine-Grained Dataset and Graph-Enhanced Framework


183. AQPIM: Breaking the PIM Capacity Wall for LLMs with In-Memory Activation Quantization


184. Soft Label Pruning and Quantization for Large-Scale Dataset Distillation


185. Depth Registers Unlock W4A4 on SwiGLU: A Reader/Generator Decomposition


186. TLoRA: Task-aware Low Rank Adaptation of Large Language Models


187. The Collaboration Gap in Human-AI Work


188. Autonomous Unmanned Aircraft Systems for Enhanced Search and Rescue of Drowning Swimmers: Image-Based Localization and Mission Simulation


189. Mix and Match: Context Pairing for Scalable Topic-Controlled Educational Summarisation


190. Implicit neural representations as a coordinate-based framework for continuous environmental field reconstruction from sparse ecological observations


191. Class-specific diffusion models improve military object detection in a low-data domain


192. ExAI5G: A Logic-Based Explainable AI Framework for Intrusion Detection in 5G Networks


193. First, Do No Harm (With LLMs): Mitigating Racial Bias via Agentic Workflows


194. RASP-Tuner: Retrieval-Augmented Soft Prompts for Context-Aware Black-Box Optimization in Non-Stationary Environments


195. Diversity Collapse in Multi-Agent LLM Systems: Structural Coupling and Collective Failure in Open-Ended Idea Generation


196. Latent Fourier Transform


197. RAVEN: Retrieval-Augmented Vulnerability Exploration Network for Memory Corruption Analysis in User Code and Binary Programs


198. How Much Cache Does Reasoning Need? Depth-Cache Tradeoffs in KV-Compressed Transformers


199. Heterogeneity in Formal Linguistic Competence of Language Models: Is Data the Real Bottleneck?


200. HEALing Entropy Collapse: Enhancing Exploration in Few-Shot RLVR via Hybrid-Domain Entropy Dynamics Alignment


201. Brain-Inspired Capture: Evidence-Driven Neuromimetic Perceptual Simulation for Visual Decoding


202. Prompting Foundation Models for Zero-Shot Ship Instance Segmentation in SAR Imagery


203. Learning to Correct: Calibrated Reinforcement Learning for Multi-Attempt Chain-of-Thought


204. Bayesian Active Learning with Gaussian Processes Guided by LLM Relevance Scoring for Dense Passage Retrieval


205. LoReC: Rethinking Large Language Models for Graph Data Analysis


206. Can Explicit Physical Feasibility Benefit VLA Learning? An Empirical Study


207. LEPO: \underline{L}atent R\underline{e}asoning \underline{P}olicy \underline{O}ptimization for Large Language~Models


208. Latent Preference Modeling for Cross-Session Personalized Tool Calling


209. Latent Abstraction for Retrieval-Augmented Generation


210. Periodic Steady-State Control of a Handkerchief-Spinning Task Using a Parallel Anti-Parallelogram Tendon-driven Wrist


211. On the Emergence of Syntax by Means of Local Interaction


212. AI Approach for MRI-only Full-Spine Vertebral Segmentation and 3D Reconstruction in Paediatric Scoliosis


213. Learning from AVA: Early Lessons from a Curated and Trustworthy Generative AI for Policy and Development Research


214. A novel LSTM music generator based on the fractional time-frequency feature extraction


215. PDDL-Mind: Large Language Models are Capable on Belief Reasoning with Reliable State Tracking


216. Do LLMs Need to See Everything? A Benchmark and Study of Failures in LLM-driven Smartphone Automation using Screentext vs. Screenshots


217. Understanding Secret Leakage Risks in Code LLMs: A Tokenization Perspective


218. Party Autonomy in Determining the Law Applicable to Non-contractual Obligations concerning Cross-Border Data Transfers


219. Ranking Abuse via Strategic Pairwise Data Perturbations


220. Bridging the Reasoning Gap in Vietnamese with Small Language Models via Test-Time Scaling


221. DuQuant++: Fine-grained Rotation Enhances Microscaling FP4 Quantization


222. AnchorRefine: Synergy-Manipulation Based on Trajectory Anchor and Residual Refinement for Vision-Language-Action Models


223. Forget What Matters, Keep the Rest: Selective Unlearning of Informative Tokens


224. SPENCE: A Syntactic Probe for Detecting Contamination in NL2SQL Benchmarks


225. Reverse Constitutional AI: A Framework for Controllable Toxic Data Generation via Probability-Clamped RLAIF


226. Community-Led AI Integration for Wildfire Risk Assessment: A Participatory AI Literacy and Explainability Integration (PALEI) Framework in Los Angeles, CA


227. MHSafeEval: Role-Aware Interaction-Level Evaluation of Mental Health Safety in Large Language Models


228. Voronoi-guided Bilateral 2D Gaussian Splatting for Arbitrary-Scale Hyperspectral Image Super-Resolution


229. RePrompT: Recurrent Prompt Tuning for Integrating Structured EHR Encoders with Large Language Models


230. GeGS-PCR: Effective and Robust 3D Point Cloud Registration with Two-Stage Color-Enhanced Geometric-3DGS Fusion


231. Concurrent Criterion Validation of a Validity Screen for LLM Confidence Signals via Selective Prediction


232. Screen Before You Interpret: A Portable Validity Protocol for Benchmark-Based LLM Confidence Signals


233. Before You Interpret the Profile: Validity Scaling for LLM Metacognitive Self-Report


234. WISV: Wireless-Informed Semantic Verification for Distributed Speculative Decoding in Device-Edge LLM Inference


235. CAPO: Counterfactual Credit Assignment in Sequential Cooperative Teams


236. SafeAnchor: Preventing Cumulative Safety Erosion in Continual Domain Adaptation of Large Language Models



238. ATLAS: Constitution-Conditioned Latent Geometry and Redistribution Across Language Models and Neural Perturbation Data


239. Semantic Density Effect (SDE): Maximizing Information Per Token Improves LLM Accuracy


240. Video-Robin: Autoregressive Diffusion Planning for Intent-Grounded Video-to-Music Generation


241. On The Mathematics of the Natural Physics of Optimization


242. Provable Coordination for LLM Agents via Message Sequence Charts


243. STEP-PD: Stage-Aware and Explainable Parkinson’s Disease Severity Classification Using Multimodal Clinical Assessments


244. Polarization and Integration in Global AI Research


245. Terminal Wrench: A Dataset of 331 Reward-Hackable Environments and 3,632 Exploit Trajectories


246. AIRA: AI-Induced Risk Audit: A Structured Inspection Framework for AI-Generated Code


247. DGSSM: Diffusion guided state-space models for multimodal salient object detection


248. How Much Data is Enough? The Zeta Law of Discoverability in Biomedical Data, featuring the enigmatic Riemann zeta function


249. PBSBench: A Multi-Level Vision-Language Framework and Benchmark for Hematopathology Whole Slide Image Interpretation


250. Causal-Temporal Event Graphs: A Formal Model for Recursive Agent Execution Traces


251. SVL: Goal-Conditioned Reinforcement Learning as Survival Learning


252. OPSDL: On-Policy Self-Distillation for Long-Context Language Models


253. Atomic Decision Boundaries: A Structural Requirement for Guaranteeing Execution-Time Admissibility in Autonomous Systems


254. Learning Unanimously Acceptable Lotteries via Queries


255. RS-HyRe-R1: A Hybrid Reward Mechanism to Overcome Perceptual Inertia for Remote Sensing Images Understanding


256. Generative AI Technologies, Techniques & Tensions: A Primer


257. A Probabilistic Consensus-Driven Approach for Robust Counterfactual Explanations


258. Dual-Anchoring: Addressing State Drift in Vision-Language Navigation


259. Project Prometheus: Bridging the Intent Gap in Agentic Program Repair via Reverse-Engineered Executable Specifications


260. Agentic Education: Using Claude Code to Teach Claude Code


261. Beyond the Bellman Fixed Point: Geometry and Fast Policy Identification in Value Iteration


262. MoVE: Translating Laughter and Tears via Mixture of Vocalization Experts in Speech-to-Speech Translation


263. Self-Consistency from Only Two Samples: CoT-PoT Ensembling for Efficient LLM Reasoning


264. Jupiter-N Technical Report


265. Long-CODE: Isolating Pure Long-Context as an Orthogonal Dimension in Video Evaluation


266. TransXion: A High-Fidelity Graph Benchmark for Realistic Anti-Money Laundering


267. Project resilience as network robustness


268. Reward Score Matching: Unifying Reward-based Fine-tuning for Flow and Diffusion Models


269. The Open-Weight Paradox: Why Restricting Access to AI Models May Undermine the Safety It Seeks to Protect


270. DuConTE: Dual-Granularity Text Encoder with Topology-Constrained Attention for Text-attributed Graphs


271. Speculative Decoding for Autoregressive Video Generation


272. MESA: A Training-Free Multi-Exemplar Deep Framework for Restoring Ancient Inscription Textures


273. Study and Improvement of Search Algorithms in Multi-Player Perfect-Information Games


274. Towards Generalizable Deepfake Image Detection with Vision Transformers


275. When Text Hijacks Vision: Benchmarking and Mitigating Text Overlay-Induced Hallucination in Vision Language Models


276. ArgBench: Benchmarking LLMs on Computational Argumentation Tasks


277. PsychBench: Auditing Epidemiological Fidelity in Large Language Model Mental Health Simulations


278. Still Between Us? Evaluating and Improving Voice Assistant Robustness to Third-Party Interruptions


279. Robust Diabetic Retinopathy Grading Using Dual-Resolution Attention-Based Deep Learning with Ordinal Regression


280. Rethinking the Comparison Unit in Sequence-Level Reinforcement Learning: An Equal-Length Paired Training Framework from Loss Correction to Sample Construction


281. Signal or Noise in Multi-Agent LLM-based Stock Recommendations?


282. SigGate-GT: Taming Over-Smoothing in Graph Transformers via Sigmoid-Gated Attention


283. Calibrated? Not for Everyone: How Sexual Orientation and Religious Markers Distort LLM Accuracy and Confidence in Medical QA


284. A Survey of Reinforcement Learning for Large Language Models under Data Scarcity: Challenges and Solutions


285. RoTRAG: Rule of Thumb Reasoning for Conversation Harm Detection with Retrieval-Augmented Generation


286. Chaos-Enhanced Prototypical Networks for Few-Shot Medical Image Classification


287. Cat-DPO: Category-Adaptive Safety Alignment


288. Probabilistic Programs of Thought


289. Clover: A Neural-Symbolic Agentic Harness with Stochastic Tree-of-Thoughts for Verified RTL Repair


290. HorizonBench: Long-Horizon Personalization with Evolving Preferences


291. Fully Analog Resonant Recurrent Neural Network via Metacircuit


292. Instinct vs. Reflection: Unifying Token and Verbalized Confidence in Multimodal Large Models


293. What Security and Privacy Transparency Users Need from Consumer-Facing Generative AI


294. Fractal Characterization of Low-Correlation Signals in AI-Generated Image Detection


295. HORIZON: A Benchmark for In-the-wild User Behaviour Modeling


296. REZE: Representation Regularization for Domain-adaptive Text Embedding Pre-finetuning


297. Seeing Isn’t Believing: Mitigating Belief Inertia via Active Intervention in Embodied Agents


298. DORA Explorer: Improving the Exploration Ability of LLMs Without Training


299. HeadRank: Decoding-Free Passage Reranking via Preference-Aligned Attention Heads


300. Enhancing Zero-shot Personalized Image Aesthetics Assessment with Profile-aware Multimodal LLM


301. Region-Affinity Attention for Whole-Slide Breast Cancer Classification in Deep Ultraviolet Imaging


302. Dynamics of Cognitive Heterogeneity: Investigating Behavioral Biases in Multi-Stage Supply Chains with LLM-Based Simulation


303. Cross-Modal Attention Analysis and Optimization in Vision-Language Models: A Study on Visual Reliability


304. DREAM: Dynamic Retinal Enhancement with Adaptive Multi-modal Fusion for Expert Precision Medical Report Generation


305. CDSA-Net:Collaborative Decoupling of Vascular Structure and Background for High-Fidelity Coronary Digital Subtraction Angiography


306. Demystifying the unreasonable effectiveness of online alignment methods


307. Beyond Overlap Metrics: Rewarding Reasoning and Preferences for Faithful Multi-Role Dialogue Summarization


308. Persona-Based Requirements Engineering for Explainable Multi-Agent Educational Systems: A Scenario Simulator for Clinical Reasoning Training


309. Layer-wise MoE Routing Locality under Shared-Prefix Code Generation: Token-Identity Decomposition and Compile-Equivalent Fork Redundancy


310. Decentralised Trust and Security Mechanisms for IoT Networks at the Edge: A Comprehensive Review


311. Intent-aligned Autonomous Spacecraft Guidance via Reasoning Models


312. RosettaSearch: Multi-Objective Inference-Time Search for Protein Sequence Design


313. CCCL: In-GPU Compression-Coupled Collective Communication


314. Systematic Capability Benchmarking of Frontier Large Language Models for Offensive Cyber Tasks



316. The Consensus Trap: Rescuing Multi-Agent LLMs from Adversarial Majorities via Token-Level Collaboration


317. CASCADE: A Cascaded Hybrid Defense Architecture for Prompt Injection Detection in MCP-Based Systems


318. The Topological Trouble With Transformers


319. A Two-Stage Deep Learning Framework for Segmentation of Ten Gastrointestinal Organs from Coronal MR Enterography


320. HiveMind: OS-Inspired Scheduling for Concurrent LLM Agent Workloads


321. Beyond Word Boundaries: A Hebrew Coreference Benchmark and an Evaluation Protocol for Morphologically Complex Text


322. TensorHub: Rethinking AI Model Hub with Tensor-Centric Compression


323. Configuration Over Selection: Hyperparameter Sensitivity Exceeds Model Differences in Open-Source LLMs for RTL Generation


324. Comparing Human and Large Language Model Interpretation of Implicit Information


325. Abstain-R1: Calibrated Abstention and Post-Refusal Clarification via Verifiable RL


326. RLM-on-KG: Heuristics First, LLMs When Needed: Adaptive Retrieval Control over Mention Graphs for Scattered Evidence


327. mEOL: Training-Free Instruction-Guided Multimodal Embedder for Vector Graphics and Image Retrieval


328. Efficient Task Adaptation in Large Language Models via Selective Parameter Optimization


329. Where is the Mind? Persona Vectors and LLM Individuation


330. The Instrumental Dissolution of Typing: Why AI Challenges the Keyboard Era in Knowledge Work


331. Beyond Black-Box Labels: Interpretable Criteria for Diagnosing SubjectiveNLP Tasks


332. Beyond Static Benchmarks: Synthesizing Harmful Content via Persona-based Simulation for Robust Evaluation


333. Improving LLM Code Reasoning via Semantic Equivalence Self-Play with Formal Verification


334. MobileAgeNet: Lightweight Facial Age Estimation for Mobile Deployment


335. Inductive Convolution Nuclear Norm Minimization for Tensor Completion with Arbitrary Sampling


336. Bolzano: Case Studies in LLM-Assisted Mathematical Research


337. In-Context Learning Under Regime Change


338. Light-Adapted Electroretinogram and Oscillatory Potentials (LEOPs) Dataset for Autism Spectrum Disorder and Typically Developing Individuals


339. Evaluating Multimodal LLMs for Inpatient Diagnosis: Real-World Performance, Safety, and Cost Across Ten Frontier Models


340. NaviFormer: A Deep Reinforcement Learning Transformer-like Model to Holistically Solve the Navigation Problem


341. Visual Inception: Compromising Long-term Planning in Agentic Recommenders via Multimodal Memory Poisoning


342. Multi-stage Planning for Multi-target Surveillance using Aircrafts Equipped with Synthetic Aperture Radars Aware of Target Visibility


343. Training-inference input alignment outweighs framework choice in longitudinal retinal image prediction


344. Hybrid Quantum Neural Networks for Enhanced Breast Cancer Thermographic Classification: A Novel Quantum-Classical Integration Approach


345. MEMRES: A Memory-Augmented Resolver with Confidence Cascade for Agentic Python Dependency Resolution


346. D-QRELO: Training- and Data-Free Delta Compression for Large Language Models via Quantization and Residual Low-Rank Approximation


347. Adaptive receptive field-based spatial-frequency feature reconstruction network for few-shot fine-grained image classification


348. CoGR-MoE: Concept-Guided Expert Routing with Consistent Selection and Flexible Reasoning for Visual Question Answering


349. Test-Time Adaptation for EEG Foundation Models: A Systematic Study under Real-World Distribution Shifts


350. Noise-Adaptive Diffusion Sampling for Inverse Problems Without Task-Specific Tuning


351. PRISM: Probing Reasoning, Instruction, and Source Memory in LLM Hallucinations


352. ProtoCycle: Reflective Tool-Augmented Planning for Text-Guided Protein Design


353. Physics-Informed Tracking (PIT)


354. SinkRouter: Sink-Aware Routing for Efficient Long-Context Decoding in Large Language and Multimodal Models


355. Incentivizing Parametric Knowledge via Reinforcement Learning with Verifiable Rewards for Cross-Cultural Entity Translation


356. Governed MCP: Kernel-Level Tool Governance for AI Agents via Logit-Based Safety Primitives


357. Applications of deep generative models to DNA reaction kinetics and to cryogenic electron microscopy


358. Refinement of Accelerated Demonstrations via Incremental Iterative Reference Learning Control for Fast Contact-Rich Imitation Learning


359. TowerDataset: A Heterogeneous Benchmark for Transmission Corridor Segmentation with a Global-Local Fusion Framework


360. enclawed: A Configurable, Sector-Neutral Hardening Framework for Single-User AI Assistant Gateways


361. Lorentz Framework for Semantic Segmentation


362. The Illusion of Certainty: Decoupling Capability and Calibration in On-Policy Distillation


363. SafeDream: Safety World Model for Proactive Early Jailbreak Detection


364. Hierarchical Vision Transformer Enhanced by Graph Convolutional Network for Image Classification


365. Self-Reinforcing Controllable Synthesis of Rare Relational Data via Bayesian Calibration


366. AutoOR: Scalably Post-training LLMs to Autoformalize Operations Research Problems


367. Bias in the Loop: Auditing LLM-as-a-Judge for Software Engineering


368. When Informal Text Breaks NLI: Tokenization Failure, Distribution Shift, and Targeted Mitigations


369. Bridging Coarse and Fine Recognition: A Hybrid Approach for Open-Ended Multi-Granularity Object Recognition in Interactive Educational Games


370. FairNVT: Improving Fairness via Noise Injection in Vision Transformers


371. Federation over Text: Insight Sharing for Multi-Agent Reasoning


372. Representation Before Training: A Fixed-Budget Benchmark for Generative Medical Event Models


373. StageMem: Lifecycle-Managed Memory for Language Models


374. The Reliance Negotiation Framework: A Dynamic Process Model of Student LLM Engagement in Academic Writing


375. CapSeal: Capability-Sealed Secret Mediation for Secure Agent Execution


376. Frozen Vision Transformers for Dense Prediction on Small Datasets: A Case Study in Arrow Localization


377. Mitigating Prompt-Induced Cognitive Biases in General-Purpose AI for Software Engineering


378. TriTS: Time Series Forecasting from a Multimodal Perspective


379. Evaluating Adaptive Personalization of Educational Readings with Simulated Learners


380. Reducing Peak Memory Usage for Modern Multimodal Large Language Model Pipelines


381. Agentic Large Language Models for Training-Free Neuro-Radiological Image Analysis


382. Late Fusion Neural Operators for Extrapolation Across Parameter Space in Partial Differential Equations


383. Scalable and Adaptive Parallel Training of Graph Transformer on Large Graphs


384. The impact of postediting on AI generative translation in Yemeni context: Translating literary prose by ChatGPT


385. LOD-Net: Locality-Aware 3D Object Detection Using Multi-Scale Transformer Network


386. No-Worse Context-Aware Decoding: Preventing Neutral Regression in Context-Conditioned Generation


387. Graph Transformer-Based Pathway Embedding for Cancer Prognosis


388. Rewind-IL: Online Failure Detection and State Respawning for Imitation Learning


389. KAIROS: Stateful, Context-Aware Power-Efficient Agentic Inference Serving


390. ReconVLA: An Uncertainty-Guided and Failure-Aware Vision-Language-Action Framework for Robotic Control


391. Cross-Modal Bayesian Low-Rank Adaptation for Uncertainty-Aware Multimodal Learning


392. A Two-Stage Multi-Modal MRI Framework for Lifespan Brain Age Prediction


393. AdaExplore: Failure-Driven Adaptation and Diversity-Preserving Search for Efficient Kernel Generation


394. Aligning Backchannel and Dialogue Context Representations via Contrastive LLM Fine-Tuning


395. Beyond Feature Fusion: Contextual Bayesian PEFT for Multimodal Uncertainty Estimation


396. Spotlights and Blindspots: Evaluation Machine-Generated Text Detection


397. Human Cognition in Machines: A Unified Perspective of World Models


398. Randomized Antipodal Search Done Right for Data Pareto Improvement of LLM Unlearning


399. Global Attention with Linear Complexity for Exascale Generative Data Assimilation in Earth System Prediction


400. Hybrid Spectro-Temporal Fusion Framework for Structural Health Monitoring


401. MambaKick: Early Penalty Direction Prediction from HAR Embeddings


402. Real-Time Visual Attribution Streaming in Thinking Model


403. A Systematic Survey and Benchmark of Deep Learning for Molecular Property Prediction in the Foundation Model Era


404. The Global Neural World Model: Spatially Grounded Discrete Topologies for Action-Conditioned Planning


405. Certified Program Synthesis with a Multi-Modal Verifier


406. POLAR: Online Learning for LoRA Adapter Caching and Routing in Edge LLM Serving


407. Camo-M3FD: A New Benchmark Dataset for Cross-Spectral Camouflaged Pedestrian Detection


408. NCO4CVRP: Neural Combinatorial Optimization for the Capacitated Vehicle Routing Problem


409. Continuous ageing trajectory representations for knee-aware lifetime prediction of lithium-ion batteries across heterogeneous dataset


410. Towards Trustworthy Depression Estimation via Disentangled Evidential Learning


411. Multilevel neural networks with dual-stage feature fusion for human activity recognition


412. Evaluating Temporal and Structural Anomaly Detection Paradigms for DDoS Traffic


413. FedOBP: Federated Optimal Brain Personalization through Cloud-Edge Element-wise Decoupling


414. In Search of Lost DNA Sequence Pretraining


415. Reasoning on the Manifold: Bidirectional Consistency for Self-Verification in Diffusion Language Models


416. Classification of systolic murmurs in heart sounds using multiresolution complex Gabor dictionary and vision transformer


417. See Through the Noise: Improving Domain Generalization in Gaze Estimation


418. SpecPylot: Python Specification Generation using Large Language Models



420. PA-TCNet: Pathology-Aware Temporal Calibration with Physiology-Guided Target Refinement for Cross-Subject Motor Imagery EEG Decoding in Stroke Patients


421. Co-generation of Layout and Shape from Text via Autoregressive 3D Diffusion


422. An Interpretable Framework Applying Protein Words to Predict Protein-Small Molecule Complementary Pairing Rules


423. A Survey on the Security of Long-Term Memory in LLM Agents: Toward Mnemonic Sovereignty


424. Conjunctive Prompt Attacks in Multi-Agent LLM Systems


425. PoInit-of-View: Poisoning Initialization of Views Transfers Across Multiple 3D Reconstruction Systems


426. Understanding Tool-Augmented Agents for Lean Formalization: A Factorial Analysis


427. Robustifying and Selecting Cohort-Appropriate Prognostic Models under Distributional Shifts


428. Towards Reliable Testing of Machine Unlearning


429. SCATR: Simple Calibrated Test-Time Ranking


430. Public and private blockchain for decentralized digital building twins and building automation system


431. G-PARC: Graph-Physics Aware Recurrent Convolutional Neural Networks for Spatiotemporal Dynamics on Unstructured Meshes


432. Beyond Attack Success Rate: A Multi-Metric Evaluation of Adversarial Transferability in Medical Imaging Models


433. Scaling Test-Time Compute for Agentic Coding


434. Expert-Annotated Embryo Image Dataset with Natural Language Descriptions for Evidence-Based Patient Communication in IVF


435. CAMP: Cumulative Agentic Masking and Pruning for Privacy Protection in Multi-Turn LLM Conversations


436. On-Orbit Space AI: Federated, Multi-Agent, and Collaborative Algorithms for Satellite Constellations


437. Predicting Blastocyst Formation in IVF: Integrating DINOv2 and Attention-Based LSTM on Time-Lapse Embryo Images


438. Motif-Video 2B: Technical Report


439. HQA-VLAttack: Towards High Quality Adversarial Attack on Vision-Language Pre-Trained Models


440. Forge-UGC: FX optimization and register-graph engine for universal graph compiler


441. Gradient-Free Continual Learning in Spiking Neural Networks via Inter-Spike Interval Regularization


442. NL2SQLBench: A Modular Benchmarking Framework for LLM-Enabled NL2SQL Solutions


443. LayerCache: Exploiting Layer-wise Velocity Heterogeneity for Efficient Flow Matching Inference


444. A Lightweight Transformer for Pain Recognition from Brain Activity


445. An Uncertainty-Aware Loss Function Incorporating Fuzzy Logic: Application to MRI Brain Image Segmentation


446. Geometry-Aware CLIP Retrieval via Local Cross-Modal Alignment and Steering


447. Saccade Attention Networks: Using Transfer Learning of Attention to Reduce Network Sizes


448. DexWorldModel: Causal Latent World Modeling towards Automated Learning of Embodied Tasks


449. Dynamic Eraser for Guided Concept Erasure in Diffusion Models


450. Erasing Thousands of Concepts: Towards Scalable and Practical Concept Erasure for Text-to-Image Diffusion Models


451. Latent-Compressed Variational Autoencoder for Video Diffusion Models


452. Spike-driven Large Language Model


453. Full Feature Spiking Neural Network Simulation on Micro-Controllers for Neuromorphic Applications at the Edge


454. Training Language Models for Bilateral Trade with Private Information


455. Semantic Channel Theory: Deductive Compression and Structural Fidelity for Multi-Agent Communication


456. B-PASTE: Beam-Aware Pattern-Guided Speculative Execution for Resource-Constrained LLM Agents


457. MLE-Toolbox: An Open-Source Toolbox for Comprehensive EEG and MEG Data Analysis


458. From Inheritance to Saturation: Disentangling the Evolution of Visual Redundancy for Architecture-Aware MLLM Inference Acceleration


459. Deep Hierarchical Knowledge Loss for Fault Intensity Diagnosis


460. EchoChain: A Full-Duplex Benchmark for State-Update Reasoning Under Interruptions


461. Sampling for Quality: Training-Free Reward-Guided LLM Decoding via Sequential Monte Carlo


462. SAND: The Challenge on Speech Analysis for Neurodegenerative Disease Assessment


463. The Breakthrough of Sleep: A Contactless Approach for Accurate Sleep Stage Detection Using the Sleepal AI Lamp


464. iPhoneme: Brain-to-Text Communication for ALS Using ConformerXL Decoding


465. LatentMimic: Terrain-Adaptive Locomotion via Latent Space Imitation


466. Sampling Matters: The Effect of ECG Frequency on Deep Learning-Based Atrial Fibrillation Detection


467. Quantifying how AI Panels improve precision


468. Dimensional Criticality at Grokking Across MLPs and Transformers


469. HalluSAE: Detecting Hallucinations in Large Language Models via Sparse Auto-Encoders


470. (Sparse) Attention to the Details: Preserving Spectral Fidelity in ML-based Weather Forecasting Models


471. Non-Stationarity in the Embedding Space of Time Series Foundation Models


472. Safety, Security, and Cognitive Risks in State-Space Models: A Systematic Threat Analysis with Spectral, Stateful, and Capacity Attacks


473. Shifting the Gradient: Understanding How Defensive Training Methods Protect Language Model Integrity


474. Injecting Structured Biomedical Knowledge into Language Models: Continual Pretraining vs. GraphRAG


475. Measuring Representation Robustness in Large Language Models for Geometry


476. Breaking Validity-Induced Boundaries to Expand Algorithm Search Space: A Two-Stage AST-Based Operator for LLM-Driven Automated Heuristic Evolution


477. Modeling User Exploration Saturation: When Recommender Systems Should Stop Pushing Novelty


478. What Is Actually Being Annotated? Inter-Prompt Reliability as a Measurement Problem in LLM-Based Social Science Labeling


479. How unique are hallucinated citations offered by generative Artificial Intelligence models?


480. ICAT: Incident-Case-Grounded Adaptive Testing for Physical-Risk Prediction in Embodied World Models


481. GraphRAG-Router: Learning Cost-Efficient Routing over GraphRAGs and LLMs with Reinforcement Learning


482. CoLLM: A Unified Framework for Co-execution of LLMs Federated Fine-tuning and Inference


483. IACDM: Interactive Adversarial Convergence Development Methodology – A Structured Framework for AI-Assisted Software Development


484. A Framework for Human-AI Q-Matrix Refinement: A NeuralCDM Evaluation


485. Instructor-Created Custom GPTs as Pedagogical Partners Fostering Immersion in Online Higher Education: Two Case Studies


486. Stream2LLM: Overlap Context Streaming and Prefill for Reduced TTFT



488. RoMathExam: A Longitudinal Dataset of Romanian Math Exams (1895-2025) with a Seven-Decade Core (1957-2025)


489. Large language models for post-publication research evaluation: Evidence from expert recommendations and citation indicators


490. DAOnt: A Formal Ontology for EU Data Act Compliance


491. StressWeb: A Diagnostic Benchmark for Web Agent Robustness under Realistic Interaction Variability


492. Same Verdict, Different Reasons: LLM-as-a-Judge and Clinician Disagreement on Medical Chatbot Completeness


493. CFMS: Towards Explainable and Fine-Grained Chinese Multimodal Sarcasm Detection Benchmark


494. Brain-CLIPLM: Decoding Compressed Semantic Representations in EEG for Language Reconstruction


495. Why AI Readiness Is an Organizational Learning Problem, Not a Technology Purchase


496. Talk, Walk, and Market Response: Multimodal Measurement of AI Washing and Its Capital Market Consequences in China


497. Clinical Note Bloat Reduction for Efficient LLM Use


498. CSF: Black-box Fingerprinting via Compositional Semantics for Text-to-Image Models


499. SetFlow: Generating Structured Sets of Representations for Multiple Instance Learning


500. Mapping Recent Shifts in Digital Art via Conference Discourse: AI, XR, the Metaverse, and Blockchain/NFTs (2021-2025)