전체 AI 논문 - 2026-04-14

1. Detecting Safety Violations Across Many Agent Traces


2. GenTac: Generative Modeling and Forecasting of Soccer Tactics


3. Retrieval Is Not Enough: Why Organizational AI Needs Epistemic Infrastructure


4. Collaborative Multi-Agent Scripts Generation for Enhancing Imperfect-Information Reasoning in Murder Mystery Games


5. SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context


6. A Mamba-Based Multimodal Network for Multiscale Blast-Induced Rapid Structural Damage Assessment


7. Agentic Driving Coach: Robustness and Determinism of Agentic AI-Powered Human-in-the-Loop Cyber-Physical Systems


8. DreamKG: A KG-Augmented Conversational System for People Experiencing Homelessness


9. Why Do Large Language Models Generate Harmful Content?


10. RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time


11. Context Kubernetes: Declarative Orchestration of Enterprise Knowledge for Agentic AI Systems


12. Intersectional Sycophancy: How Perceived User Demographics Shape False Validation in Large Language Models


13. UniToolCall: Unifying Tool-Use Representation, Data, and Evaluation for LLM Agents


14. SemaClaw: A Step Towards General-Purpose Personal AI Agents through Harness Engineering


15. A collaborative agent with two lightweight synergistic models for autonomous crystal materials research


16. Problem Reductions at Scale: Agentic Integration of Computationally Hard Problems


17. Limited Perfect Monotonical Surrogates constructed using low-cost recursive linkage discovery with guaranteed output


18. PAC-BENCH: Evaluating Multi-Agent Collaboration under Privacy Constraints


19. Lectures on AI for Mathematics


20. Anthropogenic Regional Adaptation in Multimodal Vision-Language Model


21. On the Complexity of the Discussion-based Semantics in Abstraction Argumentation


22. OOM-RL: Out-of-Money Reinforcement Learning Market-Driven Alignment for LLM-Based Multi-Agent Systems


23. From Attribution to Action: A Human-Centered Application of Activation Steering


24. Three Roles, One Model: Role Orchestration at Inference Time to Close the Performance Gap Between Small and Large Agents


25. Escaping the Context Bottleneck: Active Context Curation for LLM Agents via Reinforcement Learning


26. Beyond RAG for Cyber Threat Intelligence: A Systematic Evaluation of Graph-Based and Agentic Retrieval


27. From Agent Loops to Structured Graphs:A Scheduler-Theoretic Framework for LLM Agent Execution


28. Learning from Contrasts: Synthesizing Reasoning Paths from Diverse Search Trajectories


29. The Missing Knowledge Layer in Cognitive Architectures for AI Agents


30. CoRe-ECG: Advancing Self-Supervised Representation Learning for 12-Lead ECG via Contrastive and Reconstructive Synergy


31. Dynamic Summary Generation for Interpretable Multimodal Depression Detection


32. Select Smarter, Not More: Prompt-Aware Evaluation Scheduling with Submodular Guarantees


33. PaperScope: A Multi-Modal Multi-Document Benchmark for Agentic Deep Research Across Massive Scientific Papers


34. BankerToolBench: Evaluating AI Agents in End-to-End Investment Banking Workflows


35. Consistency of AI-Generated Exercise Prescriptions: A Repeated Generation Study Using a Large Language Model


36. Inspectable AI for Science: A Research Object Approach to Generative AI Governance


37. Mobile GUI Agent Privacy Personalization with Trajectory Induced Preference Optimization


38. Measuring the Authority Stack of AI Systems: Empirical Analysis of 366,120 Forced-Choice Responses Across 8 AI Models


39. Environmental Footprint of GenAI Research: Insights from the Moshi Foundation Model


40. From Answers to Arguments: Toward Trustworthy Clinical Diagnostic Reasoning with Toulmin-Guided Curriculum Goal-Conditioned Learning


41. MADQRL: Distributed Quantum Reinforcement Learning Framework for Multi-Agent Environments


42. A Proposed Biomedical Data Policy Framework to Reduce Fragmentation, Improve Quality, and Incentivize Sharing in Indian Healthcare in the era of Artificial Intelligence and Digital Health


43. Persona Non Grata: Single-Method Safety Evaluation Is Incomplete for Persona-Imbued LLMs


44. Frugal Knowledge Graph Construction with Local LLMs: A Zero-Shot Pipeline, Self-Consistency and Wisdom of Artificial Crowds


45. Do Agent Rules Shape or Distort? Guardrails Beat Guidance in Coding Agents


46. Towards Proactive Information Probing: Customer Service Chatbots Harvesting Value from Conversation


47. Hodoscope: Unsupervised Monitoring for AI Misbehaviors


48. PRISM Risk Signal Framework: Hierarchy-Based Red Lines for AI Behavioral Risk


49. AI Integrity: A New Paradigm for Verifiable AI Governance


50. EmergentBridge: Improving Zero-Shot Cross-Modal Transfer in Unified Multimodal Embedding Models


51. From Topology to Trajectory: LLM-Driven World Models For Supply Chain Resilience


52. Intelligent Approval of Access Control Flow in Office Automation Systems via Relational Modeling


53. Introspective Diffusion Language Models


54. Min-$k$ Sampling: Decoupling Truncation from Temperature Scaling via Relative Logit Dynamics


55. Diffusion-CAM: Faithful Visual Explanations for dMLLMs


56. Sanity Checks for Agentic Data Science


57. MAFIG: Multi-agent Driven Formal Instruction Generation Framework


58. WebForge: Breaking the Realism-Reproducibility-Scalability Trilemma in Browser Agent Benchmark


59. Back to the Barn with LLAMAs: Evolving Pretrained LLM Backbones in Finetuning Vision Language Models


60. ATANT v1.1: Positioning Continuity Evaluation Against Memory, Long-Context, and Agentic-Memory Benchmarks


61. CFMS: A Coarse-to-Fine Multimodal Synthesis Framework for Enhanced Tabular Reasoning


62. Delving Aleatoric Uncertainty in Medical Image Segmentation via Vision Foundation Models


63. RAG-KT: Cross-platform Explainable Knowledge Tracing with Multi-view Fusion Retrieval Generation


64. CSPO: Alleviating Reward Ambiguity for Structured Table-to-LaTeX Generation


65. EvoNash-MARL: A Closed-Loop Multi-Agent Reinforcement Learning Framework for Medium-Horizon Equity Allocation


66. Reasoning as Data: Representation-Computation Unity and Its Implementation in a Domain-Algebraic Inference Engine


67. CASK: Core-Aware Selective KV Compression for Reasoning Traces


68. ZoomR: Memory Efficient Reasoning through Multi-Granularity Key Value Retrieval


69. A Quantitative Definition of Intelligence


70. Beyond Statistical Co-occurrence: Unlocking Intrinsic Semantics for Tabular Data Clustering


71. A Benchmark for Gap and Overlap Analysis as a Test of KG Task Readiness


72. Your Model Diversity, Not Method, Determines Reasoning Strategy


73. CheeseBench: Evaluating Large Language Models on Rodent Behavioral Neuroscience Paradigms


74. TorchUMM: A Unified Multimodal Model Codebase for Evaluation, Analysis, and Post-training


75. Learning Preference-Based Objectives from Clinical Narratives for Sequential Treatment Decision-Making


76. When More Thinking Hurts: Overthinking in LLM Test-Time Compute Scaling


77. Teaching Language Models How to Code Like Learners: Conversational Serialization for Student Simulation


78. SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences?


79. Camyla: Scaling Autonomous Research in Medical Image Segmentation


80. FACT-E: Causality-Inspired Evaluation for Trustworthy Chain-of-Thought Reasoning


81. Do LLMs Build Spatial World Models? Evidence from Grid-World Maze Tasks


82. FedRio: Personalized Federated Social Bot Detection via Cooperative Reinforced Contrastive Adversarial Distillation


83. Principles Do Not Apply Themselves: A Hermeneutic Perspective on AI Alignment


84. Preference-Agile Multi-Objective Optimization for Real-time Vehicle Dispatching


85. Governed Reasoning for Institutional AI


86. Enhancing Cross-Problem Vehicle Routing via Federated Learning


87. Working Paper: Towards Schema-based Learning from a Category-Theoretic Perspective


88. Failure Ontology: A Lifelong Learning Framework for Blind Spot Detection and Resilience Design


89. Agent^2 RL-Bench: Can LLM Agents Engineer Agentic RL Post-Training?


90. From Perception to Planning: Evolving Ego-Centric Task-Oriented Spatiotemporal Reasoning via Curriculum Learning


91. Agent Mentor: Framing Agent Knowledge through Semantic Trajectory Analysis


92. Thinking Fast, Thinking Wrong: Intuitiveness Modulates LLM Counterfactual Reasoning in Policy Evaluation


93. Beyond Compliance: A Resistance-Informed Motivation Reasoning Framework for Challenging Psychological Client Simulation


94. A Progressive Training Strategy for Vision-Language Models to Counteract Spatio-Temporal Hallucinations in Embodied Reasoning


95. Cooperation in Human and Machine Agents: Promise Theory Considerations


96. CARO: Chain-of-Analogy Reasoning Optimization for Robust Content Moderation


97. CHAIRO: Contextual Hierarchical Analogical Induction and Reasoning Optimization for LLMs


98. Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs


99. PEMANT: Persona-Enriched Multi-Agent Negotiation for Travel


100. VeriSim: A Configurable Framework for Evaluating Medical AI Under Realistic Patient Noise


101. Safety Guarantees in Zero-Shot Reinforcement Learning for Cascade Dynamical Systems


102. CWCD: Category-Wise Contrastive Decoding for Structured Medical Report Generation


103. TrajOnco: a multi-agent framework for temporal reasoning over longitudinal EHR for multi-cancer early detection


104. Beyond Monologue: Interactive Talking-Listening Avatar Generation with Conversational Audio Context-Aware Kernels


105. ClawVM: Harness-Managed Virtual Memory for Stateful Tool-Using LLM Agents


106. VeriTrans: Fine-Tuned LLM-Assisted NL-to-PL Translation via a Deterministic Neuro-Symbolic Pipeline


107. Zero-shot World Models Are Developmentally Efficient Learners


108. From GPT-3 to GPT-5: Mapping their capabilities, scope, limitations, and consequences


109. Gypscie: A Cross-Platform AI Artifact Management System


110. TimeSeriesExamAgent: Creating Time Series Reasoning Benchmarks at Scale


111. AI Organizations are More Effective but Less Aligned than Individual Agents


112. Dead Cognitions: A Census of Misattributed Insights


113. STARS: Skill-Triggered Audit for Request-Conditioned Invocation Safety in Agent Systems


114. The Amazing Agent Race: Strong Tool Users, Weak Navigators


115. A Dual-Positive Monotone Parameterization for Multi-Segment Bids and a Validity Assessment Framework for Reinforcement Learning Agent-based Simulation of Electricity Markets


116. SVSR: A Self-Verification and Self-Rectification Paradigm for Multimodal Reasoning


117. Cognitive Pivot Points and Visual Anchoring: Unveiling and Rectifying Hallucinations in Multimodal Reasoning Models


118. Edu-MMBias: A Three-Tier Multimodal Benchmark for Auditing Social Bias in Vision-Language Models under Educational Contexts


119. Credit-Budgeted ICPC-Style Coding: When Agents Must Pay for Every Decision


120. PoreDiT: A Scalable Generative Model for Large-Scale Digital Rock Reconstruction


121. MAVEN-T: Multi-Agent enVironment-aware Enhanced Neural Trajectory predictor with Reinforcement Learning


122. Inductive Reasoning for Temporal Knowledge Graphs with Emerging Entities


123. SpecMoE: A Fast and Efficient Mixture-of-Experts Inference via Self-Assisted Speculative Decoding


124. Learning from Emptiness: De-biasing Listwise Rerankers with Content-Agnostic Probability Calibration


125. Trust Your Memory: Verifiable Control of Smart Homes through Reinforcement Learning with Multi-dimensional Rewards


126. Ontological Trajectory Forecasting via Finite Semigroup Iteration and Lie Algebra Approximation in Geopolitical Knowledge Graphs


127. Learning Hierarchical and Geometry-Aware Graph Representations for Text-to-CAD


128. LoopGuard: Breaking Self-Reinforcing Attention Loops via Dynamic KV Cache Intervention


129. AI Achieves a Perfect LSAT Score


130. FinTrace: Holistic Trajectory-Level Evaluation of LLM Tool Calling for Long-Horizon Financial Tasks


131. New Hybrid Fine-Tuning Paradigm for LLMs: Algorithm Design and Convergence Analysis Framework


132. HealthAdminBench: Evaluating Computer-Use Agents on Healthcare Administration Tasks


133. GLEaN: A Text-to-image Bias Detection Approach for Public Comprehension


134. In-situ process monitoring for defect detection in wire-arc additive manufacturing: an agentic AI approach


135. What do your logits know? (The answer may surprise you!)


136. Evolutionary Token-Level Prompt Optimization for Diffusion Models


137. Instructing LLMs to Negotiate using Reinforcement Learning with Verifiable Rewards


138. MEMENTO: Teaching LLMs to Manage Their Own Context


139. Steered LLM Activations are Non-Surjective


140. COMPOSITE-Stem


141. EE-MCP: Self-Evolving MCP-GUI Agents via Automated Environment Generation and Experience Learning


142. Controllable and Verifiable Tool-Use Data Synthesis for Agentic Reinforcement Learning


143. Pioneer Agent: Continual Improvement of Small Language Models in Production


144. The Myth of Expert Specialization in MoEs: Why Routing Reflects Geometry, Not Necessarily Domain Expertise


145. Tipiano: Cascaded Piano Hand Motion Synthesis via Fingertip Priors


146. Belief-Aware VLM Model for Human-like Reasoning


147. How LLMs Might Think


148. Competing with AI Scientists: Agent-Driven Approach to Astrophysics Research


149. AdaQE-CG: Adaptive Query Expansion for Web-Scale Generative AI Model and Data Card Generation


150. The Geometry of Knowing: From Possibilistic Ignorance to Probabilistic Certainty – A Measure-Theoretic Framework for Epistemic Convergence


151. Beyond Theory of Mind in Robotics


152. General-purpose LLMs as Models of Human Driver Behavior: The Case of Simplified Merging


153. Unifying Ontology Construction and Semantic Alignment for Deterministic Enterprise Reasoning at Scale


154. Evaluating Reliability Gaps in Large Language Model Safety via Repeated Prompt Sampling


155. LLMs for Text-Based Exploration and Navigation Under Partial Observability


156. From Scalars to Tensors: Declared Losses Recover Epistemic Distinctions That Neutrosophic Scalars Cannot Express


157. Hubble: An LLM-Driven Agentic Framework for Safe and Automated Alpha Factor Discovery


158. CID-TKG: Collaborative Historical Invariance and Evolutionary Dynamics Learning for Temporal Knowledge Graph Reasoning


159. DERM-3R: A Resource-Efficient Multimodal Agents Framework for Dermatologic Diagnosis and Treatment in Real-World Clinical Settings


160. Spatial Competence Benchmark


161. DeepReviewer 2.0: A Traceable Agentic System for Auditable Scientific Peer Review


162. Persistent Identity in AI Agents: A Multi-Anchor Architecture for Resilient Memory and Continuity


163. MobiFlow: Real-World Mobile Agent Benchmarking through Trajectory Fusion


164. Agentic Exploration of PDE Spaces using Latent Foundation Models for Parameterized Simulations


165. Factorizing formal contexts from closures of necessity operators


166. OpeFlo: Automated UX Evaluation via Simulated Human Web Interaction with GUI Grounding


167. OOWM: Structuring Embodied Reasoning and Planning via Object-Oriented Programmatic World Modeling


168. Help Without Being Asked: A Deployed Proactive Agent System for On-Call Support with Continuous Self-Improvement


169. Explainable Planning for Hybrid Systems


170. AHC: Meta-Learned Adaptive Compression for Continual Object Detection on Memory-Constrained Microcontrollers


171. Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization


172. Seven simple steps for log analysis in AI systems


173. Linear Programming for Multi-Criteria Assessment with Cardinal and Ordinal Data: A Pessimistic Virtual Gap Analysis


174. LABBench2: An Improved Benchmark for AI Systems Performing Biology Research


175. Physics-Informed State Space Models for Reliable Solar Irradiance Forecasting in Off-Grid Systems


176. Solving Physics Olympiad via Reinforcement Learning on Physics Simulators


177. Budget-Aware Uncertainty for Radiotherapy Segmentation QA Using nnU-Net


178. C-ReD: A Comprehensive Chinese Benchmark for AI-Generated Text Detection Derived from Real-World Prompts


179. A Mechanistic Analysis of Looped Reasoning Language Models


180. ClawGuard: A Runtime Security Framework for Tool-Augmented LLM Agents Against Indirect Prompt Injection


181. ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents


182. General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks


183. Efficient KernelSHAP Explanations for Patch-based 3D Medical Image Segmentation


184. StarVLA-$α$: Reducing Complexity in Vision-Language-Action Systems


185. Grounded World Model for Semantically Generalizable Planning


186. Discourse Diversity in Multi-Turn Empathic Dialogue


187. Multi-ORFT: Stable Online Reinforcement Fine-Tuning for Multi-Agent Diffusion Planning in Cooperative Driving


188. Endogenous Information in Routing Games: Memory-Constrained Equilibria, Recall Braess Paradoxes, and Memory Design


189. Evaluating Cooperation in LLM Social Groups through Elected Leadership


190. On the Robustness of Watermarking for Autoregressive Image Generation


191. Fairness is Not Flat: Geometric Phase Transitions Against Shortcut Learning



193. AffordSim: A Scalable Data Generator and Benchmark for Affordance-Aware Robotic Manipulation


194. NetworkNet: A Deep Neural Network Approach for Random Networks with Sparse Nodal Attributes and Complex Nodal Heterogeneity


195. Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind


196. Beyond LLMs, Sparse Distributed Memory, and Neuromorphics <A Hyper-Dimensional SRAM-CAM “VaCoAl” for Ultra-High Speed, Ultra-Low Power, and Low Cost>


197. Towards Autonomous Mechanistic Reasoning in Virtual Cells


198. RPA-Check: A Multi-Stage Automated Framework for Evaluating Dynamic LLM-based Role-Playing Agents


199. CodeTracer: Towards Traceable Agent States


200. SCNO: Spiking Compositional Neural Operator – Towards a Neuromorphic Foundation Model for Nuclear PDE Solving


201. CUTEv2: Unified and Configurable Matrix Extension for Diverse CPU Architectures with Minimal Design Overhead


202. Layerwise Dynamics for In-Context Classification in Transformers


203. A Triadic Suffix Tokenization Scheme for Numerical Reasoning


204. Minimizing classical resources in variational measurement-based quantum computation for generative modeling


205. Synthius-Mem: Brain-Inspired Hallucination-Resistant Persona Memory Achieving 94.4% Memory Accuracy and 99.6% Adversarial Robustness on LoCoMo


206. bacpipe: a Python package to make bioacoustic deep learning models accessible


207. FM-Agent: Scaling Formal Methods to Large Systems via LLM-Based Hoare-Style Reasoning


208. Time is Not a Label: Continuous Phase Rotation for Temporal Knowledge Graphs and Agentic Memory


209. NovBench: Evaluating Large Language Models on Academic Paper Novelty Assessment


210. CLAY: Conditional Visual Similarity Modulation in Vision-Language Embedding Space


211. SVD-Prune: Training-Free Token Pruning For Efficient Vision-Language Models


212. From Translation to Superset: Benchmark-Driven Evolution of a Production AI Agent from Rust to Python


213. EdgeCIM: A Hardware-Software Co-Design for CIM-Based Acceleration of Small Language Models


214. Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization


215. Not All Forgetting Is Equal: Architecture-Dependent Retention Dynamics in Fine-Tuned Image Classifiers


216. Deep Learning for Sequential Decision Making under Uncertainty: Foundations, Frameworks, and Frontiers


217. METER: Evaluating Multi-Level Contextual Causal Reasoning in Large Language Models


218. Quantization Dominates Rank Reduction for KV-Cache Compression


219. ADD for Multi-Bit Image Watermarking


220. SLALOM: Simulation Lifecycle Analysis via Longitudinal Observation Metrics for Social Simulation


221. Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration


222. Think Before you Write: QA-Guided Reasoning for Character Descriptions in Books


223. Hardening x402: PII-Safe Agentic Payments via Pre-Execution Metadata Filtering


224. METRO: Towards Strategy Induction from Expert Dialogue Transcripts for Non-collaborative Dialogues


225. Emulating Non-Differentiable Metrics via Knowledge-Guided Learning: Introducing the Minkowski Image Loss


226. Efficient Emotion-Aware Iconic Gesture Prediction for Robot Co-Speech


227. Retrieval as Generation: A Unified Framework with Self-Triggered Information Planning


228. One Scale at a Time: Scale-Autoregressive Modeling for Fluid Flow Distributions


229. From Redaction to Restoration: Deep Learning for Medical Image Anonymization and Reconstruction


230. Minimal Embodiment Enables Efficient Learning of Number Concepts in Robot


231. Governance by Design: A Parsonian Institutional Architecture for Internet-Wide Agent Societies


232. A Compact and Efficient 1.251 Million Parameter Machine Learning CNN Model PD36-C for Plant Disease Detection: A Case Study


233. Do LLMs Know Tool Irrelevance? Demystifying Structural Alignment Bias in Tool Invocations


234. S$^3$: Structured Sparsity Specification


235. Network Effects and Agreement Drift in LLM Debates


236. The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems


237. Learning to Forget – Hierarchical Episodic Memory for Lifelong Robot Deployment


238. 3D-Anchored Lookahead Planning for Persistent Robotic Scene Memory via World-Model-Based MCTS


239. Enhancing Multimodal Large Language Models for Ancient Chinese Character Evolution Analysis via Glyph-Driven Fine-Tuning


240. The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping


241. THEIA: Learning Complete Kleene Three-Valued Logic in a Pure-Neural Modular Architecture


242. AbLWR:A Context-Aware Listwise Ranking Framework for Antibody-Antigen Binding Affinity Prediction via Positive-Unlabeled Learning


243. Evolving Many Worlds: Towards Open-Ended Discovery in Petri Dish NCA via Population-Based Training


244. RECIPER: A Dual-View Retrieval Pipeline for Procedure-Oriented Materials Question Answering


245. Regional Explanations: Bridging Local and Global Variable Importance


246. Exploring Knowledge Conflicts for Faithful LLM Reasoning: Benchmark and Method


247. Designing Adaptive Digital Nudging Systems with LLM-Driven Reasoning


248. CocoaBench: Evaluating Unified Digital Agents in the Wild


249. ShapShift: Explaining Model Prediction Shifts with Subgroup Conditional Shapley Values


250. Towards Adaptive Open-Set Object Detection via Category-Level Collaboration Knowledge Mining


251. MathAgent: Adversarial Evolution of Constraint Graphs for Mathematical Reasoning Data Synthesis


252. Taking a Pulse on How Generative AI is Reshaping the Software Engineering Research Landscape


253. EmbodiedGovBench: A Benchmark for Governance, Recovery, and Upgrade Safety in Embodied Agent Systems


254. Cost-optimal Sequential Testing via Doubly Robust Q-learning


255. BoxTuning: Directly Injecting the Object Box for Multimodal Model Fine-Tuning


256. Semantic-Geometric Dual Compression: Training-Free Visual Token Reduction for Ultra-High-Resolution Remote Sensing Understanding


257. Use of AI Tools: Guidelines to Maintain Academic Integrity in Computing Colleges



259. ActorMind: Emulating Human Actor Reasoning for Speech Role-Playing


260. Efficient Training for Cross-lingual Speech Language Models


261. Bottleneck Tokens for Unified Multimodal Retrieval


262. E2E-REME: Towards End-to-End Microservices Auto-Remediation via Experience-Simulation Reinforcement Fine-Tuning


263. FlowCoMotion: Text-to-Motion Generation via Token-Latent Flow Modeling


264. ReSpinQuant: Efficient Layer-Wise LLM Quantization via Subspace Residual Rotation Approximation


265. Lightweight Low-Light Image Enhancement via Distribution-Normalizing Preprocessing and Depthwise U-Net


266. Pando: Do Interpretability Methods Work When Models Won’t Explain Themselves?


267. Rethinking Token-Level Credit Assignment in RLVR: A Polarity-Entropy Analysis


268. Shared Emotion Geometry Across Small Language Models: A Cross-Architecture Study of Representation, Behavior, and Methodological Confounds


269. A Systematic Analysis of the Impact of Persona Steering on LLM Capabilities


270. RTMC: Step-Level Credit Assignment via Rollout Trees


271. Uncertainty-Aware Web-Conditioned Scientific Fact-Checking


272. Federated Single-Agent Robotics: Multi-Robot Coordination Without Intra-Robot Multi-Agent Fragmentation


273. Optimal Stability of KL Divergence under Gaussian Perturbations


274. Brief2Design: A Multi-phased, Compositional Approach to Prompt-based Graphic Design


275. NimbusGuard: A Novel Framework for Proactive Kubernetes Autoscaling Using Deep Q-Networks


276. Panoptic Pairwise Distortion Graph


277. When Valid Signals Fail: Regime Boundaries Between LLM Features and RL Trading Policies


278. Examining EAP Students’ AI Disclosure Intention: A Cognition-Affect-Conation Perspective


279. When Verification Fails: How Compositionally Infeasible Claims Escape Rejection


280. Enabling and Inhibitory Pathways of Students’ AI Use Concealment Intention in Higher Education: Evidence from SEM and fsQCA


281. MMR-AD: A Large-Scale Multimodal Dataset for Benchmarking General Anomaly Detection with Multimodal Large Language Models


282. Towards Automated Solar Panel Integrity: Hybrid Deep Feature Extraction for Advanced Surface Defect Identification


283. You Only Judge Once: Multi-response Reward Modeling in a Single Forward Pass


284. Continuous-time Online Learning via Mean-Field Neural Networks: Regret Analysis in Diffusion Environments


285. A molecular clock for writing systems reveals the quantitative impact of imperial power on cultural evolution


286. Pseudo-Unification: Entropy Probing Reveals Divergent Information Patterns in Unified Multimodal Models


287. QShield: Securing Neural Networks Against Adversarial Attacks using Quantum Circuits


288. Mem$^2$Evolve: Towards Self-Evolving Agents via Co-Evolutionary Capability Expansion and Experience Distillation


289. ReXSonoVQA: A Video QA Benchmark for Procedure-Centric Ultrasound Understanding


290. Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music


291. Evaluating the Impact of Medical Image Reconstruction on Downstream AI Fairness and Performance


292. Beyond A Fixed Seal: Adaptive Stealing Watermark in Large Language Models


293. Product Review Based on Optimized Facial Expression Detection


294. Ambiguity Detection and Elimination in Automated Executable Process Modeling


295. DIB-OD: Preserving the Invariant Core for Robust Heterogeneous Graph Adaptation via Decoupled Information Bottleneck and Online Distillation


296. Compliant But Unsatisfactory: The Gap Between Auditing Standards and Practices for Probabilistic Genotyping Software


297. AOP-Smart: A RAG-Enhanced Large Language Model Framework for Adverse Outcome Pathway Analysis


298. Query Lower Bounds for Diffusion Sampling


299. BridgeSim: Unveiling the OL-CL Gap in End-to-End Autonomous Driving


300. Task2vec Readiness: Diagnostics for Federated Learning from Pre-Training Embeddings


301. Retinal Cyst Detection from Optical Coherence Tomography Images


302. Resilient Write: A Six-Layer Durable Write Surface for LLM Coding Agents


303. Harnessing Photonics for Machine Intelligence


304. LLMs for Qualitative Data Analysis Fail on Security-specificComments in Human Experiments


305. Speaking to No One: Ontological Dissonance and the Double Bind of Conversational AI


306. MeloTune: On-Device Arousal Learning and Peer-to-Peer Mood Coupling for Proactive Music Curation


307. Verify Before You Fix: Agentic Execution Grounding for Trustworthy Cross-Language Code Analysis


308. Advancing Polish Language Modeling through Tokenizer Optimization in the Bielik v3 7B and 11B Series


309. TInR: Exploring Tool-Internalized Reasoning in Large Language Models


310. Do BERT Embeddings Encode Narrative Dimensions? A Token-Level Probing Analysis of Time, Space, Causality, and Character in Fiction


311. Lung Cancer Detection Using Deep Learning


312. Prosociality by Coupling, Not Mere Observation: Homeostatic Sharing in an Inspectable Recurrent Artificial Life Agent


313. Generating Multiple-Choice Knowledge Questions with Interpretable Difficulty Estimation using Knowledge Graphs and Large Language Models


314. Deep-Reporter: Deep Research for Grounded Multimodal Long-Form Generation


315. Too Nice to Tell the Truth: Quantifying Agreeableness-Driven Sycophancy in Role-Playing Language Models


316. Perceived Importance of Cognitive Skills Among Computing Students in the Era of AI


317. Tail-Aware Information-Theoretic Generalization for RLHF and SGLD


318. Turning Generators into Retrievers: Unlocking MLLMs for Natural Language-Guided Geo-Localization


319. Detecting RAG Extraction Attack via Dual-Path Runtime Integrity Game


320. Audio-Omni: Extending Multi-modal Understanding to Versatile Audio Generation and Editing


321. Architecture-Agnostic Modality-Isolated Gated Fusion for Robust Multi-Modal Prostate MRI Segmentation


322. Bringing Value Models Back: Generative Critics for Value Modeling in LLM Reinforcement Learning


323. SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting


324. Critical-CoT: A Robust Defense Framework against Reasoning-Level Backdoor Attacks in Large Language Models


325. Skill-SD: Skill-Conditioned Self-Distillation for Multi-turn LLM Agents


326. Learning and Enforcing Context-Sensitive Control for LLMs


327. DynamicsLLM: a Dynamic Analysis-based Tool for Generating Intelligent Execution Traces Using LLMs to Detect Android Behavioural Code Smells


328. Efficient Process Reward Modeling via Contrastive Mutual Information


329. LoViF 2026 The First Challenge on Weather Removal in Videos


330. Vibe-driven model-based engineering


331. Computational Lesions in Multilingual Language Models Separate Shared and Language-specific Brain Alignment


332. NSFL: A Post-Training Neuro-Symbolic Fuzzy Logic Framework for Boolean Operators in Neural Embeddings


333. MoEITS: A Green AI approach for simplifying MoE-LLMs


334. COREY: A Prototype Study of Entropy-Guided Operator Fusion with Hadamard Reparameterization for Selective State Space Models


335. GeoMeld: Toward Semantically Grounded Foundation Models for Remote Sensing


336. Bridging Linguistic Gaps: Cross-Lingual Mapping in Pre-Training and Dataset for Enhanced Multilingual LLM Performance


337. Calibration Collapse Under Sycophancy Fine-Tuning: How Reward Hacking Breaks Uncertainty Quantification in LLMs


338. AffordGen: Generating Diverse Demonstrations for Generalizable Object Manipulation with Afford Correspondence


339. The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents


340. Universal statistical signatures of evolution in artificial intelligence architectures


341. Early Decisions Matter: Proximity Bias and Initial Trajectory Shaping in Non-Autoregressive Diffusion Language Models


342. LLMs Should Incorporate Explicit Mechanisms for Human Empathy


343. WaveMoE: A Wavelet-Enhanced Mixture-of-Experts Foundation Model for Time Series Forecasting


344. VidAudio-Bench: Benchmarking V2A and VT2A Generation across Four Audio Categories


345. IceCache: Memory-efficient KV-cache Management for Long-Sequence LLMs


346. Machine Learning-Based Detection of MCP Attacks


347. PepBenchmark: A Standardized Benchmark for Peptide Machine Learning


348. Towards an Appropriate Level of Reliance on AI: A Preliminary Reliance-Control Framework for AI in Software Engineering


349. AI Patents in the United States and China: Measurement, Organization, and Knowledge Flows


350. STORM: End-to-End Referring Multi-Object Tracking in Videos


351. ReFEree: Reference-Free and Fine-Grained Method for Evaluating Factual Consistency in Real-World Code Summarization


352. Data-Efficient Surgical Phase Segmentation in Small-Incision Cataract Surgery: A Controlled Study of Vision Foundation Models


353. How Many Tries Does It Take? Iterative Self-Repair in LLM Code Generation Across Model Scales and Benchmarks


354. Cross-Cultural Bias in Mel-Scale Representations: Evidence and Alternatives from Speech and Music


355. UDAPose: Unsupervised Domain Adaptation for Low-Light Human Pose Estimation



357. Rethinking the Diffusion Model from a Langevin Perspective


358. Toward Accountable AI-Generated Content on Social Platforms: Steganographic Attribution and Multimodal Harm Detection


359. Towards Green Wearable Computing: A Physics-Aware Spiking Neural Network for Energy-Efficient IMU-based Human Activity Recognition


360. A Queueing-Theoretic Framework for Dynamic Attack Surfaces: Data-Integrated Risk Analysis and Adaptive Defense


361. CodaRAG: Connecting the Dots with Associativity Inspired by Complementary Learning


362. IMPACT: A Dataset for Multi-Granularity Human Procedural Action Understanding in Industrial Assembly


363. Rethinking Video Human-Object Interaction: Set Prediction over Time for Unified Detection and Anticipation


364. Intent-aligned Formal Specification Synthesis via Traceable Refinement


365. FishRoPE: Projective Rotary Position Embeddings for Omnidirectional Visual Perception


366. Multinex: Lightweight Low-light Image Enhancement via Multi-prior Retinex


367. A Diffusion-Contrastive Graph Neural Network with Virtual Nodes for Wind Nowcasting in Unobserved Regions


368. Jailbreaking the Matrix: Nullspace Steering for Controlled Model Subversion


369. Class-Adaptive Cooperative Perception for Multi-Class LiDAR-based 3D Object Detection in V2X Systems


370. From Helpful to Trustworthy: LLM Agents for Pair Programming


371. FashionMV: Product-Level Composed Image Retrieval with Multi-View Fashion Data


372. Adapting 2D Multi-Modal Large Language Model for 3D CT Image Analysis


373. Exploring the impact of fairness-aware criteria in AutoML


374. Wolkowicz-Styan Upper Bound on the Hessian Eigenspectrum for Cross-Entropy Loss in Nonlinear Smooth Neural Networks


375. Virtual Smart Metering in District Heating Networks via Heterogeneous Spatial-Temporal Graph Neural Networks


376. A Temporally Augmented Graph Attention Network for Affordance Classification


377. MOSAIC: Multi-Domain Orthogonal Session Adaptive Intent Capture for Prescient Recommendations


378. Think in Sentences: Explicit Sentence Boundaries Enhance Language Model’s Capabilities


379. Semantic Manipulation Localization


380. VGA-Bench: A Unified Benchmark and Multi-Model Framework for Video Aesthetics and Generation Quality Evaluation


381. MR-Coupler: Automated Metamorphic Test Generation via Functional Coupling Analysis


382. A Dual Cross-Attention Graph Learning Framework For Multimodal MRI-Based Major Depressive Disorder Detection


383. CircuitSynth: Reliable Synthetic Data Generation


384. Degradation-Consistent Paired Training for Robust AI-Generated Image Detection


385. MatRes: Zero-Shot Test-Time Model Adaptation for Simultaneous Matching and Restoration


386. Graph-RHO: Critical-path-aware Heterogeneous Graph Network for Long-Horizon Flexible Job-Shop Scheduling


387. ASPIRin: Action Space Projection for Interactivity-Optimized Reinforcement Learning in Full-Duplex Speech Language Models


388. Computational Implementation of a Model of Category-Theoretic Metaphor Comprehension


389. Closed-Form Concept Erasure via Double Projections


390. CoSToM:Causal-oriented Steering for Intrinsic Theory-of-Mind Alignment in Large Language Models


391. LVSum: A Benchmark for Timestamp-Aware Long Video Summarization


392. FREE-Switch: Frequency-based Dynamic LoRA Switch for Style Transfer


393. Demographic and Linguistic Bias Evaluation in Omnimodal Language Models


394. Like a Hammer, It Can Build, It Can Break: Large Language Model Uses, Perceptions, and Adoption in Cybersecurity Operations on Reddit


395. Agentic Application in Power Grid Static Analysis: Automatic Code Generation and Error Correction


396. FlowPalm: Optical Flow Driven Non-Rigid Deformation for Geometrically Diverse Palmprint Generation


397. A Minimal Model of Representation Collapse: Frustration, Stop-Gradient, and Dynamics


398. Muon$^2$: Boosting Muon via Adaptive Second-Moment Preconditioning


399. Rebooting Microreboot: Architectural Support for Safe, Parallel Recovery in Microservice Systems


400. Cross-Cultural Value Awareness in Large Vision-Language Models


401. I Walk the Line: Examining the Role of Gestalt Continuity in Object Binding for Vision Transformers


402. A Hybrid Intelligent Framework for Uncertainty-Aware Condition Monitoring of Industrial Systems


403. The Rise and Fall of $G$ in AGI


404. From UAV Imagery to Agronomic Reasoning: A Multimodal LLM Benchmark for Plant Phenotyping


405. Diffusion Denoiser Achievable Analysis for Finite Blocklength Unsourced Random Access


406. Should We be Pedantic About Reasoning Errors in Machine Translation?


407. Not Your Stereo-Typical Estimator: Combining Vision and Language for Volume Perception


408. DINO_4D: Semantic-Aware 4D Reconstruction


409. Efficient Personalization of Generative User Interfaces


410. Relational Preference Encoding in Looped Transformer Internal States


411. Exploring Structural Complexity in Normative RAG with Graph-based approaches: A case study on the ETSI Standards


412. Automating Structural Analysis Across Multiple Software Platforms Using Large Language Models


413. PAS: Estimating the target accuracy before domain adaptation


414. RoboLab: A High-Fidelity Simulation Benchmark for Analysis of Task Generalist Policies


415. Is There Knowledge Left to Extract? Evidence of Fragility in Medically Fine-Tuned Vision-Language Models


416. F3G-Avatar : Face Focused Full-body Gaussian Avatar


417. ACCIDENT: A Benchmark Dataset for Vehicle Accident Detection from Traffic Surveillance Videos


418. Explainable Human Activity Recognition: A Unified Review of Concepts and Mechanisms


419. GIANTS: Generative Insight Anticipation from Scientific Literature


420. MedLVR: Latent Visual Reasoning for Reliable Medical Visual Question Answering


421. A-IO: Adaptive Inference Orchestration for Memory-Bound NPUs


422. Conflicts Make Large Reasoning Models Vulnerable to Attacks


423. Backdoors in RLVR: Jailbreak Backdoors in LLMs From Verifiable Reward


424. ADAM: A Systematic Data Extraction Attack on Agent Memory via Adaptive Querying


425. CONSCIENTIA: Can LLM Agents Learn to Strategize? Emergent Deception and Trust in a Multi-Agent NYC Simulation


426. MPAC: A Multi-Principal Agent Coordination Protocol for Interoperable Multi-Agent Collaboration


427. ExecTune: Effective Steering of Black-Box LLMs with Guide Models


428. STaR-DRO: Stateful Tsallis Reweighting for Group-Robust Structured Prediction


429. Multi-Frequency Local Plasticity for Visual Representation Learning


430. SMART: When is it Actually Worth Expanding a Speculative Tree?


431. LOLGORITHM: Funny Comment Generation Agent For Short Videos


432. ConfigSpec: Profiling-Based Configuration Selection for Distributed Edge–Cloud Speculative LLM Serving


433. Training Deep Visual Networks Beyond Loss and Accuracy Through a Dynamical Systems Approach


434. LAST: Leveraging Tools as Hints to Enhance Spatial Reasoning for Multimodal Large Language Models


435. Orthogonal Quadratic Complements for Vision Transformer Feed-Forward Networks


436. The Deployment Gap in AI Media Detection: Platform-Aware and Visually Constrained Adversarial Evaluation


437. Identity-Aware U-Net: Fine-grained Cell Segmentation via Identity-Aware Representation Learning


438. Attention-Guided Flow-Matching for Sparse 3D Geological Generation


439. Evaluating Scene-based In-Situ Item Labeling for Immersive Conversational Recommendation


440. I Can’t Believe TTA Is Not Better: When Test-Time Augmentation Hurts Medical Image Classification


441. Assessing Privacy Preservation and Utility in Online Vision-Language Models


442. TaFall: Balance-Informed Fall Detection via Passive Thermal Sensing


443. CAGE: Bridging the Accuracy-Aesthetics Gap in Educational Diagrams via Code-Anchored Generative Enhancement


444. Face Density as a Proxy for Data Complexity: Quantifying the Hardness of Instance Count


445. Grid2Matrix: Revealing Digital Agnosia in Vision-Language Models


446. Decision-Theoretic Safety Assessment of Persona-Driven Multi-Agent Systems in O-RAN


447. Heterogeneous Consensus-Progressive Reasoning for Efficient Multi-Agent Debate


448. NetAgentBench: A State-Centric Benchmark for Evaluating Agentic Network Configuration


449. A Comparative Theoretical Analysis of Entropy Control Methods in Reinforcement Learning


450. Real-Time Voicemail Detection in Telephony Audio Using Temporal Speech Activity Features


451. Active Inference with a Self-Prior in the Mirror-Mark Task


452. Human-like Working Memory Interference in Large Language Models


453. Digital hybridity and relics in cultural heritage: using corpus linguistics to inform design in emerging technologies from AI to VR


454. Do We Still Need GraphRAG? Benchmarking RAG and GraphRAG for Agentic Search Systems


455. Deliberative Alignment is Deep, but Uncertainty Remains: Inference time safety improvement in reasoning via attribution of unsafe behavior to base model


456. Learning noisy phase transition dynamics from stochastic partial differential equations


457. Fairboard: a quantitative framework for equity assessment of healthcare models


458. NeuroPath: Practically Adopting Motor Imagery Decoding through EEG Signals


459. Diffusion-Based Generative Priors for Efficient Beam Alignment in Directional Networks


460. Dynamic Forecasting and Temporal Feature Evolution of Stock Repurchases in Listed Companies Using Attention-Based Deep Temporal Networks


461. WearBCI Dataset: Understanding and Benchmarking Real-World Wearable Brain-Computer Interfaces Signals


462. Efficient Disruption of Criminal Networks through Multi-Objective Genetic Algorithms


463. Generating High Quality Synthetic Data for Dutch Medical Conversations


464. Detecting Corporate AI-Washing via Cross-Modal Semantic Inconsistency Learning


465. Leveraging Machine Learning Techniques to Investigate Media and Information Literacy Competence in Tackling Disinformation


466. From Understanding to Creation: A Prerequisite-Free AI Literacy Course with Technical Depth Across Majors


467. Agentic AI in Engineering and Manufacturing: Industry Perspectives on Utility, Adoption, Challenges, and Opportunities


468. Hardware Utilization and Inference Performance of Edge Object Detection Under Fault Injection


469. Adoption and Effectiveness of AI-Based Anomaly Detection for Cross Provider Health Data Exchange


470. Assessing Model-Agnostic XAI Methods against EU AI Act Explainability Requirements


471. Explainability and Certification of AI-Generated Educational Assessments


472. LLM Nepotism in Organizational Governance


473. Assessing the Pedagogical Readiness of Large Language Models as AI Tutors in Low-Resource Contexts: A Case Study of Nepal’s K-10 Curriculum


474. HearthNet: Edge Multi-Agent Orchestration for Smart Homes


475. Token-Budget-Aware Pool Routing for Cost-Efficient LLM Inference


476. Characterizing Performance-Energy Trade-offs of Large Language Models in Multi-Request Workflows


477. Human-AI Interaction Traces as Blackout Poetry: Reframing AI-Supported Writing as Found-Text Creativity


478. ECHO: Elastic Speculative Decoding with Sparse Gating for High-Concurrency Scenarios


479. Duration-Informed Workload Scheduler


480. From Theory to Protocol: Executable Frameworks for Creative Emergence and Strategic Foresight


481. Why Smaller Is Slower? Dimensional Misalignment in Compressed LLMs


482. Evaluating Visual Prompts with Eye-Tracking Data for MLLM-Based Human Activity Recognition


483. Generative UI: LLMs are Effective UI Generators


484. ACE-TA: An Agentic Teaching Assistant for Grounded Q&A, Quiz Generation, and Code Tutoring


485. Tuning Qwen2.5-VL to Improve Its Web Interaction Skills


486. Neuro-Symbolic Strong-AI Robots with Closed Knowledge Assumption: Learning and Deductions


487. LETGAMES: An LLM-Powered Gamified Approach to Cognitive Training for Patients with Cognitive Impairment


488. AEG: A Baremetal Framework for AI Acceleration via Direct Hardware Access in Heterogeneous Accelerators


489. ACE-Bench: A Lightweight Benchmark for Evaluating Azure SDK Usage Correctness


490. StreamServe: Adaptive Speculative Flows for Low-Latency Disaggregated LLM Serving


491. Emergent Social Structures in Autonomous AI Agent Networks: A Metadata Analysis of 626 Agents on the Pilot Protocol


492. SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding


493. Para-B&B: Load-Balanced Deterministic Parallelization of Solving MIP


494. SRBench: A Comprehensive Benchmark for Sequential Recommendation with Large Language Models


495. MCERF: Advancing Multimodal LLM Evaluation of Engineering Documentation with Enhanced Retrieval


496. SemaCDR: LLM-Powered Transferable Semantics for Cross-Domain Sequential Recommendation


497. Beyond Offline A/B Testing: Context-Aware Agent Simulation for Recommender System Evaluation


498. Retrieval-Augmented Large Language Models for Evidence-Informed Guidance on Cannabidiol Use in Older Adults


499. The Paradox of Professional Input: How Expert Collaboration with AI Systems Shapes Their Future Value