전체 AI 논문 - 2026-03-24

1. MARCUS: An agentic, multimodal vision-language model for cardiac diagnosis and management


2. SpecTM: Spectral Targeted Masking for Trustworthy Foundation Models


3. GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning


4. A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP


5. Future-Interactions-Aware Trajectory Prediction via Braid Theory


6. Guideline-grounded retrieval-augmented generation for ophthalmic clinical decision support


7. Tacit Knowledge Management with Generative AI: Proposal of the GenAI SECI Model


8. Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models


9. Agentic Personas for Adaptive Scientific Explanations with Knowledge Graphs


10. The Presupposition Problem in Representation Genesis


11. The Reasoning Error About Reasoning: Why Different Types of Reasoning Require Different Representational Structures


12. EvoIdeator: Evolving Scientific Ideas through Checklist-Grounded Reinforcement Learning


13. CurvZO: Adaptive Curvature-Guided Sparse Zeroth-Order Optimization for Efficient LLM Fine-Tuning


14. Compensating Visual Insufficiency with Stratified Language Guidance for Long-Tail Class Incremental Learning


15. A Blueprint for Self-Evolving Coding Agents in Vehicle Aerodynamic Drag Prediction


16. MIND: Multi-agent inference for negotiation dialogue in travel planning


17. Deterministic Hallucination Detection in Medical VQA via Confidence-Evidence Bayesian Gain


18. Reasoning Provenance for Autonomous AI Agents: Structured Behavioral Analytics Beyond State Checkpoints and Execution Traces


19. AI Token Futures Market: Commoditization of Compute and Derivatives Contract Design


20. Mirage The Illusion of Visual Understanding


21. Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks


22. EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises


23. INTRYGUE: Induction-Aware Entropy Gating for Reliable RAG Uncertainty Estimation


24. A Multidisciplinary AI Board for Multimodal Dementia Characterization and Risk Assessment


25. Mind over Space: Can Multimodal Large Language Models Mentally Navigate?


26. Adaptive Robust Estimator for Multi-Agent Reinforcement Learning


27. Counterfactual Credit Policy Optimization for Multi-Agent Collaboration


28. Stabilizing Iterative Self-Training with Verified Reasoning via Symbolic Recursive Self-Alignment


29. Unified-MAS: Universally Generating Domain-Specific Nodes for Empowering Automatic Multi-Agent Systems


30. Beyond Correlation: Refutation-Validated Aspect-Based Sentiment Analysis for Explainable Energy Market Returns


31. Safety as Computation: Certified Answer Reuse via Capability Closure in Task-Oriented Dialogue


32. Behavioural feasible set: Value alignment constraints on AI decision support


33. DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation


34. Is the future of AI green? What can innovation diffusion models say about generative AI’s environmental impact?


35. Silent Commitment Failure in Instruction-Tuned Language Models: Evidence of Governability Divergence Across Architectures


36. The Myhill-Nerode Theorem for Bounded Interaction: Canonical Abstractions via Agent-Bounded Indistinguishability


37. Persona Vectors in Games: Measuring and Steering Strategies via Activation Vectors


38. PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost


39. A transformer architecture alteration to incentivise externalised reasoning


40. AdaRubric: Task-Adaptive Rubrics for LLM Agent Evaluation


41. AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling


42. The AI Scientific Community: Agentic Virtual Lab Swarms


43. RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action Models


44. ARYA: A Physics-Constrained Composable & Deterministic World Model Architecture


45. Improving Coherence and Persistence in Agentic AI for System Optimization


46. The Library Theorem: How External Organization Governs Agentic Reasoning Capacity


47. Graph of States: Solving Abductive Tasks with Large Language Models


48. ConsRoute:Consistency-Aware Adaptive Query Routing for Cloud-Edge-Device Large Language Models


49. Does AI Homogenize Student Thinking? A Multi-Dimensional Analysis of Structural Convergence in AI-Augmented Essays


50. Revisiting Tree Search for LLMs: Gumbel and Sequential Halving for Budget-Scalable Reasoning


51. Can LLMs Fool Graph Learning? Exploring Universal Adversarial Attacks on Text-Attributed Graphs


52. ORACLE: Optimizing Reasoning Abilities of Large Language Models via Constraint-Led Synthetic Data Elicitation


53. LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning


54. KLDrive: Fine-Grained 3D Scene Reasoning for Autonomous Driving based on Knowledge Graph


55. Knowledge Boundary Discovery for Large Language Models


56. A Framework for Low-Latency, LLM-driven Multimodal Interaction on the Pepper Robot


57. The Intelligent Disobedience Game: Formulating Disobedience in Stackelberg Games and Markov Decision Processes


58. Can we automatize scientific discovery in the cognitive sciences?


59. AutoMOOSE: An Agentic AI for Autonomous Phase-Field Simulation


60. gUFO: A Gentle Foundational Ontology for Semantic Web Knowledge Graphs


61. Profit is the Red Team: Stress-Testing Agents in Strategic Economic Interactions


62. Do LLM-Driven Agents Exhibit Engagement Mechanisms? Controlled Tests of Information Load, Descriptive Norms, and Popularity Cues


63. ReLaMix: Residual Latency-Aware Mixing for Delay-Robust Financial Time-Series Forecasting


64. Governance-Aware Vector Subscriptions for Multi-Agent Knowledge Ecosystems


65. GMPilot: An Expert AI Agent For FDA cGMP Compliance


66. Modeling Epistemic Uncertainty in Social Perception via Rashomon Set Agents


67. Multi-RF Fusion with Multi-GNN Blending for Molecular Property Prediction


68. AI-Driven Multi-Agent Simulation of Stratified Polyamory Systems: A Computational Framework for Optimizing Social Reproductive Efficiency


69. Towards Intelligent Geospatial Data Discovery: a knowledge graph-driven multi-agent framework powered by large language models


70. Attention in Space: Functional Roles of VLM Heads for Spatial Reasoning


71. From 50% to Mastery in 3 Days: A Low-Resource SOP for Localizing Graduate-Level AI Tutors via Shadow-RAG


72. Agentic AI and the next intelligence explosion


73. Seed1.8 Model Card: Towards Generalized Real-World Agency


74. Reasoning Traces Shape Outputs but Models Won’t Say So


75. Where can AI be used? Insights from a deep ontology of work activities


76. Position: Multi-Agent Algorithmic Care Systems Demand Contestability for Trustworthy AI


77. Context Cartography: Toward Structured Governance of Contextual Space in Large Language Model Systems


78. LLM-Driven Heuristic Synthesis for Industrial Process Control: Lessons from Hot Steel Rolling


79. Grounded Chess Reasoning in Language Models via Master Distillation


80. Efficient Counterfactual Reasoning in ProbLog via Single World Intervention Programs


81. DiffGraph: An Automated Agent-driven Model Merging Framework for In-the-Wild Text-to-Image Generation


82. Deep reflective reasoning in interdependence constrained structured data extraction from clinical notes for digital health


83. Leveraging Natural Language Processing and Machine Learning for Evidence-Based Food Security Policy Decision-Making in Data-Scarce Making


84. Compression is all you need: Modeling Mathematics


85. LLM-Enhanced Energy Contrastive Learning for Out-of-Distribution Detection in Text-Attributed Graphs


86. AgentComm-Bench: Stress-Testing Cooperative Embodied AI Under Latency, Packet Loss, and Bandwidth Collapse


87. Me, Myself, and $π$ : Evaluating and Explaining LLM Introspection


88. FactorSmith: Agentic Simulation Generation via Markov Decision Process Decomposition with Planner-Designer-Critic Refinement


89. Domain-Specialized Tree of Thought through Plug-and-Play Predictors


90. ProMAS: Proactive Error Forecasting for Multi-Agent Systems Using Markov Transition Dynamics


91. AgenticGEO: A Self-Evolving Agentic System for Generative Engine Optimization


92. WorldCache: Content-Aware Caching for Accelerated Video World Models


93. End-to-End Training for Unified Tokenization and Latent Denoising


94. UniMotion: A Unified Framework for Motion-Text-Vision Understanding and Generation


95. ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model


96. 3D-Layout-R1: Structured Reasoning for Language-Instructed Spatial Editing


97. TiCo: Time-Controllable Training for Spoken Dialogue Models


98. Confidence-Based Decoding is Provably Efficient for Diffusion Language Models


99. One Model, Two Markets: Bid-Aware Generative Recommendation


100. SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation


101. Dyadic: A Scalable Platform for Human-Human and Human-AI Conversation Research


102. Evaluating the Reliability and Fidelity of Automated Judgment Systems of Large Language Models


103. SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection


104. CayleyPy-4: AI-Holography. Towards analogs of holographic string dualities for AI tasks


105. Seeing is Improving: Visual Feedback for Iterative Text Layout Refinement


106. Enhancing Document-Level Machine Translation via Filtered Synthetic Corpora and Two-Stage LLM Adaptation


107. Calibeating Made Simple


108. Multimodal Survival Analysis with Locally Deployable Large Language Models


109. Beyond Matching to Tiles: Bridging Unaligned Aerial and Satellite Views for Vision-Only UAV Navigation


110. More Isn’t Always Better: Balancing Decision Accuracy and Conformity Pressures in Multi-AI Advice


111. Mamba-VMR: Multimodal Query Augmentation via Generated Videos for Precise Temporal Grounding


112. On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation


113. On the Failure of Topic-Matched Contrast Baselines in Multi-Directional Refusal Abliteration


114. Uncertainty-guided Compositional Alignment with Part-to-Whole Semantic Representativeness in Hyperbolic Vision-Language Models


115. ROM: Real-time Overthinking Mitigation via Streaming Detection and Intervention


116. SegMaFormer: A Hybrid State-Space and Transformer Model for Efficient Segmentation


117. λ-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks


118. TREX: Trajectory Explanations for Multi-Objective Reinforcement Learning


119. LRC-WeatherNet: LiDAR, RADAR, and Camera Fusion Network for Real-time Weather-type Classification in Autonomous Driving


120. SecureBreak – A dataset towards safe and secure models


121. Parameter-Efficient Fine-Tuning for Medical Text Summarization: A Comparative Study of Lora, Prompt Tuning, and Full Fine-Tuning


122. Suiren-1.0 Technical Report: A Family of Molecular Foundation Models


123. Chronological Contrastive Learning: Few-Shot Progression Assessment in Irreversible Diseases


124. Camera-Agnostic Pruning of 3D Gaussian Splats via Descriptor-Based Beta Evidence


125. Deep Reinforcement Learning and The Tale of Two Temporal Difference Errors


126. SHAPE: Structure-aware Hierarchical Unsupervised Domain Adaptation with Plausibility Evaluation for Medical Image Segmentation


127. Not All Layers Are Created Equal: Adaptive LoRA Ranks for Personalized Image Generation


128. SmaAT-QMix-UNet: A Parameter-Efficient Vector-Quantized UNet for Precipitation Nowcasting


129. P^2O: Joint Policy and Prompt Optimization


130. Manifold-Aware Exploration for Reinforcement Learning in Video Generation


131. Adversarial Camouflage


132. Adaptive Video Distillation: Mitigating Oversaturation and Temporal Collapse in Few-Step Generation


133. Sim-to-Real of Humanoid Locomotion Policies via Joint Torque Space Perturbation Injection


134. On the Number of Conditional Independence Tests in Constraint-based Causal Discovery


135. Select, Label, Evaluate: Active Testing in NLP


136. Instruction Set and Language for Symbolic Regression


137. CoRA: Boosting Time Series Foundation Models for Multivariate Forecasting through Correlation-aware Adapter


138. BadminSense: Enabling Fine-Grained Badminton Stroke Evaluation on a Single Smartwatch


139. SteelDefectX: A Coarse-to-Fine Vision-Language Dataset and Benchmark for Generalizable Steel Surface Defect Detection


140. Ctrl-A: Control-Driven Online Data Augmentation


141. Extending Precipitation Nowcasting Horizons via Spectral Fusion of Radar Observations and Foundation Model Priors


142. Cycle Inverse-Consistent TransMorph: A Balanced Deep Learning Framework for Brain MRI Registration


143. Let’s Think with Images Efficiently! An Interleaved-Modal Chain-of-Thought Reasoning Framework with Dynamic and Precise Visual Thoughts


144. Cognitive Agency Surrender: Defending Epistemic Sovereignty via Scaffolded AI Friction


145. FISformer: Replacing Self-Attention with a Fuzzy Inference System in Transformer Models for Time Series Forecasting


146. SemEval-2026 Task 12: Abductive Event Reasoning: Towards Real-World Event Causal Inference for Large Language Models


147. When Exploration Comes for Free with Mixture-Greedy: Do we need UCB in Diversity-Aware Multi-Armed Bandits?


148. Rethinking Token Reduction for Large Vision-Language Models


149. Structured Visual Narratives Undermine Safety Alignment in Multimodal Large Language Models


150. Thinking Deeper, Not Longer: Depth-Recurrent Transformers for Compositional Generalization


151. Cross-Scenario Deraining Adaptation with Unpaired Data: Superpixel Structural Priors and Multi-Stage Pseudo-Rain Synthesis


152. Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and Benchmarks


153. Efficient Zero-Shot AI-Generated Image Detection


154. AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents


155. Rule-State Inference (RSI): A Bayesian Framework for Compliance Monitoring in Rule-Governed Domains


156. DiT-Flow: Speech Enhancement Robust to Multiple Distortions based on Flow Matching in Latent Space and Diffusion Transformers


157. mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT


158. Riemannian Geometry Speaks Louder Than Words: From Graph Foundation Model to Next-Generation Graph Intelligence


159. Spatio-Temporal Attention Enhanced Multi-Agent DRL for UAV-Assisted Wireless Networks with Limited Communications


160. PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block Selection


161. CataractSAM-2: A Domain-Adapted Model for Anterior Segment Surgery Segmentation and Scalable Ground-Truth Annotation


162. Rethinking SAR ATR: A Target-Aware Frequency-Spatial Enhancement Framework with Noise-Resilient Knowledge Guidance


163. Toward a Theory of Hierarchical Memory for Language Agents


164. What Do World Models Learn in RL? Probing Latent Representations in Learned Environment Simulators


165. Evolutionary Biparty Multiobjective UAV Path Planning: Problems and Empirical Comparisons


166. Sharper Generalization Bounds for Transformer



168. BOxCrete: A Bayesian Optimization Open-Source AI Model for Concrete Strength Forecasting and Mix Optimization


169. CatRAG: Functor-Guided Structural Debiasing with Retrieval Augmentation for Fair LLMs


170. SafePilot: A Framework for Assuring LLM-enabled Cyber-Physical Systems


171. Efficient Failure Management for Multi-Agent Systems with Reasoning Trace Representation


172. Optimizing Feature Extraction for On-device Model Inference with User Behavior Sequences


173. Quotient Geometry, Effective Curvature, and Implicit Bias in Simple Shallow Neural Networks


174. A Framework for Closed-Loop Robotic Assembly, Alignment and Self-Recovery of Precision Optical Systems


175. RuntimeSlicer: Towards Generalizable Unified Runtime State Representation for Failure Management


176. Effective Strategies for Asynchronous Software Engineering Agents


177. DSPA: Dynamic SAE Steering for Data-Efficient Preference Alignment


178. When Documents Disagree: Measuring Institutional Variation in Transplant Guidance with Retrieval-Augmented Language Models


179. KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning


180. LLM-Powered Workflow Optimization for Multidisciplinary Software Development: An Automotive Industry Case Study


181. HyReach: Vision-Guided Hybrid Manipulator Reaching in Unseen Cluttered Environments


182. Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs


183. Fingerprinting Deep Neural Networks for Ownership Protection: An Analytical Approach


184. An InSAR Phase Unwrapping Framework for Large-scale and Complex Events


185. Benchmarking Bengali Dialectal Bias: A Multi-Stage Framework Integrating RAG-Based Translation and Human-Augmented RLAIF


186. Generalized Discrete Diffusion from Snapshots


187. COINBench: Moving Beyond Individual Perspectives to Collective Intent Understanding


188. B-jet Tagging Using a Hybrid Edge Convolution and Transformer Architecture


189. enhancing reasoning accuracy in large language models during inference time


190. More Than Sum of Its Parts: Deciphering Intent Shifts in Multimodal Hate Speech Detection


191. DeepXplain: XAI-Guided Autonomous Defense Against Multi-Stage APT Campaigns


192. When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning


193. Sonny: Breaking the Compute Wall in Medium-Range Weather Forecasting


194. Fusing Memory and Attention: A study on LSTM, Transformer and Hybrid Architectures for Symbolic Music Generation


195. WARBENCH: A Comprehensive Benchmark for Evaluating LLMs in Military Decision-Making


196. Conversation Tree Architecture: A Structured Framework for Context-Aware Multi-Branch LLM Conversations


197. Aggregation Alignment for Federated Learning with Mixture-of-Experts under Data Heterogeneity


198. Domain Elastic Transform: Bayesian Function Registration for High-Dimensional Scientific Data


199. QMoP: Query Guided Mixture-of-Projector for Efficient Visual Token Compression


200. When Convenience Becomes Risk: A Semantic View of Under-Specification in Host-Acting Agents


201. Positional Segmentor-Guided Counterfactual Fine-Tuning for Spatially Localized Image Synthesis


202. Is Monitoring Enough? Strategic Agent Selection For Stealthy Attack in Multi-Agent Discussions


203. Context Selection for Hypothesis and Statistical Evidence Extraction from Full-Text Scientific Articles


204. LLM-based Automated Architecture View Generation: Where Are We Now?


205. Prompt replay: speeding up grpo with on-policy reuse of high-signal prompts


206. Reward Sharpness-Aware Fine-Tuning for Diffusion Models


207. Rethinking Plasticity in Deep Reinforcement Learning


208. TRACE: A Multi-Agent System for Autonomous Physical Reasoning in Seismological Science


209. Emergent Formal Verification: How an Autonomous AI Ecosystem Independently Discovered SMT-Based Safety Across Six Domains


210. NeSy-Edge: Neuro-Symbolic Trustworthy Self-Healing in the Computing Continuum


211. One Pool Is Not Enough: Multi-Cluster Memory for Practical Test-Time Adaptation


212. DMMRL: Disentangled Multi-Modal Representation Learning via Variational Autoencoders for Molecular Property Prediction


213. Learning Progressive Adaptation for Multi-Modal Tracking


214. Mixture of Chapters: Scaling Learnt Memory in Transformers


215. Representation-Level Adversarial Regularization for Clinically Aligned Multitask Thyroid Ultrasound Assessment


216. ViCLSR: A Supervised Contrastive Learning Framework with Natural Language Inference for Natural Language Understanding Tasks


217. Assessing the Ability of Neural TTS Systems to Model Consonant-Induced F0 Perturbation


218. CTFS : Collaborative Teacher Framework for Forward-Looking Sonar Image Semantic Segmentation with Extremely Limited Labels


219. Harmful Visual Content Manipulation Matters in Misinformation Detection Under Multimedia Scenarios


220. A Two-stage Transformer Framework for Temporal Localization of Distracted Driver Behaviors


221. SpatialFly: Geometry-Guided Representation Alignment for UAV Vision-and-Language Navigation in Urban Environments


222. LPNSR: Prior-Enhanced Diffusion Image Super-Resolution via LR-Guided Noise Prediction


223. DSL-R1: From SQL to DSL for Training Retrieval Agents across Structured and Unstructured Data with Reinforcement Learning


224. Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO


225. ALL-FEM: Agentic Large Language models Fine-tuned for Finite Element Methods


226. How AI Systems Think About Education: Analyzing Latent Preference Patterns in Large Language Models


227. Long-Term Outlier Prediction Through Outlier Score Modeling


228. Structural Sensitivity in Compressed Transformers: Error Propagation, Lyapunov Stability, and Formally Verified Bounds


229. ECI: Effective Contrastive Information to Evaluate Hard-Negatives


230. Cyber Deception for Mission Surveillance via Hypergame-Theoretic Deep Reinforcement Learning


231. From Causal Discovery to Dynamic Causal Inference in Neural Time Series


232. Detection of adversarial intent in Human-AI teams using LLMs


233. Learning to Aggregate Zero-Shot LLM Agents for Corporate Disclosure Classification


234. Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Language Models


235. Beyond Expression Similarity: Contrastive Learning Recovers Functional Gene Associations from Protein Interaction Structure


236. Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents


237. User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction


238. AC4A: Access Control for Agents


239. Causally-Guided Diffusion for Stable Feature Selection



241. Enhancing LIME using Neural Decision Trees


242. Mitigating Shortcut Reasoning in Language Models: A Gradient-Aware Training Approach


243. Natural Gradient Descent for Online Continual Learning


244. The data heat island effect: quantifying the impact of AI data centers in a warming world


245. Beyond the Birkhoff Polytope: Spectral-Sphere-Constrained Hyper-Connections


246. Characterizing the onset and offset of motor imagery during passive arm movements induced by an upper-body exoskeleton


247. RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for Rubric Generation


248. Semantic Sections: An Atlas-Native Feature Ontology for Obstructed Representation Spaces


249. Restoring Neural Network Plasticity for Faster Transfer Learning


250. SozKZ: Training Efficient Small Language Models for Kazakh from Scratch


251. Can ChatGPT Really Understand Modern Chinese Poetry?


252. HiCI: Hierarchical Construction-Integration for Long-Context Attention


253. Dodgersort: Uncertainty-Aware VLM-Guided Human-in-the-Loop Pairwise Ranking


254. MERIT: Multi-domain Efficient RAW Image Translation


255. Compass: Optimizing Compound AI Workflows for Dynamic Adaptation


256. PlanaReLoc: Camera Relocalization in 3D Planar Primitives via Region-Based Structure Matching


257. OmniPatch: A Universal Adversarial Patch for ViT-CNN Cross-Architecture Transfer in Semantic Segmentation


258. Memory-Efficient Fine-Tuning Diffusion Transformers via Dynamic Patch Sampling and Block Skipping


259. Reasoning Topology Matters: Network-of-Thought for Complex Reasoning Tasks


260. Weakly supervised multimodal segmentation of acoustic borehole images with depth-aware cross-attention


261. Decoupling Numerical and Structural Parameters: An Empirical Study on Adaptive Genetic Algorithms via Deep Reinforcement Learning for the Large-Scale TSP


262. Satellite-to-Street: Synthesizing Post-Disaster Views from Satellite Imagery via Generative Vision Models


263. SWE-Next: Scalable Real-World Software Engineering Tasks for Agents


264. Artificial Intelligence in Experimental Approaches: Growth Hacking, Lean Startup, Design Thinking, and Agile


265. SNAP: Speaker Nulling for Artifact Projection in Speech Deepfake Detection


266. Centrality-Based Pruning for Efficient Echo State Networks


267. PAVE: Premise-Aware Validation and Editing for Retrieval-Augmented LLMs


268. REVERE: Reflective Evolving Research Engineer for Scientific Workflows


269. Sinkhorn Based Associative Memory Retrieval Using Spherical Hellinger Kantorovich Dynamics


270. Modernizing Amdahl’s Law: How AI Scaling Laws Shape Computer Architecture


271. A Multihead Continual Learning Framework for Fine-Grained Fashion Image Retrieval with Contrastive Learning and Exponential Moving Average Distillation


272. Weber’s Law in Transformer Magnitude Representations: Efficient Coding, Representational Geometry, and Psychophysical Laws in Language Models


273. AEGIS: From Clues to Verdicts – Graph-Guided Deep Vulnerability Reasoning via Dialectics and Meta-Auditing


274. CFNN: Continued Fraction Neural Network


275. Interpretable Operator Learning for Inverse Problems via Adaptive Spectral Filtering: Convergence and Discretization Invariance


276. Graph-based data-driven discovery of interpretable laws governing corona-induced noise and radio interference for high-voltage transmission lines


277. MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning


278. Permutation-Consensus Listwise Judging for Robust Factuality Evaluation


279. An Industrial-Scale Retrieval-Augmented Generation Framework for Requirements Engineering: Empirical Evaluation with Automotive Manufacturing Data


280. Revenue-Sharing as Infrastructure: A Distributed Business Model for Generative AI Platforms


281. Epistemic Observability in Language Models


282. Does This Gradient Spark Joy?


283. Delightful Distributed Policy Gradient


284. Evaluating Large Language Models on Historical Health Crisis Knowledge in Resource-Limited Settings: A Hybrid Multi-Metric Study


285. ReBOL: Retrieval via Bayesian Optimization with Batched LLM Relevance Observations and Query Reformulation


286. Measuring Reasoning Trace Legibility: Can Those Who Understand Teach?


287. Meeting in the Middle: A Co-Design Paradigm for FHE and AI Inference


288. Profiling learners’ affective engagement: Emotion AI, intercultural pragmatics, and language learning


289. Diffutron: A Masked Diffusion Language Model for Turkish Language


290. Shift-Invariant Feature Attribution in the Application of Wireless Electrocardiograms


291. Policies Permitting LLM Use for Polishing Peer Reviews Are Currently Not Enforceable


292. Solver-Aided Verification of Policy Compliance in Tool-Augmented LLM Agents


293. Detecting Neurovascular Instability from Multimodal Physiological Signals Using Wearable-Compatible Edge AI: A Responsible Computational Framework


294. ALICE: A Multifaceted Evaluation Framework of Large Audio-Language Models’ In-Context Learning Ability


295. Coding Agents are Effective Long-Context Processors


296. PEARL: Personalized Streaming Video Understanding Model


297. Meta-Learning for Repeated Bayesian Persuasion


298. Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Translation


299. KV Cache Optimization Strategies for Scalable and Efficient LLM Inference


300. SymCircuit: Bayesian Structure Inference for Tractable Probabilistic Circuits via Entropy-Regularized Reinforcement Learning


301. CAMA: Exploring Collusive Adversarial Attacks in c-MARL


302. The production of meaning in the processing of natural language


303. ALARA for Agents: Least-Privilege Context Engineering Through Portable Composable Multi-Agent Teams


304. WebNavigator: Global Web Navigation via Interaction Graph Retrieval


305. Comprehensive Description of Uncertainty in Measurement for Representation and Propagation with Scalable Precision


306. Memory poisoning and secure multi-agent systems


307. Leum-VL Technical Report


308. MANA: Towards Efficient Mobile Ad Detection via Multimodal Agentic UI Navigation


309. ContractSkill: Repairable Contract-Based Skills for Multimodal Web Agents


310. Low-pass Personalized Subgraph Federated Recommendation


311. GEM: A Native Graph-based Index for Multi-Vector Retrieval


312. Procedural Refinement by LLM-driven Algorithmic Debugging for ARC-AGI-2


313. Bounded Coupled AI Learning Dynamics in Tri-Hierarchical Drone Swarms


314. Probing the Latent World: Emergent Discrete Symbols and Physical Structure in Latent Representations


315. When Agents Disagree: The Selection Bottleneck in Multi-Agent LLM Pipelines


316. GIP-RAG: An Evidence-Grounded Retrieval-Augmented Framework for Interpretable Gene Interaction and Pathway Impact Analysis


317. The Causal Impact of Tool Affordance on Safety Alignment in LLM Agents


318. Bypassing Document Ingestion: An MCP Approach to Financial Q&A


319. Semantic Tool Discovery for Large Language Models: A Vector-Based Approach to MCP Tool Selection


320. kRAIG: A Natural Language-Driven Agent for Automated DataOps Pipeline Generation


321. Reason-to-Transmit: Deliberative Adaptive Communication for Cooperative Perception


322. EARTalking: End-to-end GPT-style Autoregressive Talking Head Synthesis with Frame-wise Control


323. InjectFlow: Weak Guides Strong via Orthogonal Injection for Flow Matching


324. Voice Privacy from an Attribute-based Perspective


325. From Human Interfaces to Agent Interfaces: Rethinking Software Design in the Age of AI-Native Systems


326. HCAG: Hierarchical Abstraction and Retrieval-Augmented Generation on Theoretical Repositories with LLMs


327. Transformer-Based Predictive Maintenance for Risk-Aware Instrument Calibration


328. Collaborative Adaptive Curriculum for Progressive Knowledge Distillation


329. MARLIN: Multi-Agent Reinforcement Learning for Incremental DAG Discovery


330. HSI Image Enhancement Classification Based on Knowledge Distillation: A Study on Forgetting


331. On the Fragility of AI Agent Collusion


332. Learning Communication Between Heterogeneous Agents in Multi-Agent Reinforcement Learning for Autonomous Cyber Defence


333. OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis


334. Understanding Pruning Regimes in Vision-Language Models Through Domain-Aware Layer Selection


335. Efficient AI-Driven Multi-Section Whole Slide Image Analysis for Biochemical Recurrence Prediction in Prostate Cancer


336. JointFM-0.1: A Foundation Model for Multi-Target Joint Distributional Prediction


337. JCAS-MARL: Joint Communication and Sensing UAV Networks via Resource-Constrained Multi-Agent Reinforcement Learning


338. Deciphering Scientific Reasoning Steps from Outcome Data for Molecule Optimization



340. SciNav: A General Agent Framework for Scientific Coding Tasks


341. AI Detectors Fail Diverse Student Populations: A Mathematical Framing of Structural Detection Limits


342. Developing Machine Learning-Based Watch-to-Warning Severe Weather Guidance from the Warn-on-Forecast System


343. Stability of AI Governance Systems: A Coupled Dynamics Model of Public Trust and Social Disruptions


344. Decoding the decoder: Contextual sequence-to-sequence modeling for intracortical speech decoding


345. Writing literature reviews with AI: principles, hurdles and some lessons learned


346. Emergency Lane-Change Simulation: A Behavioral Guidance Approach for Risky Scenario Generation


347. Fusing Driver Perceived and Physical Risk for Safety Critical Scenario Screening in Autonomous Driving


348. Email in the Era of LLMs


349. Beyond Scalar Rewards: Distributional Reinforcement Learning with Preordered Objectives for Safe and Reliable Autonomous Driving


350. Characterizing the ability of LLMs to recapitulate Americans’ distributional responses to public opinion polling questions across political issues


351. The Arrival of AGI? When Expert Personas Exceed Expert Benchmarks


352. Inference Energy and Latency in AI-Mediated Education: A Learning-per-Watt Analysis of Edge and Cloud Models


353. Locally Coherent Parallel Decoding in Diffusion Language Models


354. Beyond Detection: Governing GenAI in Academic Peer Review as a Sociotechnical Challenge


355. Exploring Teacher-Chatbot Interaction and Affect in Block-Based Programming


356. CRoCoDiL: Continuous and Robust Conditioned Diffusion for Language


357. Children’s Intelligence Tests Pose Challenges for MLLMs? KidGym: A 2D Grid-Based Reasoning Benchmark for MLLMs


358. RedacBench: Can AI Erase Your Secrets?


359. Enhancing Safety of Large Language Models via Embedding Space Separation


360. Measuring Research Convergence in Interdisciplinary Teams Using Large Language Models and Graph Analytics


361. Your Robot Will Feel You Now: Empathy in Robots and Embodied Agents


362. REMI: Reconstructing Episodic Memory During Internally Driven Path Planning