전체 AI 논문 - 2026-03-31

1. Dynamic Dual-Granularity Skill Bank for Agentic RL


2. Not Search, But Scan: Benchmarking MLLMs on Scan-Oriented Academic Paper Reasoning


3. The Ultimate Tutorial for AI-driven Scale Development in Generative Psychometrics: Releasing AIGENIE from its Bottle


4. Seeing with You: Perception-Reasoning Coevolution for Multimodal Reasoning


5. MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models


6. Towards a Medical AI Scientist


7. T-Norm Operators for EU AI Act Compliance Classification: An Empirical Comparison of Lukasiewicz, Product, and Gödel Semantics in a Neuro-Symbolic Reasoning System


8. Entropic Claim Resolution: Uncertainty-Driven Evidence Selection for RAG


9. MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome


10. The Scaffold Effect: How Prompt Framing Drives Apparent Multimodal Gains in Clinical VLM Evaluation


11. COvolve: Adversarial Co-Evolution of Large-Language-Model-Generated Policies and Environments via Two-Player Zero-Sum Game


12. Deep Research of Deep Research: From Transformer to Agent, From AI to AI for Science


13. CoE: Collaborative Entropy for Uncertainty Quantification in Agentic Multi-LLM Systems


14. A Multi-Agent Rhizomatic Pipeline for Non-Linear Literature Analysis


15. Evaluating LLMs for Answering Student Questions in Introductory Programming Courses


16. Reasoning as Energy Minimization over Structured Latent Trajectories


17. Differentiable Power-Flow Optimization


18. EpiPersona: Persona Projection and Episode Coupling for Pluralistic Preference Modeling


19. PReD: An LLM-based Foundation Multimodal Model for Electromagnetic Perception, Recognition, and Decision


20. CoT2-Meta: Budgeted Metacognitive Control for Test-Time Reasoning


21. Reward Hacking as Equilibrium under Finite Evaluation


22. SLOW: Strategic Logical-inference Open Workspace for Cognitive Adaptation in AI Tutoring


23. Meta-Harness: End-to-End Optimization of Model Harnesses


24. Dogfight Search: A Swarm-Based Optimization Algorithm for Complex Engineering Optimization and Mountainous Terrain Path Planning


25. Beyond the Answer: Decoding the Behavior of LLMs as Scientific Reasoners


26. When Choices Become Priors: Contrastive Decoding for Scientific Figure Multiple-Choice QA


27. What an Autonomous Agent Discovers About Molecular Transformer Design: Does It Transfer?


28. HeteroHub: An Applicable Data Management Framework for Heterogeneous Multi-Embodied Agent System


29. SARL: Label-Free Reinforcement Learning by Rewarding Reasoning Topology


30. CARV: A Diagnostic Benchmark for Compositional Analogical Reasoning in Multimodal LLMs


31. GEAKG: Generative Executable Algorithm Knowledge Graphs


32. GAAMA: Graph Augmented Associative Memory for Agents


33. CARGO: Carbon-Aware Gossip Orchestration in Smart Shipping


34. Let the Agent Steer: Closed-Loop Ranking Optimization via Influence Exchange


35. SkyNet: Belief-Aware Planning for Partially-Observable Stochastic Games


36. TianJi:An autonomous AI meteorologist for discovering physical mechanisms in atmospheric science


37. DSevolve: Enabling Real-Time Adaptive Scheduling on Dynamic Shop Floor with LLM-Evolved Heuristic Portfolios


38. What does a system modify when it modifies itself?


39. From indicators to biology: the calibration problem in artificial consciousness


40. Dual-Stage LLM Framework for Scenario-Centric Semantic Interpretation in Driving Assistance


41. PeopleSearchBench: A Multi-Dimensional Benchmark for Evaluating AI-Powered People Search Platforms


42. The Novelty Bottleneck: A Framework for Understanding Human Effort Scaling in AI-Assisted Work


43. AstraAI: LLMs, Retrieval, and AST-Guided Assistance for HPC Codebases


44. Greedy Is a Strong Default: Agents as Iterative Optimizers


45. On the Relationship between Bayesian Networks and Probabilistic Structural Causal Models


46. Heterogeneous Debate Engine: Identity-Grounded Cognitive Architecture for Resilient LLM-Based Ethical Tutoring


47. Defend: Automated Rebuttals for Peer Review with Minimal Author Guidance


48. LLM Readiness Harness: Evaluation, Observability, and CI Gates for LLM/RAG Applications


49. Beyond Completion: Probing Cumulative State Tracking to Predict LLM Agent Performance


50. A Comparative Study in Surgical AI: Datasets, Foundation Models, and Barriers to Med-AGI


51. CounterMoral: Editing Morals in Language Models


52. TokenDance: Token-to-Token Music-to-Dance Generation with Bidirectional Mamba


53. EpochX: Building the Infrastructure for an Emergent Agent Civilization


54. Self-evolving AI agents for protein discovery and directed evolution


55. Quantification of Credal Uncertainty: A Distance-Based Approach


56. AutoMS: Multi-Agent Evolutionary Search for Cross-Physics Inverse Microstructure Design


57. Aligning LLMs with Graph Neural Solvers for Combinatorial Optimization


58. daVinci-LLM:Towards the Science of Pretraining


59. MediHive: A Decentralized Agent Collective for Medical Reasoning


60. The Price of Meaning: Why Every Semantic Memory System Forgets


61. When Verification Hurts: Asymmetric Effects of Multi-Agent Feedback in Logic Proof Tutoring


62. FormalProofBench: Can Models Write Graduate Level Math Proofs That Are Formally Verified?


63. Transparency as Architecture: Structural Compliance Gaps in EU AI Act Article 50 II


64. Compliance-Aware Predictive Process Monitoring: A Neuro-Symbolic Approach


65. Neuro-Symbolic Learning for Predictive Process Monitoring via Two-Stage Logic Tensor Networks with Rule Pruning


66. Concerning Uncertainty – A Systematic Survey of Uncertainty-Aware XAI


67. Multiverse: Language-Conditioned Multi-Game Level Blending via Shared Representation


68. Bitboard version of Tetris AI


69. Geometry-aware similarity metrics for neural representations on Riemannian and statistical manifolds


70. On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers


71. ParaSpeechCLAP: A Dual-Encoder Speech-Text Model for Rich Stylistic Language-Audio Pretraining


72. RAD-AI: Rethinking Architecture Documentation for AI-Augmented Ecosystems


73. SAGAI-MID: A Generative AI-Driven Middleware for Dynamic Runtime Interoperability


74. Stepwise Credit Assignment for GRPO on Flow-Matching Models


75. A Convex Route to Thermomechanics: Learning Internal Energy and Dissipation


76. AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding


77. Why Aggregate Accuracy is Inadequate for Evaluating Fairness in Law Enforcement Facial Recognition Systems


78. AMIGO: Agentic Multi-Image Grounding Oracle Benchmark


79. Information-Theoretic Limits of Safety Verification for Self-Improving Systems


80. Dynamic Lookahead Distance via Reinforcement Learning-Based Pure Pursuit for Autonomous Racing


81. Trust-Aware Routing for Distributed Generative AI Inference at the Edge


82. TGIF2: Extended Text-Guided Inpainting Forgery Dataset & Benchmark


83. ResAdapt: Adaptive Resolution for Efficient Multimodal Reasoning


84. Moving Beyond Review: Applying Language Models to Planning and Translation in Reflection


85. Detection of Adversarial Attacks in Robotic Perception


86. Navigating the Mirage: A Dual-Path Agentic Framework for Robust Misleading Chart Question Answering


87. ChemCLIP: Bridging Organic and Inorganic Anticancer Compounds Through Contrastive Learning


88. Learning Partial Action Replacement in Offline MARL


89. CirrusBench: Evaluating LLM-based Agents Beyond Correctness in Real-World Cloud Service Environments


90. Fine-Tuning Large Language Models for Cooperative Tactical Deconfliction of Small Unmanned Aerial Systems


91. Domain-Invariant Prompt Learning for Vision-Language Models


92. Hydra: Unifying Document Retrieval and Generation in a Single Vision-Language Model


93. Detecting low left ventricular ejection fraction from ECG using an interpretable and scalable predictor-driven framework


94. RAD-LAD: Rule and Language Grounded Autonomous Driving in Real-Time


95. The Unreasonable Effectiveness of Scaling Laws in AI


96. Next-Token Prediction and Regret Minimization


97. MRI-to-CT synthesis using drifting models


98. Courtroom-Style Multi-Agent Debate with Progressive RAG and Role-Switching for Controversial Claim Verification


99. CiQi-Agent: Aligning Vision, Tools and Aesthetics in Multimodal Agent for Cultural Reasoning on Chinese Porcelains


100. HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention


101. FeDMRA: Federated Incremental Learning with Dynamic Memory Replay Allocation


102. GeoHCC: Local Geometry-Aware Hierarchical Context Compression for 3D Gaussian Splatting


103. AceleradorSNN: A Neuromorphic Cognitive System Integrating Spiking Neural Networks and DynamicImage Signal Processing on FPGA


104. Learning unified control of internal spin squeezing in atomic qudits for magnetometry


105. Spectral Higher-Order Neural Networks


106. KGroups: A Versatile Univariate Max-Relevance Min-Redundancy Feature Selection Algorithm for High-dimensional Biological Data


107. Evolutionary Discovery of Reinforcement Learning Algorithms via Large Language Models


108. EdgeDiT: Hardware-Aware Diffusion Transformers for Efficient On-Device Image Generation


109. From Simulation to Deep Learning: Survey on Network Performance Modeling Approaches


110. Critic-Free Deep Reinforcement Learning for Maritime Coverage Path Planning on Irregular Hexagonal Grids


111. Membership Inference Attacks against Large Audio Language Models


112. Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design


113. Coherent Without Grounding, Grounded Without Success: Observability and Epistemic Failure


114. Crossing the NL/PL Divide: Information Flow Analysis Across the NL/PL Boundary in LLM-Integrated Code


115. Integrating Multimodal Large Language Model Knowledge into Amodal Completion


116. Building evidence-based knowledge graphs from full-text literature for disease-specific biomedical reasoning


117. Mapping data literacy trajectories in K-12 education


118. Self++: Co-Determined Agency for Human–AI Symbiosis in Extended Reality


119. NeiGAD: Augmenting Graph Anomaly Detection via Spectral Neighbor Information


120. FI-KAN: Fractal Interpolation Kolmogorov-Arnold Networks


121. Pre-Deployment Complexity Estimation for Federated Perception Systems


122. Merge and Conquer: Instructing Multilingual Models by Adding Target Language Weights


123. Categorical Perception in Large Language Model Hidden States: Structural Warping at Digit-Count Boundaries


124. MR-ImagenTime: Multi-Resolution Time Series Generation through Dual Image Representations


125. DiffAttn: Diffusion-Based Drivers’ Visual Attention Prediction with LLM-Enhanced Semantic Reasoning


126. TwinMixing: A Shuffle-Aware Feature Interaction Model for Multi-Task Segmentation


127. An Optimal Battery-Free Approach for Emission Reduction by Storing Solar Surplus in Building Thermal Mass


128. ERPO: Token-Level Entropy-Regulated Policy Optimization for Large Reasoning Models


129. Designing AI for Real Users – Accessibility Gaps in Retail AI Front-End


130. Skillful Kilometer-Scale Regional Weather Forecasting via Global and Regional Coupling


131. Evaluating Privilege Usage of Agents on Real-World Tools


132. RecycleLoRA: Rank-Revealing QR-Based Dual-LoRA Subspace Adaptation for Domain Generalized Semantic Segmentation


133. MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios


134. Does Claude’s Constitution Have a Culture?


135. Q-DIVER: Integrated Quantum Transfer Learning and Differentiable Quantum Architecture Search with EEG Data


136. Quid est VERITAS? A Modular Framework for Archival Document Analysis


137. Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models


138. MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions


139. MolmoPoint: Better Pointing for VLMs with Grounding Tokens


140. Synonymix: Unified Group Personas for Generative Simulations


141. Bit-Identical Medical Deep Learning via Structured Orthogonal Initialization


142. CARLA-Air: Fly Drones Inside a CARLA World – A Unified Infrastructure for Air-Ground Embodied Intelligence


143. Kill-Chain Canaries: Stage-Level Tracking of Prompt Injection Across Attack Surfaces and Model Safety Tiers


144. ViviDoc: Generating Interactive Documents through Human-Agent Collaboration


145. Beyond Dataset Distillation: Lossless Dataset Concentration via Diffusion-Assisted Distribution Alignment


146. FedFG: Privacy-Preserving and Robust Federated Learning via Flow-Matching Generation


147. CDH-Bench: A Commonsense-Driven Hallucination Benchmark for Evaluating Visual Fidelity in Vision-Language Models


148. JaWildText: A Benchmark for Vision-Language Models on Japanese Scene Text Understanding


149. Physics-Guided Transformer (PGT): Physics-Aware Attention Mechanism for PINNs


150. Adversarial Attacks on Multimodal Large Language Models: A Comprehensive Survey


151. ITQ3_S: High-Fidelity 3-bit LLM Inference via Interleaved Ternary Quantization with Rotation-Domain Smoothing


152. AI-ready design of realistic 2D materials and interfaces with Mat3ra-2D


153. Kernel Dynamics under Path Entropy Maximization


154. A Revealed Preference Framework for AI Alignment


155. ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks


156. KVSculpt: KV Cache Compression as Distillation


157. Towards Context-Aware Image Anonymization with Multi-Agent Reasoning


158. Towards Emotion Recognition with 3D Pointclouds Obtained from Facial Expression Images


159. What-If Explanations Over Time: Counterfactuals for Time Series Classification


160. Heracles: Bridging Precise Tracking and Generative Synthesis for General Humanoid Control


161. AI-Powered Facial Mask Removal Is Not Suitable For Biometric Identification


162. Needle in the Repo: A Benchmark for Maintainability in AI-Generated Repository Edits


163. Robust Smart Contract Vulnerability Detection via Contrastive Learning-Enhanced Granular-ball Training


164. Suppression of $^{14}\mathrm{C}$ photon hits in large liquid scintillator detectors via spatiotemporal deep learning


165. The role of neuromorphic principles in the future of biomedicine and healthcare


166. RAP: Retrieve, Adapt, and Prompt-Fit for Training-Free Few-Shot Medical Image Segmentation


167. LVRPO: Language-Visual Alignment with GRPO for Multimodal Understanding and Generation


168. ProgressVLA: Progress-Guided Diffusion Policy for Vision-Language Robotic Manipulation


169. EvA: An Evidence-First Audio Understanding Paradigm for LALMs


170. ContraMap: Contrastive Uncertainty Mapping for Robot Environment Representation


171. Umwelt Engineering: Designing the Cognitive Worlds of Linguistic Agents


172. Expert Streaming: Accelerating Low-Batch MoE Inference via Multi-chiplet Architecture and Dynamic Expert Trajectory Scheduling


173. STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding


174. InnerPond: Fostering Inter-Self Dialogue with a Multi-Agent Approach for Introspection


175. A General Model for Deepfake Speech Detection: Diverse Bonafide Resources or Diverse AI-Based Generators


176. Drag or Traction: Understanding How Designers Appropriate Friction in AI Ideation Outputs


177. A Novel Immune Algorithm for Multiparty Multiobjective Optimization


178. Toward Reliable Evaluation of LLM-Based Financial Multi-Agent Systems: Taxonomy, Coordination Primacy, and Cost Awareness


179. Demo-Pose: Depth-Monocular Modality Fusion For Object Pose Estimation


180. Cross-attentive Cohesive Subgraph Embedding to Mitigate Oversquashing in GNNs


181. Safer Builders, Risky Maintainers: A Comparative Study of Breaking Changes in Human vs Agentic PRs


182. A Systematic Taxonomy of Security Vulnerabilities in the OpenClaw AI Agent Framework


183. Understanding Semantic Perturbations on In-Processing Generative Image Watermarks


184. Learning to Focus and Precise Cropping: A Reinforcement Learning Framework with Information Gaps and Grounding Loss for MLLMs


185. Copilot-Assisted Second-Thought Framework for Brain-to-Robot Hand Motion Decoding


186. AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents


187. Difference Feedback: Generating Multimodal Process-Level Supervision for VLM Reinforcement Learning


188. On Token’s Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models


189. KV Cache Quantization for Self-Forcing Video Generation: A 33-Method Empirical Study


190. TurboAngle: Near-Lossless KV Cache Compression via Uniform Angle Quantization


191. Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development


192. Multi-Agent Dialectical Refinement for Enhanced Argument Classification


193. GIFT: Bootstrapping Image-to-CAD Program Synthesis via Geometric Feedback


194. Evaluating Large and Lightweight Vision Models for Irregular Component Segmentation in E-Waste Disassembly


195. Improving Attributed Long-form Question Answering with Intent Awareness


196. CarbonEdge: Carbon-Aware Deep Learning Inference Framework for Sustainable Edge Computing


197. Agent-Driven Autonomous Reinforcement Learning Research: Iterative Policy Improvement for Quadruped Locomotion


198. Multiple-Prediction-Powered Inference


199. The Hidden Costs of AI-Mediated Political Outreach: Persuasion and AI Penalties in the US and UK


200. The Geometry of Harmful Intent: Training-Free Anomaly Detection via Angular Deviation in LLM Residual Streams


201. Grounding Social Perception in Intuitive Physics


202. Conditional Factuality Controlled LLMs with Generalization Certificates via Conformal Sampling


203. Diagnosing Non-Markovian Observations in Reinforcement Learning via Prediction-Based Violation Scoring


204. Where Does AI Leave a Footprint? Children’s Reasoning About AI’s Environmental Costs


205. Guided Lensless Polarization Imaging


206. Culturally Adaptive Explainable LLM Assessment for Multilingual Information Disorder: A Human-in-the-Loop Approach


207. D-SPEAR: Dual-Stream Prioritized Experience Adaptive Replay for Stable Reinforcement Learninging Robotic Manipulation


208. ComBench: A Repo-level Real-world Benchmark for Compilation Error Repair


209. Improving Automated Wound Assessment Using Joint Boundary Segmentation and Multi-Class Classification Models


210. Multimodal Forecasting for Commodity Prices Using Spectrogram-Based and Time Series Representations


211. GUIDE: Guided Updates for In-context Decision Evolution in LLM-Driven Spacecraft Operations


212. A Multi-agent AI System for Deep Learning Model Migration from TensorFlow to JAX


213. Beyond Descriptions: A Generative Scene2Audio Framework for Blind and Low-Vision Users to Experience Vista Landscapes


214. Codebase-Memory: Tree-Sitter-Based Knowledge Graphs for LLM Code Exploration via MCP


215. Robust Global-Local Behavior Arbitration via Continuous Command Fusion Under LiDAR Errors


216. From Foundation ECG Models to NISQ Learners: Distilling ECGFounder into a VQC Student


217. Amalgam: Hybrid LLM-PGM Synthesis Algorithm for Accuracy and Realism


218. Zero-shot Vision-Language Reranking for Cross-View Geolocalization


219. Diagnosing and Repairing Unsafe Channels in Vision-Language Models via Causal Discovery and Dual-Modal Safety Subspace Projection


220. Can pre-trained Deep Learning models predict groove ratings?


221. EuraGovExam: A Multilingual Multimodal Benchmark from Real-World Civil Service Exams


222. Unsupervised Evaluation of Deep Audio Embeddings for Music Structure Analysis


223. Multi-AUV Ad-hoc Networks-Based Multi-Target Tracking Based on Scene-Adaptive Embodied Intelligence


224. An End-to-end Flight Control Network for High-speed UAV Obstacle Avoidance based on Event-Depth Fusion


225. GSR-GNN: Training Acceleration and Memory-Saving Framework of Deep GNNs on Circuit Graph


226. A Tight Expressivity Hierarchy for GNN-Based Entity Resolution in Master Data Management


227. SafetyDrift: Predicting When AI Agents Cross the Line Before They Actually Do


228. Bayes-MICE: A Bayesian Approach to Multiple Imputation for Time Series Data


229. Bayesian-Symbolic Integration for Uncertainty-Aware Parking Prediction


230. Gender-Based Heterogeneity in Youth Privacy-Protective Behavior for Smart Voice Assistants: Evidence from Multigroup PLS-SEM


231. Autonomous Agent-Orchestrated Digital Twins (AADT): Leveraging the OpenClaw Framework for State Synchronization in Rare Genetic Disorders


232. Sovereign Context Protocol: An Open Attribution Layer for Human-Generated Content in the Age of Large Language Models


233. RDEx-MOP: Indicator-Guided Reconstructed Differential Evolution for Fixed-Budget Multiobjective Optimization


234. RDEx-CSOP: Feasibility-Aware Reconstructed Differential Evolution with Adaptive epsilon-Constraint Ranking


235. RDEx-SOP: Exploitation-Biased Reconstructed Differential Evolution for Fixed-Budget Bound-Constrained Single-Objective Optimization


236. Voice-based debate with an AI adversary is associated with increased divergent ideation


237. Dynamic resource matching in manufacturing using deep reinforcement learning


238. ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding


239. Debiasing Large Language Models toward Social Factors in Online Behavior Analytics through Prompt Knowledge Tuning


240. Persona-Based Simulation of Human Opinion at Population Scale


241. Multi-Level Barriers to Generative AI Adoption Across Disciplines and Professional Roles in Higher Education


242. Unsupervised Behavioral Compression: Learning Low-Dimensional Policy Manifolds through State-Occupancy Matching


243. TAPS: Task Aware Proposal Distributions for Speculative Sampling


244. Generative Shape Reconstruction with Geometry-Guided Langevin Dynamics


245. UMI-Underwater: Learning Underwater Manipulation without Underwater Teleoperation


246. AutoSiMP: Autonomous Topology Optimization from Natural Language via LLM-Driven Problem Configuration and Adaptive Solver Control


247. Online Statistical Inference of Constant Sample-averaged Q-Learning


248. ASTER – Agentic Science Toolkit for Exoplanet Research


249. Mimetic Alignment with ASPECT: Evaluation of AI-inferred Personal Profiles


250. Are LLMs Good For Quantum Software, Architecture, and System Design?


251. Magic Words or Methodical Work? Challenging Conventional Wisdom in LLM-Based Political Text Annotation


252. Strategic Candidacy in Generative AI Arenas


253. LACON: Training Text-to-Image Model from Uncurated Data


254. A federated architecture for sector-led AI governance: lessons from India


255. EZASP – Facilitating the usage of ASP


256. Beyond Textual Knowledge-Leveraging Multimodal Knowledge Bases for Enhancing Vision-and-Language Navigation


257. AFSS: Artifact-Focused Self-Synthesis for Mitigating Bias in Audio Deepfake Detection


258. Stable Reasoning, Unstable Responses: Mitigating LLM Deception via Stability Asymmetry


259. GISclaw: An Open-Source LLM-Powered Agent System for Full-Stack Geospatial Analysis


260. Uncertainty-Aware Mapping from 3D Keypoints to Anatomical Landmarks for Markerless Biomechanics


261. VAN-AD: Visual Masked Autoencoder with Normalizing Flow For Time Series Anomaly Detection


262. FatigueFormer: Static-Temporal Feature Fusion for Robust sEMG-Based Muscle Fatigue Recognition


263. Dual-branch Graph Domain Adaptation for Cross-scenario Multi-modal Emotion Recognition


264. SpatialAnt: Autonomous Zero-Shot Robot Navigation via Active Scene Reconstruction and Visual Anticipation


265. Hybrid Diffusion Model for Breast Ultrasound Image Augmentation


266. Envisioning global urban development with satellite imagery and generative AI


267. A Regression Framework for Understanding Prompt Component Impact on LLM Performance


268. Squish and Release: Exposing Hidden Hallucinations by Making Them Surface as Safety Signals


269. Central-to-Local Adaptive Generative Diffusion Framework for Improving Gene Expression Prediction in Data-Limited Spatial Transcriptomics


270. Throughput Optimization as a Strategic Lever in Large-Scale AI Systems: Evidence from Dataloader and Memory Profiling Innovations


271. Epileptic Seizure Prediction Using Patient-Adaptive Transformer Networks


272. PiCSRL: Physics-Informed Contextual Spectral Reinforcement Learning


273. Resolving the Robustness-Precision Trade-off in Financial RAG through Hybrid Document-Routed Retrieval


274. Implicit neural representations for larval zebrafish brain microscopy: a reproducible benchmark on the MapZebrain atlas


275. GroupRAG: Cognitively Inspired Group-Aware Retrieval and Reasoning via Knowledge-Driven Problem Structuring


276. The Language of Touch: Translating Vibrations into Text with Dual-Branch Learning


277. Sparse-by-Design Cross-Modality Prediction: L0-Gated Representations for Reliable and Efficient Learning


278. DSO: Dual-Scale Neural Operators for Stable Long-term Fluid Dynamics Forecasting


279. Explaining, Verifying, and Aligning Semantic Hierarchies in Vision-Language Model Embeddings


280. Robust Batch-Level Query Routing for Large Language Models under Cost and Capacity Constraints


281. HASS: Hierarchical Simulation of Logopenic Aphasic Speech for Scalable PPA Detection


282. PhyDCM: A Reproducible Open-Source Framework for AI-Assisted Brain Tumor Classification from Multi-Sequence MRI


283. A Firefly Algorithm for Mixed-Variable Optimization Based on Hybrid Distance Modeling


284. CRISP: Characterizing Relative Impact of Scholarly Publications


285. A Step Toward Federated Pretraining of Multimodal Large Language Models


286. Can We Change the Stroke Size for Easier Diffusion?


287. Limits of Imagery Reasoning in Frontier LLM Models


288. TED: Training-Free Experience Distillation for Multimodal Reasoning


289. Learning to Select Visual In-Context Demonstrations


290. From Content to Audience: A Multimodal Annotation Framework for Broadcast Television Analytics


291. Edge Reliability Gap in Vision-Language Models: Quantifying Failure Modes of Compressed VLMs Under Visual Corruption


292. Aesthetic Assessment of Chinese Handwritings Based on Vision Language Models


293. Tiny-ViT: A Compact Vision Transformer for Efficient and Explainable Potato Leaf Disease Classification


294. Generating Synthetic Wildlife Health Data from Camera Trap Imagery: A Pipeline for Alopecia and Body Condition Training Data


295. Training-Free Diffusion-Driven Modeling of Pareto Set Evolution for Dynamic Multiobjective Optimization


296. LARD 2.0: Enhanced Datasets and Benchmarking for Autonomous Landing Systems


297. Steering Sparse Autoencoder Latents to Control Dynamic Head Pruning in Vision Transformers (Student Abstract)


298. Language-Conditioned World Modeling for Visual Navigation


299. Quantum Fuzzy Sets Revisited: Density Matrices, Decoherence, and the Q-Matrix Framework


300. SleepVLM: Explainable and Rule-Grounded Sleep Staging via a Vision-Language Model


301. Beyond Static Visual Tokens: Structured Sequential Visual Chain-of-Thought Reasoning


302. Ordinal Semantic Segmentation Applied to Medical and Odontological Images


303. Distilled Large Language Model-Driven Dynamic Sparse Expert Activation Mechanism


304. Contextual inference from single objects in Vision-Language models


305. Multi-view Graph Convolutional Network with Fully Leveraging Consistency via Granular-ball-based Topology Construction, Feature Enhancement and Interactive Fusion


306. SEAR: Schema-Based Evaluation and Routing for LLM Gateways


307. The Nonverbal Gap: Toward Affective Computer Vision for Safer and More Equitable Online Dating


308. A Multimodal Deep Learning Framework for Edema Classification Using HCT and Clinical Data


309. Capability Safety as Datalog: A Foundational Equivalence


310. Brain-inspired AI for Edge Intelligence: a systematic review


311. Stress Classification from ECG Signals Using Vision Transformer


312. SutureAgent: Learning Surgical Trajectories via Goal-conditioned Offline RL in Pixel Space


313. Toward Evaluation Frameworks for Multi-Agent Scientific AI Systems


314. On the Carbon Footprint of Economic Research in the Age of Generative AI


315. Agentic AI for Human Resources: LLM-Driven Candidate Assessment


316. The Cognitive Divergence: AI Context Windows, Human Attention Decline, and the Delegation Feedback Loop


317. PI-Mamba: Linear-Time Protein Backbone Generation via Spectrally Initialized Flow Matching


318. Deep Learning Multi-Horizon Irradiance Nowcasting: A Comparative Evaluation of Three Methods for Leveraging Sky Images


319. Physicochemical-Neural Fusion for Semi-Closed-Circuit Respiratory Autonomy in Extreme Environments


320. Complementarity-Preserving Generative Theory for Multimodal ECG Synthesis: A Quantum-Inspired Approach


321. Degrees, Levels, and Profiles of Contextuality


322. SpatialPoint: Spatial-aware Point Prediction for Embodied Localization


323. Learning Energy-Efficient Air–Ground Actuation for Hybrid Robots on Stair-Like Terrain


324. Contextual Graph Representations for Task-Driven 3D Perception and Planning


325. LITTA: Late-Interaction and Test-Time Alignment for Visually-Grounded Multimodal Retrieval


326. Operationalizing Perceptions of Agent Gender: Foundations and Guidelines


327. AlpsBench: An LLM Personalization Benchmark for Real-Dialogue Memorization and Preference Alignment


328. AI Meets Mathematics Education: A Case Study on Supporting an Instructor in a Large Mathematics Class with Context-Aware AI


329. Power Couple? AI Growth and Renewable Energy Investment


330. Evaluating Human-AI Safety: A Framework for Measuring Harmful Capability Uplift


331. Can AI be a Teaching Partner? Evaluating ChatGPT, Gemini, and DeepSeek across Three Teaching Strategies


332. ReCQR: Incorporating conversational query rewriting to improve Multimodal Image Retrieval


333. Bridge-RAG: An Abstract Bridge Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter


334. M-RAG: Making RAG Faster, Stronger, and More Efficient


335. Lingshu-Cell: A generative cellular world model for transcriptome modeling toward virtual cells


336. SimulCost: A Cost-Aware Benchmark and Toolkit for Automating Physics Simulations with LLMs


337. Exploring Cultural Variations in Moral Judgments with Large Language Models