전체 AI 논문 - 2026-04-20

1. ASMR-Bench: Auditing for Sabotage in ML Research


2. Using Large Language Models and Knowledge Graphs to Improve the Interpretability of Machine Learning Models in Manufacturing


3. Learning to Reason with Insight for Informal Theorem Proving


4. Characterising LLM-Generated Competency Questions: a Cross-Domain Empirical Study using Open and Closed Models


5. MARCH: Multi-Agent Radiology Clinical Hierarchy for CT Report Generation


6. SocialGrid: A Benchmark for Planning and Social Reasoning in Embodied Multi-Agent Systems


7. MEDLEY-BENCH: Scale Buys Evaluation but Not Control in AI Metacognition


8. ReactBench: A Benchmark for Topological Reasoning in MLLMs on Chemical Reaction Diagrams



10. Integrating Graphs, Large Language Models, and Agents: Reasoning and Retrieval


11. Towards Rigorous Explainability by Feature Attribution


12. Experience Compression Spectrum: Unifying Memory, Skills, and Rules in LLM Agents


13. Discover and Prove: An Open-source Agentic Framework for Hard Mode Automated Theorem Proving in Lean 4


14. Stein Variational Black-Box Combinatorial Optimization


15. KWBench: Measuring Unprompted Problem Recognition in Knowledge Work


16. Structured Abductive-Deductive-Inductive Reasoning for LLMs via Algebraic Invariants


17. LLM Reasoning Is Latent, Not the Chain of Thought


18. The World Leaks the Future: Harness Evolution for Future Prediction Agents



20. Subliminal Transfer of Unsafe Behaviors in AI Agent Distillation


21. Preregistered Belief Revision Contracts


22. LACE: Lattice Attention for Cross-thread Exploration


23. Bureaucratic Silences: What the Canadian AI Register Reveals, Omits, and Obscures


24. GIST: Multimodal Knowledge Extraction and Spatial Grounding via Intelligent Semantic Topology


25. DeepER-Med: Advancing Deep Evidence-Based Research in Medicine Through Agentic AI


26. VEFX-Bench: A Holistic Benchmark for Generic Video Editing and Visual Effects



28. Beyond Distribution Sharpening: The Importance of Task Rewards


29. Joint-Centric Dual Contrastive Alignment with Structure-Preserving and Information-Balanced Regularization


30. BAGEL: Benchmarking Animal Knowledge Expertise in Language Models


31. A Two-Stage, Object-Centric Deep Learning Framework for Robust Exam Cheating Detection


32. Neuro-Symbolic ODE Discovery with Latent Grammar Flow


33. “Taking Stock at FAccT”: Using Participatory Design to Co-Create a Vision for the Fairness, Accountability and Transparency Community


34. Beyond Surface Statistics: Robust Conformal Prediction for LLMs via Internal Representations


35. AIFIND: Artifact-Aware Interpreting Fine-Grained Alignment for Incremental Face Forgery Detection


36. ChemGraph-XANES: An Agentic Framework for XANES Simulation and Analysis


37. Synthetic data in cryptocurrencies using generative models


38. JumpLoRA: Sparse Adapters for Continual Learning in Large Language Models


39. AtManRL: Towards Faithful Reasoning via Differentiable Attention Saliency


40. SWNet: A Cross-Spectral Network for Camouflaged Weed Detection


41. Training Time Prediction for Mixed Precision-based Distributed Training


42. Can LLMs Understand the Impact of Trauma? Costs and Benefits of LLMs Coding the Interviews of Firearm Violence Survivors


43. SCRIPT: Implementing an Intelligent Tutoring System for Programming in a German University Context


44. The Relic Condition: When Published Scholarship Becomes Material for Its Own Replacement


45. Reckoning with the Political Economy of AI: Avoiding Decoys in Pursuit of Accountability


46. Dual-Modal Lung Cancer AI: Interpretable Radiology and Microscopy with Clinical Risk Integration


47. Robust Synchronisation for Federated Learning in The Face of Correlated Device Failure


48. Stylistic-STORM (ST-STORM) : Perceiving the Semantic Nature of Appearance


49. Unveiling Stochasticity: Universal Multi-modal Probabilistic Modeling for Traffic Forecasting


50. Early Detection of Acute Myeloid Leukemia (AML) Using YOLOv12 Deep Learning Model


51. Prototype-Grounded Concept Models for Verifiable Concept Alignment


52. Chain-of-Thought Degrades Visual Spatial Reasoning Capabilities of Multimodal LLMs


53. AST: Adaptive, Seamless, and Training-Free Precise Speech Editing


54. Mind’s Eye: A Benchmark of Visual Abstraction, Transformation and Composition for Multimodal LLMs


55. Towards Intrinsic Interpretability of Large Language Models:A Survey of Design Principles and Architectures


56. Safe Deep Reinforcement Learning for Building Heating Control and Demand-side Flexibility


57. Where does output diversity collapse in post-training?


58. Neurosymbolic Repo-level Code Localization


59. AgentV-RL: Scaling Reward Modeling with Agentic Verifier


60. From Vulnerable Data Subjects to Vulnerabilizing Data Practices: Navigating the Protection Paradox in AI-Based Analyses of Platformized Lives


61. Polarization by Default: Auditing Recommendation Bias in LLM-Based Content Curation


62. UniEditBench: A Unified and Cost-Effective Benchmark for Image and Video Editing via Distilled MLLMs


63. DiZiNER: Disagreement-guided Instruction Refinement via Pilot Annotation Simulation for Zero-shot Named Entity Recognition


64. QuantSightBench: Evaluating LLM Quantitative Forecasting with Prediction Intervals


65. Robust Multispectral Semantic Segmentation under Missing or Full Modalities via Structured Latent Projection


66. DPrivBench: Benchmarking LLMs’ Reasoning for Differential Privacy


67. ECG-Lens: Benchmarking ML & DL Models on PTB-XL Dataset


68. Beyond a Single Frame: Multi-Frame Spatially Grounded Reasoning Across Volumetric MRI


69. From Seeing to Simulating: Generative High-Fidelity Simulation with Digital Cousins for Generalizable Robot Learning and Evaluation


70. From Intention to Text: AI-Supported Goal Setting in Academic Writing


71. Self-Distillation as a Performance Recovery Mechanism for LLMs: Counteracting Compression and Catastrophic Forgetting


72. EVIL: Evolving Interpretable Algorithms for Zero-Shot Inference on Event Sequences and Time Series with LLMs


73. SegMix:Shuffle-based Feedback Learning for Semantic Segmentation of Pathology Images


74. PIIBench: A Unified Multi-Source Benchmark Corpus for Personally Identifiable Information Detection


75. Phase Transitions as the Breakdown of Statistical Indistinguishability


76. Closing the Theory-Practice Gap in Spiking Transformers via Effective Dimension


77. cuNNQS-SCI: A Fully GPU-Accelerated Framework for High-Performance Configuration Interaction Selection withNeural Network QQantum States


78. When Do Early-Exit Networks Generalize? A PAC-Bayesian Theory of Adaptive Depth


79. DepCap: Adaptive Block-Wise Parallel Decoding for Efficient Diffusion LM Inference


80. Learning Uncertainty from Sequential Internal Dispersion in Large Language Models


81. Sketch and Text Synergy: Fusing Structural Contours and Descriptive Attributes for Fine-Grained Image Retrieval


82. MambaBack: Bridging Local Features and Global Contexts in Whole Slide Image Analysis


83. Privacy-Preserving LLMs Routing


84. Reasoning-targeted Jailbreak Attacks on Large Reasoning Models via Semantic Triggers and Psychological Framing


85. Diffusion Autoencoder for Unsupervised Artifact Restoration in Handheld Fundus Images


86. NeuroLip: An Event-driven Spatiotemporal Learning Framework for Cross-Scene Lip-Motion-based Visual Speaker Recognition


87. GTA-2: Benchmarking General Tool Agents from Atomic Tool-Use to Open-Ended Workflows


88. Just Type It in Isabelle! AI Agents Drafting, Mechanizing, and Generalizing from Human Hints


89. SSMamba: A Self-Supervised Hybrid State Space Model for Pathological Image Classification


90. The Price of Paranoia: Robust Risk-Sensitive Cooperation in Non-Stationary Multi-Agent Reinforcement Learning


91. Hierarchical Active Inference using Successor Representations


92. CodeMMR: Bridging Natural Language, Code, and Image for Unified Retrieval


93. HYPERHEURIST: A Simulated Annealing-Based Control Framework for LLM-Driven Code Generation in Optimized Hardware Design


94. Rethinking the Necessity of Adaptive Retrieval-Augmented Generation through the Lens of Adaptive Listwise Ranking


95. VoodooNet: Achieving Analytic Ground States via High-Dimensional Random Projections


96. CLIMB: Controllable Longitudinal Brain Image Generation using Mamba-based Latent Diffusion Model and Gaussian-aligned Autoencoder


97. Imperfectly Cooperative Human-AI Interactions: Comparing the Impacts of Human and AI Attributes in Simulated and User Studies


98. DataCenterGym: A Physics-Grounded Simulator for Multi-Objective Data Center Scheduling


99. DALM: A Domain-Algebraic Language Model via Three-Phase Structured Generation


100. BioHiCL: Hierarchical Multi-Label Contrastive Learning for Biomedical Retrieval with MeSH Labels


101. CSLE: A Reinforcement Learning Platform for Autonomous Security Management


102. LLM attribution analysis across different fine-tuning strategies and model scales for automated code compliance


103. “Excuse me, may I say something…” CoLabScience, A Proactive AI Assistant for Biomedical Discovery and LLM-Expert Collaborations


104. PAWN: Piece Value Analysis with Neural Networks


105. Symbolic Guardrails for Domain-Specific Agents: Stronger Safety and Security Guarantees Without Sacrificing Utility


106. Reward Weighted Classifier-Free Guidance as Policy Improvement in Autoregressive Models


107. Why Fine-Tuning Encourages Hallucinations and How to Fix It


108. Natural gradient descent with momentum


109. Consistency Analysis of Sentiment Predictions using Syntactic & Semantic Context Assessment Summarization (SSAS)


110. LLMbench: A Comparative Close Reading Workbench for Large Language Models


111. PolicyBank: Evolving Policy Understanding for LLM Agents


112. SecureRouter: Encrypted Routing for Efficient Secure Inference


113. A Q-learning-based QoS-aware multipath routing protocol in IoMT-based wireless body area network


114. FineSteer: A Unified Framework for Fine-Grained Inference-Time Steering in Large Language Models


115. Harmonizing Multi-Objective LLM Unlearning via Unified Domain Representation and Bidirectional Logit Distillation


116. The Semi-Executable Stack: Agentic Software Engineering and the Expanding Scope of SE


117. Ragged Paged Attention: A High-Performance and Flexible LLM Inference Kernel for TPU


118. The Crutch or the Ceiling? How Different Generations of LLMs Shape EFL Student Writings


119. RelativeFlow: Taming Medical Image Denoising Learning with Noisy Reference



121. Transfer Learning from Foundational Optimization Embeddings to Unsupervised SAT Representations


122. StoSignSGD: Unbiased Structural Stochasticity Fixes SignSGD for Training Large Language Models


123. HarmfulSkillBench: How Do Harmful Skills Weaponize Your Agents?


124. Beyond Single-Model Optimization: Preserving Plasticity in Continual Reinforcement Learning


125. PRL-Bench: A Comprehensive Benchmark Evaluating LLMs’ Capabilities in Frontier Physics Research


126. The Illusion of Equivalence: Systematic FP16 Divergence in KV-Cached Autoregressive Inference


127. Dispatch-Aware Ragged Attention for Pruned Vision Transformers


128. Hallucination as Trajectory Commitment: Causal Evidence for Asymmetric Attractor Dynamics in Transformer Generation


129. Lightweight Geometric Adaptation for Training Physics-Informed Neural Networks


130. Analyzing Chain of Thought (CoT) Approaches in Control Flow Code Deobfuscation Tasks


131. Exploring LLM-based Verilog Code Generation with Data-Efficient Fine-Tuning and Testbench Automation


132. LinuxArena: A Control Setting for AI Agents in Live Production Software Environments


133. Temporal Contrastive Decoding: A Training-Free Method for Large Audio-Language Models


134. Exascale Multi-Task Graph Foundation Models for Imbalanced, Multi-Fidelity Atomistic Data


135. Zoom Consistency: A Free Confidence Signal in Multi-Step Visual Grounding Pipelines


136. VeriCWEty: Embedding enabled Line-Level CWE Detection in Verilog


137. Seeing the imagined: a latent functional alignment in visual imagery decoding from fMRI data


138. InfoChess: A Game of Adversarial Inference and a Laboratory for Quantifiable Information Control


139. The Synthetic Media Shift: Tracking the Rise, Virality, and Detectability of AI-Generated Multimodal Misinformation


140. Applied Explainability for Large Language Models: A Comparative Study


141. Taming Asynchronous CPU-GPU Coupling for Frequency-aware Latency Estimation on Mobile Edge


142. Sequential KV Cache Compression via Probabilistic Language Tries: Beyond the Per-Vector Shannon Limit


143. SocialWise: LLM-Agentic Conversation Therapy for Individuals with Autism Spectrum Disorder to Enhance Communication Skills


144. To LLM, or Not to LLM: How Designers and Developers Navigate LLMs as Tools or Teammates


145. When the Loop Closes: Architectural Limits of In-Context Isolation, Metacognitive Co-option, and the Two-Target Design Problem in Human-LLM Systems


146. MRGEN: A Conceptual Framework for LLM-Powered Mixed Reality Authoring Tools for Education


147. Uncertainty, Vagueness, and Ambiguity in Human-Robot Interaction: Why Conceptualization Matters


148. Facial-Expression-Aware Prompting for Empathetic LLM Tutoring


149. A Comparative Study on the Impact of Traditional Learning and Interactive Learning on Students’ Academic Performance and Emotional Well-Being


150. Beyond Passive Viewing: A Pilot Study of a Hybrid Learning Platform Augmenting Video Lectures with Conversational AI


151. Technically Love: The Evolution of Human-AI Romance Discourse on Reddit


152. Automating Crash Diagram Generation Using Vision-Language Models: A Case Study on Multi-Lane Roundabouts


153. How people use Copilot for Health


154. Evaluating LLMs as Human Surrogates in Controlled Experiments


155. Eco-Bee: A Personalised Multi-Modal Agent for Advancing Student Climate Awareness and Sustainable Behaviour in Campus Ecosystems


156. Struggle Premium : How Human Effort and Imperfection Drive Perceived Value in the Age of AI


157. Explainable Iterative Data Visualisation Refinement via an LLM Agent


158. Anthropomorphism and Trust in Human-Large Language Model interactions


159. Modeling of ASD/TD Children’s Behaviors in Interaction with a Virtual Social Robot During a Music Education Program Using Deep Neural Networks


160. Seeing the Intangible: Survey of Image Classification into High-Level and Abstract Categories