전체 AI 논문 - 2026-04-15

1. PAL: Personal Adaptive Learner


2. Bilevel Late Acceptance Hill Climbing for the Electric Capacitated Vehicle Routing Problem


3. Cycle-Consistent Search: Question Reconstructability as a Proxy Reward for Search Agent Training


4. Modeling Co-Pilots for Text-to-Model Translation


5. Drawing on Memory: Dual-Trace Encoding Improves Cross-Session Recall in LLM Agents


6. BEAM: Bi-level Memory-adaptive Algorithmic Evolution for LLM-Powered Heuristic Design


7. AISafetyBenchExplorer: A Metric-Aware Catalogue of AI Safety Benchmarks Reveals Fragmented Measurement and Weak Benchmark Governance


8. LIFE – an energy efficient advanced continual learning agentic AI framework for frontier systems


9. QuarkMedSearch: A Long-Horizon Deep Search Agent for Exploring Medical Intelligence


10. From edges to meaning: Semantic line sketches as a cognitive scaffold for ancient pictograph invention


11. Artificial Intelligence for Modeling and Simulation of Mixed Automated and Human Traffic


12. RePAIR: Interactive Machine Unlearning through Prompt-Aware Model Repair


13. DocSeeker: Structured Visual Reasoning with Evidence Grounding for Long Document Understanding


14. Can AI Tools Transform Low-Demand Math Tasks? An Evaluation of Task Modification Capabilities


15. Transferable Expertise for Autonomous Agents via Real-World Case-Based Learning


16. MISID: A Multimodal Multi-turn Dataset for Complex Intent Recognition in Strategic Deception Games


17. A hierarchical spatial-aware algorithm with efficient reinforcement learning for human-robot task planning and allocation in production


18. Safe reinforcement learning with online filtering for fatigue-predictive human-robot task planning and allocation in production


19. Human-Centric Topic Modeling with Goal-Prompted Contrastive Learning and Optimal Transport


20. Broadening the Applicability of Conditional Syntax Splitting for Reasoning from Conditional Belief Bases


21. RPRA: Predicting an LLM-Judge for Efficient but Performant Inference


22. KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance


23. Every Picture Tells a Dangerous Story: Memory-Augmented Multi-Agent Jailbreak Attacks on VLMs


24. DeepTest Tool Competition 2026: Benchmarking an LLM-Based Automotive Assistant


25. IDEA: An Interpretable and Editable Decision-Making Framework for LLMs via Verbal-to-Numeric Calibration


26. Cross-Cultural Simulation of Citizen Emotional Responses to Bureaucratic Red Tape Using LLM Agents


27. A Two-Stage LLM Framework for Accessible and Verified XAI Explanations


28. Technical Report – A Context-Sensitive Multi-Level Similarity Framework for First-Order Logic Arguments: An Axiomatic Study


29. Intelligent ROI-Based Vehicle Counting Framework for Automated Traffic Monitoring


30. CIA: Inferring the Communication Topology from LLM-based Multi-Agent Systems


31. Enhancing Clustering: An Explainable Approach via Filtered Patterns


32. Operationalising the Right to be Forgotten in LLMs: A Lightweight Sequential Unlearning Framework for Privacy-Aligned Deployment in Politically Sensitive Environments


33. Heuristic Classification of Thoughts Prompting (HCoT): Integrating Expert System Heuristics for Structured Reasoning into Large Language Models


34. Preventing Safety Drift in Large Language Models via Coupled Weight and Activation Constraints


35. ReflectCAP: Detailed Image Captioning with Reflective Memory


36. MultiDocFusion: Hierarchical and Multimodal Chunking Pipeline for Enhanced RAG on Long Industrial Documents


37. Frontier-Eng: Benchmarking Self-Evolving Agents on Real-World Engineering Tasks with Generative Optimization


38. GAM: Hierarchical Graph-based Agentic Memory for LLM Agents


39. A Scoping Review of Large Language Model-Based Pedagogical Agents


40. How memory can affect collective and cooperative behaviors in an LLM-Based Social Particle Swarm


41. HintMR: Eliciting Stronger Mathematical Reasoning in Small Language Models


42. Designing Reliable LLM-Assisted Rubric Scoring for Constructed Responses: Evidence from Physics Exams


43. Modality-Native Routing in Agent-to-Agent Networks: A Multimodal A2A Protocol Extension


44. Beyond Prompt: Fine-grained Simulation of Cognitively Impaired Standardized Patients via Stochastic Steering


45. Latent patterns of urban mixing in mobility analysis across five global cities


46. Beyond Scores: Diagnostic LLM Evaluation via Fine-Grained Abilities


47. TRUST Agents: A Collaborative Multi-Agent Framework for Fake News Detection, Explainable Verification, and Logic-Aware Claim Reasoning


48. Policy-Invisible Violations in LLM-Based Agents


49. Evaluating Relational Reasoning in LLMs with REL


50. EMBER: Autonomous Cognitive Behaviour from Learned Spiking Neural Network Dynamics in a Hybrid LLM Architecture


51. Development, Evaluation, and Deployment of a Multi-Agent System for Thoracic Tumor Board


52. Beyond Factual Grounding: The Case for Opinion-Aware Retrieval-Augmented Generation


53. Towards Platonic Representation for Table Reasoning: A Foundation for Permutation-Invariant Retrieval


54. Aethon: A Reference-Based Replication Primitive for Constant-Time Instantiation of Stateful AI Agents


55. Long-Horizon Plan Execution in Large Tool Spaces through Entropy-Guided Branching


56. The A-R Behavioral Space: Execution-Level Profiling of Tool-Using Language Model Agents in Organizational Deployment


57. Spatial Atlas: Compute-Grounded Reasoning for Spatial-Aware Research Agent Benchmarks


58. LLM-HYPER: Generative CTR Modeling for Cold-Start Ad Personalization via LLM-Based Hypernetworks


59. Human-Inspired Context-Selective Multimodal Memory for Social Robots


60. Mathematics Teachers Interactions with a Multi-Agent System for Personalized Problem Generation


61. Memory as Metabolism: A Design for Companion Knowledge Systems


62. WiseOWL: A Methodology for Evaluating Ontological Descriptiveness and Semantic Correctness for Ontology Reuse and Ontology Recommendations


63. A longitudinal health agent framework


64. Identity as Attractor: Geometric Evidence for Persistent Agent Architecture in LLM Activation Space


65. When to Forget: A Memory Governance Primitive


66. The Long-Horizon Task Mirage? Diagnosing Where and Why Agentic Systems Break


67. Narrative-Driven Paper-to-Slide Generation via ArcDeck


68. GoodPoint: Learning Constructive Scientific Paper Feedback from Author Responses


69. Self-Monitoring Benefits from Structural Integration: Lessons from Metacognition in Continuous-Time Multi-Timescale Agents


70. The Non-Optimality of Scientific Knowledge: Path Dependence, Lock-In, and The Local Minimum Trap


71. Visual Preference Optimization with Rubric Rewards


72. Representation geometry shapes task performance in vision-language modeling for CT enterography


73. Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe


74. Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation


75. One Token Away from Collapse: The Fragility of Instruction-Tuned Helpfulness


76. LogicEval: A Systematic Framework for Evaluating Automated Repair Techniques for Logical Vulnerabilities in Real-World Software


77. ROSE: An Intent-Centered Evaluation Metric for NL2SQL


78. Parallax: Why AI Agents That Think Must Never Act


79. Distorted or Fabricated? A Survey on Hallucination in Video LLMs


80. CoDe-R: Refining Decompiler Output with LLMs via Rationale Guidance and Adaptive Inference


81. Round-Trip Translation Reveals What Frontier Multilingual Benchmarks Miss



83. FastGrasp: Learning-based Whole-body Control method for Fast Dexterous Grasping with Mobile Manipulators


84. Detecting and refurbishing ground truth errors during training of deep learning-based echocardiography segmentation models


85. Loop Corrections to the Training and Generalization Errors of Random Feature Models


86. Algorithmic Analysis of Dense Associative Memory: Finite-Size Guarantees and Adversarial Robustness


87. Rethinking Satellite Image Restoration for Onboard AI: A Lightweight Learning-Based Approach


88. Efficiency of Proportional Mechanisms in Online Auto-Bidding Advertising


89. VFA: Relieving Vector Operations in Flash Attention with Global Maximum Pre-computation


90. OSC: Hardware Efficient W4A4 Quantization via Outlier Separation in Channel Dimension


91. Efficient Adversarial Training via Criticality-Aware Fine-Tuning


92. DoseRAD2026 Challenge dataset: AI accelerated photon and proton dose calculation for radiotherapy


93. Cognition-Inspired Dual-Stream Semantic Enhancement for Vision-Based Dynamic Emotion Modeling


94. CLASP: Class-Adaptive Layer Fusion and Dual-Stage Pruning for Multimodal Large Language Models



96. GF-Score: Certified Class-Conditional Robustness Evaluation with Fairness Guarantees


97. LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety


98. Information-Theoretic Optimization for Task-Adapted Compressed Sensing Magnetic Resonance Imaging


99. BID-LoRA: A Parameter-Efficient Framework for Continual Learning and Unlearning


100. PromptEcho: Annotation-Free Reward from Vision-Language Models for Text-to-Image Reinforcement Learning


101. Learning Chain Of Thoughts Prompts for Predicting Entities, Relations, and even Literals on Knowledge Graphs


102. TimeSAF: Towards LLM-Guided Semantic Asynchronous Fusion for Time Series Forecasting


103. Contextual Multi-Task Reinforcement Learning for Autonomous Reef Monitoring


104. Calibration-Aware Policy Optimization for Reasoning LLMs


105. Neural Dynamic GI: Random-Access Neural Compression for Temporal Lightmaps in Dynamic Lighting Environments


106. Efficient Semantic Image Communication for Traffic Monitoring at the Edge


107. SOAR: Self-Correction for Optimal Alignment and Refinement in Diffusion Models


108. LLM-Guided Prompt Evolution for Password Guessing


109. KumoRFM-2: Scaling Foundation Models for Relational Learning


110. When Does Data Augmentation Help? Evaluating LLM and Back-Translation Methods for Hausa and Fongbe NLP


111. MODIX: A Training-Free Multimodal Information-Driven Positional Index Scaling for Vision-Language Models


112. Orthogonal Subspace Projection for Continual Machine Unlearning via SVD-Based LoRA


113. NTIRE 2026 The 3rd Restore Any Image Model (RAIM) Challenge: Professional Image Quality Assessment (Track 1)


114. Topology-Aware Reasoning over Incomplete Knowledge Graph with Graph-Based Soft Prompting


115. SEATrack: Simple, Efficient, and Adaptive Multimodal Tracker


116. Lit2Vec: A Reproducible Workflow for Building a Legally Screened Chemistry Corpus from S2ORC for Downstream Retrieval and Text Mining


117. Latent Planning Emerges with Scale


118. Deepfakes at Face Value: Image and Authority


119. KG-Reasoner: A Reinforced Model for End-to-End Multi-Hop Knowledge Graph Reasoning


120. Elastic Net Regularization and Gabor Dictionary for Classification of Heart Sound Signals using Deep Learning


121. Social Learning Strategies for Evolved Virtual Soft Robots


122. Audio Source Separation in Reverberant Environments using $β$-divergence based Nonnegative Factorization


123. Mining Large Language Models for Low-Resource Language Data: Comparing Elicitation Strategies for Hausa and Fongbe


124. From Kinematics to Dynamics: Learning to Refine Hybrid Plans for Physically Feasible Execution


125. Euler-inspired Decoupling Neural Operator for Efficient Pansharpening


126. X-VC: Zero-shot Streaming Voice Conversion in Codec Space


127. IAD-Unify: A Region-Grounded Unified Model for Industrial Anomaly Segmentation, Understanding, and Generation


128. Decoding by Perturbation: Mitigating MLLM Hallucinations via Dynamic Textual Perturbation


129. RACF: A Resilient Autonomous Car Framework with Object Distance Correction


130. Security and Resilience in Autonomous Vehicles: A Proactive Design Approach


131. Chain-of-Models Pre-Training: Rethinking Training Acceleration of Vision Foundation Models


132. Beyond Output Correctness: Benchmarking and Evaluating Large Language Model Reasoning in Coding Tasks


133. SCRIPT: A Subcharacter Compositional Representation Injection Module for Korean Pre-Trained Language Models


134. Cooperative Memory Paging with Keyword Bookmarks for Long-Horizon LLM Conversations


135. Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning


136. Scaffold-Conditioned Preference Triplets for Controllable Molecular Optimization with Large Language Models


137. FRTSearch: Unified Detection and Parameter Inference of Fast Radio Transients using Instance Segmentation


138. GeM-EA: A Generative and Meta-learning Enhanced Evolutionary Algorithm for Streaming Data-Driven Optimization


139. Black-Box Optimization From Small Offline Datasets via Meta Learning with Synthetic Tasks


140. EgoEsportsQA: An Egocentric Video Benchmark for Perception and Reasoning in Esports


141. Is Vibe Coding the Future? An Empirical Assessment of LLM Generated Codes for Construction Safety


142. GCA Framework: A Gulf-Grounded Dataset and Agentic Pipeline for Climate Decision Support


143. Local-Splitter: A Measurement Study of Seven Tactics for Reducing Cloud LLM Token Usage on Coding-Agent Workloads


144. MAST: Mask-Guided Attention Mass Allocation for Training-Free Multi-Style Transfer


145. CascadeDebate: Multi-Agent Deliberation for Cost-Aware LLM Cascades


146. Coding-Free and Privacy-Preserving MCP Framework for Clinical Agentic Research Intelligence System


147. ARGen: Affect-Reinforced Generative Augmentation towards Vision-based Dynamic Emotion Perception


148. SpanKey: Dynamic Key Space Conditioning for Neural Network Access Control


149. SpecBound: Adaptive Bounded Self-Speculation with Layer-wise Confidence Calibration


150. Socrates Loss: Unifying Confidence Calibration and Classification by Leveraging the Unknown


151. Continuous Knowledge Metabolism: Generating Scientific Hypotheses from Evolving Literature


152. MolMem: Memory-Augmented Agentic Reinforcement Learning for Sample-Efficient Molecular Optimization


153. TEMPLATEFUZZ: Fine-Grained Chat Template Fuzzing for Jailbreaking and Red Teaming LLMs


154. LLM-Guided Semantic Bootstrapping for Interpretable Text Classification with Tsetlin Machines


155. Ride the Wave: Precision-Allocated Sparse Attention for Smooth Video Generation


156. Unveiling the Surprising Efficacy of Navigation Understanding in End-to-End Autonomous Driving


157. Towards grounded autonomous research: an end-to-end LLM mini research loop on published computational physics


158. Characterizing Resource Sharing Practices on Underground Internet Forum Synthetic Non-Consensual Intimate Image Content Creation Communities


159. Clustering-Enhanced Domain Adaptation for Cross-Domain Intrusion Detection in Industrial Control Systems


160. CycloneMAE: A Scalable Multi-Task Learning Model for Global Tropical Cyclone Probabilistic Forecasting


161. Fully Homomorphic Encryption on Llama 3 model for privacy preserving LLM inference


162. Domain-Specific Latent Representations Improve the Fidelity of Diffusion-Based Medical Image Super-Resolution


163. From Plan to Action: How Well Do Agents Follow the Plan?


164. Observing the unobserved confounding through its effects: toward randomized trial-like estimates from real-world survival data


165. PR-MaGIC: Prompt Refinement Via Mask Decoder Gradient Flow For In-Context Segmentation


166. LLM-Based Automated Diagnosis Of Integration Test Failures At Google


167. Narrative over Numbers: The Identifiable Victim Effect and its Amplification Under Alignment and Reasoning in Large Language Models


168. OpenTME: An Open Dataset of AI-powered H&E Tumor Microenvironment Profiles from TCGA


169. Robust Explanations for User Trust in Enterprise NLP Systems


170. Interpretable DNA Sequence Classification via Dynamic Feature Generation in Decision Trees


171. Leveraging Weighted Syntactic and Semantic Context Assessment Summary (wSSAS) Towards Text Categorization Using LLMs


172. VISTA: Validation-Informed Trajectory Adaptation via Self-Distillation


173. SIR-Bench: Evaluating Investigation Depth in Security Incident Response Agents


174. Benchmarking Deflection and Hallucination in Large Vision-Language Models


175. Curvelet-Based Frequency-Aware Feature Enhancement for Deepfake Detection


176. LLMs Struggle with Abstract Meaning Comprehension More Than Expected


177. BayMOTH: Bayesian optiMizatiOn with meTa-lookahead – a simple approacH


178. The Second Challenge on Cross-Domain Few-Shot Object Detection at NTIRE 2026: Methods and Results


179. Filtered Reasoning Score: Evaluating Reasoning Quality on a Model’s Most-Confident Traces


180. INDOTABVQA: A Benchmark for Cross-Lingual Table Understanding in Bahasa Indonesia Documents


181. AnyPoC: Universal Proof-of-Concept Test Generation for Scalable LLM-Based Bug Detection


182. ResBM: Residual Bottleneck Models for Low-Bandwidth Pipeline Parallelism


183. AutoSurrogate: An LLM-Driven Multi-Agent Framework for Autonomous Construction of Deep Learning Surrogate Models in Subsurface Flow


184. Can AI Detect Life? Lessons from Artificial Life


185. How Transformers Learn to Plan via Multi-Token Prediction


186. Thermodynamic Liquid Manifold Networks: Physics-Bounded Deep Learning for Solar Forecasting in Autonomous Off-Grid Microgrids


187. Disposition Distillation at Small Scale: A Three-Arc Negative Result


188. MVAdapt: Zero-Shot Multi-Vehicle Adaptation for End-to-End Autonomous Driving


189. Evaluating the Limitations of Protein Sequence Representations for Parkinson’s Disease Classification


190. DBGL: Decay-aware Bipartite Graph Learning for Irregular Medical Time Series Classification


191. Polynomial Expansion Rank Adaptation: Enhancing Low-Rank Fine-Tuning with High-Order Interactions


192. Beyond Static Sandboxing: Learned Capability Governance for Autonomous AI Agents


193. A Layer-wise Analysis of Supervised Fine-Tuning


194. Schema-Adaptive Tabular Representation Learning with LLMs for Generalizable Multimodal Clinical Reasoning


195. M$^\star$: Every Task Deserves Its Own Memory Harness


196. GRACE: A Dynamic Coreset Selection Framework for Large Language Model Optimization


197. Back to Basics: Let Conversational Agents Remember with Just Retrieval and Generation


198. Should There be a Teacher In-the-Loop? A Study of Generative AI Personalized Tasks Middle School


199. ART-VITON: Measurement-Guided Latent Diffusion for Artifact-Free Virtual Try-On