전체 AI 논문 - 2026-03-09

1. Boosting deep Reinforcement Learning using pretraining with Logical Options


2. Talk Freely, Execute Strictly: Schema-Gated Agentic AI for Flexible and Reproducible Scientific Workflows


3. SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement


4. The EpisTwin: A Knowledge Graph-Grounded Neuro-Symbolic Architecture for Personal AI


5. Artificial Intelligence for Climate Adaptation: Reinforcement Learning for Climate Change-Resilient Transport


6. Conversational Demand Response: Bidirectional Aggregator-Prosumer Coordination through Agentic AI


7. Offline Materials Optimization with CliqueFlowmer


8. Aggregative Semantics for Quantitative Bipolar Argumentation Frameworks


9. Agentic LLM Planning via Step-Wise PDDL Simulation: An Empirical Characterisation


10. An Interactive Multi-Agent System for Evaluation of New Product Concepts


11. DeepFact: Co-Evolving Benchmarks and Agents for Deep Research Factuality


12. The World Won’t Stay Still: Programmable Evolution for Agent Benchmarks


13. Evolving Medical Imaging Agents via Experience-driven Self-skill Discovery


14. Reasoning Models Struggle to Control their Chains of Thought


15. Real-Time AI Service Economy: A Framework for Agentic Computing Across the Continuum


16. RoboLayout: Differentiable 3D Scene Generation for Embodied Agents


17. BEVLM: Distilling Semantic Knowledge from LLMs into Bird’s-Eye View Representations


18. Fly360: Omnidirectional Obstacle Avoidance within Drone View


19. SUREON: A Benchmark and Vision-Language-Model for Surgical Reasoning


20. LiveSense: A Real-Time Wi-Fi Sensing Platform for Range-Doppler on COTS Laptop


21. RAMoEA-QA: Hierarchical Specialization for Robust Respiratory Audio Question Answering


22. Artificial Intelligence for Detecting Fetal Orofacial Clefts and Advancing Medical Education


23. COLD-Steer: Steering Large Language Models via In-Context One-step Learning Dynamics


24. NOBLE: Accelerating Transformers with Nonlinear Low-Rank Branches


25. PONTE: Personalized Orchestration for Natural Language Trustworthy Explanations


26. Do Foundation Models Know Geometry? Probing Frozen Features for Continuous Physical Measurement


27. Prosodic Boundary-Aware Streaming Generation for LLM-Based TTS with Streaming Text Input


28. Abductive Reasoning with Syllogistic Forms in Large Language Models


29. CLoPA: Continual Low Parameter Adaptation of Interactive Segmentation for Medical Image Annotation


30. A Reference Architecture of Reinforcement Learning Frameworks


31. Physical Simulator In-the-Loop Video Generation


32. Prompt Group-Aware Training for Robust Text-Guided Nuclei Segmentation


33. Kinetic-based regularization: Learning spatial derivatives and PDE applications


34. ESAA-Security: An Event-Sourced, Verifiable Architecture for Agent-Assisted Security Audits of AI-Generated Code


35. CLAIRE: Compressed Latent Autoencoder for Industrial Representation and Evaluation – A Deep Learning Framework for Smart Manufacturing


36. Dynamic Chunking Diffusion Transformer


37. MoEless: Efficient MoE LLM Serving via Serverless Computing


38. K-MaT: Knowledge-Anchored Manifold Transport for Cross-Modal Prompt Learning in Medical Imaging


39. AI End-to-End Radiation Treatment Planning Under One Second


40. Structured Exploration vs. Generative Flexibility: A Field Study Comparing Bandit and LLM Architectures for Personalised Health Behaviour Interventions


41. From Entropy to Calibrated Uncertainty: Training Language Models to Reason About Uncertainty


42. DEX-AR: A Dynamic Explainability Method for Autoregressive Vision-Language Models


43. Learning Where the Physics Is: Probabilistic Adaptive Sampling for Stiff PDEs


44. Stem: Rethinking Causal Information Flow in Sparse Attention


45. Looking Through Glass Box


46. Agentic retrieval-augmented reasoning reshapes collective reliability under model variability in radiology question answering


47. HiPP-Prune: Hierarchical Preference-Conditioned Structured Pruning for Vision-Language Models


48. Learning to Solve Orienteering Problem with Time Windows and Variable Profits


49. GazeMoE: Perception of Gaze Target with Mixture-of-Experts


50. TaPD: Temporal-adaptive Progressive Distillation for Observation-Adaptive Trajectory Forecasting in Autonomous Driving


51. Cut to the Chase: Training-free Multimodal Summarization via Chain-of-Events


52. FlashPrefill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context Prefilling


53. MAPO: Mixed Advantage Policy Optimization for Long-Horizon Multi-Turn Dialogue


54. Whisper-CD: Accurate Long-Form Speech Recognition using Multi-Negative Contrastive Decoding


55. CRIMSON: A Clinically-Grounded LLM-Based Metric for Generative Radiology Report Evaluation


56. Contrastive-to-Self-Supervised: A Two-Stage Framework for Script Similarity Learning


57. Reflective Flow Sampling Enhancement


58. Do Compact SSL Backbones Matter for Audio Deepfake Detection? A Controlled Study with RAPTOR


59. Ensemble Graph Neural Networks for Probabilistic Sea Surface Temperature Forecasting via Input Perturbations


60. VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models


61. Predictive Coding Graphs are a Superset of Feedforward Neural Networks


62. Place-it-R1: Unlocking Environment-aware Reasoning Potential of MLLM for Video Object Insertion


63. Partial Policy Gradients for RL in LLMs


64. A Causal Graph Approach to Oppositional Narrative Analysis


65. A Hazard-Informed Data Pipeline for Robotics Physical Safety


66. Making Implicit Premises Explicit in Logical Understanding of Enthymemes


67. Experiences Build Characters: The Linguistic Origins and Functional Impact of LLM Personality


68. StreamVoiceAnon+: Emotion-Preserving Streaming Speaker Anonymization via Frame-Level Acoustic Distillation


69. Lifelong Embodied Navigation Learning


70. Text-Driven Emotionally Continuous Talking Face Generation


71. Evaluating Austrian A-Level German Essays with Large Language Models for Automated Essay Scoring


72. TempoSyncDiff: Distilled Temporally-Consistent Diffusion for Low-Latency Audio-Driven Talking Head Generation


73. Probing Visual Concepts in Lightweight Vision-Language Models for Automated Driving


74. Sensitivity-Aware Retrieval-Augmented Intent Clarification


75. MASFactory: A Graph-centric Framework for Orchestrating LLM-Based Multi-Agent Systems with Vibe Graphing


76. Demystifying KAN for Vision Tasks: The RepKAN Approach


77. Restoring Linguistic Grounding in VLA Models via Train-Free Attention Recalibration


78. MM-ISTS: Cooperating Irregularly Sampled Time Series Forecasting with Multimodal Vision-Text LLMs


79. TADPO: Reinforcement Learning Goes Off-road


80. Technical Report: Automated Optical Inspection of Surgical Instruments


81. Imagine How To Change: Explicit Procedure Modeling for Change Captioning


82. Skeleton-to-Image Encoding: Enabling Skeleton Representation Learning via Vision-Pretrained Models


83. Domain-Adaptive Model Merging across Disconnected Modes


84. Who We Are, Where We Are: Mental Health at the Intersection of Person, Situation, and Large Language Models


85. Energy-Driven Adaptive Visual Token Pruning for Efficient Vision-Language Models


86. XAI for Coding Agent Failures: Transforming Raw Execution Traces into Actionable Insights


87. Facial Expression Recognition Using Residual Masking Network


88. Addressing the Ecological Fallacy in Larger LMs with Human Context


89. RAC: Rectified Flow Auto Coder


90. BlackMirror: Black-Box Backdoor Detection for Text-to-Image Models via Instruction-Response Deviation


91. Stock Market Prediction Using Node Transformer Architecture Integrated with BERT Sentiment Analysis


92. CORE-Seg: Reasoning-Driven Segmentation for Complex Lesions via Reinforcement Learning


93. LUMINA: LLM-Guided GPU Architecture Exploration via Bottleneck Analysis


94. Reference-guided Policy Optimization for Molecular Optimization via LLM Reasoning


95. Lost in Stories: Consistency Bugs in Long Story Generation by LLMs


96. Reconstruct! Don’t Encode: Self-Supervised Representation Reconstruction Loss for High-Intelligibility and Low-Latency Streaming Neural Audio Codec


97. Computational Pathology in the Era of Emerging Foundation and Agentic AI – International Expert Perspectives on Clinical Integration and Translational Readiness


98. Remote Sensing Image Classification Using Deep Ensemble Learning


99. Evaluating LLM Alignment With Human Trust Models


100. Lexara: A User-Centered Toolkit for Evaluating Large Language Models for Conversational Visual Analytics


101. Margin and Consistency Supervision for Calibrated and Robust Vision Models


102. Ambiguity Collapse by LLMs: A Taxonomy of Epistemic Risks


103. StreamWise: Serving Multi-Modal Generation in Real-Time at Scale


104. Proof-of-Guardrail in AI Agents and What (Not) to Trust from It


105. Visual Words Meet BM25: Sparse Auto-Encoder Visual Word Scoring for Image Retrieval


106. Balancing Domestic and Global Perspectives: Evaluating Dual-Calibration and LLM-Generated Nudges for Diverse News Recommendation


107. PVminerLLM: Structured Extraction of Patient Voice from Patient-Generated Text using Large Language Models


108. Knowing without Acting: The Disentangled Geometry of Safety Mechanisms in Large Language Models


109. Depth Charge: Jailbreak Large Language Models from Deep Safety Attention Heads


110. Bridging Domains through Subspace-Aware Model Merging


111. TML-Bench: Benchmark for Data Science Agents on Tabular ML Tasks


112. Revisiting the (Sub)Optimality of Best-of-N for Inference-Time Alignment


113. LTLGuard: Formalizing LTL Specifications with Compact Language Models and Lightweight Symbolic Reasoning


114. Cultural Perspectives and Expectations for Generative AI: A Global Survey Approach


115. The Rise of AI in Weather and Climate Information and its Impact on Global Inequality


116. Autonomous Algorithm Discovery for Ptychography via Evolutionary LLM Reasoning


117. Longitudinal Lesion Inpainting in Brain MRI via 3D Region Aware Diffusion


118. SecureRAG-RTL: A Retrieval-Augmented, Multi-Agent, Zero-Shot LLM-Driven Framework for Hardware Vulnerability Detection


119. When Rubrics Fail: Error Enumeration as Reward in Reference-Free RL Post-Training for Virtual Try-On


120. The DSA’s Blind Spot: Algorithmic Audit of Advertising and Minor Profiling on TikTok


121. The Fragility Of Moral Judgment In Large Language Models



123. Post Fusion Bird’s Eye View Feature Stabilization for Robust Multimodal 3D Detection


124. Adversarial Batch Representation Augmentation for Batch Correction in High-Content Cellular Screening


125. RACAS: Controlling Diverse Robots With a Single Agentic System


126. DreamCAD: Scaling Multi-modal CAD Generation using Differentiable Parametric Surfaces


127. On the Value of Tokeniser Pretraining in Physics Foundation Models


128. Spatiotemporal Heterogeneity of AI-Driven Traffic Flow Patterns and Land Use Interaction: A GeoAI-Based Analysis of Multimodal Urban Mobility


129. Tool-Genesis: A Task-Driven Tool Creation Benchmark for Self-Evolving Language Agent


130. PRISM: Personalized Refinement of Imitation Skills for Manipulation via Human Instructions


131. CBR-to-SQL: Rethinking Retrieval-based Text-to-SQL using Case-based Reasoning in the Healthcare Domain


132. When AI Levels the Playing Field: Skill Homogenization, Asset Concentration, and Two Regimes of Inequality


133. Model Change for Description Logic Concepts


134. Towards Efficient and Stable Ocean State Forecasting: A Continuous-Time Koopman Approach


135. EigenData: A Self-Evolving Multi-Agent Platform for Function-Calling Data Synthesis, Auditing, and Repair


136. Human-Data Interaction, Exploration, and Visualization in the AI Era: Challenges and Opportunities


137. VDCook:DIY video data cook your MLLMs


138. JAWS: Enhancing Long-term Rollout of Neural Operators via Spatially-Adaptive Jacobian Regularization


139. On the Reliability of AI Methods in Drug Discovery: Evaluation of Boltz-2 for Structure and Binding Affinity Prediction


140. Towards Neural Graph Data Management


141. Omni-C: Compressing Heterogeneous Modalities into a Single Dense Encoder


142. Molecular Representations for AI in Chemistry and Materials Science: An NLP Perspective


143. Traversal-as-Policy: Log-Distilled Gated Behavior Trees as Externalized, Verifiable Policies for Safe, Robust, and Efficient Agents


144. From Toil to Thought: Designing for Strategic Exploration and Responsible AI in Systematic Literature Reviews


145. An Embodied Companion for Visual Storytelling


146. Exploring Human-in-the-Loop Themes in AI Application Development: An Empirical Thematic Analysis


147. Can LLM Aid in Solving Constraints with Inductive Definitions?