전체 AI 논문 - 2026-01-22

1. BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries


2. Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning


3. How to Build AI Agents by Augmenting LLMs with Codified Human Expert Domain Knowledge? A Software Engineering Framework


4. Vehicle Routing with Finite Time Horizon using Deep Reinforcement Learning with Improved Network Embedding


5. The Plausibility Trap: Using Probabilistic Engines for Deterministic Tasks


6. Emerging from Ground: Addressing Intent Deviation in Tool-Using Agents via Deriving Real Calls into Virtual Trajectories


7. The Why Behind the Action: Unveiling Internal Drivers via Agentic Attribution


8. The Responsibility Vacuum: Organizational Failure in Scaled Agent Systems


9. Emergent, not Immanent: A Baradian Reading of Explainable AI


10. Multi-Behavior Sequential Modeling with Transition-Aware Graph Attention Network for E-Commerce Recommendation


11. Just aware enough: Evaluating awareness across artificial systems


12. To Neuro-Symbolic Classification and Beyond by Compiling Description Logic Ontologies to Probabilistic Circuits


13. Implementing Knowledge Representation and Reasoning with Object Oriented Design


14. Measuring and Aligning Abstraction in Vision-Language Models with Medical Taxonomies


15. CI4A: Semantic Component Interfaces for Agents Empowering Web Automation


16. Towards Bound Consistency for the No-Overlap Constraint Using MDDs


17. Semantic-Guided Unsupervised Video Summarization


18. An XAI View on Explainable ASP: Methods, Systems, and Perspectives


19. DARA: Few-shot Budget Allocation in Online Advertising via In-Context Decision Making with RL-Finetuned LLMs


20. AutoDriDM: An Explainable Benchmark for Decision-Making of Vision-Language Models in Autonomous Driving


21. Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation


22. IB-GRPO: Aligning LLM-based Learning Path Recommendation with Educational Objectives via Indicator-Based Group Relative Policy Optimization


23. Local Language Models for Context-Aware Adaptive Anonymization of Sensitive Text


24. Query-Efficient Agentic Graph Extraction Attacks on GraphRAG Systems


25. MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks


26. Large Language Model-Powered Evolutionary Code Optimization on a Phylogenetic Tree


27. “Just in Time” World Modeling Supports Human Planning and Reasoning


28. Scalable Knee-Point Guided Activity Group Selection in Multi-Tree Genetic Programming for Dynamic Multi-Mode Project Scheduling


29. On the Generalization Gap in LLM Planning: Tests and Verifier-Reward RL


30. VisTIRA: Closing the Image-Text Modality Gap in Visual Math Reasoning via Structured Tool Integration


31. Epistemic Constitutionalism Or: how to avoid coherence bias


32. The Ontological Neutrality Theorem: Why Neutral Ontological Substrates Must Be Pre-Causal and Pre-Normative


33. Iterative Refinement Improves Compositional Image Generation


34. Rethinking Video Generation Model for the Embodied World


35. MolecularIQ: Characterizing Chemical Reasoning Capabilities Through Symbolic Verification on Molecular Graphs



37. Many Experiments, Few Repetitions, Unpaired Data, and Sparse Effects: Is Causal Inference Possible?


38. Recommending Best Paper Awards for ML/AI Conferences via the Isotonic Mechanism


39. Feasibility Preservation under Monotone Retrieval Truncation


40. Tracing 3D Anatomy in 2D Strokes: A Multi-Stage Projection Driven Approach to Cervical Spine Fracture Identification


41. Deaf and Hard of Hearing Access to Intelligent Personal Assistants: Comparison of Voice-Based Options with an LLM-Powered Touch Interface


42. Where Do AI Coding Agents Fail? An Empirical Study of Failed Agentic Pull Requests in GitHub


43. Benchmarking Large Language Models for ABAP Code Generation: An Empirical Study on Iterative Improvement by Compiler Feedback


44. Dynamic Management of a Deep Learning-Based Anomaly Detection System for 5G Networks


45. The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models


46. V-CAGE: Context-Aware Generation and Verification for Scalable Long-Horizon Embodied Tasks


47. Automated Rubrics for Reliable Evaluation of Medical Dialogue Systems


48. Outcome-Based RL Provably Leads Transformers to Reason, but Only With the Right Data


49. Overcoming In-Memory Bottlenecks in Graph Foundation Models via Retrieval-Augmented Generation


50. BREPS: Bounding-Box Robustness Evaluation of Promptable Segmentation


51. Auditing Language Model Unlearning via Information Decomposition


52. An Agentic Operationalization of DISARM for FIMI Investigation on Social Media


53. Memory Retention Is Not Enough to Master Memory Tasks in Reinforcement Learning


54. Multi-Agent Constraint Factorization Reveals Latent Invariant Solution Structure


55. Incentive-Tuning: Understanding and Designing Incentives for Empirical Human-AI Decision-Making Studies


56. Differential Privacy Image Generation with Reconstruction Loss and Noise Injection Using an Error Feedback SGD


57. Federated Transformer-GNN for Privacy-Preserving Brain Tumor Localization with Modality-Level Explainability


58. A Curriculum-Based Deep Reinforcement Learning Framework for the Electric Vehicle Routing Problem


59. Knowledge Restoration-driven Prompt Optimization: Unlocking LLM Potential for Open-Domain Relational Triplet Extraction


60. Visual and Cognitive Demands of a Large Language Model-Powered In-vehicle Conversational Agent


61. Obscuring Data Contamination Through Translation: Evidence from Arabic Corpora


62. Interoperable Architecture for Digital Identity Delegation for AI Agents with Blockchain Integration


63. HumanDiffusion: A Vision-Based Diffusion Trajectory Planner with Human-Conditioned Goals for Search and Rescue UAV


64. InstructTime++: Time Series Classification with Multimodal Language Modeling via Implicit Feature Enhancement


65. A Comprehensive Benchmark of Language Models on Unicode and Romanized Sinhala


66. CorpusQA: A 10 Million Token Benchmark for Corpus-Level Analysis and Reasoning


67. TempViz: On the Evaluation of Temporal Knowledge in Text-to-Image Models


68. TIDAL: Temporally Interleaved Diffusion and Action Loop for High-Frequency VLA Control


69. Generative Artificial Intelligence, Musical Heritage and the Construction of Peace Narratives: A Case Study in Mali


70. Fast-ULCNet: A fast and ultra low complexity network for single-channel speech enhancement


71. Vision-Language Models on the Edge for Real-Time Robotic Perception


72. Tailoring Adverse Event Prediction in Type 1 Diabetes with Patient-Specific Deep Learning Models


73. SpatialMem: Unified 3D Memory with Metric Anchoring and Fast Retrieval


74. What Makes Low-Bit Quantization-Aware Training Work for Reasoning LLMs? A Systematic Study


75. GAT-NeRF: Geometry-Aware-Transformer Enhanced Neural Radiance Fields for High-Fidelity 4D Facial Avatars


76. From Observation to Prediction: LSTM for Vehicle Lane Change Forecasting on Highway On/Off-Ramps


77. CAG-Avatar: Cross-Attention Guided Gaussian Avatars for High-Fidelity Head Reconstruction


78. Multimodal system for skin cancer detection


79. Training-Efficient Text-to-Music Generation with State-Space Modeling


80. RECAP: Resistance Capture in Text-based Mental Health Counseling with Large Language Models


81. FunCineForge: A Unified Dataset Toolkit and Model for Zero-Shot Movie Dubbing in Diverse Cinematic Scenes


82. Anytime Optimal Decision Tree Learning with Continuous Features


83. Mechanism Shift During Post-training from Autoregressive to Masked Diffusion Language Models


84. FSX: Message Flow Sensitivity Enhanced Structural Explainer for Graph Neural Networks


85. AQAScore: Evaluating Semantic Alignment in Text-to-Audio Generation via Audio Question Answering


86. HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding


87. PCL-Reasoner-V1.5: Advancing Math Reasoning with Offline Reinforcement Learning


88. Adaptive Fidelity Estimation for Quantum Programs with Graph-Guided Noise Awareness


89. Case-Guided Sequential Assay Planning in Drug Discovery


90. Proximal Policy Optimization with Evolutionary Mutations


91. When Text-as-Vision Meets Semantic IDs in Generative Recommendation: An Empirical Study


92. CoScale-RL: Efficient Post-Training by Co-Scaling Data and Computation


93. Re-understanding Graph Unlearning through Memorization


94. Beyond Error-Based Optimization: Experience-Driven Symbolic Regression with Goal-Conditioned Reinforcement Learning


95. HCVR Scene Generation: High Compatibility Virtual Reality Environment Generation for Extended Redirected Walking


96. Transfer Learning from One Cancer to Another via Deep Learning Domain Adaptation


97. A comprehensive overview of deep learning models for object detection from videos/images


98. Efficient reformulations of ReLU deep neural networks for surrogate modelling in power system optimisation


99. GEGO: A Hybrid Golden Eagle and Genetic Optimization Algorithm for Efficient Hyperparameter Tuning in Resource-Constrained Environments


100. INFA-Guard: Mitigating Malicious Propagation via Infection-Aware Safeguarding in LLM-Based Multi-Agent Systems


101. Calibrated uncertainty quantification for prosumer flexibility aggregation in ancillary service markets


102. NeuroFilter: Privacy Guardrails for Conversational LLM Agents


103. Say Anything but This: When Tokenizer Betrays Reasoning in LLMs


104. Forest-Chat: Adapting Vision-Language Agents for Interactive Forest Change Analysis


105. A Brain-inspired Embodied Intelligence for Fluid and Fast Reflexive Robotics Control


106. Scaling Ambiguity: Augmenting Human Annotation in Speech Emotion Recognition with Audio-Language Models


107. SearchGym: Bootstrapping Real-World Search Agents via Cost-Effective and High-Fidelity Environment Simulation


108. Communication-Efficient Federated Risk Difference Estimation for Time-to-Event Clinical Outcomes


109. Rethinking Reinforcement fine-tuning of LLMs: A Multi-armed Bandit Learning Perspective


110. HELIOS: Hierarchical Graph Abstraction for Structure-Aware LLM Decompilation


111. Optimality of Staircase Mechanisms for Vector Queries under Differential Privacy


112. IntelliSA: An Intelligent Static Analyzer for IaC Security Smell Detection Using Symbolic Rules and Neural Inference


113. Designing KRIYA: An AI Companion for Wellbeing Self-Reflection


114. Breaking the accuracy-resource dilemma: a lightweight adaptive video inference enhancement


115. Self-Blinding and Counterfactual Self-Simulation Mitigate Biases and Sycophancy in Large Language Models


116. Report for NSF Workshop on AI for Electronic Design Automation


117. Towards Execution-Grounded Automated AI Research


118. How Worst-Case Are Adversarial Attacks? Linking Adversarial and Statistical Robustness


119. GutenOCR: A Grounded Vision-Language Front-End for Documents


120. XD-MAP: Cross-Modal Domain Adaptation using Semantic Parametric Mapping


121. GPU-accelerated simulated annealing based on p-bits with real-world device-variability modeling


122. Real-Time Wildfire Localization on the NASA Autonomous Modular Sensor using Deep Learning


123. Prosody-Guided Harmonic Attention for Phase-Coherent Neural Vocoding in the Complex Spectrum


124. Tokenomics: Quantifying Where Tokens Are Used in Agentic Software Engineering


125. Diffusion Large Language Models for Black-Box Optimization


126. Vision-Based Natural Language Scene Understanding for Autonomous Driving: An Extended Dataset and a New Model for Traffic Scene Description Generation


127. Agentic AI Meets Edge Computing in Autonomous UAV Swarms


128. Quantum Super-resolution by Adaptive Non-local Observables


129. Measuring the State of Open Science in Transportation Using Large Language Models


130. Recursivism: An Artistic Paradigm for Self-Transforming Art in the Age of AI


131. If You Want Coherence, Orchestrate a Team of Rivals: Multi-Agent Models of Organizational Intelligence


132. DiSPA: Differential Substructure-Pathway Attention for Drug Response Prediction


133. CityCube: Benchmarking Cross-view Spatial Reasoning on Vision-Language Models in Urban Environments


134. Self-Supervised Score-Based Despeckling for SAR Imagery via Log-Domain Transformation


135. Layer-adaptive Expert Pruning for Pre-Training of Mixture-of-Experts Large Language Models


136. SilentDrift: Exploiting Action Chunking for Stealthy Backdoor Attacks on Vision-Language-Action Models


137. Tracing the Data Trail: A Survey of Data Provenance, Transparency and Traceability in LLMs


138. CORVUS: Red-Teaming Hallucination Detectors via Internal Signal Camouflage in Large Language Models


139. An Optimized Decision Tree-Based Framework for Explainable IoT Anomaly Detection


140. DDSA: Dual-Domain Strategic Attack for Spatial-Temporal Efficiency in Adversarial Robustness Testing


141. Guardrails for trust, safety, and ethical development and deployment of Large Language Models (LLM)


142. RPC-Bench: A Fine-grained Benchmark for Research Paper Comprehension


143. DeepInflation: an AI agent for research and model discovery of inflation


144. Beyond Affinity: A Benchmark of 1D, 2D, and 3D Methods Reveals Critical Trade-offs in Structure-Based Drug Design


145. Hallucination-Free Automatic Question & Answer Generation for Intuitive Learning


146. On the Limits of Learned Importance Scoring for KV Cache Compression


147. Divide and Refine: Enhancing Multimodal Representation and Explainability for Emotion Recognition in Conversation


148. Opening the Black Box: A Survey on the Mechanisms of Multi-Step Reasoning in Large Language Models


149. The Slow Drift of Support: Boundary Failures in Multi-Turn Mental Health LLM Dialogues


150. Developmental trajectories of decision making and affective dynamics in large language models


151. From Textbook to Talkbot: A Case Study of a Greek-Language RAG-Based Chatbot in Higher Education


152. Call2Instruct: Automated Pipeline for Generating Q&A Datasets from Call Center Recordings for LLM Fine-Tuning


153. On Meta-Evaluation


154. A Cloud-Based Cross-Modal Transformer for Emotion Recognition and Adaptive Human-Computer Interaction