전체 AI 논문 - 2026-01-07

1. MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents


2. InfiAgent: An Infinite-Horizon Framework for General-Purpose Autonomous Agents


3. Automatic Prompt Engineering with No Task Cues and No Tuning


4. A framework for assuring the accuracy and fidelity of an AI-enabled Digital Twin of en route UK airspace


5. Explainable Fuzzy GNNs for Leak Detection in Water Distribution Networks


6. Rationale-Grounded In-Context Learning for Time Series Reasoning with Multimodal Large Language Models


7. Batch-of-Thought: Cross-Instance Learning for Enhanced LLM Reasoning


8. Logical Phase Transitions: Understanding Collapse in LLM Logical Reasoning


9. ReTreVal: Reasoning Tree with Validation - A Hybrid Framework for Enhanced LLM Multi-Step Reasoning


10. SimRPD: Optimizing Recruitment Proactive Dialogue Agents through Simulator-Based Data Evaluation and Selection


11. M3MAD-Bench: Are Multi-Agent Debates Really Effective Across Domains and Modalities?


12. Sample-Efficient Neurosymbolic Deep Reinforcement Learning


13. Quantum-enhanced long short-term memory with attention for spatial permeability prediction in oilfield reservoirs


14. Causal-Enhanced AI Agents for Medical Research Screening


15. HAL: Inducing Human-likeness in LLMs with Alignment


16. LLM Agent Framework for Intelligent Change Analysis in Urban Environment using Remote Sensing Imagery


17. The Path Ahead for Agentic AI: Challenges and Opportunities


18. Time-Scaling Is What Agents Need Now


19. Learning User Preferences Through Interaction for Long-Term Collaboration


20. Learning from Prompt itself: the Hierarchical Attribution Prompt Optimization


21. Inferring Causal Graph Temporal Logic Formulas to Expedite Reinforcement Learning in Temporally Extended Tasks


22. AWARE-US: Benchmark for Preference-Aware Resolution in Tool-Calling Agents


23. An Empirical Study of On-Device Translation for Real-Time Live-Stream Chat on Mobile Devices


24. Orchestral AI: A Framework for Agent Orchestration


25. SimpleMem: Efficient Lifelong Memory for LLM Agents


26. Textual Explanations and Their Evaluations for Reinforcement Learning Policy


27. Multi-RADS Synthetic Radiology Report Dataset and Head-to-Head Benchmarking of 41 Open-Weight and Proprietary Language Models


28. The Sonar Moment: Benchmarking Audio-Language Models in Audio Geo-Localization


29. The Fake Friend Dilemma: Trust and the Political Economy of Conversational AI


30. Fine-tuning Small Language Models as Efficient Enterprise Search Relevance Labelers


31. UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward


32. Counterfactual Fairness with Graph Uncertainty


33. Recursive querying of neural networks via weighted structures


34. DIP: Dynamic In-Context Planner For Diffusion Language Models


35. UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision


36. AnatomiX, an Anatomy-Aware Grounded Multimodal Large Language Model for Chest X-Ray Interpretation


37. Decentralized Autoregressive Generation


38. Multi-Modal Data-Enhanced Foundation Models for Prediction and Control in Wireless Networks: A Survey


39. Rapid Augmentations for Time Series (RATS): A High-Performance Library for Time Series Augmentation


40. Prompt-Counterfactual Explanations for Generative AI System Behavior


41. Self-Verification is All You Need To Pass The Japanese Bar Examination


42. Limited Linguistic Diversity in Embodied AI Datasets


43. Unified Thinker: A General Reasoning Modular Core for Image Generation


44. LeafLife: An Explainable Deep Learning Framework with Robustness for Grape Leaf Disease Recognition


45. ToxiGAN: Toxic Data Augmentation via LLM-Guided Directional Adversarial Generation


46. Transformers self-organize like newborn visual systems when trained in prenatal worlds


47. Who Laughs with Whom? Disentangling Influential Factors in Humor Preferences across User Clusters and LLMs


48. Text-Guided Layer Fusion Mitigates Hallucination in Multimodal LLMs


49. Grad-ELLM: Gradient-based Explanations for Decoder-only LLMs


50. Joint Encoding of KV-Cache Blocks for Scalable LLM Serving


51. Do LLMs Encode Functional Importance of Reasoning Tokens?


52. IBISAgent: Reinforcing Pixel-Level Visual Reasoning in MLLMs for Universal Biomedical Object Referring and Segmentation


53. On the Intrinsic Limits of Transformer Image Embeddings in Non-Solvable Spatial Reasoning


54. Motion Blur Robust Wheat Pest Damage Detection with Dynamic Fuzzy Feature Fusion


55. Lil: Less is Less When Applying Post-Training Sparse-Attention Algorithms in Long-Decode Stage


56. PiDR: Physics-Informed Inertial Dead Reckoning for Autonomous Platforms


57. Validating Generalist Robots with Situation Calculus and STL Falsification


58. Causal Manifold Fairness: Enforcing Geometric Invariance in Representation Learning


59. Dementia-R1: Reinforced Pretraining and Reasoning from Unstructured Clinical Notes for Real-World Dementia Prognosis


60. In-Context Reinforcement Learning through Bayesian Fusion of Context and Value Prior


61. SentGraph: Hierarchical Sentence Graph for Multi-hop Retrieval-Augmented Question Answering


62. JPU: Bridging Jailbreak Defense and Unlearning via On-Policy Path Rectification


63. Learning to Act Robustly with View-Invariant Latent Actions


64. Towards Faithful Reasoning in Comics for Small MLLMs


65. ULS+: Data-driven Model Adaptation Enhances Lesion Segmentation


66. LAMS-Edit: Latent and Attention Mixing with Schedulers for Improved Content Preservation in Diffusion-Based Image and Style Editing


67. Interpretable All-Type Audio Deepfake Detection with Audio LLMs via Frequency-Time Reinforcement Learning


68. Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders


69. Correct, Concise and Complete: Multi-stage Training For Adaptive Reasoning


70. MoE Adapter for Large Audio Language Models: Sparsity, Disentanglement, and Gradient-Conflict-Free


71. The World is Not Mono: Enabling Spatial Understanding in Large Audio-Language Models


72. SastBench: A Benchmark for Testing Agentic SAST Triage


73. PrismVAU: Prompt-Refined Inference System for Multimodal Video Anomaly Understanding


74. DCG ReID: Disentangling Collaboration and Guidance Fusion Representations for Multi-modal Vehicle Re-Identification


75. RAL2M: Retrieval Augmented Learning-To-Match Against Hallucination in Compliance-Guaranteed Service Systems


76. TA-Prompting: Enhancing Video Large Language Models for Dense Video Captioning via Temporal Anchors


77. LOST-3DSG: Lightweight Open-Vocabulary 3D Scene Graphs with Semantic Tracking in Dynamic Environments


78. LongBench Pro: A More Realistic and Comprehensive Bilingual Long-Context Evaluation Benchmark


79. TiMem: Temporal-Hierarchical Memory Consolidation for Long-Horizon Conversational Agents


80. Breaking Self-Attention Failure: Rethinking Query Initialization for Infrared Small Target Detection


81. MiMo-V2-Flash Technical Report


82. Closing the Reality Gap: Zero-Shot Sim-to-Real Deployment for Dexterous Force-Based Grasping and Manipulation


83. UniSRCodec: Unified and Low-Bitrate Single Codebook Codec with Sub-Band Reconstruction


84. Netflix Artwork Personalization via LLM Post-training


85. Q-Regularized Generative Auto-Bidding: From Suboptimal Trajectories to Optimal Policies


86. Window-based Membership Inference Attacks Against Fine-tuned Large Language Models


87. Hypothesize-Then-Verify: Speculative Root Cause Analysis for Microservices with Pathwise Parallelism


88. Agentic Memory Enhanced Recursive Reasoning for Root Cause Localization in Microservices


89. Foreground-Aware Dataset Distillation via Dynamic Patch Selection


90. Privacy-Preserving AI-Enabled Decentralized Learning and Employment Records System


91. CREAM: Continual Retrieval on Dynamic Streaming Corpora with Adaptive Soft Memory


92. Adversarial Question Answering Robustness: A Multi-Level Error Analysis and Mitigation Study


93. Multi-channel multi-speaker transformer for speech recognition


94. Topology-Independent Robustness of the Weighted Mean under Label Poisoning Attacks in Heterogeneous Decentralized Learning


95. Extracting books from production language models


96. When Do Tools and Planning Help LLMs Think? A Cost- and Latency-Aware Benchmark



98. Prioritized Replay for RL Post-training


99. DreamLoop: Controllable Cinemagraph Generation from a Single Photograph


100. Credit Assignment via Neural Manifold Noise Correlation


101. TAAF: A Trace Abstraction and Analysis Framework Synergizing Knowledge Graphs and LLMs


102. Improved Evidence Extraction for Document Inconsistency Detection with LLMs


103. LAsset: An LLM-assisted Security Asset Identification Framework for System-on-Chip (SoC) Verification


104. Hierarchical temporal receptive windows and zero-shot timescale generalization in biologically constrained scale-invariant deep networks


105. Chronicals: A High-Performance Framework for LLM Fine-Tuning with 3.51x Speedup over Unsloth


106. LongDA: Benchmarking LLM Agents for Long-Document Data Analysis


107. Annealed Langevin Posterior Sampling (ALPS): A Rapid Algorithm for Image Restoration with Multiscale Energy Models


108. FlowPlan-G2P: A Structured Generation Framework for Transforming Scientific Papers into Patent Descriptions


109. Reconstructing Item Characteristic Curves using Fine-Tuned Large Language Models


110. Fact-Checking with Large Language Models via Probabilistic Certainty and Consistency


111. LendNova: Towards Automated Credit Risk Assessment with Language Models


112. AI-exposed jobs deteriorated before ChatGPT


113. Normalized Conditional Mutual Information Surrogate Loss for Deep Neural Classifiers


114. ModeX: Evaluator-Free Best-of-N Selection for Open-Ended Generation


115. Losses that Cook: Topological Optimal Transport for Structured Recipe Generation


116. Enhancing Debugging Skills with AI-Powered Assistance: A Real-Time Tool for Debugging Support


117. GEM-Style Constraints for PEFT with Dual Gradient Projection in LoRA


118. The Rise of Agentic Testing: Multi-Agent Systems for Robust Software Quality Assurance


119. mHC-GNN: Manifold-Constrained Hyper-Connections for Graph Neural Networks


120. VocalBridge: Latent Diffusion-Bridge Purification for Defeating Perturbation-Based Voiceprint Defenses


121. Evaluating the Diagnostic Classification Ability of Multimodal Large Language Models: Insights from the Osteoarthritis Initiative


122. Understanding Pure Textual Reasoning for Blind Image Quality Assessment


123. Mitigating Long-Tailed Anomaly Score Distributions with Importance-Weighted Loss


124. Focus on What Matters: Fisher-Guided Adaptive Multimodal Fusion for Vulnerability Detection


125. TAP-ViTs: Task-Adaptive Pruning for On-Device Deployment of Vision Transformers


126. WebCoderBench: Benchmarking Web Application Generation with Comprehensive and Interpretable Evaluation Metrics


127. A Dynamic Retrieval-Augmented Generation System with Selective Memory and Remembrance


128. NitroGen: An Open Foundation Model for Generalist Gaming Agents


129. A large-scale nanocrystal database with aligned synthesis and properties enabling generative inverse design


130. Watch Wider and Think Deeper: Collaborative Cross-modal Chain-of-Thought for Complex Visual Reasoning


131. Multimodal Sentiment Analysis based on Multi-channel and Symmetric Mutual Promotion Feature Fusion


132. MIAR: Modality Interaction and Alignment Representation Fuison for Multimodal Emotion


133. Socially-Aware Recommender Systems Mitigate Opinion Clusterization


134. SpikySpace: A Spiking State Space Model for Energy-Efficient Time Series Forecasting


135. The Vibe-Check Protocol: Quantifying Cognitive Offloading in AI Programming


136. Expert-Guided Explainable Few-Shot Learning with Active Sample Selection for Medical Image Analysis


137. PCEval: A Benchmark for Evaluating Physical Computing Capabilities of Large Language Models


138. ProSoftArena: Benchmarking Hierarchical Capabilities of Multimodal Agents in Professional Software Environments


139. AI-Native Integrated Sensing and Communications for Self-Organizing Wireless Networks: Architectures, Learning Paradigms, and System-Level Design


140. Self-Supervised Masked Autoencoders with Dense-Unet for Coronary Calcium Removal in limited CT Data


141. Tree of Preferences for Diversified Recommendation


142. Base Station Deployment under EMF constrain by Deep Reinforcement learning


143. How to Discover Knowledge for FutureG: Contextual RAG and LLM Prompting for O-RAN


144. The Refutability Gap: Challenges in Validating Reasoning by Large Language Models


145. Movement Primitives in Robotics: A Comprehensive Survey


146. LeafTutor: An AI Agent for Programming Assignment Tutoring


147. Permission Manifests for Web Agents


148. Distillation-based Scenario-Adaptive Mixture-of-Experts for the Matching Stage of Multi-scenario Recommendation


149. Cross-Platform Digital Discourse Analysis of the Israel-Hamas Conflict: Sentiment, Topics, and Event Dynamics


150. TextBridgeGNN: Pre-training Graph Neural Network for Cross-Domain Recommendation via Text-Guided Transfer


151. FUSE : Failure-aware Usage of Subagent Evidence for MultiModal Search and Recommendation


152. Towards Trustworthy LLM-Based Recommendation via Rationale Integration


153. The Impact of LLM-Generated Reviews on Recommender Systems: Textual Shifts, Performance Effects, and Strategic Platform Control


154. TWIST: Training-free and Label-free Short Text Clustering through Iterative Vector Updating with LLMs