전체 AI 논문 - 2026-01-21

1. BoxMind: Closed-loop AI strategy optimization for elite boxing validated in the 2024 Olympics


2. Health Facility Location in Ethiopia: Leveraging LLMs to Integrate Expert Knowledge into Algorithmic Planning


3. Exploring LLM Features in Predictive Process Monitoring for Small-Scale Event-Logs


4. Hyperparameter Optimization of Constraint Programming Solvers


5. AstroReason-Bench: Evaluating Unified Agentic Planning across Heterogeneous Space Planning Problems


6. XChoice: Explainable Evaluation of AI-Human Alignment in LLM-based Constrained Choice Decision Making


7. Beyond Model Scaling: Test-Time Intervention for Efficient Deep Reasoning


8. Policy-Based Deep Reinforcement Learning Hyperheuristics for Job-Shop Scheduling Problems


9. TANDEM: Temporal-Aware Neural Detection for Multimodal Hate Speech


10. Do We Always Need Query-Level Workflows? Rethinking Agentic Workflow Generation for Multi-Agent Systems


11. ReCreate: Reasoning and Creating Domain Agents Driven by Experience


12. MiCA: A Mobility-Informed Causal Adapter for Lightweight Epidemic Forecasting


13. AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts



15. Efficient Protein Optimization via Structure-aware Hamiltonian Dynamics


16. AdaMARP: An Adaptive Multi-Agent Interaction Framework for General Immersive Role-Playing


17. What Matters in Data Curation for Multimodal Reasoning? Insights from the DCVLR Challenge


18. ARC Prize 2025: Technical Report


19. Optimisation of complex product innovation processes based on trend models with three-valued logic


20. Explore with Long-term Memory: A Benchmark and Multimodal LLM-based Reinforcement Learning Framework for Embodied Exploration


21. CTHA: Constrained Temporal Hierarchical Architecture for Stable Multi-Agent LLM Systems


22. ORBITFLOW: SLO-Aware Long-Context LLM Serving with Fine-Grained KV Cache Reconfiguration


23. Building AI Agents to Improve Job Referral Requests to Strangers


24. Do You Trust Me? Cognitive-Affective Signatures of Trustworthiness in Large Language Models


25. Japanese AI Agent System on Human Papillomavirus Vaccination: System Design


26. Do explanations generalize across large reasoning models?


27. Building Production-Ready Probes For Gemini


28. MetaboNet: The Largest Publicly Available Consolidated Dataset for Type 1 Diabetes Management


29. The Poisoned Apple Effect: Strategic Manipulation of Mediated Markets via Technology Expansion of AI Agents


30. MHA2MLA-VLM: Enabling DeepSeek’s Economical Multi-Head Latent Attention across Vision-Language Models


31. Interactive Narrative Analytics: Bridging Computational Narrative Extraction and Human Sensemaking


32. PRISM-CAFO: Prior-conditioned Remote-sensing Infrastructure Segmentation and Mapping for CAFOs


33. Map2Thought: Explicit 3D Spatial Reasoning via Metric Cognitive Maps


34. Hierarchical Orthogonal Residual Spread for Precise Massive Editing in Large Language Models


35. GenDA: Generative Data Assimilation on Complex Urban Areas via Classifier-Free Diffusion Guidance


36. Relational Linearity is a Predictor of Hallucinations


37. The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents


38. Topology-Guaranteed Image Segmentation: Enforcing Connectivity, Genus, and Width Constraints


39. Wetland mapping from sparse annotations with satellite image time series and temporal-aware segment anything model


40. Evaluating LLM Behavior in Hiring: Implicit Weights, Fairness Across Groups, and Alignment with Human Preferences


41. Institutional AI: Governing LLM Collusion in Multi-Agent Cournot Markets via Public Governance Graphs


42. Think-Clip-Sample: Slow-Fast Frame Selection for Video Understanding


43. FEATHer: Fourier-Efficient Adaptive Temporal Hierarchy Forecaster for Time-Series Forecasting


44. How Much Would a Clinician Edit This Draft? Evaluating LLM Alignment for Patient Message Response Drafting


45. From SERPs to Sound: How Search Engine Result Pages and AI-generated Podcasts Interact to Influence User Attitudes on Controversial Topics


46. X-Distill: Cross-Architecture Vision Distillation for Visuomotor Learning


47. Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation


48. FactCorrector: A Graph-Inspired Approach to Long-Form Factuality Correction of Large Language Models


49. SDFLoRA: Selective Dual-Module LoRA for Federated Fine-tuning with Heterogeneous Clients


50. LoRA as Oracle


51. Epistemic Control and the Normativity of Machine Learning-Based Science


52. FAQ: Mitigating Quantization Error via Regenerating Calibration Data with Family-Aware Quantization


53. SD-RAG: A Prompt-Injection-Resilient Framework for Selective Disclosure in Retrieval-Augmented Generation


54. Artificial Intelligence and the US Economy: An Accounting Perspective on Investment and Production


55. Clustering High-dimensional Data: Balancing Abstraction and Representation Tutorial at AAAI 2026


56. Cross-Modal Attention Network with Dual Graph Learning in Multimodal Recommendation


57. Deep GraphRAG: A Balanced Approach to Hierarchical Retrieval and Adaptive Integration


58. Learning Quadrupedal Locomotion for a Heavy Hydraulic Robot Using an Actuator Model


59. Context-aware Graph Causality Inference for Few-Shot Molecular Property Prediction


60. Learn Before Represent: Bridging Generative and Contrastive Learning for Domain-Specific LLM Embeddings


61. Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning


62. Efficient Multilingual Name Type Classification Using Convolutional Networks


63. Visual Marker Search for Autonomous Drone Landing in Diverse Urban Environments


64. ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development


65. A3D: Adaptive Affordance Assembly with Dual-Arm Manipulation


66. Bridging Cognitive Neuroscience and Graph Intelligence: Hippocampus-Inspired Multi-View Hypergraph Learning for Web Finance Fraud


67. Fairness in Healthcare Processes: A Quantitative Analysis of Decision Making in Triage


68. H-AIM: Orchestrating LLMs, PDDL, and Behavior Trees for Hierarchical Multi-Robot Planning


69. Predicting Biased Human Decision-Making with Large Language Models in Conversational Settings


70. Spectral Characterization and Mitigation of Sequential Knowledge Editing Collapse


71. Your One-Stop Solution for AI-Generated Video Detection


72. IDDR-NGP: Incorporating Detectors for Distractor Removal with Instant Neural Radiance Field


73. Combating Spurious Correlations in Graph Interpretability via Self-Reflection


74. Finding the Translation Switch: Discovering and Exploiting the Task-Initiation Features in LLMs


75. Contextual Distributionally Robust Optimization with Causal and Continuous Structure: An Interpretable and Tractable Approach


76. When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs


77. Steering Language Models Before They Speak: Logit-Level Interventions


78. Beyond Max Tokens: Stealthy Resource Amplification via Tool Calling Chains in LLM Agents


79. Multi-Stage Patient Role-Playing Framework for Realistic Clinical Interactions


80. PatientVLM Meets DocVLM: Pre-Consultation Dialogue Between Vision-Language Models for Efficient Diagnosis


81. Sparse Data Tree Canopy Segmentation: Fine-Tuning Leading Pretrained Models on Only 150 Images


82. Selecting Language Models for Social Science: Start Small, Start Open, and Validate


83. RobuMTL: Enhancing Multi-Task Learning Robustness Against Weather Conditions


84. Self-learned representation-guided latent diffusion model for breast cancer classification in deep ultraviolet whole surface images


85. Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation


86. Can Vision-Language Models Understand Construction Workers? An Exploratory Study


87. Approximately Optimal Global Planning for Contact-Rich SE(2) Manipulation on a Graph of Reachable Sets


88. Towards Reliable ML Feature Engineering via Planning in Constrained-Topology of LLM Agents


89. Digital Metabolism: Decoupling Logic from Facts via Regenerative Unlearning – Towards a Pure Neural Logic Core


90. Unified Optimization of Source Weights and Transfer Quantities in Multi-Source Transfer Learning: An Asymptotic Framework


91. LogicLens: Leveraging Semantic Code Graph to explore Multi Repository large systems


92. Unifying Speech Recognition, Synthesis and Conversion with Autoregressive Transformers


93. AnyECG: Evolved ECG Foundation Model for Holistic Health Profiling


94. Line-based Event Preprocessing: Towards Low-Energy Neuromorphic Computer Vision


95. Neuro-Symbolic Activation Discovery: Transferring Mathematical Structures from Physics to Ecology for Parameter-Efficient Neural Networks


96. Millimeter-Wave Gesture Recognition in ISAC: Does Reducing Sensing Airtime Hamper Accuracy?


97. DSA-Tokenizer: Disentangled Semantic-Acoustic Tokenization via Flow Matching-based Hierarchical Fusion


98. EvidFuse: Writing-Time Evidence Learning for Consistent Text-Chart Data Reporting


99. Generative AI Purpose-built for Social and Mental Health: A Real-World Pilot