전체 AI 논문 - 2026-01-26

1. Empowering Medical Equipment Sustainability in Low-Resource Settings: An AI-Powered Diagnostic and Support Platform for Biomedical Technicians


2. Spatial-Agent: Agentic Geo-spatial Reasoning with Scientific Core Concepts


3. AgentDrive: An Open Benchmark Dataset for Agentic AI Reasoning with LLM-Generated Scenarios in Autonomous Systems


4. Preventing the Collapse of Peer Review Requires Verification-First AI


5. MAGE-KT: Multi-Agent Graph-Enhanced Knowledge Tracing with Subgraph Retrieval and Asymmetric Fusion


6. Mixture-of-Models: Unifying Heterogeneous Agents via N-Way Self-Evaluating Deliberation


7. Reasoning Promotes Robustness in Theory of Mind Tasks


8. An Efficient Insect-inspired Approach for Visual Point-goal Navigation


9. LongCat-Flash-Thinking-2601 Technical Report


10. AgentsEval: Clinically Faithful Evaluation of Medical Imaging Reports via Multi-Agent Reasoning


11. LUMINA: Long-horizon Understanding for Multi-turn Interactive Agents


12. LLM is Not All You Need: A Systematic Evaluation of ML vs. Foundation Models for text and image based Medical Classification


13. SycoEval-EM: Sycophancy Evaluation of Large Language Models in Simulated Clinical Encounters for Emergency Care


14. Doc2AHP: Inferring Structured Multi-Criteria Decision Models via Semantic Trees with LLMs


15. DSGym: A Holistic Framework for Evaluating and Training Data Science Agents


16. SemanticALLI: Caching Reasoning, Not Just Responses, in Agentic Systems


17. When Agents Fail to Act: A Diagnostic Framework for Tool Invocation Reliability in Multi-Agent LLM Systems


18. A Scalable Measure of Loss Landscape Curvature for Analyzing the Training Dynamics of LLMs


19. BONO-Bench: A Comprehensive Test Suite for Bi-objective Numerical Optimization with Traceable Pareto Sets


20. Information Representation Fairness in Long-Document Embeddings: The Peculiar Interaction of Positional and Language Bias


21. Nishpaksh: TEC Standard-Compliant Framework for Fairness Auditing and Certification of AI Models


22. LoL: Longer than Longer, Scaling Video Generation to Hour


23. GRIP: Algorithm-Agnostic Machine Unlearning for Mixture-of-Experts via Geometric Router Constraints


24. Evaluating Large Vision-language Models for Surgical Tool Detection


25. LLM-Based Adversarial Persuasion Attacks on Fact-Checking Systems


26. Explaining Group Recommendations via Counterfactuals


27. No Validation, No Problem: Predicting Model Performance from a Single Gradient


28. Boosting Deep Reinforcement Learning with Semantic Knowledge for Robotic Manipulators


29. Orbitopal Fixing in SAT


30. Uncertainty propagation through trained multi-layer perceptrons: Exact analytical results


31. Privacy in Human-AI Romantic Relationships: Concerns, Boundaries, and Agency


32. Trapped in the past? Disentangling fluid and crystallized intelligence of large language models using chess


33. Incorporating Eye-Tracking Signals Into Multimodal Deep Visual Models For Predicting User Aesthetic Experience In Residential Interiors


34. Will It Survive? Deciphering the Fate of AI-Generated Code in Open Source


35. SoS: Analysis of Surface over Semantics in Multilingual Text-To-Image Generation


36. REL-SF4PASS: Panoramic Semantic Segmentation with REL Depth Representation and Spherical Fusion


37. GTA: Generative Traffic Agents for Simulating Realistic Mobility Behavior


38. Do LLM hallucination detectors suffer from low-resource effect?


39. Curated endoscopic retrograde cholangiopancreatography images dataset


40. Standardizing Longitudinal Radiology Report Evaluation via Large Language Model Annotation


41. Dynamic Expert-Guided Model Averaging for Causal Discovery


42. Adoption of Generative Artificial Intelligence in the German Software Engineering Industry: An Empirical Study


43. Sim-to-Real Transfer via a Style-Identified Cycle Consistent Generative Adversarial Network: Zero-Shot Deployment on Robotic Manipulators through Visual Domain Adaptation



45. Revisiting the Role of Natural Language Code Comments in Code Translation


46. Provably Robust Bayesian Counterfactual Explanations under Model Changes


47. Generative Confidants: How do People Experience Trust in Emotional Support from Generative AI?


48. Sycophancy Hides Linearly in the Attention Heads


49. Dual-Prototype Disentanglement: A Context-Aware Enhancement Framework for Time Series Forecasting


50. E2Former-V2: On-the-Fly Equivariant Attention with Linear Activation Memory


51. Boundary and Position Information Mining for Aerial Small Object Detection


52. Attention-MoA: Enhancing Mixture-of-Agents via Inter-Agent Semantic Attention and Deep Residual Synthesis


53. Integrating Meteorological and Operational Data: A Novel Approach to Understanding Railway Delays in Finland


54. Emerging Threats and Countermeasures in Neuromorphic Systems: A Survey


55. Process-Tensor Tomography of SGD: Measuring Non-Markovian Memory via Back-Flow of Distinguishability


56. PRISM: Purified Representation and Integrated Semantic Modeling for Generative Sequential Recommendation


57. CORD: Bridging the Audio-Text Reasoning Gap via Weighted On-policy Cross-modal Distillation


58. Do Models Hear Like Us? Probing the Representational Alignment of Audio LLMs and Naturalistic EEG


59. A Collision-Free Hot-Tier Extension for Engram-Style Conditional Memory: A Controlled Study of Training Dynamics


60. Beyond Superficial Unlearning: Sharpness-Aware Robust Erasure of Hallucinations in Multimodal LLMs


61. TangramPuzzle: Evaluating Multimodal Large Language Models with Compositional Spatial Reasoning


62. Finite-Time Analysis of Gradient Descent for Shallow Transformers


63. kNN-Graph: An adaptive graph model for $k$-nearest neighbors


64. SafeThinker: Reasoning about Risk to Deepen Safety Beyond Shallow Alignment


65. LOGICAL-COMMONSENSEQA: A Benchmark for Logical Commonsense Reasoning


66. MRAG: Benchmarking Retrieval-Augmented Generation for Bio-medicine


67. EvoConfig: Self-Evolving Multi-Agent Systems for Efficient Autonomous Environment Configuration


68. Timely Machine: Awareness of Time Makes Test-Time Scaling Agentic


69. DeepEra: A Deep Evidence Reranking Agent for Scientific Retrieval-Augmented Generated Question Answering


70. DeMark: A Query-Free Black-Box Attack on Deepfake Watermarking Defenses


71. Emotion-LLaMAv2 and MMEVerse: A New Framework and Benchmark for Multimodal Emotion Understanding


72. AlphaFace: High Fidelity and Real-time Face Swapper Robust to Facial Pose


73. RENEW: Risk- and Energy-Aware Navigation in Dynamic Waterways


74. PyHealth 2.0: A Comprehensive Open-Source Toolkit for Accessible and Reproducible Clinical Deep Learning


75. Jacobian Scopes: token-level causal attributions in LLMs


76. Reasoning-Enhanced Rare-Event Prediction with Balanced Outcome Correction


77. Cite-While-You-Generate: Training-Free Evidence Attribution for Multimodal Clinical Summarization


78. ResAgent: Entropy-based Prior Point Discovery and Visual Reasoning for Referring Expression Segmentation


79. Cross-Lingual Activation Steering for Multilingual Language Models


80. Cognitively-Inspired Tokens Overcome Egocentric Bias in Multimodal Models


81. Improving the Accuracy of Community Detection on Signed Networks via Community Refinement and Contrastive Learning


82. Experience with Single Domain Generalization in Real World Medical Imaging Deployments


83. NOIR: Privacy-Preserving Generation of Code with Open-Source LLMs


84. Regional Bias in Large Language Models


85. DMAVA: Distributed Multi-Autonomous Vehicle Architecture Using Autoware


86. Where is the multimodal goal post? On the Ability of Foundation Models to Recognize Contextually Important Moments


87. DMV-AVP: Distributed Multi-Vehicle Autonomous Valet Parking using Autoware


88. Machine-Assisted Grading of Nationwide School-Leaving Essay Exams with LLMs and Statistical NLP



90. Memory-V2V: Augmenting Video-to-Video Diffusion Models with Memory


91. Space Filling Curves is All You Need: Communication-Avoiding Matrix Multiplication Made Simple


92. Generating Literature-Driven Scientific Theories at Scale


93. Better as Generators Than Classifiers: Leveraging LLMs and Synthetic Data for Low-Resource Multilingual Classification


94. GameTalk: Training LLMs for Strategic Conversation


95. Ordering-based Causal Discovery via Generalized Score Matching


96. A New Paradigm for Trusted Respiratory Monitoring Via Consumer Electronics-grade Radar Signals


97. VibeTensor: System Software for Deep Learning, Fully Generated by AI Agents


98. Computational Foundations for Strategic Coopetition: Formalizing Collective Action and Loyalty


99. Policy-Embedded Graph Expansion: Networked HIV Testing with Diffusion-Driven Network Samples


100. SoundBreak: A Systematic Study of Audio-Only Adversarial Attacks on Trimodal Models


101. Zero-Shot Speech LLMs for Multi-Aspect Evaluation of L2 Speech: Challenges and Opportunities


102. ES4R: Speech Encoding Based on Prepositive Affective Modeling for Empathetic Response Generation


103. Domain Specific Specialization in Low-Resource Settings: The Efficacy of Offline Response-Based Knowledge Distillation in Large Language Models


104. M3Kang: Evaluating Multilingual Multimodal Mathematical Reasoning in Vision-Language Models


105. ChiEngMixBench: Evaluating Large Language Models on Spontaneous and Natural Chinese-English Code-Mixed Generation


106. Interpretable Fine-Gray Deep Survival Model for Competing Risks: Predicting Post-Discharge Foot Complications for Diabetic Patients in Ontario