전체 AI 논문 - 2026-02-06

1. DyTopo: Dynamic Topology Routing for Multi-Agent Reasoning via Semantic Matching


2. Learning Event-Based Shooter Models from Virtual Reality Experiments


3. AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions


4. Speech Emotion Recognition Leveraging OpenAI’s Whisper Representations and Attentive Pooling Methods


5. Geographically-aware Transformer-based Traffic Forecasting for Urban Motorway Digital Twins


6. Quantum Reinforcement Learning with Transformers for the Capacitated Vehicle Routing Problem


7. A Guide to Large Language Models in Modeling and Simulation: From Core Techniques to Critical Challenges


8. Agent2Agent Threats in Safety-Critical LLM Assistants: A Human-Centric Taxonomy


9. Beyond Manual Planning: Seating Allocation for Large Organizations


10. BABE: Biology Arena BEnchmark


11. OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality Attention


12. Learning Compact Boolean Networks


13. TKG-Thinker: Towards Dynamic Reasoning over Temporal Knowledge Graphs via Agentic Reinforcement Learning


14. STProtein: predicting spatial protein expression from multi-omics data


15. NEX: Neuron Explore-Exploit Scoring for Label-Free Chain-of-Thought Selection and Model Ranking


16. FiMI: A Domain-Specific Language Model for Indian Finance Ecosystem


17. RL-VLA$^3$: Reinforcement Learning VLA Accelerating via Full Asynchronism


18. RocqSmith: Can Automatic Optimization Forge Better Proof Agents?


19. LeakBoost: Perceptual-Loss-Based Membership Inference Attack


20. Mitigating Hallucination in Financial Retrieval-Augmented Generation via Fine-Grained Knowledge Verification


21. Anchored Policy Optimization: Mitigating Exploration Collapse Via Support-Constrained Rectification


22. Nonlinearity as Rank: Generative Low-Rank Adapter with Radial Basis Functions


23. Determining Energy Efficiency Sweet Spots in Production LLM Inference


24. Graph-based Agent Memory: Taxonomy, Techniques, and Applications


25. Generative Ontology: When Structured Knowledge Learns to Create


26. Reactive Knowledge Representation and Asynchronous Reasoning


27. BhashaSetu: Cross-Lingual Knowledge Transfer from High-Resource to Extreme Low-Resource Languages


28. Emulating Aggregate Human Choice Behavior and Biases with GPT Conversational Agents


29. TangramSR: Can Vision-Language Models Reason in Continuous Geometric Space?


30. Reasoning-guided Collaborative Filtering with Language Models for Explainable Recommendation


31. Conditional Diffusion Guidance under Hard Constraint: A Stochastic Analysis Approach


32. Split Personality Training: Revealing Latent Knowledge Through Alternate Personalities


33. A Unified Multimodal Framework for Dataset Construction and Model-Based Diagnosis of Ameloblastoma


34. SDFP: Speculative Decoding with FIT-Pruned Models for Training-Free and Plug-and-Play LLM Acceleration


35. Phi-Former: A Pairwise Hierarchical Approach for Compound-Protein Interactions Prediction


36. ALIVE: Awakening LLM Reasoning via Adversarial Learning and Instructive Verbal Evaluation


37. Refine and Purify: Orthogonal Basis Optimization with Null-Space Denoising for Conditional Representation Learning


38. Day-Ahead Electricity Price Forecasting for Volatile Markets Using Foundation Models with Regularization Strategy


39. M$^2$-Miner: Multi-Agent Enhanced MCTS for Mobile GUI Agent Data Mining



41. H-AdminSim: A Multi-Agent Simulator for Realistic Hospital Administrative Workflows with FHIR Integration


42. Advancing Opinion Dynamics Modeling with Neural Diffusion-Convection-Reaction Equation


43. Clinical Validation of Medical-based Large Language Model Chatbots on Ophthalmic Patient Queries with LLM-based Evaluation


44. RaBiT: Residual-Aware Binarization Training for Accurate and Efficient LLMs


45. PATHWAYS: Evaluating Investigation and Context Discovery in AI Web Agents


46. AgentXRay: White-Boxing Agentic Systems via Workflow Reconstruction


47. ProAct: Agentic Lookahead in Interactive Environments


48. PieArena: Frontier Language Agents Achieve MBA-Level Negotiation Performance and Reveal Novel Behavioral Differences


49. Aspect-Aware MOOC Recommendation in a Heterogeneous Network


50. Position: Universal Time Series Foundation Models Rest on a Category Error


51. Hallucination-Resistant Security Planning with a Large Language Model


52. Beyond Cosine Similarity


53. Automatic Cognitive Task Generation for In-Situ Evaluation of Embodied Agents


54. Explainable AI: A Combined XAI Framework for Explaining Brain Tumour Detection Models


55. Surgery: Mitigating Harmful Fine-Tuning for Large Language Models via Attention Sink


56. Traceable Cross-Source RAG for Chinese Tibetan Medicine Question Answering


57. First Proof


58. HugRAG: Hierarchical Causal Knowledge Graph Design for RAG


59. CAST-CKT: Chaos-Aware Spatio-Temporal and Cross-City Knowledge Transfer for Traffic Flow Prediction


60. SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers


61. Democratic Preference Alignment via Sortition-Weighted RLHF


62. Understanding LLM Evaluator Behavior: A Structured Multi-Evaluator Framework for Merchant Risk Assessment


63. GAMMS: Graph based Adversarial Multiagent Modeling Simulator


64. Evaluating Robustness and Adaptability in Learning-Based Mission Planning for Active Debris Removal


65. VERA-MH: Reliability and Validity of an Open-Source AI Safety Evaluation in Mental Health


66. Optimizing Mission Planning for Multi-Debris Rendezvous Using Reinforcement Learning with Refueling and Adaptive Collision Avoidance


67. Towards Reducible Uncertainty Modeling for Reliable Large Language Model Agents


68. Evaluating Large Language Models on Solved and Unsolved Problems in Graph Theory: Implications for Computing Education


69. MINT: Minimal Information Neuro-Symbolic Tree for Objective-Driven Knowledge-Gap Reasoning and Active Elicitation



71. Artificial Intelligence as Strange Intelligence: Against Linear Models of Intelligence


72. Shared LoRA Subspaces for almost Strict Continual Learning


73. CommCP: Efficient Multi-Agent Coordination via LLM-Based Communication with Conformal Prediction


74. Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory


75. Correctness-Optimized Residual Activation Lens (CORAL): Transferrable and Calibration-Aware Inference-Time Steering


76. Optimism Stabilizes Thompson Sampling for Adaptive Inference


77. GenArena: How Can We Achieve Human-Aligned Evaluation for Visual Generation Tasks?


78. Diamond Maps: Efficient Reward Alignment via Stochastic Flow Maps


79. RISE-Video: Can Video Generators Decode Implicit World Rules?


80. Clifford Kolmogorov-Arnold Networks


81. Inverse Depth Scaling From Most Layers Being Similar


82. LSA: Localized Semantic Alignment for Enhancing Temporal Consistency in Traffic Video Generation


83. Learning to Share: Selective Memory for Efficient Parallel Agentic Systems


84. Better Source, Better Flow: Learning Condition-Dependent Source Distribution for Flow Matching


85. Compound Deception in Elite Peer Review: A Failure Mode Taxonomy of 100 Fabricated Citations at NeurIPS 2025


86. Verification of the Implicit World Model in a Generative Model via Adversarial Sequences


87. Regularized Calibration with Successive Rounding for Post-Training Quantization


88. Parity, Sensitivity, and Transformers


89. Metric Hedonic Games on the Line


90. Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations


91. Neural Implicit 3D Cardiac Shape Reconstruction from Sparse CT Angiography Slices Mimicking 2D Transthoracic Echocardiography Views


92. EuroLLM-22B: Technical Report


93. xList-Hate: A Checklist-Based Framework for Interpretable and Generalizable Hate Speech Detection


94. DLM-Scope: Mechanistic Interpretability of Diffusion Language Models via Sparse Autoencoders


95. DARWIN: Dynamic Agentically Rewriting Self-Improving Network


96. FHAIM: Fully Homomorphic AIM For Private Synthetic Data Generation


97. Allocentric Perceiver: Disentangling Allocentric Reasoning from Egocentric Visual Priors via Frame Instantiation


98. Bagging-Based Model Merging for Robust General Text Embeddings


99. ReText: Text Boosts Generalization in Image-Based Person Re-identification


100. Automated Customization of LLMs for Enterprise Code Repositories Using Semantic Scopes


101. Variational Speculative Decoding: Rethinking Draft Training from Token Likelihood to Sequence Acceptance


102. TimelyFreeze: Adaptive Parameter Freezing Mechanism for Pipeline Parallelism


103. Learning to Inject: Automated Prompt Injection via Reinforcement Learning


104. CSRv2: Unlocking Ultra-Sparse Embeddings


105. Evaluating the impact of word embeddings on similarity scoring in practical information retrieval


106. CompactRAG: Reducing LLM Calls and Token Overhead in Multi-Hop Question Answering


107. Towards Green AI: Decoding the Energy of LLM Inference in Software Development


108. OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale


109. Poster: Camera Tampering Detection for Outdoor IoT Systems


110. Mining Generalizable Activation Functions


111. Exploring AI-Augmented Sensemaking of Patient-Generated Health Data: A Mixed-Method Study with Healthcare Professionals in Cardiac Risk Reduction


112. HyperPotter: Spell the Charm of High-Order Interactions in Audio Deepfake Detection


113. Stable but Wrong: When More Data Degrades Scientific Conclusions


114. Probabilistic Multi-Regional Solar Power Forecasting with Any-Quantile Recurrent Neural Networks


115. Alignment Verifiability in Large Language Models: Normative Indistinguishability under Behavioral Evaluation


116. Enhancing Personality Recognition by Comparing the Predictive Power of Traits, Facets, and Nuances


117. AI chatbots versus human healthcare professionals: a systematic review and meta-analysis of empathy in patient care


118. Mode-Dependent Rectification for Stable PPO Training


119. Path-Guided Flow Matching for Dataset Distillation


120. Shiva-DiT: Residual-Based Differentiable Top-$k$ Selection for Efficient Diffusion Transformers


121. CAViT – Channel-Aware Vision Transformer for Dynamic Feature Fusion


122. Unveiling Implicit Advantage Symmetry: Why GRPO Struggles with Exploration and Difficulty Adaptation


123. Multi-Task GRPO: Reliable LLM Reasoning Across Tasks


124. Steering Large Reasoning Models towards Concise Reasoning via Flow Matching


125. When Shared Knowledge Hurts: Spectral Over-Accumulation in Model Merging


126. AI Agent Systems for Supply Chains: Structured Decision Prompts and Memory Retrieval


127. Capture the Flags: Family-Based Evaluation of Agentic LLMs via Semantics-Preserving Transformations


128. DECO: Decoupled Multimodal Diffusion Transformer for Bimanual Dexterous Manipulation with a Plugin Tactile Adapter


129. XEmoGPT: An Explainable Multimodal Emotion Recognition Framework with Cue-Level Perception and Reasoning


130. Transport and Merge: Cross-Architecture Merging for Large Language Models


131. A Unified Framework for Rethinking Policy Divergence Measures in GRPO


132. LinguistAgent: A Reflective Multi-Model Platform for Automated Linguistic Annotation


133. Sovereign-by-Design A Reference Architecture for AI and Blockchain Enabled Systems


134. LMMRec: LLM-driven Motivation-aware Multimodal Recommendation


135. Thermodynamic Limits of Physical Intelligence


136. Ontology-Driven Robotic Specification Synthesis


137. Attention Retention for Continual Learning with Vision Transformers


138. Towards Segmenting the Invisible: An End-to-End Registration and Segmentation Framework for Weakly Supervised Tumour Analysis


139. DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching


140. Structured Context Engineering for File-Native Agentic Systems: Evaluating Schema Accuracy, Format Effectiveness, and Multi-File Navigation at Scale


141. Benchmarking Affordance Generalization with BusyBox


142. Disco: Densely-overlapping Cell Instance Segmentation via Adjacency-aware Collaborative Coloring


143. Reduced-Order Surrogates for Forced Flexible Mesh Coastal-Ocean Models


144. Enabling Automatic Disordered Speech Recognition: An Impaired Speech Dataset in the Akan Language


145. Optimal Bayesian Stopping for Efficient Inference of Consistent LLM Answers


146. Beyond Length: Context-Aware Expansion and Independence as Developmentally Sensitive Evaluation in Child Utterances


147. Assessing Electricity Demand Forecasting with Exogenous Data in Time Series Foundation Models


148. Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening


149. GAS: Enhancing Reward-Cost Balance of Generative Model-assisted Offline Safe RL


150. Formal Synthesis of Certifiably Robust Neural Lyapunov-Barrier Certificates


151. FlashBlock: Attention Caching for Efficient Long-Context Block Diffusion


152. Towards a Science of Collective AI: LLM-based Multi-Agent Systems Need a Transition from Blind Trial-and-Error to Rigorous Science


153. HealthMamba: An Uncertainty-aware Spatiotemporal Graph State Space Model for Effective and Reliable Healthcare Facility Visit Prediction


154. Hybrid Gated Flow (HGF): Stabilizing 1.58-bit LLMs via Selective Low-Rank Correction


155. CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs


156. EGSS: Entropy-guided Stepwise Scaling for Reliable Software Engineering


157. Balanced Anomaly-guided Ego-graph Diffusion Model for Inductive Graph Anomaly Detection


158. ZeroS: Zero-Sum Linear Attention for Efficient Transformers


159. Semantic Search over 9 Million Mathematical Theorems


160. ARCHI-TTS: A flow-matching-based Text-to-Speech Model with Self-supervised Semantic Aligner and Accelerated Inference


161. Aligning Large Language Model Behavior with Human Citation Preferences


162. Double-P: Hierarchical Top-P Sparse Attention for Long-Context LLMs


163. Towards Worst-Case Guarantees with Scale-Aware Interpretability


164. Data-Centric Interpretability for LLM-based Multi-Agent Reinforcement Learning


165. Benchmarking Artificial Intelligence Models for Daily Coastal Hypoxia Forecasting


166. Total Variation Rates for Riemannian Flow Matching


167. EBPO: Empirical Bayes Shrinkage for Stabilizing Group-Relative Policy Optimization


168. Position: Capability Control Should be a Separate Goal From Alignment


169. CoSA: Compressed Sensing-Based Adaptation of Large Language Models


170. Cross-talk based multi-task learning for fault classification of physically coupled machine system


171. TIDE: Temporal Incremental Draft Engine for Self-Improving LLM Inference


172. Rethinking Rubric Generation for Improving LLM Judge and Reward Modeling for Open-ended Tasks


173. Autodiscover: A reinforcement learning recommendation system for the cold-start imbalance challenge in active learning, powered by graph-aware thompson sampling


174. Individual Fairness In Strategic Classification



176. Reliable Explanations or Random Noise? A Reliability Metric for XAI


177. Food Portion Estimation: From Pixels to Calories


178. E-Globe: Scalable $ε$-Global Verification of Neural Networks via Tight Upper Bounds and Pattern-Aware Branching


179. Bypassing AI Control Protocols via Agent-as-a-Proxy Attacks


180. ReFORM: Reflected Flows for On-support Offline RL via Noise Manipulation


181. VISTA: Enhancing Visual Conditioning via Track-Following Preference Optimization in Vision-Language-Action Models


182. Quality Model for Machine Learning Components


183. Differentiable Inverse Graphics for Zero-shot Scene Reconstruction and Robot Grasping


184. AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders


185. Laws of Learning Dynamics and the Core of Learners


186. Do Vision-Language Models Respect Contextual Integrity in Location Disclosure?


187. From Fragmentation to Integration: Exploring the Design Space of AI Agents for Human-as-the-Unit Privacy Management


188. Enhanced QKNorm normalization for neural transformers with the Lp norm


189. CoWork-X: Experience-Optimized Co-Evolution for Multi-Agent Collaboration System


190. EntRGi: Entropy Aware Reward Guidance for Diffusion Language Models


191. Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning


192. Near-Optimal Dynamic Matching via Coarsening with Application to Heart Transplantation


193. AI-Based Detection of In-Treatment Changes from Prostate MR-Linac Images


194. Stochastic hierarchical data-driven optimization: application to plasma-surface kinetics


195. Smart Diagnosis and Early Intervention in PCOS: A Deep Learning Approach to Women’s Reproductive Health


196. Graph–Theoretic Analysis of Phase Optimization Complexity in Variational Wave Functions for Heisenberg Antiferromagnets


197. Privileged Information Distillation for Language Models


198. Linear Model Merging Unlocks Simple and Scalable Multimodal Data Mixture Optimization


199. ASA: Activation Steering for Tool-Calling Domain Adaptation


200. Depth-Wise Emergence of Prediction-Centric Geometry in Large Language Models


201. Attack Selection Reduces Safety in Concentrated AI Control Settings against Trusted Monitoring


202. PriMod4AI: Lifecycle-Aware Privacy Threat Modeling for AI Systems using LLM


203. Internalizing LLM Reasoning via Discovery and Replay of Latent Actions


204. SLAY: Geometry-Aware Spherical Linearized Attention with Yat-Kernel


205. A$^2$-LLM: An End-to-end Conversational Audio Avatar Large Language Model


206. A logical re-conception of neural networks: Hamiltonian bitwise part-whole architecture


207. Temporal Pair Consistency for Variance-Reduced Flow Matching


208. Physics as the Inductive Bias for Causal Discovery


209. DCER: Dual-Stage Compression and Energy-Based Reconstruction


210. Momentum Attention: The Physics of In-Context Learning and Spectral Forensics for Mechanistic Interpretability


211. Evaluating Kubernetes Performance for GenAI Inference: From Automatic Speech Recognition to LLM Summarization


212. Phantom Transfer: Data-level Defences are Insufficient Against Data Poisoning


213. Semantic-level Backdoor Attack against Text-to-Image Diffusion Models


214. Steering Externalities: Benign Activation Steering Unintentionally Increases Jailbreak Risk for Large Language Models


215. Extracting Recurring Vulnerabilities from Black-Box LLM-Generated Software


216. A Causal Perspective for Enhancing Jailbreak Attack and Defense


217. Doc2Spec: Synthesizing Formal Programming Specifications from Natural Language via Grammar Induction


218. A General-Purpose Diversified 2D Seismic Image Dataset from NAMSS


219. Denoising diffusion networks for normative modeling in neuroimaging


220. Cold Start Problem: An Experimental Study of Knowledge Tracing Models with New Students