전체 AI 논문 - 2026-01-12

1. Open-Vocabulary 3D Instruction Ambiguity Detection


2. TowerMind: A Tower Defence Game Learning Environment and Benchmark for LLM as Agents


3. StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management


4. From Off-Policy to On-Policy: Enhancing GUI Agents via Bi-level Expert-to-Policy Assimilation


5. DynaDebate: Breaking Homogeneity in Multi-Agent Debate with Dynamic Path Generation


6. PII-VisBench: Evaluating Personally Identifiable Information Safety in Vision Language Models Along a Continuum of Visibility


7. Overcoming Joint Intractability with Lossless Hierarchical Speculative Decoding


8. Logic-Parametric Neuro-Symbolic NLI: Controlling Logical Formalisms for Verifiable LLM Reasoning


9. Circular Reasoning: Understanding Self-Reinforcing Loops in Large Reasoning Models


10. CHDP: Cooperative Hybrid Diffusion Policies for Reinforcement Learning in Parameterized Action Space


11. HAG: Hierarchical Demographic Tree-based Agent Generation for Topic-Adaptive Simulation


12. GenCtrl – A Formal Controllability Toolkit for Generative Models


13. Cumulative Path-Level Semantic Reasoning for Inductive Knowledge Graph Completion


14. A Causal Information-Flow Framework for Unbiased Learning-to-Rank


15. Reinforcement Learning of Large Language Models for Interpretable Credit Card Fraud Detection


16. Crisis-Bench: Benchmarking Strategic Ambiguity and Reputation Management in Large Language Models


17. WildSci: Advancing Scientific Reasoning from In-the-Wild Literature


18. Safety Not Found (404): Hidden Risks of LLM-Based Robotics Decision Making


19. Explainable AI: Learning from the Learners


20. The Evaluation Gap in Medicine, AI and LLMs: Navigating Elusive Ground Truth & Uncertainty via a Probabilistic Paradigm


21. MMUEChange: A Generalized LLM Agent Framework for Intelligent Multi-Modal Urban Environment Change Analysis


22. PRISMA: Reinforcement Learning Guided Two-Stage Policy Optimization in Multi-Agent Architecture for Open-Domain Multi-Hop Question Answering


23. ART: Adaptive Reasoning Trees for Explainable Claim Verification


24. On the Effect of Cheating in Chess


25. Conformity and Social Impact on AI Agents


26. The Persona Paradox: Medical Personas as Behavioral Priors in Clinical Language Models


27. Improving Enzyme Prediction with Chemical Reaction Equations by Hypergraph-Enhanced Knowledge Graph Embeddings


28. Effects of personality steering on cooperative behavior in Large Language Model agents


29. Mathematical Knowledge Graph-Driven Framework for Equation-Based Predictive and Reliable Additive Manufacturing


30. Naiad: Novel Agentic Intelligent Autonomous System for Inland Water Monitoring


31. AdaFuse: Adaptive Ensemble Decoding with Test-Time Scaling for LLMs


32. The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning


33. VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction


34. Performance of a Deep Learning-Based Segmentation Model for Pancreatic Tumors on Public Endoscopic Ultrasound Datasets


35. Can We Predict Before Executing Machine Learning Agents?


36. Cedalion Tutorial: A Python-based framework for comprehensive analysis of multimodal fNIRS & DOT from the lab to the everyday world


37. Agentic LLMs as Powerful Deanonymizers: Re-identification of Participants in the Anthropic Interviewer Dataset


38. Auditing Fairness under Model Updates: Fundamental Complexity and Property-Preserving Updates


39. Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency


40. Can AI mediation improve democratic deliberation?


41. An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift


42. Gender Bias in LLMs: Preliminary Evidence from Shared Parenting Scenario in Czech Family Law


43. Continual-learning for Modelling Low-Resource Languages from Large Language Models


44. IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck


45. CLewR: Curriculum Learning with Restarts for Machine Translation Preference Learning


46. LayerGS: Decomposition and Inpainting of Layered 3D Human Avatars via 2D Gaussian Splatting


47. Router-Suggest: Dynamic Routing for Multimodal Auto-Completion in Visually-Grounded Dialogs


48. Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals


49. DexterCap: An Affordable and Automated System for Capturing Dexterous Hand-Object Manipulation


50. Intelligent Singularity Avoidance in UR10 Robotic Arm Path Planning Using Hybrid Fuzzy Logic and Reinforcement Learning


51. Influence of Parallelism in Vector-Multiplication Units on Correlation Power Analysis


52. Decoding Workload and Agreement From EEG During Spoken Dialogue With Conversational AI


53. SceneFoundry: Generating Interactive Infinite 3D Worlds


54. EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis


55. Tensor-DTI: Enhancing Biomolecular Interaction Prediction with Contrastive Embedding Learning


56. SAFE: Secure and Accurate Federated Learning for Privacy-Preserving Brain-Computer Interfaces


57. Adaptive Disentangled Representation Learning for Incomplete Multi-View Multi-Label Classification


58. Variational Autoencoders for P-wave Detection on Strong Motion Earthquake Spectrograms


59. VIGIL: Defending LLM Agents Against Tool Stream Injection via Verify-Before-Commit


60. Analysing Differences in Persuasive Language in LLM-Generated Text: Uncovering Stereotypical Gender Patterns


61. The Echo Chamber Multi-Turn LLM Jailbreak


62. mHC-lite: You Don’t Need 20 Sinkhorn-Knopp Iterations


63. Visualising Information Flow in Word Embeddings with Diffusion Tensor Imaging


64. Multimodal In-context Learning for ASR of Low-resource Languages


65. AIBoMGen: Generating an AI Bill of Materials for Secure, Transparent, and Compliant Model Training


66. Joint Optimization of Neural Autoregressors via Scoring rules


67. AGDC: Autoregressive Generation of Variable-Length Sequences with Joint Discrete and Continuous Spaces


68. Advancing credit mobility through stakeholder-informed AI design and adoption


69. Stephanie2: Thinking, Waiting, and Making Decisions Like Humans in Step-by-Step AI Social Chat


70. A Framework for Personalized Persuasiveness Prediction via Context-Aware User Profiling


71. Open World Knowledge Aided Single-Cell Foundation Model with Robust Cross-Modal Cell-Language Pre-training


72. Transformer Is Inherently a Causal Learner


73. PiXTime: A Model for Federated Time Series Forecasting with Heterogeneous Data Structures Across Nodes


74. ACR: Adaptive Context Refactoring via Context Refactoring Operators for Multi-Turn Dialogue


75. Autoregressive Ranking: Bridging the Gap Between Dual and Cross Encoders


76. HogVul: Black-box Adversarial Code Generation Framework Against LM-based Vulnerability Detectors


77. GS-DMSR: Dynamic Sensitive Multi-scale Manifold Enhancement for Accelerated High-Quality 3D Gaussian Splatting


78. RISE: Rule-Driven SQL Dialect Translation via Query Reduction


79. ReasonAny: Incorporating Reasoning Capability to Any Model via Simple and Effective Model Merging


80. Semi-Supervised Facial Expression Recognition based on Dynamic Threshold and Negative Learning


81. VIB-Probe: Detecting and Mitigating Hallucinations in Vision-Language Models via Variational Information Bottleneck


82. Understanding LLM-Driven Test Oracle Generation


83. Scalable Heterogeneous Graph Learning via Heterogeneous-aware Orthogonal Prototype Experts


84. DeMa: Dual-Path Delay-Aware Mamba for Efficient Multivariate Time Series Analysis


85. Over-Searching in Search-Augmented Large Language Models


86. Evaluating the Use of LLMs for Automated DOM-Level Resolution of Web Performance Issues


87. Prompt-Free SAM-Based Multi-Task Framework for Breast Ultrasound Lesion Segmentation and Classification


88. Efficient Differentiable Causal Discovery via Reliable Super-Structure Learning


89. STELP: Secure Transpilation and Execution of LLM-Generated Programs


90. Jailbreaking Large Language Models through Iterative Tool-Disguised Attacks via Reinforcement Learning


91. Do LLMs Need Inherent Reasoning Before Reinforcement Learning? A Study in Korean Self-Correction


92. Tracing Moral Foundations in Large Language Models


93. Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization


94. Multi-task Cross-modal Learning for Chest X-ray Image Retrieval


95. Ensemble of radiomics and ConvNeXt for breast cancer diagnosis


96. Lost in Execution: On the Multilingual Robustness of Tool Calling in Large Language Models


97. STResNet & STYOLO : A New Family of Compact Classification and Object Detection Models for MCUs


98. PRISM: Protocol Refinement through Intelligent Simulation Modeling


99. A Bayesian Generative Modeling Approach for Arbitrary Conditional Inference


100. Multi-turn Jailbreaking Attack in Multi-Modal Large Language Models


101. Bi-Orthogonal Factor Decomposition for Vision Transformers


102. MoEBlaze: Breaking the Memory Wall for Efficient MoE Training on Modern GPUs


103. A Survey of Agentic AI and Cybersecurity: Challenges, Opportunities and Use-case Prototypes


104. On the Limits of Self-Improving in LLMs and Why AGI, ASI and the Singularity Are Not Near Without Symbolic Model Synthesis


105. Simulation-Free PSRO: Removing Game Simulation from Policy Space Response Oracles


106. Evolving Cognitive Architectures


107. Bayesian Recovery for Probabilistic Coalition Structures


108. LiveVectorLake: A Real-Time Versioned Knowledge Base Architecture for Streaming Vector Updates and Temporal Retrieval


109. Retrieval-Augmented Multi-LLM Ensemble for Industrial Part Specification Extraction


110. Cross-Document Topic-Aligned Chunking for Retrieval-Augmented Generation


111. Engineering the RAG Stack: A Comprehensive Review of the Architecture and Trust Frameworks for Retrieval-Augmented Generation Systems


112. LLM2IR: simple unsupervised contrastive learning makes long-context LLM great retriever


113. Quantifying Document Impact in RAG-LLMs



115. KP-Agent: Keyword Pruning in Sponsored Search Advertising via LLM-Powered Contextual Bandits


116. SP-Rank: A Dataset for Ranked Preferences with Secondary Information


117. Tiny Recursive Models on ARC-AGI-1: Inductive Biases, Identity Conditioning, and Test-Time Compute


118. Automating Deception: Scalable Multi-Turn LLM Jailbreaks


119. Towards Realistic Guarantees: A Probabilistic Certificate for SmoothLLM


120. EvoC2Rust: A Skeleton-guided Framework for Project-Level C-to-Rust Translation