전체 AI 논문 - 2025-12-05

1. Benchmark for Planning and Control with Large Language Model Agents: Blocksworld with Model Context Protocol


2. Autonomous Agents and Policy Compliance: A Framework for Reasoning About Penalties


3. A Hierarchical Tree-based approach for creating Configurable and Static Deep Research Agent (Static-DRA)


4. Omni-AutoThink: Adaptive Multimodal Reasoning via Reinforcement Learning


5. RoCo: Role-Based LLMs Collaboration for Automatic Heuristic Design


6. MemVerse: Multimodal Memory for Lifelong Learning Agents


7. DeepRule: An Integrated Framework for Automated Business Rule Generation via Deep Predictive Modeling and Hybrid Search Optimization


8. EnCompass: Enhancing Agent Programming with Search Over Program Execution Paths


9. Reason-Plan-ReAct: A Reasoner-Planner Supervising a ReAct Executor for Complex Enterprise Tasks


10. PARC: An Autonomous Self-Reflective Coding Agent for Robust Execution of Long-Horizon Tasks


11. Multi-Agent Reinforcement Learning with Communication-Constrained Priors


12. Multimodal Reinforcement Learning with Agentic Verifier for AI Agents


13. Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia


14. Prior preferences in active inference agents: soft, hard, and goal shaping


15. When Do Symbolic Solvers Enhance Reasoning in Large Language Models?


16. Beyond the Black Box: A Cognitive Architecture for Explainable and Aligned AI


17. Exploring Syntropic Frameworks in AI Alignment: A Philosophical Investigation


18. SkillFactory: Self-Distillation For Learning Cognitive Behaviors


19. Fare Comparison App of Uber, Ola and Rapido


20. Polarization by Design: How Elites Could Shape Mass Preferences as AI Reduces Persuasion Costs


21. MarkTune: Improving the Quality-Detectability Trade-off in Open-Weight LLM Watermarking


22. Fast & Efficient Normalizing Flows and Applications of Image Generative Models


23. Jina-VLM: Small Multilingual Vision Language Model


24. Large Language Models for Limited Noisy Data: A Gravitational Wave Identification Study


25. PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation


26. TARA Test-by-Adaptive-Ranks for Quantum Anomaly Detection with Conformal Prediction Guarantees


27. On the Temporality for Sketch Representation Learning


28. Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding


29. Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation


30. DIQ-H: Evaluating Hallucination Persistence in VLMs Under Temporal Visual Degradation


31. BlurDM: A Blur Diffusion Model for Image Deblurring


32. Sponsored Questions and How to Auction Them


33. Guided Flow Policy: Learning from High-Value Actions in Offline Reinforcement Learning


34. A Theoretical Framework for Auxiliary-Loss-Free Load Balancing of Sparse Mixture-of-Experts in Large-Scale AI Models


35. Hierarchical Vision Language Action Model Using Success and Failure Demonstrations


36. Autonomous Reinforcement Learning Robot Control with Intel’s Loihi 2 Neuromorphic Hardware


37. BERnaT: Basque Encoders for Representing Natural Textual Diversity


38. Hyperdimensional Computing for Sustainable Manufacturing: An Initial Assessment


39. Scalable Decision Focused Learning via Online Trainable Surrogates


40. PULSE: A Unified Multi-Task Architecture for Cardiac Segmentation, Diagnosis, and Few-Shot Cross-Modality Clinical Adaptation


41. DVPO: Distributional Value Modeling-based Policy Optimization for LLM Post-Training


42. MPCFormer: A physics-informed data-driven approach for explainable socially-aware autonomous driving


43. AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition


44. Bayesian Optimization for Automatic Tuning of Torque-Level Nonlinear Model Predictive Control


45. In-Context Representation Hijacking


46. Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective


47. Research on Brain Tumor Classification Method Based on Improved ResNet34 Network


48. Out-of-the-box: Black-box Causal Attacks on Object Detectors


49. AI/ML in 3GPP 5G Advanced - Services and Architecture


50. Context-Aware Hierarchical Learning: A Two-Step Paradigm towards Safer LLMs


51. Over-the-Air Federated Learning: Rethinking Edge AI Through Signal Processing


52. Matrix Editing Meets Fair Clustering: Parameterized Algorithms and Complexity


53. Quantum Topological Graph Neural Networks for Detecting Complex Fraud Patterns


54. ToG-Bench: Task-Oriented Spatio-Temporal Grounding in Egocentric Videos


55. Dynamically Scaled Activation Steering


56. MKSNet: Advanced Small Object Detection in Remote Sensing Imagery with Multi-Kernel and Dual Attention Mechanisms


57. AlignCheck: a Semantic Open-Domain Metric for Factual Consistency Assessment


58. The promising potential of vision language models for the generation of textual weather forecasts


59. SELF: A Robust Singular Value and Eigenvalue Approach for LLM Fingerprinting


60. KVNAND: Efficient On-Device Large Language Model Inference Using DRAM-Free In-Flash Computing


61. Fine-grained Narrative Classification in Biased News Articles


62. When, How Long and How Much? Interpretable Neural Networks for Time Series Regression by Learning to Mask and Aggregate


63. Machine Learning to Predict Slot Usage in TSCH Wireless Sensor Networks


64. State Space Models for Bioacoustics: A comparative Evaluation with Transformers


65. Dynamic Content Moderation in Livestreams: Combining Supervised Classification with MLLM-Boosted Similarity Matching


66. A Learning-based Control Methodology for Transitioning VTOL UAVs


67. V-ITI: Mitigating Hallucinations in Multimodal Large Language Models via Visual Inference-Time Intervention


68. CookAnything: A Framework for Flexible and Consistent Multi-Step Recipe Image Generation


69. Rethinking Prompt Design for Inference-time Scaling in Text-to-Visual Generation


70. M3DR: Towards Universal Multilingual Multimodal Document Retrieval


71. Physics-Driven Learning Framework for Tomographic Tactile Sensing


72. NAS-LoRA: Empowering Parameter-Efficient Fine-Tuning for Visual Foundation Models with Searchable Adaptation


73. Cell-cell communication inference and analysis: biological mechanisms, computational approaches, and future opportunities


74. ATHENA: Agentic Team for Hierarchical Evolutionary Numerical Algorithms


75. AsymPuzl: An Asymmetric Puzzle for multi-agent cooperation


76. Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models


77. Learning From Limited Data and Feedback for Cell Culture Process Monitoring: A Comparative Study


78. Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles


79. GalaxyDiT: Efficient Video Generation with Guidance Alignment and Adaptive Proxy in Diffusion Transformers


80. Multi-Aspect Knowledge-Enhanced Medical Vision-Language Pretraining with Multi-Agent Data Generation


81. World Models for Autonomous Navigation of Terrestrial Robots from LIDAR Observations


82. BookRAG: A Hierarchical Structure-aware Index-based Approach for Retrieval-Augmented Generation on Complex Documents


83. Better World Models Can Lead to Better Post-Training Performance


84. VS-Graph: Scalable and Efficient Graph Classification Using Hyperdimensional Computing


85. UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs


86. FireSentry: A Multi-Modal Spatio-temporal Benchmark Dataset for Fine-Grained Wildfire Spread Forecasting


87. HalluGen: Synthesizing Realistic and Controllable Hallucinations for Evaluating Image Restoration


88. Idea-Gated Transformers: Enforcing Semantic Coherence via Differentiable Vocabulary Pruning


89. ProtoEFNet: Dynamic Prototype Learning for Inherently Interpretable Ejection Fraction Estimation in Echocardiography


90. Single-Round Scalable Analytic Federated Learning


91. Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs


92. NavMapFusion: Diffusion-based Fusion of Navigation Maps for Online Vectorized HD Map Construction


93. Retrofitting Earth System Models with Cadence-Limited Neural Operator Updates


94. Robust Tabular Foundation Models


95. HydroDCM: Hydrological Domain-Conditioned Modulation for Cross-Reservoir Inflow Prediction


96. Adaptive Regime-Switching Forecasts with Distribution-Free Uncertainty: Deep Switching State-Space Models Meet Conformal Prediction


97. BlendedNet++: A Large-Scale Blended Wing Body Aerodynamics Dataset and Benchmark


98. Thucy: An LLM-based Multi-Agent System for Claim Verification across Relational Databases


99. PyroFocus: A Deep Learning Approach to Real-Time Wildfire Detection in Multispectral Remote Sensing Imagery


100. Learning Network Sheaves for AI-native Semantic Communication


101. SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning


102. How to DP-fy Your Data: A Practical Guide to Generating Synthetic Data With Differential Privacy


103. InvertiTune: High-Quality Data Synthesis for Cost-Effective Single-Shot Text-to-Knowledge Graph Generation


104. Ultra-Strong Gradient Diffusion MRI with Self-Supervised Learning for Prostate Cancer Characterization


105. Plantain: Plan-Answer Interleaved Reasoning


106. Culture Affordance Atlas: Reconciling Object Diversity Through Functional Mapping


107. Atomic Diffusion Models for Small Molecule Structure Elucidation from NMR Spectra


108. Mitigating Intra- and Inter-modal Forgetting in Continual Learning of Unified Multimodal Models


109. Lost in Modality: Evaluating the Effectiveness of Text-Based Membership Inference Attacks on Large Multimodal Models


110. Beyond Additivity: Sparse Isotonic Shapley Regression toward Nonlinear Explainability


111. PanFoMa: A Lightweight Foundation Model and Benchmark for Pan-Cancer


112. The BEAT-CF Causal Model: A model for guiding the design of trials and observational analyses of cystic fibrosis exacerbations


113. E-valuator: Reliable Agent Verifiers with Sequential Hypothesis Testing


114. Public Sentiment Analysis of Traffic Management Policies in Knoxville: A Social Media Driven Study


115. Dynamic Correction of Erroneous State Estimates via Diffusion Bayesian Exploration


116. ALARM: Automated MLLM-Based Anomaly Detection in Complex-EnviRonment Monitoring with Uncertainty Quantification


117. Ensemble Privacy Defense for Knowledge-Intensive LLMs against Membership Inference Attacks


118. QGShap: Quantum Acceleration for Faithful GNN Explanations


119. Community Quality and Influence Maximization: An Empirical Study


120. Password-Activated Shutdown Protocols for Misaligned Frontier Agents


121. When Harmful Content Gets Camouflaged: Unveiling Perception Failure of LVLMs with CamHarmTI


122. Beyond Code Pairs: Dialogue-Based Data Generation for LLM Code Translation


123. Alleviating Choice Supportive Bias in LLM with Reasoning Dependency Generation


124. AtomDisc: An Atom-level Tokenizer that Boosts Molecular LLMs and Reveals Structure–Property Associations


125. Irresponsible AI: big tech’s influence on AI research and associated impacts


126. Will Power Return to the Clouds? From Divine Authority to GenAI Authority


127. Economies of Open Intelligence: Tracing Power & Participation in the Model Ecosystem


128. PretopoMD: Pretopology-based Mixed Data Hierarchical Clustering


129. Mixed Data Clustering Survey and Challenges


130. Hierarchical clustering of complex energy systems using pretopology


131. Echoes of AI Harms: A Human-LLM Synergistic Framework for Bias-Driven Harm Anticipation


132. Quantifying the Potential to Escape Filter Bubbles: A Behavior-Aware Measure via Contrastive Simulation


133. Optimizing Life Sciences Agents in Real-Time using Reinforcement Learning


134. A note on the impossibility of conditional PAC-efficient reasoning in large language models


135. Delta Sampling: Data-Free Knowledge Transfer Across Diffusion Models


136. Physics-informed self-supervised learning for predictive modeling of coronary artery digital twins


137. Energy-Efficient Federated Learning via Adaptive Encoder Freezing for MRI-to-CT Conversion: A Green AI-Guided Research


138. Mitigating hallucinations and omissions in LLMs for invertible problems: An application to hardware logic design automation


139. Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models


140. AI-Driven Document Redaction in UK Public Authorities: Implementation Gaps, Regulatory Challenges, and the Human Oversight Imperative


141. Class conditional conformal prediction for multiple inputs by p-value aggregation