전체 AI 논문 - 2025-12-26

1. RoboSafe: Safeguarding Embodied Agents via Executable Safety Logic


2. A Real-World Evaluation of LLM Medication Safety Reviews in NHS Primary Care


3. Beyond Context: Large Language Models Failure to Grasp Users Intent


4. LLM Personas as a Substitute for Field Experiments in Method Benchmarking


5. Agentic Explainable Artificial Intelligence (Agentic XAI) Approach To Explore Better Explanation


6. TrafficSimAgent: A Hierarchical Agent Framework for Autonomous Traffic Simulation with MCP Control


7. FinAgent: An Agentic AI Framework Integrating Personal Finance and Nutrition Planning


8. A Blockchain-Monitored Agentic AI Architecture for Trusted Perception-Reasoning-Action Pipelines


9. The Silent Scholar Problem: A Probabilistic Framework for Breaking Epistemic Asymmetry in LLM Agents


10. MAR:Multi-Agent Reflexion Improves Reasoning Abilities in LLMs


11. Context-Sensitive Abstractions for Reinforcement Learning with Parameterized Actions


12. Safety Alignment of LMs via Non-cooperative Games


13. A Benchmark for Evaluating Outcome-Driven Constraint Violations in Autonomous AI Agents


14. AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent


15. From artificial to organic: Rethinking the roots of intelligence for digital health


16. From Pilots to Practices: A Scoping Review of GenAI-Enabled Personalization in Computer Science Education


17. Bridging the AI Trustworthiness Gap between Functions and Norms


18. Eidoku: A Neuro-Symbolic Verification Gate for LLM Reasoning via Structural Constraint Satisfaction


19. Quantifying Laziness, Decoding Suboptimality, and Context Degradation in Large Language Models


20. From Fake Focus to Real Precision: Confusion-Driven Adversarial Attention Learning in Transformers


21. AI-Driven Decision-Making System for Hiring Process


22. Memory Bear AI A Breakthrough from Memory to Cognition Toward Artificial General Intelligence


23. Mixture of Attention Schemes (MoAS): Learning to Route Between MHA, GQA, and MQA


24. AIAuditTrack: A Framework for AI Security system


25. Reasoning Relay: Evaluating Stability and Interchangeability of Large Language Models in Mathematical Reasoning


26. Erkang-Diagnosis-1.1 Technical Report


27. MicroProbe: Efficient Reliability Assessment for Foundation Models with Minimal Data


28. Proceedings of the 20th International Conference on Knowledge, Information and Creativity Support Systems (KICSS 2025)


29. MegaRAG: Multimodal Knowledge Graph-Based Retrieval Augmented Generation


30. Quantum-Inspired Multi Agent Reinforcement Learning for Exploration Exploitation Optimization in UAV-Assisted 6G Network Deployment


31. BitRL-Light: 1-bit LLM Agents with Deep Reinforcement Learning for Energy-Efficient Smart Home Lighting Optimization


32. Optimizing Decoding Paths in Masked Diffusion Models by Quantifying Uncertainty


33. C2LLM Technical Report: A New Frontier in Code Retrieval via Adaptive Cross-Attention Pooling


34. Measuring all the noises of LLM Evals


35. Scaling Laws for Economic Productivity: Experimental Evidence in LLM-Assisted Consulting, Data Analyst, and Management Tasks


36. Model Merging via Multi-Teacher Knowledge Distillation


37. SMART SLM: Structured Memory and Reasoning Transformer, A Small Language Model for Accurate Document Assistance


38. Learning Factors in AI-Augmented Education: A Comparative Study of Middle and High School Students


39. LookPlanGraph: Embodied Instruction Following Method with VLM Graph Augmentation


40. Improving the Convergence Rate of Ray Search Optimization for Query-Efficient Hard-Label Attacks


41. Casting a SPELL: Sentence Pairing Exploration for LLM Limitation-breaking


42. PhononBench:A Large-Scale Phonon-Based Benchmark for Dynamical Stability in Crystal Generation


43. Leveraging Lightweight Entity Extraction for Scalable Event-Based Image Retrieval


44. SpidR-Adapt: A Universal Speech Representation Model for Few-Shot Adaptation


45. Schrödinger’s Navigator: Imagining an Ensemble of Futures for Zero-Shot Object Navigation


46. BALLAST: Bandit-Assisted Learning for Latency-Aware Stable Timeouts in Raft


47. MODE: Multi-Objective Adaptive Coreset Selection


48. TGC-Net: A Structure-Aware and Semantically-Aligned Framework for Text-Guided Medical Image Segmentation


49. AutoBaxBuilder: Bootstrapping Code Security Benchmarking


50. STLDM: Spatio-Temporal Latent Diffusion Model for Precipitation Nowcasting


51. Semi-Supervised Learning for Large Language Models Safety and Content Moderation


52. Semantic Refinement with LLMs for Graph Representations


53. TexAvatars : Hybrid Texel-3D Representations for Stable Rigging of Photorealistic Gaussian Head Avatars


54. Understanding Scaling Laws in Deep Neural Networks via Feature Learning Dynamics


55. DexAvatar: 3D Sign Language Reconstruction with Hand and Body Pose Priors


56. Policy-Conditioned Policies for Multi-Agent Task Solving


57. Rethinking Supervised Fine-Tuning: Emphasizing Key Answer Tokens for Improved LLM Accuracy


58. LLM Swiss Round: Aggregating Multi-Benchmark Performance via Competitive Swiss-System Dynamics


59. Distilling the Essence: Efficient Reasoning Distillation via Sequence Truncation


60. Automatic Replication of LLM Mistakes in Medical Conversations


61. GenTSE: Enhancing Target Speaker Extraction via a Coarse-to-Fine Generative Language Model


62. Generalised Linear Models in Deep Bayesian RL with Learnable Basis Functions


63. Mesh-Attention: A New Communication-Efficient Distributed Attention with Improved Data Locality


64. Can Agentic AI Match the Performance of Human Data Scientists?


65. ReACT-Drug: Reaction-Template Guided Reinforcement Learning for de novo Drug Design


66. One Tool Is Enough: Reinforcement Learning for Repository-Level LLM Agents


67. Reflection Pretraining Enables Token-Level Self-Correction in Biological Sequence Models


68. MultiMind at SemEval-2025 Task 7: Crosslingual Fact-Checked Claim Retrieval via Multi-Source Alignment


69. Neural Probe-Based Hallucination Detection for Large Language Models


70. A Multi-fidelity Double-Delta Wing Dataset and Empirical Scaling Laws for GNN-based Aerodynamic Field Surrogate


71. Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning


72. Guardrailed Elasticity Pricing: A Churn-Aware Forecasting Playbook for Subscription Strategy


73. RevFFN: Memory-Efficient Full-Parameter Fine-Tuning of Mixture-of-Experts LLMs with Reversible Blocks


74. DiEC: Diffusion Embedded Clustering


75. Embodied AI-Enhanced IoMT Edge Computing: UAV Trajectory Optimization and Task Offloading with Mobility Prediction


76. DGSAN: Dual-Graph Spatiotemporal Attention Network for Pulmonary Nodule Malignancy Prediction


77. Lightweight framework for underground pipeline recognition and spatial localization based on multi-view 2D GPR images


78. Memory-Efficient Acceleration of Block Low-Rank Foundation Models on Resource Constrained GPUs


79. NVIDIA Nemotron 3: Efficient and Open Intelligence


80. Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning


81. NotSoTiny: A Large, Living Benchmark for RTL Code Generation


82. MediEval: A Unified Medical Benchmark for Patient-Contextual and Knowledge-Grounded Reasoning in LLMs


83. X-GridAgent: An LLM-Powered Agentic AI System for Assisting Power Grid Analysis


84. NULLBUS: Multimodal Mixed-Supervision for Breast Ultrasound Segmentation via Nullable Global-Local Prompts


85. Towards Optimal Performance and Action Consistency Guarantees in Dec-POMDPs with Inconsistent Beliefs and Limited Communication


86. TS-Arena Technical Report – A Pre-registered Live Forecasting Platform


87. Generalization of RLVR Using Causal Reasoning as a Testbed


88. Bridging Efficiency and Safety: Formal Verification of Neural Networks with Early Exits


89. Stabilizing Multimodal Autoencoders: A Theoretical and Empirical Analysis of Fusion Strategies


90. A Physics Informed Neural Network For Deriving MHD State Vectors From Global Active Regions Observations


91. AI-Driven Green Cognitive Radio Networks for Sustainable 6G Communication


92. FEM-Bench: A Structured Scientific Reasoning Benchmark for Evaluating Code-Generating LLMs


93. SA-DiffuSeq: Addressing Computational and Scalability Challenges in Long-Document Generation with Sparse Attention


94. Mechanism-Based Intelligence (MBI): Differentiable Incentives for Rational Coordination and Guaranteed Alignment in Multi-Agent Systems


95. PHOTON: Hierarchical Autoregressive Modeling for Lightspeed and Memory-Efficient Language Generation


96. Revisiting the Learning Objectives of Vision-Language Reward Models


97. HyDRA: Hierarchical and Dynamic Rank Adaptation for Mobile Vision Language Model


98. Disentangling Fact from Sentiment: A Dynamic Conflict-Consensus Framework for Multimodal Fake News Detection


99. Improving Cardiac Risk Prediction Using Data Generation Techniques


100. Forward Only Learning for Orthogonal Neural Networks of any Depth


101. Dominating vs. Dominated: Generative Collapse in Diffusion Models


102. Managing the Stochastic: Foundations of Learning in Neuro-Symbolic Systems for Software Engineering


103. MaskOpt: A Large-Scale Mask Optimization Dataset to Advance AI in Integrated Circuit Manufacturing


104. Forecasting N-Body Dynamics: A Comparative Study of Neural Ordinary Differential Equations and Universal Differential Equations


105. Uncovering Competency Gaps in Large Language Models and Their Benchmarks


106. Data-Free Pruning of Self-Attention Layers in LLMs


107. Real Time Detection and Quantitative Analysis of Spurious Forgetting in Continual Learning


108. Enhancing Lung Cancer Treatment Outcome Prediction through Semantic Feature Engineering Using Large Language Models


109. Zero-Training Temporal Drift Detection for Transformer Sentiment Models: A Comprehensive Analysis on Authentic Social Media Streams


110. Learning Evolving Latent Strategies for Multi-Agent Language Systems without Model Fine-Tuning


111. Efficient Asynchronous Federated Evaluation with Strategy Similarity Awareness for Intent-Based Networking in Industrial Internet of Things


112. Parameter-Efficient Neural CDEs via Implicit Function Jacobians


113. Cooperation Through Indirect Reciprocity in Child-Robot Interactions


114. Inspection Planning Primitives with Implicit Models