전체 AI 논문 - 2025-12-11

1. Same Content, Different Answers: Cross-Modal Inconsistency in MLLMs


2. EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce


3. Interpolation in Knowledge Representation


4. CARLoS: Retrieval via Concise Assessment Representation of LoRAs at Scale


5. A Practical Guide for Designing, Developing, and Deploying Production-Grade Agentic AI Workflows


6. Performance Comparison of Aerial RIS and STAR-RIS in 3D Wireless Environments


7. Towards Foundation Models with Native Multi-Agent Intelligence


8. Deconstructing the Dual Black Box:A Plug-and-Play Cognitive Framework for Human-AI Collaborative Enhancement and Its Implications for AI Governance


9. Multi-Agent Intelligence for Multidisciplinary Decision-Making in Gastrointestinal Oncology


10. See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm


11. Protein Secondary Structure Prediction Using Transformers


12. CogMCTS: A Novel Cognitive-Guided Monte Carlo Tree Search Framework for Iterative Heuristic Evolution with Large Language Models


13. The SMART+ Framework for AI Systems


14. Principles2Plan: LLM-Guided System for Operationalising Ethical Principles into Plans


15. A Lightweight Transfer Learning-Based State-of-Health Monitoring with Application to Lithium-ion Batteries in Unmanned Air Vehicles


16. Autonomous Issue Resolver: Towards Zero-Touch Code Maintenance


17. Using reinforcement learning to probe the role of feedback in skill acquisition


18. From Accuracy to Impact: The Impact-Driven AI Framework (IDAIF) for Aligning Engineering Architecture with Theory of Change


19. Prismatic World Model: Learning Compositional Dynamics for Planning in Hybrid Systems


20. DeepFeature: Iterative Context-aware Feature Generation for Wearable Biosignals


21. Reflecting with Two Voices: A Co-Adaptive Dual-Strategy Framework for LLM-Based Agent Decision Making


22. The High Cost of Incivility: Quantifying Interaction Inefficiency via Multi-Agent Monte Carlo Simulations


23. Enhancing Explainability of Graph Neural Networks Through Conceptual and Structural Analyses and Their Extensions


24. Soil Compaction Parameters Prediction Based on Automated Machine Learning Approach


25. Predicting California Bearing Ratio with Ensemble and Neural Network Models: A Case Study from Türkiye


26. rSIM: Incentivizing Reasoning Capabilities of LLMs via Reinforced Strategy Injection


27. Towards a Science of Scaling Agent Systems


28. AgentEval: Generative Agents as Reliable Proxies for Human Evaluation of AI-Generated Content


29. Reasoning Models Ace the CFA Exams


30. Beyond Traditional Diagnostics: Transforming Patient-Side Information into Predictive Insights with Knowledge Graphs and Prototypes


31. Empowerment Gain and Causal Model Construction: Children and adults are sensitive to controllability and variability in their causal interventions


32. Scalable Back-End for an AI-Based Diabetes Prediction Application


33. Large Language Models for Education and Research: An Empirical and User Survey-based Analysis


34. Toward an AI Reasoning-Enabled System for Patient-Clinical Trial Matching


35. SkipKV: Selective Skipping of KV Generation and Storage for Efficient Inference with Large Reasoning Models


36. Can AI autonomously build, operate, and use the entire data stack?


37. Impact of Data-Oriented and Object-Oriented Design on Performance and Cache Utilization with Artificial Intelligence Algorithms in Multi-Threaded CPUs


38. Astra: General Interactive World Model with Autoregressive Denoising


39. SAQ: Stabilizer-Aware Quantum Error Correction Decoder


40. Revisiting the Scaling Properties of Downstream Metrics in Large Language Model Training


41. Toward Faithful Retrieval-Augmented Generation with Sparse Autoencoders


42. No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers


43. DAO-GP Drift Aware Online Non-Linear Regression Gaussian-Process


44. When Tables Leak: Attacking String Memorization in LLM-Based Tabular Data Generation


45. Siamese-Driven Optimization for Low-Resolution Image Latent Embedding in Image Captioning


46. Fed-SE: Federated Self-Evolution for Privacy-Constrained Multi-Environment LLM Agents


47. Differentially Private Synthetic Data Generation Using Context-Aware GANs


48. InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models


49. Training-Free Dual Hyperbolic Adapters for Better Cross-Modal Reasoning


50. Do Depth-Grown Models Overcome the Curse of Depth? An In-Depth Analysis


51. Emovectors: assessing emotional content in jazz improvisations for creativity evaluation


52. Multicalibration for LLM-based Code Generation


53. PrivTune: Efficient and Privacy-Preserving Fine-Tuning of Large Language Models via Device-Cloud Collaboration


54. Democratizing ML for Enterprise Security: A Self-Sustained Attack Detection Framework


55. Can TabPFN Compete with GNNs for Node Classification via Graph Tabularization?


56. MatteViT: High-Frequency-Aware Document Shadow Removal with Shadow Matte Guidance


57. A Systematic Evaluation of Preference Aggregation in Federated RLHF for Pluralistic Alignment of LLMs


58. Fluent Alignment with Disfluent Judges: Post-training for Lower-resource Languages


59. Refining Visual Artifacts in Diffusion Models via Explainable AI-based Flaw Activation Maps


60. Data-Driven Dynamic Parameter Learning of manipulator robots


61. Mitigating Individual Skin Tone Bias in Skin Lesion Classification through Distribution-Aware Reweighting


62. Multi-domain performance analysis with scores tailored to user preferences


63. Automatic Essay Scoring and Feedback Generation in Basque Language Learning


64. Reusability in MLOps: Leveraging Ports and Adapters to Build a Microservices Architecture for the Maritime Domain


65. Aerial Vision-Language Navigation with a Unified Framework for Spatial, Temporal and Embodied Reasoning


66. Decoupling Template Bias in CLIP: Harnessing Empty Prompts for Enhanced Few-Shot Learning


67. Examining Student Interactions with a Pedagogical AI-Assistant for Essay Writing and their Impact on Students Writing Quality


68. Mind to Hand: Purposeful Robotic Control via Embodied Reasoning


69. Disturbance-Free Surgical Video Generation from Multi-Camera Shadowless Lamps for Open Surgery


70. A Hybrid Model for Stock Market Forecasting: Integrating News Sentiment and Time Series Data with Graph Neural Networks


71. Bridging Scale Discrepancies in Robotic Control via Language-Based Action Representations


72. Curriculum Guided Massive Multi Agent System Solving For Robust Long Horizon Tasks


73. A Novel Wasserstein Quaternion Generative Adversarial Network for Color Image Generation


74. SensHRPS: Sensing Comfortable Human-Robot Proxemics and Personal Space With Eye-Tracking


75. Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal Reasoning Models


76. Developing Distance-Aware Uncertainty Quantification Methods in Physics-Guided Neural Networks for Reliable Bearing Health Prediction


77. LLM-based Vulnerable Code Augmentation: Generate or Refactor?


78. Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform


79. ContextDrag: Precise Drag-Based Image Editing via Context-Preserving Token Injection and Position-Consistent Attention


80. Biothreat Benchmark Generation Framework for Evaluating Frontier AI Models III: Implementing the Bacterial Biothreat Benchmark (B3) Dataset


81. Biothreat Benchmark Generation Framework for Evaluating Frontier AI Models II: Benchmark Generation Process


82. Are generative AI text annotations systematically biased?


83. Conditional Morphogenesis: Emergent Generation of Structural Digits via Neural Cellular Automata


84. Robust Finetuning of Vision-Language-Action Robot Policies via Parameter Merging


85. Interpreting Structured Perturbations in Image Protection Methods for Diffusion Models


86. Argus: A Multi-Agent Sensitive Information Leakage Detection Framework Based on Hierarchical Reference Relationships


87. GeoDM: Geometry-aware Distribution Matching for Dataset Distillation


88. Terrain Diffusion: A Diffusion-Based Successor to Perlin Noise in Infinite, Real-Time Terrain Generation


89. Systematization of Knowledge: Security and Safety in the Model Context Protocol Ecosystem


90. Empowering smart app development with SolidGPT: an edge-cloud hybrid AI agent framework


91. Model-Based Diffusion Sampling for Predictive Control in Offline Decision Making


92. Distilling Future Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection


93. Residual-SwinCA-Net: A Channel-Aware Integrated Residual CNN-Swin Transformer for Malignant Lesion Segmentation in BUSI


94. HybridToken-VLM: Hybrid Token Compression for Vision-Language Models


95. SpeechQualityLLM: LLM-Based Multimodal Assessment of Speech Quality


96. MM-CoT:A Benchmark for Probing Visual Chain-of-Thought Reasoning in Multimodal Models


97. PR-CapsNet: Pseudo-Riemannian Capsule Network with Adaptive Curvature Routing for Graph Learning


98. ClinicalTrialsHub: Bridging Registries and Literature for Comprehensive Clinical Trial Access


99. Embodied Tree of Thoughts: Deliberate Manipulation Planning with Embodied World Model


100. A Practical Framework for Evaluating Medical AI Security: Reproducible Assessment of Jailbreaking and Privacy Vulnerabilities Across Clinical Specialties


101. Information-Dense Reasoning for Efficient and Auditable Security Alert Triage


102. LayerPipe2: Multistage Pipelining and Weight Recompute via Improved Exponential Moving Average for Training Neural Networks


103. TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models


104. Chat with UAV – Human-UAV Interaction Based on Large Language Models


105. Biothreat Benchmark Generation Framework for Evaluating Frontier AI Models I: The Task-Query Architecture


106. Long-only cryptocurrency portfolio management by ranking the assets: a neural network approach


107. Balanced Accuracy: The Right Metric for Evaluating LLM Judges - Explained through Youden’s J statistic


108. Scalable Offline Model-Based RL with Action Chunks


109. Training LLMs for Honesty via Confessions


110. Short-Context Dominance: How Much Local Context Natural Language Actually Needs?


111. Joint Activity Design Heuristics for Enhancing Human-Machine Collaboration


112. FRIEDA: Benchmarking Multi-Step Cartographic Reasoning in Vision-Language Models


113. A Gray Literature Study on Fairness Requirements in AI-enabled Software Engineering


114. Restrictive Hierarchical Semantic Segmentation for Stratified Tooth Layer Detection


115. An Empirical Framework for Evaluating Semantic Preservation Using Hugging Face


116. Near-real time fires detection using satellite imagery in Sudan conflict


117. DeepCode: Open Agentic Coding


118. CFD-copilot: leveraging domain-adapted large language model and model context protocol to enhance simulation automation


119. Harmonizing Community Science Datasets to Model Highly Pathogenic Avian Influenza (HPAI) in Birds in the Subantarctic


120. The Theory of Strategic Evolution: Games with Endogenous Players and Strategic Replicators


121. MARINE: Theoretical Optimization and Design for Multi-Agent Recursive IN-context Enhancement


122. Functional Random Forest with Adaptive Cost-Sensitive Splitting for Imbalanced Functional Data Classification


123. ByteStorm: a multi-step data-driven approach for Tropical Cyclones detection and tracking


124. GSPN-2: Efficient Parallel Sequence Modeling


125. Referenceless Proton Resonance Frequency Thermometry Using Deep Learning with Self-Attention


126. Artificial Intelligence-Driven Network-on-Chip Design Space Exploration: Neural Network Architectures for Design


127. Advancing physiological time series reconstruction and imputation via mixture of receptive fields and experts fusion


128. Quantum Circuit Reasoning Models: A Variational Framework for Differentiable Logical Inference


129. Manifolds and Modules: How Function Develops in a Neural Foundation Model


130. Bayesian Optimization for Function-Valued Responses under Min-Max Criteria


131. LLM-Generated Counterfactual Stress Scenarios for Portfolio Risk Simulation via Hybrid Prompt-RAG Pipeline


132. Command & Control (C2) Traffic Detection Via Algorithm Generated Domain (Dga) Classification Using Deep Learning And Natural Language Processing


133. GPU Memory Prediction for Multimodal Model Training


134. SABER: Small Actions, Big Errors - Safeguarding Mutating Steps in LLM Agents


135. MixLM: High-Throughput and Effective LLM Ranking via Text-Embedding Mix-Interaction


136. AudioScene: Integrating Object-Event Audio into 3D Scenes


137. Space Alignment Matters: The Missing Piece for Inducing Neural Collapse in Long-Tailed Learning


138. ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models


139. Automating High Energy Physics Data Analysis with LLM-Powered Agents


140. MELLA: Bridging Linguistic Capability and Cultural Groundedness for Low-Resource Language MLLMs