전체 AI 논문 - 2025-12-17

1. MedCEG: Reinforcing Verifiable Medical Reasoning with Critical Evidence Graph


2. Defending the Hierarchical Result Models of Precedential Constraint


3. neuralFOMO: Can LLMs Handle Being Second Best? Measuring Envy-Like Preferences in Multi-Agent Settings


4. Differentiable Evolutionary Reinforcement Learning


5. Behavior and Representation in Large Language Models for Combinatorial Optimization: From Feature Extraction to Algorithm Selection


6. Error-Driven Prompt Optimization for Arithmetic Reasoning


7. MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data


8. Reflective Preference Optimization (RPO): Enhancing On-Policy Alignment via Hint-Guided Reflection


9. Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows


10. SpeakRL: Synergizing Reasoning, Speaking, and Acting in Language Models with Reinforcement Learning


11. MAC: A Multi-Agent Framework for Interactive User Clarification in Multi-turn Conversations


12. Can AI Understand What We Cannot Say? Measuring Multilevel Alignment Through Abortion Stigma Across Cognitive, Interpersonal, and Structural Levels


13. Towards Unified Co-Speech Gesture Generation via Hierarchical Implicit Periodicity Learning


14. Socratic Students: Teaching Language Models to Learn by Asking Questions


15. M-GRPO: Stabilizing Self-Supervised Reinforcement Learning for Large Language Models with Momentum-Anchored Policy Optimization


16. Towards Open Standards for Systemic Complexity in Digital Forensics


17. Satisfiability Modulo Theory Meets Inductive Logic Programming


18. Forgetful but Faithful: A Cognitive Memory Architecture and Benchmark for Privacy-Aware Generative Agents


19. Fault-Tolerant Sandboxing for AI Coding Agents: A Transactional Approach to Safe Autonomous Execution


20. Causal Counterfactuals Reconsidered


21. Personalized QoE Prediction: A Demographic-Augmented Machine Learning Framework for 5G Video Streaming Networks


22. Synergizing Code Coverage and Gameplay Intent: Coverage-Aware Game Playtesting with LLM-Guided Reinforcement Learning


23. WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment


24. Memoria: A Scalable Agentic Memory Framework for Personalized Conversational AI


25. Value-Aware Multiagent Systems


26. Modular and Multi-Path-Aware Offline Benchmarking for Mobile GUI Agents


27. AgentSHAP: Interpreting LLM Agent Tool Importance with Monte Carlo Shapley Value Estimation


28. Large Language Newsvendor: Decision Biases and Cognitive Mechanisms


29. World Models Unlock Optimal Foraging Strategies in Reinforcement Learning Agents


30. KidsArtBench: Multi-Dimensional Children’s Art Evaluation with Attribute-Aware MLLMs


31. SafeGen: Embedding Ethical Safeguards in Text-to-Image Generation


32. MetaHGNIE: Meta-Path Induced Hypergraph Contrastive Learning in Heterogeneous Knowledge Graphs


33. AI Transparency Atlas: Framework, Scoring, and Real-Time Model Card Evaluation Pipeline


34. Understanding Critical Thinking in Generative Artificial Intelligence Use: Development, Validation, and Correlates of the Critical Thinking in AI Use Scale


35. Feeling the Strength but Not the Source: Partial Introspection in LLMs


36. Entropy Collapse: A Universal Failure Mode of Intelligent Systems


37. Quantum-Aware Generative AI for Materials Discovery: A Framework for Robust Exploration Beyond DFT Biases


38. A Multi-Axial Mindset for Ontology Design Lessons from Wikidata’s Polyhierarchical Structure


39. A Geometric Theory of Cognition


40. TA-KAND: Two-stage Attention Triple Enhancement and U-KAN based Diffusion For Few-shot Knowledge Graph Completion


41. Floorplan2Guide: LLM-Guided Floorplan Parsing for BLV Indoor Navigation


42. Rethinking Label Consistency of In-Context Learning: An Implicit Transductive Label Propagation Perspective


43. Reliable Policy Iteration: Performance Robustness Across Architecture and Environment Perturbations


44. The Forecast Critic: Leveraging Large Language Models for Poor Forecast Identification


45. Context-Aware Agentic Power Resources Optimisation in EV using Smart2ChargeApp


46. Log Anomaly Detection with Large Language Models via Knowledge-Enriched Fusion


47. Hypergame Rationalisability: Solving Agent Misalignment In Strategic Play


48. AGAPI-Agents: An Open-Access Agentic AI Platform for Accelerated Materials Design on AtomGPT.org


49. CXL-SpecKV: A Disaggregated FPGA Speculative KV-Cache for Datacenter LLM Serving


50. Robustness of Probabilistic Models to Low-Quality Data: A Multi-Perspective Analysis


51. Causal Strengths and Leaky Beliefs: Interpreting LLM Reasoning via Noisy-OR Causal Bayes Nets


52. Structured Personalization: Modeling Constraints as Matroids for Data-Minimal LLM Agents


53. Mirror Mode in Fire Emblem: Beating Players at their own Game with Imitation and Reinforcement Learning


54. Solving Parallel Machine Scheduling With Precedences and Cumulative Resource Constraints With Calendars


55. A Monad-Based Clause Architecture for Artificial Age Score (AAS) in Large Language Models


56. DiffusionBrowser: Interactive Diffusion Previews via Multi-Branch Decoders


57. Feedforward 3D Editing via Text-Steerable Image-to-3D


58. Embedding-Based Rankings of Educational Resources based on Learning Outcome Alignment: Benchmarking, Expert Validation, and Learner Performance


59. Large-Language Memorization During the Classification of United States Supreme Court Cases


60. World Models Can Leverage Human Videos for Dexterous Manipulation


61. From Code to Field: Evaluating the Robustness of Convolutional Neural Networks for Disease Diagnosis in Mango Leaves


62. Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models


63. DA-SSL: self-supervised domain adaptor to leverage foundational models in turbt histopathology slides


64. ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding


65. DP-CSGP: Differentially Private Stochastic Gradient Push with Compressed Communication


66. Superposition as Lossy Compression: Measure with Sparse Autoencoders and Connect to Adversarial Vulnerability


67. Memory in the Age of AI Agents


68. Verifying Rumors via Stance-Aware Structural Modeling


69. Behavior-Aware and Generalizable Defense Against Black-Box Adversarial Attacks for ML-Based IDS


70. SkipCat: Rank-Maximized Low-Rank Compression of Large Language Models via Shared Projection and Block Skipping


71. Non-Resolution Reasoning: A Framework for Preserving Semantic Ambiguity in Language Models


72. SSAS: Cross-subject EEG-based Emotion Recognition through Source Selection with Adversarial Strategy


73. From User Interface to Agent Interface: Efficiency Optimization of UI Representations for LLM Agents


74. End2Reg: Learning Task-Specific Segmentation for Markerless Registration in Spine Surgery


75. Detecting Emotion Drift in Mental Health Text Using Pre-Trained Transformers


76. Control of a Twin Rotor using Twin Delayed Deep Deterministic Policy Gradient (TD3)


77. FIN-bench-v2: A Unified and Robust Benchmark Suite for Evaluating Finnish Large Language Models


78. Security and Detectability Analysis of Unicode Text Watermarking Methods Against Large Language Models


79. Face Identity Unlearning for Retrieval via Embedding Dispersion


80. ALIGN-FL: Architecture-independent Learning through Invariant Generative component sharing in Federated Learning


81. No One Left Behind: How to Exploit the Incomplete and Skewed Multi-Label Data for Conversion Rate Prediction


82. MiniLingua: A Small Open-Source LLM for European Languages


83. Intrinsic-Motivation Multi-Robot Social Formation Navigation with Coordinated Exploration


84. LINA: Learning INterventions Adaptively for Physical Alignment and Generalization in Diffusion Models


85. CORE: Contrastive Masked Feature Reconstruction on Graphs


86. Efficient Adaptive Rejection Sampling for Accelerating Speculative Decoding in Large Language Models


87. WAY: Estimation of Vessel Destination in Worldwide AIS Trajectory


88. PolySet: Restoring the Statistical Ensemble Nature of Polymers for Machine Learning


89. Carrot, stick, or both? Price incentives for sustainable food choice in competitive environments


90. SACn: Soft Actor-Critic with n-step Returns


91. A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis


92. Intrinsic Image Fusion for Multi-View 3D Material Reconstruction


93. DePT3R: Joint Dense Point Tracking and 3D Reconstruction of Dynamic Scenes in a Single Forward Pass


94. From Overfitting to Reliability: Introducing the Hierarchical Approximate Bayesian Neural Network


95. Uncovering the Role of Initial Saliency in U-Shaped Attention Bias: Scaling Initial Token Weight for Enhanced Long-Text Processing


96. Diffusion-Based Restoration for Multi-Modal 3D Object Detection in Adverse Weather


97. TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning


98. Harmonizing Generalization and Specialization: Uncertainty-Informed Collaborative Learning for Semi-supervised Medical Image Segmentation


99. OXE-AugE: A Large-Scale Robot Augmentation of OXE for Scaling Cross-Embodiment Policy Learning


100. Sequence of Expert: Boosting Imitation Planners for Autonomous Driving through Temporal Alternation


101. UniVCD: A New Method for Unsupervised Change Detection in the Open-Vocabulary Era


102. A Simple and Effective Framework for Symmetric Consistent Indexing in Large-Scale Dense Retrieval


103. LLM Rationalis? Measuring Bargaining Capabilities of AI Negotiators


104. GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training


105. Scaling Bidirectional Spans and Span Violations in Attention Mechanism


106. Calibrating Uncertainty for Zero-Shot Adversarial CLIP


107. Tackling Snow-Induced Challenges: Safe Autonomous Lane-Keeping with Robust Reinforcement Learning



109. Content Adaptive based Motion Alignment Framework for Learned Video Compression


110. Unified Interactive Multimodal Moment Retrieval via Cascaded Embedding-Reranking and Temporal-Aware Score Fusion


111. Investigating Data Pruning for Pretraining Biological Foundation Models at Scale


112. MADTempo: An Interactive System for Multi-Event Temporal Video Retrieval with Query Augmentation


113. Cisco Integrated AI Security and Safety Framework Report


114. CTIGuardian: A Few-Shot Framework for Mitigating Privacy Leakage in Fine-Tuned LLMs


115. Meta-GPT: Decoding the Metasurface Genome with Generative Artificial Intelligence


116. SignRAG: A Retrieval-Augmented System for Scalable Zero-Shot Road Sign Recognition


117. Optimal Labeler Assignment and Sampling for Active Learning in the Presence of Imperfect Labels


118. Counting Clues: A Lightweight Probabilistic Baseline Can Match an LLM


119. Information-Consistent Language Model Recommendations through Group Relative Policy Optimization


120. Selective Conformal Risk Control


121. SAGA: Open-World Mobile Manipulation via Structured Affordance Grounding


122. PRIVEE: Privacy-Preserving Vertical Federated Learning Against Feature Inference Attacks


123. Network Level Evaluation of Hangup Susceptibility of HRGCs using Deep Learning and Sensing Techniques: A Goal Towards Safer Future


124. Adapting Multimodal Foundation Models for Few-Shot Learning: A Comprehensive Study on Contrastive Captioners


125. Lemon: A Unified and Scalable 3D Multimodal Model for Universal Spatial Understanding


126. On the continuity of flows


127. Hindsight is 20/20: Building Agent Memory that Retains, Recalls, and Reflects


128. Decoding Human and AI Persuasion in National College Debate: Analyzing Prepared Arguments Through Aristotle’s Rhetorical Principles


129. Does Tone Change the Answer? Evaluating Prompt Politeness Effects on Modern LLMs: GPT, Gemini, LLaMA


130. OPAL: Operator-Programmed Algorithms for Landscape-Aware Black-Box Optimization


131. From Small to Large: Generalization Bounds for Transformers on Variable-Size Inputs


132. A Disproof of Large Language Model Consciousness: The Necessity of Continual Learning for Consciousness


133. Liquid Reasoning Transformers: A Sudoku-Based Prototype for Chess-Scale Algorithmic Tasks


134. Beyond Task Completion: An Assessment Framework for Evaluating Agentic AI Systems


135. Unveiling Statistical Significance of Online Regression over Multiple Datasets


136. OLC-WA: Drift Aware Tuning-Free Online Classification with Weighted Average


137. State over Tokens: Characterizing the Role of Reasoning Tokens


138. Designing The Drive: Enhancing User Experience through Adaptive Interfaces in Autonomous Vehicles


139. Adaptive Edge-Cloud Inference for Speech-to-Action Systems Using ASR and Large Language Models (ASTA)


140. CoRe3D: Collaborative Reasoning as a Foundation for 3D Intelligence


141. Federated Learning with Feedback Alignment


142. Intelligent Scientific Literature Explorer using Machine Learning (ISLE)


143. Robust Motion Generation using Part-level Reliable Data from Videos


144. Co-Exploration and Co-Exploitation via Shared Structure in Multi-Task Bandits


145. Theoretical Foundations of Prompt Engineering: From Heuristics to Expressivity


146. Quantum Implicit Neural Representations for 3D Scene Reconstruction and Novel View Synthesis


147. Fine-Tuning Causal LLMs for Text Classification: Embedding-Based vs. Instruction-Based Approaches


148. Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling


149. DynaGen: Unifying Temporal Knowledge Graph Reasoning with Dynamic Subgraphs and Generative Regularization


150. PerNodeDrop: A Method Balancing Specialized Subnets and Regularization in Deep Neural Networks


151. Anatomy-Guided Representation Learning Using a Transformer-Based Network for Thyroid Nodule Segmentation in Ultrasound Images


152. DiG: Differential Grounding for Enhancing Fine-Grained Perception in Multimodal Large Language Model


153. ORIBA: Exploring LLM-Driven Role-Play Chatbot as a Creativity Support Tool for Original Character Artists


154. Understanding Syllogistic Reasoning in LLMs from Formal and Natural Language Perspectives


155. Human-Inspired Learning for Large Language Models via Obvious Record and Maximum-Entropy Method Discovery


156. Content-Aware Ad Banner Layout Generation with Two-Stage Chain-of-Thought in Vision Language Models


157. Detecting Prompt Injection Attacks Against Application Using Classifiers


158. Coupled Variational Reinforcement Learning for Language Model General Reasoning


159. StreamingAssistant: Efficient Visual Token Pruning for Accelerating Online Video Understanding


160. Skillful Subseasonal-to-Seasonal Forecasting of Extreme Events with a Multi-Sphere Coupled Probabilistic Model


161. Diverse LLMs vs. Vulnerabilities: Who Detects and Fixes Them Better?


162. Noise-robust Contrastive Learning for Critical Transition Detection in Dynamical Systems


163. Can You Keep a Secret? Exploring AI for Care Coordination in Cognitive Decline


164. Explainable Artificial Intelligence for Economic Time Series: A Comprehensive Review and a Systematic Taxonomy of Methods and Concepts


165. Explainable AI as a Double-Edged Sword in Dermatology: The Impact on Clinicians versus The Public


166. Mage: Cracking Elliptic Curve Cryptography with Cross-Axis Transformers


167. AI-Driven Real-Time Kick Classification in Olympic Taekwondo Using Sensor Fusion


168. Exploring the Design Space of Transition Matching


169. Dynamical modeling of nonlinear latent factors in multiscale neural activity with real-time inference


170. Cross-Modal Representational Knowledge Distillation for Enhanced Spike-Informed LFP Modeling


171. Rough Sets for Explainability of Spectral Graph Clustering


172. A Graph Attention Network-Based Framework for Reconstructing Missing LiDAR Beams


173. SCIR: A Self-Correcting Iterative Refinement Framework for Enhanced Information Extraction Based on Schema


174. Dynamic Homophily with Imperfect Recall: Modeling Resilience in Adversarial Networks


175. UniMark: Artificial Intelligence Generated Content Identification Toolkit


176. Fractional Differential Equation Physics-Informed Neural Network and Its Application in Battery State Estimation


177. V-Rex: Real-Time Streaming Video LLM Acceleration via Dynamic KV Cache Retrieval


178. GRC-Net: Gram Residual Co-attention Net for epilepsy prediction


179. Accurate de novo sequencing of the modified proteome with OmniNovo


180. Stochastic Volatility Modelling with LSTM Networks: A Hybrid Approach for S&P 500 Index Volatility Forecasting


181. Adversarially Probing Cross-Family Sound Symbolism in 27 Languages


182. Semantic Distance Measurement based on Multi-Kernel Gaussian Processes


183. Comparison of different segmentation algorithms on brain volume and fractal dimension in infant brain MRIs


184. Training Versatile Coding Agents in Synthetic Environments


185. Measuring What Matters: Scenario-Driven Evaluation for Trajectory Predictors in Autonomous Driving



187. ALERT Open Dataset and Input-Size-Agnostic Vision Transformer for Driver Activity Recognition using IR-UWB


188. Epistemoverse: Toward an AI-Driven Knowledge Metaverse for Intellectual Heritage Preservation


189. Thermal RGB Fusion for Micro-UAV Wildfire Perimeter Tracking with Minimal Comms



191. Extending the Context of Pretrained LLMs by Dropping Their Positional Embeddings


192. MeltwaterBench: Deep learning for spatiotemporal downscaling of surface meltwater


193. BaRISTA: Brain Scale Informed Spatiotemporal Representation of Human Intracranial Neural Activity


194. A Benchmark Dataset for Spatially Aligned Road Damage Assessment in Small Uncrewed Aerial Systems Disaster Imagery


195. MixtureKit: A General Framework for Composing, Training, and Visualizing Mixture-of-Experts Models


196. A neuro-symbolic framework for accountability in public-sector AI


197. Congestion Reduction in EV Charger Placement Using Traffic Equilibrium Models


198. Rethinking Jailbreak Detection of Large Vision Language Models with Representational Contrastive Scoring


199. The Instability of Safety: How Random Seeds and Temperature Expose Inconsistent LLM Refusal Behavior


200. Instruction-Tuning Open-Weight Language Models for BPMN Model Generation


201. AI as a Teaching Partner: Early Lessons from Classroom Codesign with Secondary Teachers


202. Semantic-Drive: Democratizing Long-Tail Data Curation via Open-Vocabulary Grounding and Neuro-Symbolic VLM Consensus


203. Hold Onto That Thought: Assessing KV Cache Compression On Reasoning


204. V-REX: Benchmarking Exploratory Visual Reasoning via Chain-of-Questions


205. Evidence-Driven Decision Support for AI Model Selection in Research Software Engineering


206. Semantic search for 100M+ galaxy images using AI-generated captions


207. Designing The Internet of Agents: A Framework for Trustworthy, Transparent, and Collaborative Human-Agent Interaction (HAX)


208. Data-Driven Global Sensitivity Analysis for Engineering Design Based on Individual Conditional Expectations


209. A Review of Learning-Based Motion Planning: Toward a Data-Driven Optimal Control Approach


210. How AI Agents Follow the Herd of AI? Network Effects, History, and Machine Optimism


211. DynaPURLS: Dynamic Refinement of Part-aware Representations for Skeleton-based Zero-Shot Action Recognition


212. Unveiling User Perceptions in the Generative AI Era: A Sentiment-Driven Evaluation of AI Educational Apps’ Role in Digital Transformation of e-Teaching


213. The Agentic Regulator: Risks for AI in Finance and a Proposed Agent-based Framework for Governance


214. Mapping AI Risk Mitigations: Evidence Scan and Preliminary AI Risk Mitigation Taxonomy


215. Evolutionary Reinforcement Learning based AI tutor for Socratic Interdisciplinary Instruction


216. MONET – Virtual Cell Painting of Brightfield Images and Time Lapses Using Reference Consistent Diffusion


217. Gene regulatory network inference algorithm based on spectral signed directed graph convolution


218. FloraForge: LLM-Assisted Procedural Generation of Editable and Analysis-Ready 3D Plant Geometric Models For Agricultural Applications


219. Vibe Coding in Practice: Flow, Technical Debt, and Guidelines for Sustainable Use


220. Towards Accessible Physical AI: LoRA-Based Fine-Tuning of VLA Models for Real-World Robot Control


221. A fine-grained look at causal effects in causal spaces


222. Beyond Automation: Rethinking Work, Creativity, and Governance in the Age of Generative AI


223. Should AI Become an Intergenerational Civil Right?


224. Advancing Autonomous Driving System Testing: Demands, Challenges, and Future Directions


225. Aesthetic Alignment Risks Assimilation: How Image Generation and Reward Models Reinforce Beauty Bias and Ideological “Censorship”


226. An Experience Report on a Pedagogically Controlled, Curriculum-Constrained AI Tutor for SE Education


227. Understanding Structural Representation in Foundation Models for Polymers


228. It’s About Time: The Temporal and Modal Dynamics of Copilot Usage


229. WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Driving


230. Automated Plant Disease and Pest Detection System Using Hybrid Lightweight CNN-MobileViT Models for Diagnosis of Indigenous Crops


231. Using Socio-economic Indicators, Smart Transit Systems, and Urban Simulator to Accelerate ZEV Adoption and Reduce VMT


232. Industrial AI Robustness Card: Evaluating and Monitoring Time Series Models


233. On the Dangers of Bootstrapping Generation for Continual Learning and Beyond


234. Explainable Adversarial-Robust Vision-Language-Action Model for Robotic Manipulation


235. Expert Assessment: The Systemic Environmental Risks of Artficial Intelligence


236. Hierarchical Task Offloading and Trajectory Optimization in Low-Altitude Intelligent Networks Via Auction and Diffusion-based MARL


237. An Operator-Consistent Graph Neural Network for Learning Diffusion Dynamics on Irregular Meshes


238. Generative Stochastic Optimal Transport: Guided Harmonic Path-Integral Diffusion


239. Adaptive Path Integral Diffusion: AdaPID




242. Achieving Approximate Symmetry Is Exponentially Easier than Exact Symmetry


243. Rep Smarter, Not Harder: AI Hypertrophy Coaching with Wearable Sensors and Edge Neural Networks


244. Explainable AI for Smart Greenhouse Control: Interpretability of Temporal Fusion Transformer in the Internet of Robotic Things


245. KV Cache Recycling to Expand Usable Context Capacity in Low Parameter LLMs


246. KH-FUNSD: A Hierarchical and Fine-Grained Layout Analysis Dataset for Low-Resource Khmer Business Document


247. Airport Passenger Flow Forecasting via Deformable Temporal-Spectral Transformer Approach


248. Spiking Manifesto


249. Vision Foundry: A System for Training Foundational Vision AI Models


250. Semantic Nutrition Estimation: Predicting Food Healthfulness from Text Descriptions


251. Soft Decision Tree classifier: explainable and extendable PyTorch implementation


252. Performance and Efficiency of Climate In-Situ Data Reconstruction: Why Optimized IDW Outperforms kriging and Implicit Neural Representation


253. CR3G: Causal Reasoning for Patient-Centric Explanations in Radiology Report Generation


254. Active Inference with Reusable State-Dependent Value Profiles


255. Assessing Greenspace Attractiveness with ChatGPT, Claude, and Gemini: Do AI Models Reflect Human Perceptions?


256. The Ontological Dissonance Hypothesis: AI-Triggered Delusional Ideation as Folie a Deux Technologique


257. Totalitarian Technics: The Hidden Cost of AI Scribes in Healthcare


258. Enhancing Urban Visual Place Recognition for Crowdsourced Flood Imagery via LLM-Guided Attention


259. EMNLP: Educator-role Moral and Normative Large Language Models Profiling


260. A Multitask VAE for Time Series Preprocessing and Prediction of Blood Glucose Level