전체 AI 논문 - 2025-12-18

1. Universal Reasoning Model


2. Dynamic Learning Rate Scheduling based on Loss Changes Leads to Faster Convergence


3. Sparse Multi-Modal Transformer with Masking for Alzheimer’s Disease Classification


4. Model-First Reasoning LLM Agents: Reducing Hallucinations through Explicit Problem Modeling


5. Context-Picker: Dynamic context selection using multi-stage reinforcement learning


6. Seismology modeling agent: A smart assistant for geophysical researchers


7. PortAgent: LLM-driven Vehicle Dispatching Agent for Port Terminals


8. Massive Editing for Large Language Models Based on Dynamic Weight Generation


9. TiCard: Deployable EXPLAIN-only Residual Learning for Cardinality Estimation


10. Leveraging LLMs for Collaborative Ontology Engineering in Parkinson Disease Monitoring and Alerting


11. Gödel’s Poetry


12. Georeferencing complex relative locality descriptions with large language models


13. Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis


14. Optimizing Multi-Tier Supply Chain Ordering with a Hybrid Liquid Neural Network and Extreme Gradient Boosting Model


15. HydroGEM: A Self Supervised Zero Shot Hybrid TCN Transformer Foundation Model for Continental Scale Streamflow Quality Control


16. Grammar Search for Multi-Agent Systems


17. RADAR: Accelerating Large Language Model Inference With RL-Based Dynamic Draft Trees


18. OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value


19. Intention Chain-of-Thought Prompting with Dynamic Routing for Code Generation


20. Evaluating Small Language Models for Agentic On-Farm Decision Support Systems


21. MobileWorldBench: Towards Semantic World Modeling For Mobile Agents


22. Sparsity-Controllable Dynamic Top-p MoE for Large Foundation Model Pre-training


23. ReflCtrl: Controlling LLM Reflection via Representation Engineering


24. Evaluating Frontier LLMs on PhD-Level Mathematical Reasoning: A Benchmark on a Textbook in Theoretical Computer Science about Randomized Algorithms


25. MURIM: Multidimensional Reputation-based Incentive Mechanism for Federated Learning


26. EvoLattice: Persistent Internal-Population Evolution through Multi-Alternative Quality-Diversity Graph Representations for LLM-Guided Program Discovery


27. Semantic Grounding Index: Geometric Bounds on Context Engagement in RAG Systems


28. Mathematics and Coding are Universal AI Benchmarks


29. State-Dependent Refusal and Learned Incapacity in RLHF-Aligned Language Models


30. Compressed Causal Reasoning: Quantization and GraphRAG Effects on Interventional and Counterfactual Accuracy


31. ValuePilot: A Two-Phase Framework for Value-Driven Decision-Making


32. Meta Hierarchical Reinforcement Learning for Scalable Resource Management in O-RAN


33. AI-Powered Annotation Pipelines for Stabilizing Large Language Models: A Human-AI Synergy Approach


34. LoopBench: Discovering Emergent Symmetry Breaking Strategies with LLM Swarms


35. Adjudicator: Correcting Noisy Labels with a KG-Informed Council of LLM Agents


36. Blind Radio Mapping via Spatially Regularized Bayesian Trajectory Inference


37. Leveraging LLMs for Structured Data Extraction from Unstructured Patient Records


38. TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs


39. Spherical Leech Quantization for Visual Tokenization and Generation


40. Native and Compact Structured Latents for 3D Generation


41. Spoken DialogSum: An Emotion-Rich Conversational Dataset for Spoken Dialogue Summarization


42. Bias-Variance Trade-off for Clipped Stochastic First-Order Methods: From Bounded Variance to Infinite Mean


43. VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image


44. gridfm-datakit-v1: A Python Library for Scalable and Realistic Power Flow and Optimal Power Flow Data Generation


45. A Multicenter Benchmark of Multiple Instance Learning Models for Lymphoma Subtyping from HE-stained Whole Slide Images


46. MuseCPBench: an Empirical Study of Music Editing Methods through Music Context Preservation


47. JMMMU-Pro: Image-based Japanese Multi-discipline Multimodal Understanding Benchmark via Vibe Benchmark Construction


48. Model-Based Reinforcement Learning in Discrete-Action Non-Markovian Reward Decision Processes


49. FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos


50. Towards Nepali-language LLMs: Efficient GPT training with a Nepali BPE tokenizer


51. Low-Resource, High-Impact: Building Corpora for Inclusive Language Technologies


52. Residual GRU+MHSA: A Lightweight Hybrid Recurrent Attention Model for Cardiovascular Disease Detection


53. Polypersona: Persona-Grounded LLM for Synthetic Survey Responses


54. CLNet: Cross-View Correspondence Makes a Stronger Geo-Localizationer



56. Dual Language Models: Balancing Training Efficiency and Overfitting Resilience


57. CAPRMIL: Context-Aware Patch Representations for Multiple Instance Learning


58. SASQ: Static Activation Scaling for Quantization-Aware Training in Large Language Models


59. TACK Tunnel Data (TTD): A Benchmark Dataset for Deep Learning-Based Defect Detection in Tunnels


60. Reasoning-Style Poisoning of LLM Agents via Stealthy Style Transfer: Process-Level Attacks and Runtime Monitoring in RSV Space


61. Effect of Document Packing on the Latent Multi-Hop Reasoning Capabilities of Large Language Models


62. DISCODE: Distribution-Aware Score Decoder for Robust Automatic Evaluation of Image Captioning


63. RePo: Language Models with Context Re-Positioning


64. Causal Structure Learning for Dynamical Systems with Theoretical Score Analysis


65. Enhancing Interpretability for Vision Models via Shapley Value Optimization


66. Towards Transferable Defense Against Malicious Image Edits


67. Dual Attention Guided Defense Against Malicious Edits


68. Step-Tagging: Toward controlling the generation of Language Reasoning Models through step monitoring


69. Criminal Liability in AI-Enabled Autonomous Vehicles: A Comparative Study


70. A data-physics hybrid generative model for patient-specific post-stroke motor rehabilitation using wearable sensor data


71. Semantic Mismatch and Perceptual Degradation: A New Perspective on Image Editing Immunity


72. From YOLO to VLMs: Advancing Zero-Shot and Few-Shot Detection of Wastewater Treatment Plants Using Satellite Imagery in MENA Region


73. A Threshold-Triggered Deep Q-Network-Based Framework for Self-Healing in Autonomic Software-Defined IIoT-Edge Networks


74. The Trust in AI-Generated Health Advice (TAIGHA) Scale and Short Version (TAIGHA-S): Development and Validation Study


75. SPARQL-LLM: Real-Time SPARQL Query Generation from Natural Language Questions


76. Explainable Preference Learning: a Decision Tree-based Surrogate Model for Preferential Bayesian Optimization


77. From Context to EDUs: Faithful and Structured Context Compression via Elementary Discourse Unit Decomposition


78. Beyond MMD: Evaluating Graph Generative Models with Geometric Deep Learning


79. PentestEval: Benchmarking LLM-based Penetration Testing with Modular and Stage-Level Design


80. Estimating problem difficulty without ground truth using Large Language Model comparisons


81. Error Bound Analysis of Physics-Informed Neural Networks-Driven T2 Quantification in Cardiac Magnetic Resonance Imaging


82. Understanding and Improving Hyperbolic Deep Reinforcement Learning


83. End-to-End Learning-based Video Streaming Enhancement Pipeline: A Generative AI Approach


84. Towards Explainable Quantum AI: Informing the Encoder Selection of Quantum Neural Networks via Visualization


85. A Comparative Analysis of Retrieval-Augmented Generation Techniques for Bengali Standard-to-Dialect Machine Translation Using LLMs


86. IntentMiner: Intent Inversion Attack via Tool Call Analysis in the Model Context Protocol


87. PathFinder: Advancing Path Loss Prediction for Single-to-Multi-Transmitter Scenario


88. TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models


89. LAPPI: Interactive Optimization with LLM-Assisted Preference-Based Problem Instantiation


90. UIXPOSE: Mobile Malware Detection via Intention-Behaviour Discrepancy Analysis


91. SportsGPT: An LLM-driven Framework for Interpretable Sports Motion Assessment and Training Guidance


92. Neurosymbolic Inference On Foundation Models For Remote Sensing Text-to-image Retrieval With Complex Queries


93. ProtoFlow: Interpretable and Robust Surgical Workflow Modeling with Learned Dynamic Scene Graph Prototypes


94. Arithmetic-Intensity-Aware Quantization


95. SonicMoE: Accelerating MoE with IO and Tile-aware Optimizations


96. SDAR-VL: Stable and Efficient Block-wise Diffusion for Vision-Language Understanding


97. Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed


98. Real-time prediction of workplane illuminance distribution for daylight-linked controls using non-intrusive multimodal deep learning


99. FacEDiT: Unified Talking Face Editing and Generation via Facial Motion Infilling


100. OmniDrive-R1: Reinforcement-driven Interleaved Multi-modal Chain-of-Thought for Trustworthy Vision-Language Autonomous Driving


101. ACE-SLAM: Scene Coordinate Regression for Neural Implicit Real-Time SLAM


102. Sample-Efficient Robot Skill Learning for Construction Tasks: Benchmarking Hierarchical Reinforcement Learning and Vision-Language-Action VLA Model


103. PerfCoder: Large Language Models for Interpretable Code Performance Optimization


104. KFS-Bench: Comprehensive Evaluation of Key Frame Sampling in Long Video Understanding


105. Professional Software Developers Don’t Vibe, They Control: AI Agent Use for Coding in 2025


106. Memo2496: Expert-Annotated Dataset and Dual-View Adaptive Framework for Music Emotion Recognition


107. Multi-Agent Collaborative Framework for Intelligent IT Operations: An AOI System with Context-Aware Compression and Dynamic Task Scheduling


108. Informing Acquisition Functions via Foundation Models for Molecular Discovery


109. Hierarchical Multi-agent Large Language Model Reasoning for Autonomous Functional Materials Discovery


110. Context Branching for LLM Conversations: A Version Control Approach to Exploratory Programming


111. Intelligent matter consisting of active particles


112. Exploring Machine Learning, Deep Learning, and Explainable AI Methods for Seasonal Precipitation Prediction in South America


113. Assessing High-Risk Systems: An EU AI Act Verification Framework


114. Generative AI for Video Translation: A Scalable Architecture for Multilingual Video Conferencing


115. One Permutation Is All You Need: Fast, Reliable Variable Importance and Model Stress-Testing


116. OPTIMA: Optimal One-shot Pruning for LLMs via Quadratic Programming Reconstruction


117. Privacy-Enhancing Infant Cry Classification with Federated Transformers and Denoising Regularization


118. Verification-Guided Context Optimization for Tool Calling via Hierarchical LLMs-as-Editors


119. Improvise, Adapt, Overcome – Telescopic Adapters for Efficient Fine-tuning of Vision Language Models in Medical Imaging


120. VajraV1 – The most accurate Real Time Object Detector of the YOLO family


121. EEG-D3: A Solution to the Hidden Overfitting Problem of Deep Learning Models


122. Beyond Procedural Compliance: Human Oversight as a Dimension of Well-being Efficacy in AI Governance


123. Towards Deep Learning Surrogate for the Forward Problem in Electrocardiology: A Scalable Alternative to Physics-Based Models


124. Network-Wide Traffic Volume Estimation from Speed Profiles using a Spatio-Temporal Graph Neural Network with Directed Spatial Attention


125. STAR: STacked AutoRegressive Scheme for Unified Multimodal Learning


126. MIDUS: Memory-Infused Depth Up-Scaling



128. Comparative Evaluation of Embedding Representations for Financial News Sentiment Analysis


129. Why Text Prevails: Vision May Undermine Multimodal Medical Decision Making


130. A Spatio-Temporal Hybrid Quantum-Classical Graph Convolutional Neural Network Approach for Urban Taxi Destination Prediction


131. Toward Noise-Aware Audio Deepfake Detection: Survey, SNR-Benchmarks, and Practical Recipes


132. DL$^3$M: A Vision-to-Language Framework for Expert-Level Medical Reasoning through Deep Learning and Large Language Models


133. The Laminar Flow Hypothesis: Detecting Jailbreaks via Semantic Turbulence in Large Language Models


134. Human-AI Collaboration Mechanism Study on AIGC Assisted Image Production for Special Coverage


135. Instilling Organisational Values in Firefighters through Simulation-Based Training


136. TF-MCL: Time-frequency Fusion and Multi-domain Cross-Loss for Self-supervised Depression Detection


137. DARTs: A Dual-Path Robust Framework for Anomaly Detection in High-Dimensional Multivariate Time Series


138. Plug-and-Play Parameter-Efficient Tuning of Embeddings for Federated Recommendation


139. Low-Rank Compression of Language Models via Differentiable Rank Selection


140. PIS: A Generalized Physical Inversion Solver for Arbitrary Sparse Observations via Set-Conditioned Diffusion


141. Complex Mathematical Expression Recognition: Benchmark, Large-Scale Dataset and Strong Baseline


142. Exploring the Modular Integration of “AI + Architecture” Pedagogy in Undergraduate Design Education: A Case Study of Architectural Design III/IV Courses at Zhejiang University


143. Composite Classifier-Free Guidance for Multi-Modal Conditioning in Wind Dynamics Super-Resolution


144. CurvaDion: Curvature-Adaptive Distributed Orthonormalization


145. Time-Constrained Recommendations: Reinforcement Learning Strategies for E-Commerce


146. Graph AI generates neurological hypotheses validated in molecular, organoid, and clinical systems


147. Made-in China, Thinking in America:U.S. Values Persist in Chinese LLMs


148. Federated Few-Shot Learning for Epileptic Seizure Detection Under Privacy Constraints


149. Scaling and Transferability of Annealing Strategies in Large Language Model Training


150. Safe2Harm: Semantic Isomorphism Attacks for Jailbreaking Large Language Models


151. Enhancing Transparency and Traceability in Healthcare AI: The AI Product Passport


152. Writing in Symbiosis: Mapping Human Creative Agency in the AI Era


153. MultiBanAbs: A Comprehensive Multi-Domain Bangla Abstractive Text Summarization Dataset


154. EDGC: Entropy-driven Dynamic Gradient Compression for Efficient LLM Training