전체 AI 논문 - 2025-12-24

1. LongVideoAgent: Multi-Agent Reasoning with Long Videos


2. Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent


3. Advancing Multimodal Teacher Sentiment Analysis:The Large-Scale T-MED Dataset & The Effective AAM-TSA Model


4. Benchmarking LLMs for Predictive Applications in the Intensive Care Units


5. Bohrium + SciMaster: Building the Infrastructure and Ecosystem for Agentic Science at Scale


6. Generative Digital Twins: Vision-Language Simulation Models for Executable Industrial Systems


7. A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice


8. SynCraft: Guiding Large Language Models to Predict Edit Sequences for Molecular Synthesizability Optimization


9. Synthesizing Procedural Memory: Challenges and Architectures in Automated Workflow Generation


10. ActionFlow: A Pipelined Action Acceleration for Vision Language Models on Edge


11. Graph-Symbolic Policy Enforcement and Control (G-SPEC): A Neuro-Symbolic Framework for Safe Agentic AI in 5G Autonomous Networks


12. MemR$^3$: Memory Retrieval via Reflective Reasoning for LLM Agents


13. TongSIM: A General Platform for Simulating Intelligent Machines


14. Offline Safe Policy Optimization From Heterogeneous Feedback


15. Concept Generalization in Humans and Large Language Models: Insights from the Number Game


16. A Bidirectional Gated Recurrent Unit Model for PUE Prediction in Data Centers


17. Enhancing Zero-Shot Time Series Forecasting in Off-the-Shelf LLMs via Noise Injection


18. MolAct: An Agentic RL Framework for Molecular Editing and Property Optimization


19. Adaptive Financial Sentiment Analysis for NIFTY 50 via Instruction-Tuned LLMs , RAG and Reinforcement Learning Approaches


20. Reason2Decide: Rationale-Driven Multi-Task Learning


21. Scaling Reinforcement Learning for Content Moderation with Large Language Models


22. Towards Generative Location Awareness for Disaster Response: A Probabilistic Cross-view Geolocalization Approach


23. Learning Skills from Action-Free Videos


24. Discovering Lie Groups with Flow Matching


25. S$^3$IT: A Benchmark for Spatially Situated Social Intelligence Test


26. FGDCC: Fine-Grained Deep Cluster Categorization – A Framework for Intra-Class Variability Problems in Plant Classification


27. Zero-Shot Segmentation through Prototype-Guidance for Multi-Label Plant Species Identification


28. Interpolative Decoding: Exploring the Spectrum of Personality Traits in LLMs


29. A Branch-and-Price Algorithm for Fast and Equitable Last-Mile Relief Aid Distribution


30. PhysMaster: Building an Autonomous AI Physicist for Theoretical and Computational Physics Research


31. Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning


32. Cube Bench: A Benchmark for Spatial Visual Reasoning in MLLMs


33. Leveraging High-Fidelity Digital Models and Reinforcement Learning for Mission Engineering: A Case Study of Aerial Firefighting Under Perfect Information


34. Performative Policy Gradient: Optimality in Performative Reinforcement Learning


35. Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMs


36. Distilling to Hybrid Attention Models via KL-Guided Layer Selection


37. LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving


38. SweRank+: Multilingual, Multi-Turn Code Ranking for Software Issue Localization


39. Dual-Encoder Transformer-Based Multimodal Learning for Ischemic Stroke Lesion Segmentation Using Diffusion MRI


40. Evasion-Resilient Detection of DNS-over-HTTPS Data Exfiltration: A Practical Evaluation and Toolkit


41. Simplifying Multi-Task Architectures Through Task-Specific Normalization


42. DETACH : Decomposed Spatio-Temporal Alignment for Exocentric Video and Ambient Sensors with Staged Learning


43. AUDRON: A Deep Learning Framework with Fused Acoustic Signatures for Drone Type Recognition


44. Identifying Appropriately-Sized Services with Deep Reinforcement Learning


45. Clust-PSI-PFL: A Population Stability Index Approach for Clustered Non-IID Personalized Federated Learning


46. Multi-LLM Thematic Analysis with Dual Reliability Metrics: Combining Cohen’s Kappa and Semantic Similarity for Qualitative Research Validation


47. Toward Explaining Large Language Models in Software Engineering Tasks


48. Deep Learning Classification of EEG Responses to Multi-Dimensional Transcranial Electrical Stimulation


49. TableGPT-R1: Advancing Tabular Reasoning Through Reinforcement Learning


50. KnowVal: A Knowledge-Augmented and Value-Guided Autonomous Driving System


51. Patterns vs. Patients: Evaluating LLMs against Mental Health Professionals on Personality Disorder Diagnosis through First-Person Narratives


52. TAVID: Text-Driven Audio-Visual Interactive Dialogue Generation


53. SlideTailor: Personalized Presentation Slide Generation for Scientific Papers


54. UbiQVision: Quantifying Uncertainty in XAI for Image Recognition


55. ${D}^{3}${ETOR}: ${D}$ebate-Enhanced Pseudo Labeling and Frequency-Aware Progressive ${D}$ebiasing for Weakly-Supervised Camouflaged Object ${D}$etection with Scribble Annotations


56. Memory as Resonance: A Biomimetic Architecture for Infinite Context Memory on Ergodic Phonetic Manifolds


57. Corpus of Cross-lingual Dialogues with Minutes and Detection of Misunderstandings


58. Asynchronous Fast-Slow Vision-Language-Action Policies for Whole-Body Robotic Manipulation


59. FaithLens: Detecting and Explaining Faithfulness Hallucination


60. Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography


61. AI Security Beyond Core Domains: Resume Screening as a Case Study of Adversarial Vulnerabilities in Specialized LLM Applications


62. AXIOM: Benchmarking LLM-as-a-Judge for Code via Rule-Based Perturbation and Multisource Quality Calibration


63. Fun-Audio-Chat Technical Report


64. Retrieval-augmented Prompt Learning for Pre-trained Foundation Models


65. M$^3$KG-RAG: Multi-hop Multimodal Knowledge Graph-enhanced Retrieval-Augmented Generation


66. Evolutionary Neural Architecture Search with Dual Contrastive Learning


67. ABBEL: LLM Agents Acting through Belief Bottlenecks Expressed in Language


68. Item Region-based Style Classification Network (IRSN): A Fashion Style Classifier Based on Domain Knowledge of Fashion Experts


69. Spatio-Temporal Graphs Beyond Grids: Benchmark for Maritime Anomaly Detection


70. QE-Catalytic: A Graph-Language Multimodal Base Model for Relaxed-Energy Prediction in Catalytic Adsorption


71. CBA: Communication-Bound-Aware Cross-Domain Resource Assignment for Pipeline-Parallel Distributed LLM Training in Dynamic Multi-DC Optical Networks


72. On the Effectiveness of Instruction-Tuning Local LLMs for Identifying Software Vulnerabilities


73. An Optimal Policy for Learning Controllable Dynamics by Exploration


74. Beyond Vision: Contextually Enriched Image Captioning with Multi-Modal Retrieva


75. DecoKAN: Interpretable Decomposition for Forecasting Cryptocurrency Market Dynamics


76. Bring My Cup! Personalizing Vision-Language-Action Models with Visual Attentive Prompting


77. IoT-based Android Malware Detection Using Graph Neural Network With Adversarial Defense


78. Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models


79. Neuron-Guided Interpretation of Code LLMs: Where, Why, and How?


80. Regression of Functions by Quantum Neural Networks Circuits


81. How Much 3D Do Video Foundation Models Encode?


82. Block-Recurrent Dynamics in Vision Transformers


83. Conditional Adversarial Fragility in Financial Machine Learning under Macroeconomic Stress


84. Vehicle-centric Perception via Multimodal Structured Pre-training


85. Unified Brain Surface and Volume Registration


86. Mitigating LLM Hallucination via Behaviorally Calibrated Reinforcement Learning


87. A Time-efficient Prioritised Scheduling Algorithm to Optimise Initial Flock Formation of Drones


88. Modeling Non-Ergodic Path Effects Using Conditional Generative Model for Fourier Amplitude Spectra


89. Demystifying LLM-as-a-Judge: Analytically Tractable Model for Inference-Time Scaling


90. Fine-Tuned In-Context Learners for Efficient Adaptation


91. HARMON-E: Hierarchical Agentic Reasoning for Multimodal Oncology Notes to Extract Structured Data


92. UCCL-EP: Portable Expert-Parallel Communication


93. Learned Digital Codes for Over-the-Air Computation in Federated Edge Learning


94. A K-Means, Ward and DBSCAN repeatability study


95. A Declarative Language for Building And Orchestrating LLM-Powered Agent Workflows


96. How Many Experts Are Enough? Towards Optimal Semantic Specialization for Mixture-of-Experts


97. Attention Distance: A Novel Metric for Directed Fuzzing with Large Language Models


98. QMBench: A Research Level Benchmark for Quantum Materials Research


99. From Theory to Throughput: CUDA-Optimized APML for Large-Batch 3D Learning


100. Simulation-Driven Railway Delay Prediction: An Imitation Learning Approach


101. CoPHo: Classifier-guided Conditional Topology Generation with Persistent Homology


102. High-Performance Self-Supervised Learning by Joint Training of Flow Matching


103. Tiny, On-Device Decision Makers with the MiniConv Library


104. Multiscale Dual-path Feature Aggregation Network for Remaining Useful Life Prediction of Lithium-Ion Batteries


105. Thermodynamic Focusing for Inference-Time Search: Practical Methods for Target-Conditioned Sampling and Prompted Inference


106. Development and external validation of a multimodal artificial intelligence mortality prediction model of critically ill patients using multicenter data


107. PHANTOM: PHysical ANamorphic Threats Obstructing Connected Vehicle Mobility


108. Bidirectional human-AI collaboration in brain tumour assessments improves both expert human and AI agent performance


109. Generative AI for Analysts


110. Large Language Models for EDA Cloud Job Resource and Lifetime Prediction


111. Automated Fault Detection in 5G Core Networks Using Large Language Models


112. QoS-Aware Dynamic CU Selection in O-RAN with Graph-Based Reinforcement Learning


113. Brain-Grounded Axes for Reading and Steering LLM States