전체 AI 논문 - 2025-12-08

1. SymPyBench: A Dynamic Benchmark for Scientific Reasoning with Executable Python Code


2. Variational Quantum Rainbow Deep Q-Network for Optimizing Resource Allocation Problem


3. TRACE: A Framework for Analyzing and Enhancing Stepwise Reasoning in Vision-Language Models


4. PRiSM: An Agentic Multimodal Benchmark for Scientific Reasoning via Python-Grounded Evaluation


5. To Err Is Human: Systematic Quantification of Errors in Published AI Papers via LLM Analysis


6. Using Large Language Models to Create Personalized Networks From Therapy Sessions


7. Multimodal Oncology Agent for IDH1 Mutation Prediction in Low-Grade Glioma


8. The Missing Layer of AGI: From Pattern Alchemy to Coordination Physics


9. Evolutionary System 2 Reasoning: An Empirical Proof


10. A Fast Anti-Jamming Cognitive Radar Deployment Algorithm Based on Reinforcement Learning


11. KANFormer for Predicting Fill Probabilities via Survival Analysis in Limit Order Books


12. Enhancing Local Search for MaxSAT with Deep Differentiation Clause Weighting


13. Ontology Learning with LLMs: A Benchmark Study on Axiom Identification


14. CureAgent: A Training-Free Executor-Analyst Framework for Clinical Reasoning


15. MIND: Multi-rationale INtegrated Discriminative Reasoning Framework for Multi-modal Large Models


16. The Seeds of Scheming: Weakness of Will in the Building Blocks of Agentic Systems


17. BEAVER: An Efficient Deterministic LLM Verifier


18. ChipMind: Retrieval-Augmented Reasoning for Long-Context Circuit Design Specifications


19. MCP-AI: Protocol-Driven Intelligence Framework for Autonomous Reasoning in Healthcare


20. AI & Human Co-Improvement for Safer Co-Superintelligence


21. Resolving Zadehs Paradox Axiomatic Possibility Theory as a Foundation for Reliable Artificial Intelligence


22. On the Computability of Artificial General Intelligence


23. Bridging Traditional Machine Learning and Large Language Models: A Two-Part Course Design for Modern AI Education


24. Semantic Faithfulness and Entropy Production Measures to Tame Your LLM Demons and Manage Hallucinations


25. Documenting SME Processes with Conversational AI: From Tacit Knowledge to BPMN


26. Enhancing Retrieval-Augmented Generation with Entity Linking for Educational Platforms


27. Training-Time Action Conditioning for Efficient Real-Time Chunking


28. Whatever Remains Must Be True: Filtering Drives Reasoning in LLMs, Shaping Diversity


29. AQUA-Net: Adaptive Frequency Fusion and Illumination Aware Network for Underwater Image Enhancement


30. M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG


31. MaxShapley: Towards Incentive-compatible Generative Search with Fair Context Attribution


32. Trusted AI Agents in the Cloud


33. Impugan: Learning Conditional Generative Models for Robust Data Imputation


34. Zoom in, Click out: Unlocking and Evaluating the Potential of Zooming for GUI Grounding


35. Measuring the Effect of Background on Classification and Feature Importance in Deep Learning for AV Perception


36. World Models That Know When They Don’t Know: Controllable Video Generation with Calibrated Uncertainty


37. Natural Language Summarization Enables Multi-Repository Bug Localization by LLMs in Microservice Architectures


38. Neural Coherence : Find higher performance to out-of-distribution tasks from few samples


39. Sparse Attention Post-Training for Mechanistic Interpretability


40. Optimizing Medical Question-Answering Systems: A Comparative Study of Fine-Tuned and Zero-Shot Large Language Models with RAG Framework


41. NEAT: Neighborhood-Guided, Efficient, Autoregressive Set Transformer for 3D Molecular Generation


42. Phase-OTDR Event Detection Using Image-Based Data Transformation and Deep Learning


43. Approximation of Box Decomposition Algorithm for Fast Hypervolume-Based Multi-Objective Optimization


44. Probing the effectiveness of World Models for Spatial Reasoning through Test-time Scaling


45. 3D Path Planning for Robot-assisted Vertebroplasty from Arbitrary Bi-plane X-ray via Differentiable Rendering


46. Mechanistic Interpretability of Antibody Language Models Using SAEs


47. Active Video Perception: Iterative Evidence Seeking for Agentic Long Video Understanding


48. Efficient Text Classification with Conformal In-Context Learning


49. Big Tech-Funded AI Papers Have Higher Citation Impact, Greater Insularity, and Larger Recency Bias


50. Bayesian Active Inference for Intelligent UAV Anti-Jamming and Adaptive Trajectory Planning


51. Faithfulness metric fusion: Improving the evaluation of LLM trustworthiness across domains


52. HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies


53. Retrieving Semantically Similar Decisions under Noisy Institutional Labels: Robust Comparison of Embedding Methods


54. InverseCrafter: Efficient Video ReCapture as a Latent Domain Inverse Problem


55. On Dynamic Programming Theory for Leader-Follower Stochastic Games


56. Feasibility of AI-Assisted Programming for End-User Development


57. Grounded Multilingual Medical Reasoning for Question Answering with Large Language Models


58. Modular Jets for Supervised Pipelines: Diagnosing Mirage vs Identifiability


59. A Comprehensive Framework for Automated Quality Control in the Automotive Industry


60. 2K-Characters-10K-Stories: A Quality-Gated Stylized Narrative Dataset with Disentangled Control and Sequence Consistency


61. Improving Local Fidelity Through Sampling and Modeling Nonlinearity


62. Conscious Gaze: Adaptive Attention Mechanisms for Hallucination Mitigation in Vision-Language Models


63. RoBoN: Routed Online Best-of-n for Test-Time Scaling with Multiple LLMs


64. On the Theoretical Foundation of Sparse Dictionary Learning in Mechanistic Interpretability


65. See in Depth: Training-Free Surgical Scene Segmentation with Monocular Depth Priors


66. User Negotiations of Authenticity, Ownership, and Governance on AI-Generated Video Platforms: Evidence from Sora


67. Matching Ranks Over Probability Yields Truly Deep Safety Alignment


68. Lyrics Matter: Exploiting the Power of Learnt Representations for Music Popularity Prediction


69. UniFS: Unified Multi-Contrast MRI Reconstruction via Frequency-Spatial Fusion


70. PERM EQ x GRAPH EQ: Equivariant Neural Networks for Quantum Molecular Learning


71. How Ensemble Learning Balances Accuracy and Overfitting: A Bias-Variance Perspective on Tabular Data


72. University Building Recognition Dataset in Thailand for the mission-oriented IoT sensor system


73. Dynamic Alignment for Collective Agency: Toward a Scalable Self-Improving Framework for Open-Ended LLM Alignment


74. Knowing Your Uncertainty – On the application of LLM in social sciences


75. Parajudica: An RDF-Based Reasoner and Metamodel for Multi-Framework Context-Dependent Data Compliance Assessments


76. IdealTSF: Can Non-Ideal Data Contribute to Enhancing the Performance of Time Series Forecasting Models?


77. Building Capacity for Artificial Intelligence in Africa: A Cross-Country Survey of Challenges and Governance Pathways


78. ArtistMus: A Globally Diverse, Artist-Centric Benchmark for Retrieval-Augmented Music Question Answering


79. Moving object detection from multi-depth images with an attention-enhanced CNN


80. A Systematic Framework for Enterprise Knowledge Retrieval: Leveraging LLM-Generated Metadata to Enhance RAG Systems


81. Smart Timing for Mining: A Deep Learning Framework for Bitcoin Hardware ROI Prediction


82. Simulating Life Paths with Digital Twins: AI-Generated Future Selves Influence Decision-Making and Expand Human Choice


83. Generalization Beyond Benchmarks: Evaluating Learnable Protein-Ligand Scoring Functions on Unseen Targets


84. Fuzzing the brain: Automated stress testing for the safety of ML-driven neurostimulation


85. Mitigating Self-Preference by Authorship Obfuscation


86. China Regional 3km Downscaling Based on Residual Corrective Diffusion Model


87. Please Don’t Kill My Vibe: Empowering Agents with Data Flow Control


88. Text Rationalization for Robust Causal Effect Estimation


89. Invisible Load: Uncovering the Challenges of Neurodivergent Women in Software Engineering


90. SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling


91. Interaction Tensor Shap


92. The Effect of Document Summarization on LLM-Based Relevance Judgments


93. LYNX: Learning Dynamic Exits for Confidence-Controlled Reasoning


94. Robustness Test for AI Forecasting of Hurricane Florence Using FourCastNetv2 and Random Perturbations of the Initial Condition


95. To Think or Not to Think: The Hidden Cost of Meta-Training with Excessive CoT Examples


96. WhatsCode: Large-Scale GenAI Deployment for Developer Efficiency at WhatsApp


97. The Erosion of LLM Signatures: Can We Still Distinguish Human and LLM-Generated Scientific Ideas After Iterative Paraphrasing?


98. CFO: Learning Continuous-Time PDE Dynamics via Flow-Matched Neural Operators


99. Beyond Detection: A Comprehensive Benchmark and Study on Representation Learning for Fine-Grained Webshell Family Classification


100. From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model


101. XR-DT: Extended Reality-Enhanced Digital Twin for Agentic Mobile Robots


102. Uncertainty-Aware Data-Efficient AI: An Information-Theoretic Perspective


103. Learning to Code with Context: A Study-Based Approach


104. A Survey of Bugs in AI-Generated Code


105. MAR-FL: A Communication Efficient Peer-to-Peer Federated Learning System


106. Invariance Co-training for Robot Visual Generalization


107. Fine-Tuning BERT for Domain-Specific Question Answering: Toward Educational NLP Resources at University Scale


108. Towards A Cultural Intelligence and Values Inferences Quality Benchmark for Community Values and Common Knowledge


109. Semore: VLM-guided Enhanced Semantic Motion Representations for Visual Reinforcement Learning


110. Advanced Unsupervised Learning: A Comprehensive Overview of Multi-View Clustering Techniques


111. How to Tame Your LLM: Semantic Collapse in Continuous Systems


112. FlowEO: Generative Unsupervised Domain Adaptation for Earth Observation


113. ChromouVQA: Benchmarking Vision-Language Models under Chromatic Camouflaged Images


114. Fine-tuning an ECG Foundation Model to Predict Coronary CT Angiography Outcomes


115. Breaking Scale Anchoring: Frequency Representation Learning for Accurate High-Resolution Inference from Low-Resolution Training


116. AREA3D: Active Reconstruction Agent with Unified Feed-Forward 3D Perception and Vision-Language Guidance


117. GNSS Jammer Direction Finding in Dynamic Scenarios Using an Inertial-based Multi-Antenna System


118. SyncVoice: Towards Video Dubbing with Vision-Augmented Pretrained TTS Model


119. PESTalk: Speech-Driven 3D Facial Animation with Personalized Emotional Styles


120. RAG-IGBench: Innovative Evaluation for RAG-based Interleaved Generation in Open-domain Question Answering