전체 AI 논문 - 2025-12-04

1. From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars?


2. Invasive Context Engineering to Control Large Language Models


3. Martingale Score: An Unsupervised Metric for Bayesian Rationality in LLM Reasoning


4. The future of AI in critical mineral exploration


5. Radiologist Copilot: An Agentic Assistant with Orchestrated Tools for Radiology Reporting with Quality Control


6. Enhancing Automated Paper Reproduction via Prompt-Free Collaborative Agents


7. A Framework for Causal Concept-based Model Explanations


8. Self-Improving AI Agents through Self-Play


9. AuditCopilot: Leveraging LLMs for Fraud Detection in Double-Entry Bookkeeping


10. StockMem: An Event-Reflection Memory Framework for Stock Forecasting


11. Menta: A Small Language Model for On-Device Mental Health Prediction


12. Training Data Attribution for Image Generation using Ontology-Aligned Knowledge Graphs


13. Learning What to Attend First: Modality-Importance-Guided Reasoning for Reliable Multimodal Emotion Understanding


14. Exploring Depth Generalization in Large Language Models for Solving Recursive Logic Tasks


15. Zero-Shot Instruction Following in RL via Structured LTL Representations


16. Target-specific Adaptation and Consistent Degradation Alignment for Cross-Domain Remaining Useful Life Prediction


17. IACT: A Self-Organizing Recursive Model for General AI Agents: A Technical White Paper on the Architecture Behind kragent.ai


18. PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing


19. Empathy Level Prediction in Multi-Modal Scenario with Supervisory Documentation Assistance


20. Aetheria: A multimodal interpretable content safety framework based on multi-agent debate and collaboration


21. COPE: Chain-Of-Thought Prediction Engine for Open-Source Large Language Model Based Stroke Outcome Prediction from Clinical Notes


22. Guided Self-Evolving LLMs with Minimal Human Supervision


23. Semantic Trading: Agentic AI for Clustering and Relationship Discovery in Prediction Markets


24. Synthetic Error Injection Fails to Elicit Self-Correction In Language Models


25. Beyond Playtesting: A Generative Multi-Agent Simulation System for Massively Multiplayer Online Games


26. Reasoning Path and Latent State Analysis for Multi-view Visual Spatial Reasoning: A Cognitive Science Perspective


27. OmniGuard: Unified Omni-Modal Guardrails with Deliberate Reasoning


28. Breast Cell Segmentation Under Extreme Data Constraints: Quantum Enhancement Meets Adaptive Loss Stabilization


29. Model Recovery at the Edge under Resource Constraints for Physical AI


30. DialogGuard: Multi-Agent Psychosocial Safety Evaluation of Sensitive LLM Responses


31. Bridging the Gap: Toward Cognitive Autonomy in Artificial Intelligence


32. TradeTrap: Are LLM-based Trading Agents Truly Reliable and Faithful?


33. Benchmarking LLM Agents for Wealth-Management Workflows


34. STRIDE: A Systematic Framework for Selecting AI Modalities - Agentic AI, AI Assistants, or LLM Calls


35. From monoliths to modules: Decomposing transducers for efficient world modelling


36. Flowchart2Mermaid: A Vision-Language Model Powered System for Converting Flowcharts into Editable Diagram Code


37. The 4/$δ$ Bound: Designing Predictable LLM-Verifier Systems for Formal Method Guarantee


38. PPTArena: A Benchmark for Agentic PowerPoint Editing


39. Video4Spatial: Towards Visuospatial Intelligence with Context-Guided Video Generation


40. ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation


41. SMP: Reusable Score-Matching Motion Priors for Physics-Based Character Control


42. The Moral Consistency Pipeline: Continuous Ethical Evaluation for Large Language Models


43. LORE: A Large Generative Model for Search Relevance


44. TokenPowerBench: Benchmarking the Power Consumption of LLM Inference


45. Distribution-Calibrated Inference time compute for Thinking LLM-as-a-Judge


46. In-Context Sync-LoRA for Portrait Video Editing


47. Fine-Tuned Large Language Models for Logical Translation: Reducing Hallucinations with Lang2Logic


48. Rethinking Generalized BCIs: Benchmarking 340,000+ Unique Algorithmic Configurations for EEG Mental Command Decoding


49. Lumos: Let there be Language Model System Certification


50. Benchmarking Scientific Understanding and Reasoning for Video Generation using VideoScience-Bench


51. EGGS: Exchangeable 2D/3D Gaussian Splatting for Geometry-Appearance Balanced Novel View Synthesis


52. In Silico Development of Psychometric Scales: Feasibility of Representative Population Data Simulation with LLMs


53. MRD: Multi-resolution Retrieval-Detection Fusion for High-Resolution Image Understanding


54. Towards a fully differentiable digital twin for solar cells


55. VLA Models Are More Generalizable Than You Think: Revisiting Physical and Spatial Modeling


56. FAIRY2I: Universal Extremely-Low Bit QAT framework via Widely-Linear Representation and Phase-Aware Quantization


57. Model-Based Diagnosis with Multiple Observations: A Unified Approach for C Software and Boolean Circuits


58. OptPO: Optimal Rollout Allocation for Test-time Policy Optimization


59. GraphMatch: Fusing Language and Graph Representations in a Dynamic Two-Sided Work Marketplace


60. Cross-Lingual Prompt Steerability: Towards Accurate and Robust LLM Behavior across Languages


61. ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning


62. Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach


63. A Comparative Study on How Data Normalization Affects Zero-Shot Generalization in Time Series Foundation Models


64. Defense That Attacks: How Robust Models Become Better Attackers


65. From Navigation to Refinement: Revealing the Two-Stage Nature of Flow-based Diffusion Models through Oracle Velocity


66. Phase-Adaptive LLM Framework with Multi-Stage Validation for Construction Robot Task Allocation: A Systematic Benchmark Against Traditional Optimization Algorithms


67. Perception of AI-Generated Music - The Role of Composer Identity, Personality Traits, Music Preferences, and Perceived Humanness


68. SurveyEval: Towards Comprehensive Evaluation of LLM-Generated Academic Surveys


69. Reasoning-Aware Multimodal Fusion for Hateful Video Detection


70. DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions


71. Emergent Bayesian Behaviour and Optimal Cue Combination in LLMs


72. Empirical Assessment of the Perception of Software Product Line Engineering by an SME before Migrating its Code Base


73. An Empirical Survey of Model Merging Algorithms for Social Bias Mitigation


74. Beyond Single-Agent Safety: A Taxonomy of Risks in LLM-to-LLM Interactions


75. SAND Challenge: Four Approaches for Dysartria Severity Classification


76. Graph VQ-Transformer (GVT): Fast and Accurate Molecular Generation via High-Fidelity Discrete Latents


77. Distill, Forget, Repeat: A Framework for Continual Unlearning in Text-to-Image Diffusion Models


78. Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-Supervised Pre-Training


79. CryptoQA: A Large-scale Question-answering Dataset for AI-assisted Cryptography


80. Feedback Loops and Code Perturbations in LLM-based Software Engineering: A Case Study on a C-to-Rust Translation System


81. From Panel to Pixel: Zoom-In Vision-Language Pretraining from Biomedical Scientific Literature


82. EZYer: A simulacrum of high school with generative agent


83. ADORE: Autonomous Domain-Oriented Relevance Engine for E-commerce


84. CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning


85. Sparse Computations in Deep Learning Inference


86. AskNearby: An LLM-Based Application for Neighborhood Information Retrieval and Personalized Cognitive-Map Recommendations


87. Masking Matters: Unlocking the Spatial Reasoning Capabilities of LLMs for 3D Scene-Language Understanding


88. UCAgents: Unidirectional Convergence for Visual Evidence Anchored Multi-Agent Medical Decision-Making


89. Q-BERT4Rec: Quantized Semantic-ID Representation Learning for Multimodal Recommendation


90. scCluBench: Comprehensive Benchmarking of Clustering Algorithms for Single-Cell RNA Sequencing



92. HouseLayout3D: A Benchmark and Training-Free Baseline for 3D Layout Estimation in the Wild


93. When Refusals Fail: Unstable Safety Mechanisms in Long-Context LLM Agents


94. Boosting Medical Vision-Language Pretraining via Momentum Self-Distillation under Limited Computing Resources


95. LightHCG: a Lightweight yet powerful HSIC Disentanglement based Causal Glaucoma Detection Model framework


96. WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning


97. Quantum feature encoding optimization


98. The brain-AI convergence: Predictive and generative world models for general-purpose computation


99. Vehicle Dynamics Embedded World Models for Autonomous Driving


100. MitUNet: Enhancing Floor Plan Recognition using a Hybrid Mix-Transformer and U-Net Architecture


101. Data Curation Through the Lens of Spectral Dynamics: Static Limits, Dynamic Acceleration, and Practical Oracles


102. WISE: Weighted Iterative Society-of-Experts for Robust Multimodal Multi-Agent Debate


103. Process-Centric Analysis of Agentic Software Systems


104. Multi-Domain Enhanced Map-Free Trajectory Prediction with Selective Attention


105. Tackling Tuberculosis: A Comparative Dive into Machine Learning for Tuberculosis Detection


106. Memory-Augmented Knowledge Fusion with Safety-Aware Decoding for Domain-Adaptive Question Answering


107. VACoT: Rethinking Visual Data Augmentation with VLMs


108. Understanding and Harnessing Sparsity in Unified Multimodal Models


109. FOVA: Offline Federated Reinforcement Learning with Mixed-Quality Data


110. Video Diffusion Models Excel at Tracking Similar-Looking Objects Without Supervision


111. COGNITION: From Evaluation to Defense against Multimodal LLM CAPTCHA Solvers


112. HealthContradict: Evaluating Biomedical Knowledge Conflicts in Language Models


113. Enhancing Cross Domain SAR Oil Spill Segmentation via Morphological Region Perturbation and Synthetic Label-to-SAR Generation


114. Progressive Image Restoration via Text-Conditioned Video Generation


115. Spatiotemporal Pyramid Flow Matching for Climate Emulation


116. DETAIL Matters: Measuring the Impact of Prompt Specificity on Reasoning in Large Language Models


117. See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models


118. Orchestration Framework for Financial Agents: From Algorithmic Trading to Agentic Trading


119. Improved Training Mechanism for Reinforcement Learning via Online Model Selection


120. Multifractal Recalibration of Neural Networks for Medical Imaging Segmentation


121. Bin2Vec: Interpretable and Auditable Multi-View Binary Analysis for Code Plagiarism Detection


122. A Knowledge-Based Language Model: Deducing Grammatical Knowledge in a Multi-Agent Language Acquisition Simulation


123. Enforcing Orderedness to Improve Feature Consistency


124. Story2MIDI: Emotionally Aligned Music Generation from Text


125. Think Before You Prune: Self-Reflective Structured Pruning for Reasoning Language Models


126. CLEF: Clinically-Guided Contrastive Learning for Electrocardiogram Foundation Models


127. Young Children’s Anthropomorphism of AI Chatbots and the Role of Parent Co-Presence


128. Feature Selection Empowered BERT for Detection of Hate Speech with Vocabulary Augmentation


129. Comparing Baseline and Day-1 Diffusion MRI Using Multimodal Deep Embeddings for Stroke Outcome Prediction


130. FDRMFL:Multi-modal Federated Feature Extraction Model Based on Information Maximization and Contrastive Learning


131. HTG-GCL: Leveraging Hierarchical Topological Granularity from Cellular Complexes for Graph Contrastive Learning


132. DPWMixer: Dual-Path Wavelet Mixer for Long-Term Time Series Forecasting


133. Large Language Model based Smart Contract Auditing with LLMBugScanner


134. Parallel Multi-Circuit Quantum Feature Fusion in Hybrid Quantum-Classical Convolutional Neural Networks for Breast Tumor Classification


135. Superpixel Attack: Enhancing Black-box Adversarial Attack with Image-driven Division Areas


136. Ada-MoGE: Adaptive Mixture of Gaussian Expert Model for Time Series Forecasting


137. Opening the Black Box: An Explainable, Few-shot AI4E Framework Informed by Physics and Expert Knowledge for Materials Engineering


138. Reversing Large Language Models for Efficient Training and Fine-Tuning


139. Leveraging AI multimodal geospatial foundation models for improved near-real-time flood mapping at a global scale


140. The Impact of Artificial Intelligence on Enterprise Decision-Making Process


141. Deep Research: A Systematic Survey


142. Statistical Arbitrage in Polish Equities Market Using Deep Learning Techniques


143. Integration of LSTM Networks in Random Forest Algorithms for Stock Market Trading Predictions


144. CONFIDE: Hallucination Assessment for Reliable Biomolecular Structure Prediction and Design


145. Characterizing Continuous and Discrete Hybrid Latent Spaces for Structural Connectomes


146. On the Difficulty of Token-Level Modeling of Dysfluency and Fluency Shaping Artifacts


147. Towards Sustainable Precision: Machine Learning for Laser Micromachining Optimization


148. DySTAN: Joint Modeling of Sedentary Activity and Social Context from Smartphone Sensors


149. Do Large Language Models Walk Their Talk? Measuring the Gap Between Implicit Associations, Self-Report, and Behavioral Altruism


150. Graphing the Truth: Structured Visualizations for Automated Hallucination Detection in LLMs


151. Mixed precision accumulation for neural network inference guided by componentwise forward error analysis