전체 AI 논문 - 2025-12-12

1. Bayesian Networks, Markov Networks, Moralisation, Triangulation: a Categorical Perspective


2. SCOPE: Language Models as One-Time Teacher for Hierarchical Planning in Text Environments


3. Human-in-the-Loop and AI: Crowdsourcing Metadata Vocabulary for Materials Science


4. Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing


5. Interpretation as Linear Transformation: A Cognitive-Geometric Model of Belief and Meaning


6. RIFT: A Scalable Methodology for LLM Accelerator Fault Assessment using Reinforcement Learning


7. Analyzing Planner Design Trade-offs for MAPF under Realistic Simulation


8. Gaussian Process Aggregation for Root-Parallel Monte Carlo Tree Search with Continuous Actions


9. An End-to-end Planning Framework with Agentic LLMs and PDDL



11. Architectures for Building Agentic AI


12. Visual Categorization Across Minds and Models: Cognitive Analysis of Human Labeling and Neuro-Symbolic Integration


13. SDialog: A Python Toolkit for End-to-End Agent Building, User Simulation, Dialog Generation, and Evaluation


14. A Categorical Analysis of Large Language Models and Why LLMs Circumvent the Symbol Grounding Problem


15. AI TIPS 2.0: A Comprehensive Framework for Operationalizing AI Governance


16. Calibrated Trust in Dealing with LLM Hallucinations: A Qualitative Study


17. LISN: Language-Instructed Social Navigation with VLM-based Controller Modulating


18. FALCON: Few-step Accurate Likelihoods for Continuous Flows


19. Supervised learning pays attention


20. Efficient Continual Learning in Neural Machine Translation: A Low-Rank Adaptation Approach


21. STACHE: Local Black-Box Explanations for Reinforcement Learning Policies


22. Visual Heading Prediction for Autonomous Aerial Vehicles


23. Provably Learning from Modern Language Models via Low Logit Rank


24. FlipLLM: Efficient Bit-Flip Attacks on Multimodal LLMs using Reinforcement Learning


25. MedForget: Hierarchy-Aware Multimodal Unlearning Testbed for Medical AI



27. Composing Concepts from Images and Videos via Concept-prompt Binding


28. CHEM: Estimating and Understanding Hallucinations in Deep Learning for Image Processing


29. PathCo-LatticE: Pathology-Constrained Lattice-Of Experts Framework for Fully-supervised Few-Shot Cardiac MRI Segmentation


30. Quantifying Uncertainty in Machine Learning-Based Pervasive Systems: Application to Human Activity Recognition


31. Circuits, Features, and Heuristics in Molecular Transformers


32. Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs


33. Ethics Readiness of Artificial Intelligence: A Practical Evaluation Method


34. Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies


35. The Ky Fan Norms and Beyond: Dual Norms and Combinations for Matrix Optimization


36. Drawback of Enforcing Equivariance and its Compensation via the Lens of Expressive Power


37. Can LLMs Evaluate What They Cannot Annotate? Revisiting LLM Reliability in Hate Speech Detection


38. Rethinking Chain-of-Thought Reasoning for Videos


39. ImageTalk: Designing a Multimodal AAC Text Generation System Driven by Image Recognition and Natural Language Generation


40. Stanford Sleep Bench: Evaluating Polysomnography Pre-training Methods for Sleep Foundation Models


41. Graph-Based Bayesian Optimization for Quantum Circuit Architecture Search with Uncertainty Calibrated Surrogates


42. Hands-on Evaluation of Visual Transformers for Object Recognition and Detection


43. Auto-BenchmarkCard: Automated Synthesis of Benchmark Documentation


44. Lazy Diffusion: Mitigating spectral collapse in generative diffusion-based stable autoregressive emulation of turbulent flows


45. The Gender Code: Gendering the Global Governance of Artificial Intelligence


46. System Report for CCL25-Eval Task 10: Prompt-Driven Large Language Model Merge for Fine-Grained Chinese Hate Speech Detection


47. SWEnergy: An Empirical Study on Energy Efficiency in Agentic Issue Resolution Frameworks with SLMs


48. NeuroSketch: An Effective Framework for Neural Decoding via Systematic Architectural Optimization


49. Representation Invariance and Allocation: When Subgroup Balance Matters


50. RouteRAG: Efficient Retrieval-Augmented Generation from Text and Graph via Reinforcement Learning


51. Advancing LLM-Based Security Automation with Customized Group Relative Policy Optimization for Zero-Touch Networks


52. Color encoding in Latent Space of Stable Diffusion Models


53. Temporal-Spatial Tubelet Embedding for Cloud-Robust MSI Reconstruction using MSI-SAR Fusion: A Multi-Head Self-Attention Video Vision Transformer Approach


54. Privacy-Preserving Computer Vision for Industry: Three Case Studies in Human-Centric Manufacturing


55. Cytoplasmic Strings Analysis in Human Embryo Time-Lapse Videos using Deep Learning Framework


56. Advancing Research via Human-AI Interactive Theorem Proving


57. Representation Calibration and Uncertainty Guidance for Class-Incremental Learning based on Vision Language Model


58. CourtPressGER: A German Court Decision to Press Release Summarization Dataset


59. ODMA: On-Demand Memory Allocation Framework for LLM Serving on LPDDR-Class Accelerators


60. H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos


61. Towards Resilient Transportation: A Conditional Transformer for Accident-Informed Traffic Forecasting


62. GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection


63. CONCUR: A Framework for Continual Constrained and Unconstrained Routing


64. BugSweeper: Function-Level Detection of Smart Contract Vulnerabilities Using Graph Neural Networks


65. Log NeRF: Comparing Spaces for Learning Radiance Fields


66. Branching Strategies Based on Subgraph GNNs: A Study on Theoretical Promise versus Practical Reality


67. Efficiency-Aware Computational Intelligence for Resource-Constrained Manufacturing Toward Edge-Ready Deployment


68. Simultaneous Genetic Evolution of Neural Networks for Optimal SFC Embedding


69. Functional Percolation: A Perspective on Criticality of Form and Function


70. Hetero-SplitEE: Split Learning of Neural Networks with Early Exits for Heterogeneous IoT Devices


71. Identifying Bias in Machine-generated Text Detection


72. FBA$^2$D: Frequency-based Black-box Attack for AI-generated Image Detection


73. GLACIA: Instance-Aware Positional Reasoning for Glacial Lake Segmentation via Multimodal Large Language Model


74. A Clinically Interpretable Deep CNN Framework for Early Chronic Kidney Disease Prediction Using Grad-CAM-Based Explainable AI


75. CORE: A Conceptual Reasoning Layer for Large Language Models


76. Tensor-Compressed and Fully-Quantized Training of Neural PDE Solvers


77. LLMs for Analog Circuit Design Continuum (ACDC)


78. Towards Optimal Valve Prescription for Transcatheter Aortic Valve Replacement (TAVR) Surgery: A Machine Learning Approach


79. Understanding Mental States in Active and Autonomous Driving with EEG


80. WOLF: Werewolf-based Observations for LLM Deception and Falsehoods


81. Learning Patient-Specific Disease Dynamics with Latent Flow Matching for Longitudinal Imaging Generation


82. Prompt-Based Continual Compositional Zero-Shot Learning


83. AI-Driven Expansion and Application of the Alexandria Database


84. WonderZoom: Multi-Scale 3D World Generation


85. MindShift: Analyzing Language Models’ Reactions to Psychological Prompts


86. Detecting Hallucinations in Graph Retrieval-Augmented Generation via Attention Patterns and Semantic Alignment


87. Integrated Pipeline for Coronary Angiography With Automated Lesion Profiling, Virtual Stenting, and 100-Vessel FFR Validation


88. Knowledge-Guided Large Language Model for Automatic Pediatric Dental Record Understanding and Safe Antibiotic Recommendation


89. Semantic Trajectory Generation for Goal-Oriented Spacecraft Rendezvous


90. Evolving Excellence: Automated Optimization of LLM-based Agents


91. Masked Generative Policy for Robotic Control


92. Mental Models of Autonomy and Sentience Shape Reactions to AI


93. Beyond the Hype: Comparing Lightweight and Deep Learning Models for Air Quality Forecasting


94. KD-OCT: Efficient Knowledge Distillation for Clinical-Grade Retinal OCT Classification


95. ORCA: Open-ended Response Correctness Assessment for Audio Question Answering


96. ShelfAware: Real-Time Visual-Inertial Semantic Localization in Quasi-Static Environments with Low-Cost Sensors


97. Monitoring Deployed AI Systems in Health Care


98. Towards Lossless Ultimate Vision Token Compression for VLMs


99. Llama-based source code vulnerability detection: Prompt engineering vs Fine tuning


100. Digital Modeling of Spatial Pathway Activity from Histology Reveals Tumor Microenvironment Heterogeneity


101. A Physics-Constrained, Design-Driven Methodology for Defect Dataset Generation in Optical Lithography


102. DermETAS-SNA LLM: A Dermatology Focused Evolutionary Transformer Architecture Search with StackNet Augmented LLM Assistant


103. Demo: Generative AI helps Radiotherapy Planning with User Preference


104. Enhanced Chest Disease Classification Using an Improved CheXNet Framework with EfficientNetV2-M and Optimization-Driven Learning


105. 3DID: Direct 3D Inverse Design for Aerodynamics with Physics-Aware Optimization


106. Explainable Fundus Image Curation and Lesion Detection in Diabetic Retinopathy


107. RAG-HAR: Retrieval Augmented Generation-based Human Activity Recognition


108. HSCP: A Two-Stage Spectral Clustering Framework for Resource-Constrained UAV Identification


109. Consist-Retinex: One-Step Noise-Emphasized Consistency Training Accelerates High-Quality Retinex Enhancement


110. Mitigating Bias with Words: Inducing Demographic Ambiguity in Face Recognition Templates by Text Encoding


111. Training Multi-Image Vision Agents via End2End Reinforcement Learning


112. What Happens When: Learning Temporal Orders of Events in Videos


113. Institutional AI Sovereignty Through Gateway Architecture: Implementation Report from Fontys ICT


114. Peek-a-Boo Reasoning: Contrastive Region Masking in MLLMs


115. Enhancing Automatic Speech Recognition Through Integrated Noise Detection Architecture


116. Learning Robust Representations for Malicious Content Detection via Contrastive Sampling and Uncertainty Estimation


117. CluCERT: Certifying LLM Robustness via Clustering-Guided Denoising Smoothing


118. Financial Instruction Following Evaluation (FIFE)


119. Resolving Conflicts in Lifelong Learning via Aligning Updates in Subspaces


120. EEG-Bench: A Benchmark for EEG Foundation Models in Clinical Applications


121. LUMOS: Large User MOdels for User Behavior Prediction


122. LLM4XCE: Large Language Models for Extremely Large-Scale Massive MIMO Channel Estimation


123. An Electrocardiogram Multi-task Benchmark with Comprehensive Evaluations and Insightful Findings


124. SimClinician: A Multimodal Simulation Testbed for Reliable Psychologist AI Collaboration in Mental Health Diagnosis


125. Learning When to Ask: Simulation-Trained Humanoids for Mental-Health Diagnosis


126. AI Co-Artist: A LLM-Powered Framework for Interactive GLSL Shader Animation Evolution


127. The Linguistic Architecture of Reflective Thought: Evaluation of a Large Language Model as a Tool to Isolate the Formal Structure of Mentalization


128. Noise-Robust Abstractive Compression in Retrieval-Augmented Language Models


129. Beyond Technical Debt: How AI-Assisted Development Creates Comprehension Debt in Resource-Constrained Indie Teams


130. Assessing the Human-Likeness of LLM-Driven Digital Twins in Simulating Health Care System Trust


131. When AI Gives Advice: Evaluating AI and Human Responses to Online Advice-Seeking for Well-Being


132. A Principle-based Framework for the Development and Evaluation of Large Language Models for Health and Wellness


133. Motion2Meaning: A Clinician-Centered Framework for Contestable LLM in Parkinson’s Disease Gait Interpretation


134. Agentic AI as Undercover Teammates: Argumentative Knowledge Construction in Hybrid Human-AI Collaborative Learning


135. Prediction-aware and Reinforcement Learning based Altruistic Cooperative Driving


136. Altruistic Maneuver Planning for Cooperative Autonomous Vehicles Using Multi-agent Advantage Actor-Critic