전체 AI 논문 - 2025-11-18

1. Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping


2. Experience-Guided Adaptation of Inference-Time Reasoning Strategies


3. CURENet: Combining Unified Representations for Efficient Chronic Disease Prediction


4. Robust and Efficient Communication in Multi-Agent Reinforcement Learning


5. MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism


6. KarmaTS: A Universal Simulation Platform for Multivariate Time Series with Functional Causal Dynamics


7. RLSLM: A Hybrid Reinforcement Learning Framework Aligning Rule-Based Social Locomotion Model with Human Social Norms


8. EcoAlign: An Economically Rational Framework for Efficient LVLM Alignment


9. Can You Tell the Difference? Contrastive Explanations for ABox Entailments


10. A Workflow for Full Traceability of AI Decisions


11. AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery


12. UAVBench: An Open Benchmark Dataset for Autonomous and Agentic AI UAV Systems via LLM-Generated Flight Scenarios


13. STaR: Towards Cognitive Table Reasoning via Slow-Thinking Large Language Models


14. Multi-agent Undercover Gaming: Hallucination Removal via Counterfactual Test for Multimodal Reasoning


15. GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models


16. Satisficing and Optimal Generalised Planning via Goal Regression (Extended Version)


17. ARCTraj: A Dataset and Benchmark of Human Reasoning Trajectories for Abstract Problem Solving


18. Autonomous Vehicle Path Planning by Searching With Differentiable Simulation


19. Key Decision-Makers in Multi-Agent Debates: Who Holds the Power?


20. Faster Symmetry Breaking Constraints for Abstract Structures


21. AI Agent-Driven Framework for Automated Product Knowledge Graph Construction in E-Commerce


22. Requirements for Aligned, Dynamic Resolution of Conflicts in Operational Constraints



24. LLM enhanced graph inference for long-term disease progression modelling


25. Enhancing Demand-Oriented Regionalization with Agentic AI and Local Heterogeneous Data for Adaptation Planning


26. Advanced Tool for Traffic Crash Analysis: An AI-Driven Multi-Agent Approach to Pre-Crash Reconstruction


27. HyperComplEx: Adaptive Multi-Space Knowledge Graph Embeddings


28. HARNESS: Human-Agent Risk Navigation and Event Safety System for Proactive Hazard Forecasting in High-Risk DOE Environments


29. From Efficiency to Adaptivity: A Deeper Look at Adaptive Reasoning in Large Language Models


30. Potential Outcome Rankings for Counterfactual Decision Making


31. Structure-Aware Encodings of Argumentation Properties for Clique-width


32. Picking a Representative Set of Solutions in Multiobjective Optimization: Axioms, Algorithms, and Experiments


33. Co-EPG: A Framework for Co-Evolution of Planning and Grounding in Autonomous GUI Agents


34. The Second Law of Intelligence: Controlling Ethical Entropy in Autonomous Systems


35. Private Frequency Estimation Via Residue Number Systems


36. A Unified Convergence Analysis for Semi-Decentralized Learning: Sampled-to-Sampled vs. Sampled-to-All Communication


37. Human-AI collaborative autonomous synthesis with pulsed laser deposition for remote epitaxy


38. Volumetric Ergodic Control


39. PAS : Prelim Attention Score for Detecting Object Hallucinations in Large Vision–Language Models


40. Intrinsic Dimension Estimation for Radio Galaxy Zoo using Diffusion Models


41. ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation


42. Inferring response times of perceptual decisions with Poisson variational autoencoders


43. Context-aware Adaptive Visualizations for Critical Decision Making


44. Benchmarking Visual LLMs Resilience to Unanswerable Questions on Visually Rich Documents


45. Epistemic Error Decomposition for Multi-step Time Series Forecasting: Rethinking Bias-Variance in Recursive and Direct Strategies


46. Retrofit: Continual Learning with Bounded Forgetting for Security Applications


47. The Persistence of Cultural Memory: Investigating Multimodal Iconicity in Diffusion Models


48. Variational Quantum Algorithms for Particle Track Reconstruction


49. Privacy Challenges and Solutions in Retrieval-Augmented Generation-Enhanced LLMs for Healthcare Chatbots: A Review of Applications, Risks, and Future Directions


50. M-DAIGT: A Shared Task on Multi-Domain Detection of AI-Generated Text


51. NOVA: An Agentic Framework for Automated Histopathology Analysis and Discovery


52. LAET: A Layer-wise Adaptive Ensemble Tuning Framework for Pretrained Language Models


53. Large-scale modality-invariant foundation models for brain MRI analysis: Application to lesion segmentation


54. iMAD: Intelligent Multi-Agent Debate for Efficient and Accurate LLM Inference


55. MOON Embedding: Multimodal Representation Learning for E-commerce Search Advertising


56. AUVIC: Adversarial Unlearning of Visual Concepts for Multi-modal Large Language Models


57. Experiences from Benchmarking Vision-Language-Action Models for Robotic Manipulation


58. Building the Web for Agents: A Declarative Framework for Agent-Web Interaction


59. D-GAP: Improving Out-of-Domain Robustness via Dataset-Agnostic and Gradient-Guided Augmentation in Amplitude and Pixel Spaces


60. SQuaD: The Software Quality Dataset


61. KGQuest: Template-Driven QA Generation from Knowledge Graphs with LLM-Based Refinement


62. Toward Gaze Target Detection of Young Autistic Children


63. HealSplit: Towards Self-Healing through Adversarial Distillation in Split Federated Learning


64. Virtual Width Networks


65. 3D Gaussian and Diffusion-Based Gaze Redirection


66. Enhancing Group Recommendation using Soft Impute Singular Value Decomposition


67. Refine and Align: Confidence Calibration through Multi-Agent Interaction in VQA


68. OT-ALD: Aligning Latent Distributions with Optimal Transport for Accelerated Image-to-Image Translation


69. Specification, Application, and Operationalization of a Metamodel of Fairness


70. Utilizing LLMs for Industrial Process Automation: A Case Study on Modifying RAPID Programs


71. AV-Dialog: Spoken Dialogue Models with Audio-Visual Input


72. VIDEOP2R: Video Understanding from Perception to Reasoning


73. Scalable Population Training for Zero-Shot Coordination


74. S2D-ALIGN: Shallow-to-Deep Auxiliary Learning for Anatomically-Grounded Radiology Report Generation


75. From Retinal Pixels to Patients: Evolution of Deep Learning Research in Diabetic Retinopathy Screening


76. LiteAttention: A Temporal Sparse Attention for Diffusion Transformers


77. PINGS-X: Physics-Informed Normalized Gaussian Splatting with Axes Alignment for Efficient Super-Resolution of 4D Flow MRI


78. Enhancing Graph Representations with Neighborhood-Contextualized Message-Passing


79. Correcting Mean Bias in Text Embeddings: A Refined Renormalization with Training-Free Improvements on MMTEB


80. SemanticNN: Compressive and Error-Resilient Semantic Offloading for Extremely Weak Devices


81. CrossMed: A Multimodal Cross-Task Benchmark for Compositional Generalization in Medical Imaging


82. Algorithms Trained on Normal Chest X-rays Can Predict Health Insurance Types


83. AirCopBench: A Benchmark for Multi-drone Collaborative Embodied Perception and Reasoning


84. Data Poisoning Vulnerabilities Across Healthcare AI Architectures: A Security Threat Analysis


85. Automata-Based Steering of Large Language Models for Diverse Structured Generation


86. VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models


87. MSMT-FN: Multi-segment Multi-task Fusion Network for Marketing Audio Classification


88. DialogGraph-LLM: Graph-Informed LLMs for End-to-End Audio Dialogue Intent Recognition


89. When Data is the Algorithm: A Systematic Study and Curation of Preference Optimization Datasets


90. DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains


91. Binary Verification for Zero-Shot Vision


92. PAS: A Training-Free Stabilizer for Temporal Encoding in Video LLMs


93. How Data Quality Affects Machine Learning Models for Credit Risk Assessment


94. Text-guided Weakly Supervised Framework for Dynamic Facial Expression Recognition



96. GraphToxin: Reconstructing Full Unlearned Graphs from Graph Unlearning


97. Synthetic Voices, Real Threats: Evaluating Large Text-to-Speech Models in Generating Harmful Audio


98. Evaluating Large Language Models on Rare Disease Diagnosis: A Case Study using House M.D


99. Automated Analysis of Learning Outcomes and Exam Questions Based on Bloom’s Taxonomy


100. Expert-Guided Prompting and Retrieval-Augmented Generation for Emergency Medical Service Question Answering


101. CLIPPan: Adapting CLIP as A Supervisor for Unsupervised Pansharpening


102. DINOv3 as a Frozen Encoder for CRPS-Oriented Probabilistic Rainfall Nowcasting


103. MCN-CL: Multimodal Cross-Attention Network and Contrastive Learning for Multimodal Emotion Recognition


104. A Multifaceted Analysis of Negative Bias in Large Language Models through the Lens of Parametric Knowledge


105. Incorporating Spatial Information into Goal-Conditioned Hierarchical Reinforcement Learning via Graph Representations


106. Short-Window Sliding Learning for Real-Time Violence Detection via LLM-based Auto-Labeling


107. Generative Artificial Intelligence Adoption Among Bangladeshi Journalists: Exploring Journalists’ Awareness, Acceptance, Usage, and Organizational Stance on Generative AI


108. Accuracy-Preserving CNN Pruning Method under Limited Data Availability


109. HPCAgentTester: A Multi-Agent LLM Approach for Enhanced HPC Unit Test Generation


110. Adaptive Digital Twin of Sheet Metal Forming via Proper Orthogonal Decomposition-Based Koopman Operator with Model Predictive Control


111. Leveraging Parameter Space Symmetries for Reasoning Skill Transfer in LLMs


112. STAMP: Spatial-Temporal Adapter with Multi-Head Pooling


113. Reinforcing Stereotypes of Anger: Emotion AI on African American Vernacular English


114. Optimal Welfare in Noncooperative Network Formation under Attack


115. Behaviour Policy Optimization: Provably Lower Variance Return Estimates for Off-Policy Reinforcement Learning


116. FlowPath: Learning Data-Driven Manifolds with Invertible Flows for Robust Irregularly-sampled Time Series Classification


117. The Map of Misbelief: Tracing Intrinsic and Extrinsic Hallucinations Through Attention Patterns


118. Discounted Cuts: A Stackelberg Approach to Network Disruption


119. Fast Neural Tangent Kernel Alignment, Norm and Effective Rank via Trace Estimation


120. TEDxTN: A Three-way Speech Translation Corpus for Code-Switched Tunisian Arabic - English


121. Surrogate-Based Differentiable Pipeline for Shape Optimization


122. Understanding the Nature of Depth-1 Equivariant Quantum Circuit


123. PISanitizer: Preventing Prompt Injection to Long-Context LLMs via Prompt Sanitization


124. BadThink: Triggered Overthinking Attacks on Chain-of-Thought Reasoning in Large Language Models


125. Do Not Merge My Model! Safeguarding Open-Source LLMs Against Unauthorized Model Merging


126. Towards Uncertainty Quantification in Generative Model Learning


127. Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning


128. $π$-Attention: Periodic Sparse Transformers for Efficient Long-Context Modeling


129. Do AI Voices Learn Social Nuances? A Case of Politeness and Speech Rate


130. Evaluating from Benign to Dynamic Adversarial: A Squid Game for Large Language Models


131. Saying the Unsaid: Revealing the Hidden Language of Multimodal Systems Through Telephone Games


132. Equilibrium Dynamics and Mitigation of Gender Bias in Synthetically Generated Data


133. Who Gets the Reward, Who Gets the Blame? Evaluation-Aligned Training Signals for Multi-LLM Agents


134. Pre-Attention Expert Prediction and Prefetching for Mixture-of-Experts Large Language Models


135. Learn to Select: Exploring Label Distribution Divergence for In-Context Demonstration Selection in Text Classification


136. Continual Learning of Domain Knowledge from Human Feedback in Text-to-SQL


137. Towards Fine-Grained Code-Switch Speech Translation with Semantic Space Alignment


138. Evaluating LLM Understanding via Structured Tabular Decision Simulations


139. Guarding the Meaning: Self-Supervised Training for Semantic Robustness in Guard Models


140. Evaluating Modern Large Language Models on Low-Resource and Morphologically Rich Languages:A Cross-Lingual Benchmark Across Cantonese, Japanese, and Turkish


141. Test-Time Steering for Lossless Text Compression via Weighted Product of Experts


142. Evaluating Open-Weight Large Language Models for Structured Data Extraction from Narrative Medical Reports Across Multiple Use Cases and Languages


143. Preference Orchestrator: Prompt-Aware Multi-Objective Alignment for Large Language Models


144. Spectral Neuro-Symbolic Reasoning II: Semantic Node Merging, Entailment Filtering, and Knowledge Graph Alignment


145. Empirical Characterization of Temporal Constraint Processing in LLMs


146. Hybrid Quantum Transformer for Language Generation


147. Cognitively-Inspired Episodic Memory Architectures for Accurate and Efficient Character AI


148. Data Analysis and Performance Evaluation of Simulation Deduction Based on LLMs


149. Unsupervised Cycle Detection in Agentic Applications


150. Assessing the Capabilities of LLMs in Humor:A Multi-dimensional Analysis of Oogiri Generation and Evaluation