전체 AI 논문 - 2025-11-28

1. Agentic Learner with Grow-and-Refine Multimodal Semantic Memory


2. Bridging the Unavoidable A Priori: A Framework for Comparative Causal Modeling


3. On the Limits of Innate Planning in Large Language Models


4. From Prediction to Foresight: The Role of AI in Designing Responsible Futures


5. Self-Transparency Failures in Expert-Persona LLMs: A Large-Scale Behavioral Audit


6. Pessimistic Verification for Open Ended Math Questions


7. SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition


8. MADRA: Multi-Agent Debate for Risk-Aware Embodied Planning


9. EWE: An Agentic Framework for Extreme Weather Analysis


10. Conversational no-code and multi-agentic disease module identification and drug repurposing prediction with ChatDRex


11. New Hybrid Heuristics for Pseudo-Boolean Propagation


12. Prune4Web: DOM Tree Pruning Programming for Web Agent


13. Causality Without Causal Models


14. OVOD-Agent: A Markov-Bandit Framework for Proactive Visual Reasoning and Self-Evolving Detection



16. ICPO: Intrinsic Confidence-Driven Group Relative Preference Optimization for Efficient Reinforcement Learning


17. Improving Procedural Skill Explanations via Constrained Generation: A Symbolic-LLM Hybrid Architecture


18. ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction


19. Guaranteed Optimal Compositional Explanations for Neurons


20. Representation Interventions Enable Lifelong Unstructured Knowledge Control


21. OpenApps: Simulating Environment Variations to Measure UI-Agent Reliability


22. Learning Multi-Access Point Coordination in Agentic AI Wi-Fi with Large Language Models


23. Cross Domain Evaluation of Multimodal Chain-of-Thought Reasoning of different datasets into the Amazon CoT Framework


24. Paraconsistent-Lib: an intuitive PAL2v algorithm Python Library


25. A Brief History of Digital Twin Technology


26. Reasoning With a Star: A Heliophysics Dataset and Benchmark for Agentic Scientific Reasoning


27. $A^2Flow:$ Automating Agentic Workflow Generation via Self-Adaptive Abstraction Operators


28. AssurAI: Experience with Constructing Korean Socio-cultural Datasets to Discover Potential Risks of Generative AI


29. Minimizing Hyperbolic Embedding Distortion with LLM-Guided Hierarchy Restructuring


30. Revisiting Generalization Across Difficulty Levels: It’s Not So Easy


31. ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration


32. G$^2$VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning


33. Matrix: Peer-to-Peer Multi-Agent Synthetic Data Generation Framework


34. Through the telecom lens: Are all training samples important?


35. Escaping the Verifier: Learning to Reason via Demonstrations


36. Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models


37. Continual Error Correction on Low-Resource Devices


38. Mechanisms of Non-Monotonic Scaling in Vision Transformers


39. Qwen3-VL Technical Report


40. Scale-Agnostic Kolmogorov-Arnold Geometry in Neural Networks


41. On the Origin of Algorithmic Progress in AI


42. Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining


43. Model-Based Policy Adaptation for Closed-Loop End-to-End Autonomous Driving


44. HarmonicAttack: An Adaptive Cross-Domain Audio Watermark Removal


45. Multimodal Robust Prompt Distillation for 3D Point Cloud Models


46. BAMAS: Structuring Budget-Aware Multi-Agent Systems


47. VacuumVLA: Boosting VLA Capabilities via a Unified Suction and Gripping Tool for Complex Robotic Manipulation


48. Predictive Safety Shield for Dyna-Q Reinforcement Learning


49. Voice, Bias, and Coreference: An Interpretability Study of Gender in Speech Translation


50. Mechanistic Interpretability for Transformer-based Time Series Classification


51. Tool-RoCo: An Agent-as-Tool Self-organization Large Language Model Benchmark in Multi-robot Cooperation


52. Merge and Bound: Direct Manipulations on Weights for Class Incremental Learning


53. Frequency-Aware Token Reduction for Efficient Vision Transformer


54. Going with the Speed of Sound: Pushing Neural Surrogates into Highly-turbulent Transonic Regimes


55. Hierarchical Ranking Neural Network for Long Document Readability Assessment


56. Constructing and Benchmarking: a Labeled Email Dataset for Text-Based Phishing and Spam Detection Framework


57. EvRainDrop: HyperGraph-guided Completion for Effective Frame and Event Stream Aggregation


58. From Observation to Action: Latent Action-based Primitive Segmentation for VLA Pre-training in Industrial Settings


59. SAM Guided Semantic and Motion Changed Region Mining for Remote Sensing Change Captioning


60. Automated Dynamic AI Inference Scaling on HPC-Infrastructure: Integrating Kubernetes, Slurm and vLLM


61. Subjective Depth and Timescale Transformers: Learning Where and When to Compute


62. Training Introspective Behavior: Fine-Tuning Induces Reliable Internal State Detection in a 7B Model


63. Do Reasoning Vision-Language Models Inversely Scale in Test-Time Compute? A Distractor-centric Empirical Analysis


64. Monet: Reasoning in Latent Visual Space Beyond Images and Language


65. RIA: A Ranking-Infused Approach for Optimized listwise CTR Prediction


66. FITRep: Attention-Guided Item Representation via MLLMs


67. Anomaly Detection with Adaptive and Aggressive Rejection for Contaminated Training Data


68. The Directed Prediction Change - Efficient and Trustworthy Fidelity Assessment for Local Feature Attribution Methods


69. Hybrid-AIRL: Enhancing Inverse Reinforcement Learning with Supervised Expert Guidance


70. Generating Separated Singing Vocals Using a Diffusion Model Conditioned on Music Mixtures


71. SurgMLLMBench: A Multimodal Large Language Model Benchmark Dataset for Surgical Scene Understanding


72. Hybrid SIFT-SNN for Efficient Anomaly Detection of Traffic Flow-Control Infrastructure


73. The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment


74. SONAR: Spectral-Contrastive Audio Residuals for Generalizable Deepfake Detection


75. TALES: A Taxonomy and Analysis of Cultural Representations in LLM-generated Stories


76. Improvement of Collision Avoidance in Cut-In Maneuvers Using Time-to-Collision Metrics


77. Self-Guided Defense: Adaptive Safety Alignment for Reasoning Models via Synthesized Guidelines


78. BotaCLIP: Contrastive Learning for Botany-Aware Representation of Earth Observation Data


79. When Robots Obey the Patch: Universal Transferable Patch Attacks on Vision-Language-Action Models


80. Progress by Pieces: Test-Time Scaling for Autoregressive Image Generation


81. Privacy in Federated Learning with Spiking Neural Networks


82. CAHS-Attack: CLIP-Aware Heuristic Search Attack Method for Stable Diffusion


83. LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs


84. Maglev-Pentabot: Magnetic Levitation System for Non-Contact Manipulation using Deep Reinforcement Learning


85. Efficient Training for Human Video Generation with Entropy-Guided Prioritized Progressive Learning


86. SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation


87. Which Layer Causes Distribution Deviation? Entropy-Guided Adaptive Pruning for Diffusion and Flow Models


88. Beyond Patch Aggregation: 3-Pass Pyramid Indexing for Vision-Enhanced Document Retrieval


89. Learning Cell-Aware Hierarchical Multi-Modal Representations for Robust Molecular Modeling


90. Deformation-aware Temporal Generation for Early Prediction of Alzheimers Disease


91. Dynamic Stratified Contrastive Learning with Upstream Augmentation for MILP Branching


92. From Bits to Rounds: Parallel Decoding with Exploration for Diffusion Language Models


93. Pygmalion Effect in Vision: Image-to-Clay Translation for Reflective Geometry Reconstruction


94. MNM : Multi-level Neuroimaging Meta-analysis with Hyperbolic Brain-Text Representations


95. MLPMoE: Zero-Shot Architectural Metamorphosis of Dense LLM MLPs into Static Mixture-of-Experts


96. Enhancing Burmese News Classification with Kolmogorov-Arnold Network Head Fine-tuning


97. Data-Driven Assessment of Concrete Slab Integrity via Impact-Echo Signals and Neural Networks


98. Aligning LLMs with Biomedical Knowledge using Balanced Fine-Tuning


99. Context-Aware Pragmatic Metacognitive Prompting for Sarcasm Detection


100. Breaking the Safety-Capability Tradeoff: Reinforcement Learning with Verifiable Rewards Maintains Safety Guardrails in LLMs


101. FedAPA: Federated Learning with Adaptive Prototype Aggregation Toward Heterogeneous Wi-Fi CSI-based Crowd Counting


102. Semantic Anchors in In-Context Learning: Why Small LLMs Cannot Flip Their Labels


103. Structure-Aware Prototype Guided Trusted Multi-View Classification


104. Probabilistic Wildfire Spread Prediction Using an Autoregressive Conditional Generative Adversarial Network


105. Knowledge Completes the Vision: A Multimodal Entity-aware Retrieval-Augmented Generation Framework for News Image Captioning


106. FANoise: Singular Value-Adaptive Noise Modulation for Robust Multimodal Representation Learning


107. GuardTrace-VL: Detecting Unsafe Multimodel Reasoning via Iterative Safety Supervision


108. Subgoal Graph-Augmented Planning for LLM-Guided Open-World Reinforcement Learning


109. Even with AI, Bijection Discovery is Still Hard: The Opportunities and Challenges of OpenEvolve for Novel Bijection Construction


110. AI4X Roadmap: Artificial Intelligence for the advancement of scientific pursuit and its future directions


111. Towards Audio Token Compression in Large Audio Language Models


112. BUSTR: Breast Ultrasound Text Reporting with a Descriptor-Aware Vision-Language Model


113. SpaceX: Exploring metrics with the SPACE model for developer productivity


114. Resilient Charging Infrastructure via Decentralized Coordination of Electric Vehicles at Scale


115. Open Vocabulary Compositional Explanations for Neuron Alignment


116. Exploring Time-Step Size in Reinforcement Learning for Sepsis Treatment


117. Evolved SampleWeights for Bias Mitigation: Effectiveness Depends on Optimization Objectives


118. Dynamic Test-Time Compute Scaling in Control Policy: Difficulty-Aware Stochastic Interpolant Policy


119. A Taxonomy of Pix Fraud in Brazil: Attack Methodologies, AI-Driven Amplification, and Defensive Strategies


120. Test-Time Alignment of Text-to-Image Diffusion Models via Null-Text Embedding Optimisation


121. Selecting Belief-State Approximations in Simulators with Latent States


122. Computing Evolutionarily Stable Strategies in Multiplayer Games


123. Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory


124. Unsupervised Memorability Modeling from Tip-of-the-Tongue Retrieval Queries


125. MODEST: Multi-Optics Depth-of-Field Stereo Dataset


126. Length-MAX Tokenizer for Language Models


127. NOIR 2.0: Neural Signal Operated Intelligent Robots for Everyday Activities


128. Pre-train to Gain: Robust Learning Without Clean Labels


129. A Review of Pseudospectral Optimal Control: From Theory to Flight


130. Primal: A Unified Deterministic Framework for Quasi-Orthogonal Hashing and Manifold Learning


131. Structured Prompting Enables More Robust, Holistic Evaluation of Language Models


132. RefTr: Recurrent Refinement of Confluent Trajectories for 3D Vascular Tree Centerline Graphs


133. Training-Free Diffusion Priors for Text-to-Image Generation via Optimization-based Visual Inversion


134. SPHINX: A Synthetic Environment for Visual Perception and Reasoning


135. Conformal Safety Monitoring for Flight Testing: A Case Study in Data-Driven Safety Learning


136. Memories Retrieved from Many Paths: A Multi-Prefix Framework for Robust Detection of Training Data Leakage in Large Language Models


137. Physics Steering: Causal Control of Cross-Domain Concepts in a Physics Foundation Model


138. Revisiting KRISP: A Lightweight Reproduction and Analysis of Knowledge-Enhanced Vision-Language Models


139. Adversarial Multi-Task Learning for Liver Tumor Segmentation, Dynamic Enhancement Regression, and Classification


140. CANVAS: A Benchmark for Vision-Language Models on Tool-Based User Interface Design



142. InvisibleBench: A Deployment Gate for Caregiving Relationship AI


143. Data-Driven Methods and AI in Engineering Design: A Systematic Literature Review Focusing on Challenges and Opportunities


144. Spatio-Temporal Trajectory Foundation Model - Recent Advances and Future Directions


145. Learning from Risk: LLM-Guided Generation of Safety-Critical Scenarios with Prior Knowledge


146. Gradient Descent Algorithm Survey


147. DinoLizer: Learning from the Best for Generative Inpainting Localization


148. Foundry: Distilling 3D Foundation Models for the Edge


149. DeeAD: Dynamic Early Exit of Vision-Language Action for Efficient Autonomous Driving


150. ST-PPO: Stabilized Off-Policy Proximal Policy Optimization for Multi-Turn Agents Training


151. Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation


152. Active Slice Discovery in Large Language Models


153. Are Neuro-Inspired Multi-Modal Vision-Language Models Resilient to Membership Inference Privacy Leakage?


154. DUALGUAGE: Automated Joint Security-Functionality Benchmarking for Secure Code Generation


155. Solving Diffusion Inverse Problems with Restart Posterior Sampling


156. PropensityBench: Evaluating Latent Safety Risks in Large Language Models via an Agentic Approach


157. Post-Pruning Accuracy Recovery via Data-Free Knowledge Distillation


158. In Defense of the Turing Test and its Legacy


159. On the Role of Hidden States of Modern Hopfield Network in Transformer


160. Musical Score Understanding Benchmark: Evaluating Large Language Models’ Comprehension of Complete Musical Scores


161. Prototype-Guided Non-Exemplar Continual Learning for Cross-subject EEG Decoding


162. Morality in AI. A plea to embed morality in LLM architectures and frameworks


163. Hybrid coupling with operator inference and the overlapping Schwarz alternating method


164. Cognitive bias in LLM reasoning compromises interpretation of clinical oncology notes


165. MindSET: Advancing Mental Health Benchmarking through Large-Scale Social Media Data



167. MTTR-A: Measuring Cognitive Recovery Latency in Multi-Agent Systems


168. Transforming Higher Education with AI-Powered Video Lectures



170. Context-Aware Visual Prompting: Automating Geospatial Web Dashboards with Large Language Models and Agent Self-Validation for Decision Support


171. CodeVaani: A Multilingual, Voice-Based Code Learning Assistant


172. Domain-Grounded Evaluation of LLMs in International Student Knowledge


173. When LLMs Can’t Help: Real-World Evaluation of LLMs in Nutrition