전체 AI 논문 - 2025-09-26

1. SAGE: A Realistic Benchmark for Semantic Understanding


2. VC-Agent: An Interactive Agent for Customized Video Dataset Collection


3. Grounding AI Explanations in Experience: A Reflective Cognitive Architecture for Clinical Decision Support


4. What Do LLM Agents Do When Left Alone? Evidence of Spontaneous Meta-Cognitive Patterns


5. A Fano-Style Accuracy Upper Bound for LLM Single-Pass Reasoning in Multi-Hop QA


6. Distributed Specialization: Rare-Token Neurons in Large Language Models


7. Embodied Representation Alignment with Mirror Neurons


8. ToMPO: Training LLM Strategic Decision Making from a Multi-Agent Perspective


9. RL Squeezes, SFT Expands: A Comparative Study of Reasoning LLMs


10. Expanding Reasoning Potential in Foundation Model by Learning Diverse Chains of Thought Patterns


11. TrustJudge: Inconsistencies of LLM-as-a-Judge and How to Alleviate Them


12. Recon-Act: A Self-Evolving Multi-Agent Browser-Use System via Web Reconnaissance, Tool Generation, and Task Execution


13. Disagreements in Reasoning: How a Model’s Thinking Process Dictates Persuasion in Multi-Agent Systems


14. Combinatorial Creativity: A New Frontier in Generalization Abilities


15. CLAUSE: Agentic Neuro-Symbolic Knowledge Graph Reasoning via Dynamic Learnable Context Engineering


16. Who Gets Cited Most? Benchmarking Long-Context Language Models on Scientific Articles


17. CORE: Full-Path Evaluation of LLM Agents Beyond Final State



19. Beyond Stars: Bridging the Gap Between Ratings and Review Sentiment with LLM


20. GALAX: Graph-Augmented Language Model for Explainable Reinforcement-Guided Subgraph Reasoning in Precision Medicine


21. DeFacto: Counterfactual Thinking with Images for Enforcing Evidence-Grounded and Faithful Reasoning


22. LogReasoner: Empowering LLMs with Expert-like Coarse-to-Fine Reasoning for Log Analysis Tasks


23. Meta-Memory: Retrieving and Integrating Semantic-Spatial Memories for Robot Spatial Reasoning


24. Parallel Thinking, Sequential Answering: Bridging NAR and AR for Efficient Reasoning


25. Fairy: Interactive Mobile Assistant to Real-world Tasks via LMM-based Multi-agent


26. An Automated Retrieval-Augmented Generation LLaMA-4 109B-based System for Evaluating Radiotherapy Treatment Plans


27. Accelerate Creation of Product Claims Using Generative AI


28. Adaptive Cybersecurity Architecture for Digital Product Ecosystems Using Agentic AI


29. SAMULE: Self-Learning Agents Enhanced by Multi-level Reflection


30. A Compound Classification System Based on Fuzzy Relations Applied to the Noise-Tolerant Control of a Bionic Hand via EMG Signal Recognition


31. Adaptive Approach to Enhance Machine Learning Scheduling Algorithms During Runtime Using Reinforcement Learning in Metascheduling Applications


32. Reconstruction-Based Adaptive Scheduling Using AI Inferences in Safety-Critical Systems


33. InsightGUIDE: An Opinionated AI Assistant for Guided Critical Reading of Scientific Literature


34. Philosophy-informed Machine Learning


35. LATTS: Locally Adaptive Test-Time Scaling


36. An Approach to Checking Correctness for Agentic Systems


37. RLBFF: Binary Flexible Feedback to bridge between Human Feedback & Verifiable Rewards


38. SD3.5-Flash: Distribution-Guided Distillation of Generative Flows


39. No Prior, No Leakage: Revisiting Reconstruction Attacks in Trained Neural Networks


40. DisCoCLIP: A Distributional Compositional Tensor Network Encoder for Vision-Language Understanding


41. It’s Not You, It’s Clipping: A Soft Trust-Region via Probability Smoothing for LLM RL


42. Does FLUX Already Know How to Perform Physically Plausible Image Composition?


43. Data-Centric Elastic Pipeline Parallelism for Efficient Long-Context LLM Training


44. MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation


45. A Causality-Aware Spatiotemporal Model for Multi-Region and Multi-Pollutant Air Quality Forecasting


46. Semantic Edge-Cloud Communication for Real-Time Urban Traffic Surveillance with ViT and LLMs over Mobile Networks


47. Instruction-tuned Self-Questioning Framework for Multimodal Reasoning


48. Decipher-MR: A Vision-Language Foundation Model for 3D MRI Representations


49. Learning to Look: Cognitive Attention Alignment with Vision-Language Models


50. Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets


51. Explaining Fine Tuned LLMs via Counterfactuals A Knowledge Graph Driven Framework


52. Tree Search for LLM Agent Reinforcement Learning


53. Evading Overlapping Community Detection via Proxy Node Injection


54. Eigen-1: Adaptive Multi-Agent Refinement with Monitor-Based RAG for Scientific Reasoning


55. Towards Foundation Models for Zero-Shot Time Series Anomaly Detection: Leveraging Synthetic Data and Relative Context Discrepancy


56. Human-like Navigation in a World Built for Humans


57. Adoption, usability and perceived clinical value of a UK AI clinical reference platform (iatroX): a mixed-methods formative evaluation of real-world usage and a 1,223-respondent user survey


58. Can Less Precise Be More Reliable? A Systematic Evaluation of Quantization’s Impact on CLIP Beyond Accuracy


59. Fine-Tuning LLMs to Analyze Multiple Dimensions of Code Review: A Maximum Entropy Regulated Long Chain-of-Thought Approach


60. GRPO is Secretly a Process Reward Model


61. WAVECLIP: Wavelet Tokenization for Adaptive-Resolution CLIP


62. LAVA: Explainability for Unsupervised Latent Embeddings


63. Emerging Paradigms for Securing Federated Learning Systems


64. UniSS: Unified Expressive Speech-to-Speech Translation with Your Voice


65. Teaching RL Agents to Act Better: VLM as Action Advisor for Online Reinforcement Learning


66. Cross-Modal Instructions for Robot Motion Generation


67. GraphUniverse: Enabling Systematic Evaluation of Inductive Generalization


68. Best-of-$\infty$ – Asymptotic Performance of Test-Time Compute


69. Vision Transformers: the threat of realistic adversarial patches


70. TyphoonMLA: A Mixed Naive-Absorb MLA Kernel For Shared Prefix


71. Which Cultural Lens Do Models Adopt? On Cultural Positioning Bias and Agentic Mitigation in LLMs


72. Communication Bias in Large Language Models: A Regulatory Perspective


73. ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning


74. EnGraf-Net: Multiple Granularity Branch Network with Fine-Coarse Graft Grained for Classification Task


75. GeoRef: Referring Expressions in Geometry via Task Formulation, Synthetic Supervision, and Reinforced MLLM-based Solutions


76. Reinforcement Learning Fine-Tuning Enhances Activation Intensity and Diversity in the Internal Circuitry of LLMs


77. Generative AI for FFRDCs


78. SupCLAP: Controlling Optimization Trajectory Drift in Audio-Text Contrastive Learning with Support Vector Regularization


79. Efficient Ensemble Conditional Independence Test Framework for Causal Discovery


80. The Use of the Simplex Architecture to Enhance Safety in Deep-Learning-Powered Autonomous Systems


81. Predicting LLM Reasoning Performance with Small Proxy Model


82. Mechanism of Task-oriented Information Removal in In-context Learning


83. Automatic Red Teaming LLM-based Agents with Model Context Protocol Tools


84. ExMolRL: Phenotype-Target Joint Generation of De Novo Molecules via Multi-Objective Reinforcement Learning


85. Marching Neurons: Accurate Surface Extraction for Neural Implicit Shapes


86. AnywhereVLA: Language-Conditioned Exploration and Mobile Manipulation


87. Lossless Compression: A New Benchmark for Time Series Model Evaluation


88. Binary Autoencoder for Mechanistic Interpretability of Large Language Models


89. Fast-SEnSeI: Lightweight Sensor-Independent Cloud Masking for On-board Multispectral Sensors


90. Rejuvenating Cross-Entropy Loss in Knowledge Distillation for Recommender Systems


91. SiNGER: A Clearer Voice Distills Vision Transformers Further


92. Analysis of instruction-based LLMs’ capabilities to score and judge text-input problems in an academic setting


93. FracAug: Fractional Augmentation boost Graph-level Anomaly Detection under Limited Supervision


94. Knowledgeable Language Models as Black-Box Optimizers for Personalized Medicine


95. Dual-Path Phishing Detection: Integrating Transformer-Based NLP with Structural URL Analysis


96. i-LAVA: Insights on Low Latency Voice-2-Voice Architecture for Agents


97. Unlocking Financial Insights: An advanced Multimodal Summarization with Multimodal Output Framework for Financial Advisory Videos


98. Flow Matching in the Low-Noise Regime: Pathologies and a Contrastive Remedy


99. CTI Dataset Construction from Telegram


100. Deep Learning for Crime Forecasting: The Role of Mobility at Fine-grained Spatiotemporal Scales


101. FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies


102. Improving Early Sepsis Onset Prediction Through Federated Learning


103. Integrating Object Interaction Self-Attention and GAN-Based Debiasing for Visual Question Answering


104. On Theoretical Interpretations of Concept-Based In-Context Learning


105. SCRA-VQA: Summarized Caption-Rerank for Augmented Large Language Models in Visual Question Answering


106. Model-Based Reinforcement Learning under Random Observation Delays


107. StyleBench: Evaluating thinking styles in Large Language Models


108. Federated Markov Imputation: Privacy-Preserving Temporal Imputation in Multi-Centric ICU Environments


109. TasselNetV4: A vision foundation model for cross-scene, cross-scale, and cross-species plant counting


110. FHRFormer: A Self-supervised Transformer Approach for Fetal Heart Rate Inpainting and Forecasting


111. Robust Multi-Omics Integration from Incomplete Modalities Significantly Improves Prediction of Alzheimer’s Disease


112. ImaginationPolicy: Towards Generalizable, Precise and Reliable End-to-End Policy for Robotic Manipulation


113. Verification Limits Code LLM Training


114. Security-aware Semantic-driven ISAC via Paired Adversarial Residual Networks


115. Trustworthy Semantic Communication for Vehicular Networks: Challenges and Solutions


116. CaTS-Bench: Can Language Models Describe Numeric Time Series?


117. Even More Kawaii than Real-Person-Driven VTubers? Understanding How Viewers Perceive AI-Driven VTubers


118. Revolutionizing Precise Low Back Pain Diagnosis via Contrastive Learning


119. Leveraging What’s Overfixed: Post-Correction via LLM Grammatical Error Overcorrection


120. DAC-LoRA: Dynamic Adversarial Curriculum for Efficient and Robust Few-Shot Adaptation


121. Towards Atoms of Large Language Models


122. IConv: Focusing on Local Variation with Channel Independent Convolution for Multivariate Time Series Forecasting


123. CusEnhancer: A Zero-Shot Scene and Controllability Enhancement Method for Photo Customization via ResInversion


124. Provenance Analysis of Archaeological Artifacts via Multimodal RAG Systems


125. Measuring LLM Sensitivity in Transformer-based Tabular Data Synthesis


126. Seeing Through Words, Speaking Through Pixels: Deep Representational Alignment Between Vision and Language Models


127. Confidence-guided Refinement Reasoning for Zero-shot Question Answering


128. AI-Enabled Crater-Based Navigation for Lunar Mapping


129. Imagining Design Workflows in Agentic AI Futures


130. RobotDancing: Residual-Action Reinforcement Learning Enables Robust Long-Horizon Humanoid Motion Tracking


131. Beyond the Individual: Introducing Group Intention Forecasting with SHOT Dataset


132. Joint Flow Trajectory Optimization For Feasible Robot Motion Generation from Video Demonstrations


133. Incorporating LLM Embeddings for Variation Across the Human Genome


134. Learning to Align Molecules and Proteins: A Geometry-Aware Approach to Binding Affinity


135. Addressing Gradient Misalignment in Data-Augmented Training for Robust Speech Deepfake Detection


136. Efficient Construction of Implicit Surface Models From a Single Image for Motion Generation


137. QAMO: Quality-aware Multi-centroid One-class Learning For Speech Deepfake Detection


138. Bispectral OT: Dataset Comparison using Symmetry-Aware Optimal Transport


139. Understanding Mode Switching in Human-AI Collaboration: Behavioral Insights and Predictive Modeling


140. Look Before you Leap: Estimating LLM Benchmark Scores from Descriptions


141. A Framework for Rapidly Developing and Deploying Protection Against Large Language Model Attacks


142. Learning Terrain-Specialized Policies for Adaptive Locomotion in Challenging Environments


143. Recidivism and Peer Influence with LLM Text Embeddings in Low Security Correctional Facilities


144. Personalized Federated Dictionary Learning for Modeling Heterogeneity in Multi-site fMRI Data


145. FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models


146. MMG: Mutual Information Estimation via the MMSE Gap in Diffusion


147. Experience Deploying Containerized GenAI Services at an HPC Center


148. An LLM-based Agentic Framework for Accessible Network Control


149. Every Character Counts: From Vulnerability to Defense in Phishing Detection


150. Hierarchical Resolution Transformers: A Wavelet-Inspired Architecture for Multi-Scale Language Understanding


151. Dynamic Reasoning Chains through Depth-Specialized Mixture-of-Experts in Transformer Architectures


152. MechStyle: Augmenting Generative AI with Mechanical Simulation to Create Stylized and Structurally Viable 3D Models


153. PIRF: Physics-Informed Reward Fine-Tuning for Diffusion Models


154. SwasthLLM: a Unified Cross-Lingual, Multi-Task, and Meta-Learning Zero-Shot Framework for Medical Diagnosis Using Contrastive Representations


155. Perspectra: Choosing Your Experts Enhances Critical Thinking in Multi-Agent Research Ideation


156. GraspFactory: A Large Object-Centric Grasping Dataset


157. Understanding and Improving Adversarial Robustness of Neural Probabilistic Circuits


158. InstructVTON: Optimal Auto-Masking and Natural-Language-Guided Interactive Style Control for Inpainting-Based Virtual Try-On


159. CHOIR: A Chatbot-mediated Organizational Memory Leveraging Communication in University Research Labs


160. Complexity-Driven Policy Optimization


161. MARS: toward more efficient multi-agent collaboration for LLM reasoning


162. Boosting Zero-Shot VLN via Abstract Obstacle Map-Based Waypoint Prediction with TopoGraph-and-VisitInfo-Aware Prompting


163. AI-Specific Code Smells: From Specification to Detection


164. CoSupFormer : A Contrastive Supervised learning approach for EEG signal Classification


165. Shared Neural Space: Unified Precomputed Feature Encoding for Multi-Task and Cross Domain Vision


166. Wartime Media Dynamics in Emerging Democracies: Case Study of Pakistani Media in May 2025 Indo-Pak Conflict


167. A Taxonomy of Data Risks in AI and Quantum Computing (QAI) - A Systematic Review


168. Adversarial Defense in Cybersecurity: A Systematic Review of GANs for Threat Detection and Mitigation


169. Defending against Stegomalware in Deep Neural Networks with Permutation Symmetry


170. Variational Low-Rank Adaptation for Personalized Impaired Speech Recognition


171. Data-Efficient ASR Personalization for Non-Normative Speech Using an Uncertainty-Based Phoneme Difficulty Score for Guided Sampling


172. Centralized vs. Decentralized Security for Space AI Systems? A New Look


173. Blueprints of Trust: AI System Cards for End to End Transparency and Governance


174. The Secret Agenda: LLMs Strategically Lie and Our Current Safety Tools Are Blind


175. Can You Trust Your Copilot? A Privacy Scorecard for AI Coding Assistants


176. Dynamic ReAct: Scalable Tool Selection for Large-Scale MCP Environments


177. R1-Fuzz: Specializing Language Models for Textual Fuzzing via Reinforcement Learning


178. MARS: A Malignity-Aware Backdoor Defense in Federated Learning


179. Lightweight MobileNetV1+GRU for ECG Biometric Authentication: Federated and Adversarial Evaluation


180. USB-Rec: An Effective Framework for Improving Conversational Recommendation Capability of Large Language Model


181. ACCeLLiuM: Supervised Fine-Tuning for Automated OpenACC Pragma Generation


182. Beyond Global Emotion: Fine-Grained Emotional Speech Synthesis with Dynamic Word-Level Modulation


183. SKILL-RAG: Self-Knowledge Induced Learning and Filtering for Retrieval-Augmented Generation


184. ConceptViz: A Visual Analytics Approach for Exploring Concepts in Large Language Models


185. Assessing Classical Machine Learning and Transformer-based Approaches for Detecting AI-Generated Research Text


186. CFD-LLMBench: A Benchmark Suite for Evaluating Large Language Models in Computational Fluid Dynamics


187. AI-driven formative assessment and adaptive learning in data-science education: Evaluating an LLM-powered virtual teaching assistant


188. Interpreting Public Sentiment in Diplomacy Events: A Counterfactual Analysis Framework Using Large Language Models