전체 AI 논문 - 2025-10-15

1. Ax-Prover: A Deep Reasoning Agentic Framework for Theorem Proving in Mathematics and Quantum Physics


2. CTRL-Rec: Controlling Recommender Systems With Natural Language


3. Clutch Control: An Attention-based Combinatorial Bandit for Efficient Mutation in JavaScript Engine Fuzzing


4. Towards Robust Artificial Intelligence: Self-Supervised Learning Approach for Out-of-Distribution Detection


5. CAMNet: Leveraging Cooperative Awareness Messages for Vehicle Trajectory Prediction


6. Multi-Agent Debate for LLM Judges with Adaptive Stability Detection


7. ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning


8. Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks


9. HardcoreLogic: Challenging Large Reasoning Models with Long-tail Logic Puzzle Games


10. Inclusive Fitness as a Key Step Towards More Advanced Social Behaviors in Multi-Agent Reinforcement Learning Settings


11. ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification


12. Artificial Intelligence Virtual Cells: From Measurements to Decisions across Modality, Scale, Dynamics, and Evaluation


13. Using Medical Algorithms for Task-Oriented Dialogue in LLM-Based Medical Interviews


14. Evaluating and Mitigating LLM-as-a-judge Bias in Communication Systems


15. Biased-Attention Guided Risk Prediction for Safe Decision-Making at Unsignalized Intersections


16. MTOS: A LLM-Driven Multi-topic Opinion Simulation Framework for Exploring Echo Chamber Dynamics


17. PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks


18. A Survey of Vibe Coding with Large Language Models


19. O-Forge: An LLM + Computer Algebra Framework for Asymptotic Analysis


20. RAG-Anything: All-in-One RAG Framework


21. Tensor Logic: The Language of AI


22. $\mathbf{T^3}$: Reducing Belief Deviation in Reinforcement Learning for Active Reasoning


23. PromptFlow: Training Prompts Like Neural Networks


24. MedKGEval: A Knowledge Graph-Based Multi-Turn Evaluation Framework for Open-Ended Patient Interactions with Clinical LLMs


25. GOAT: A Training Framework for Goal-Oriented Agent with Tools


26. On the Design and Evaluation of Human-centered Explainable AI Systems: A Systematic Review and Taxonomy


27. ResearStudio: A Human-Intervenable Framework for Building Controllable Deep-Research Agents


28. Evolution of meta’s llama models and parameter-efficient fine-tuning of large language models: a survey


29. MatSciBench: Benchmarking the Reasoning Ability of Large Language Models in Materials Science


30. Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing


31. ToPolyAgent: AI Agents for Coarse-Grained Topological Polymer Simulations


32. One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration


33. Evaluating the Quality of Randomness and Entropy in Tasks Supported by Large Language Models


34. BeSTAD: Behavior-Aware Spatio-Temporal Anomaly Detection for Human Mobility Data


35. EmboMatrix: A Scalable Training-Ground for Embodied Decision-Making


36. HiCoTraj:Zero-Shot Demographic Reasoning via Hierarchical Chain-of-Thought Prompting from Trajectory


37. AI Agents as Universal Task Solvers


38. ThinkPilot: Steering Reasoning Models via Automated Think-prefixes Optimization


39. Empowering LLM Agents with Geospatial Awareness: Toward Grounded Reasoning for Wildfire Response


40. Do Large Language Models Respect Contracts? Evaluating and Enforcing Contract-Adherence in Code Generation


41. CausalTrace: A Neurosymbolic Causal Analysis Agent for Smart Manufacturing


42. Asking Clarifying Questions for Preference Elicitation With Large Language Models


43. CGBench: Benchmarking Language Model Scientific Reasoning for Clinical Genetics Research


44. Holistic Agent Leaderboard: The Missing Infrastructure for AI Agent Evaluation


45. Beyond Consensus: Mitigating the Agreeableness Bias in LLM Judge Evaluations


46. AI Agents for the Dhumbal Card Game: A Comparative Study


47. DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving


48. CuMPerLay: Learning Cubical Multiparameter Persistence Vectorizations


49. UniFusion: Vision-Language Model as Unified Encoder in Image Generation


50. MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars


51. Dr.LLM: Dynamic Layer Routing in LLMs


52. Uncertainty Matters in Dynamic Gaussian Splatting for Monocular 4D Reconstruction


53. Disentangling Neurodegeneration with Brain Age Gap Prediction Models: A Graph Signal Processing Perspective


54. VQArt-Bench: A semantically rich VQA Benchmark for Art and Cultural Heritage


55. Hey, wait a minute: on at-issue sensitivity in Language Models


56. HYPE: Hybrid Planning with Ego Proposal-Conditioned Predictions


57. Hierarchical Federated Learning for Crop Yield Prediction in Smart Agricultural Production Systems


58. Artificial intelligence for simplified patient-centered dosimetry in radiopharmaceutical therapies


59. Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning


60. Hybrid Explanation-Guided Learning for Transformer-Based Chest X-Ray Diagnosis


61. Beyond Postconditions: Can Large Language Models infer Formal Contracts for Automatic Software Verification?


62. Topological Signatures of ReLU Neural Network Activation Patterns


63. Generation Space Size: Understanding and Calibrating Open-Endedness of LLM Generations


64. Who is a Better Matchmaker? Human vs. Algorithmic Judge Assignment in a High-Stakes Startup Competition


65. DiffEM: Learning from Corrupted Data with Diffusion Models via Expectation Maximization


66. From Delegates to Trustees: How Optimizing for Long-Term Interests Shapes Bias and Alignment in LLM


67. Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?


68. SG-XDEAT: Sparsity-Guided Cross-Dimensional and Cross-Encoding Attention with Target-Aware Conditioning in Tabular Learning


69. Reasoning Pattern Matters: Learning to Reason without Human Rationales


70. Aixel: A Unified, Adaptive and Extensible System for AI-powered Data Analysis


71. Laminar: A Scalable Asynchronous RL Post-Training Framework


72. Designing Tools with Control Confidence


73. Learning-To-Measure: In-context Active Feature Acquisition


74. Rethinking Knowledge Distillation: A Data Dependent Regulariser With a Negative Asymmetric Payoff


75. StyleDecipher: Robust and Explainable Detection of LLM-Generated Texts with Stylistic Analysis



77. Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space


78. Evaluation of Real-Time Preprocessing Methods in AI-Based ECG Signal Analysis


79. Unconditional Human Motion and Shape Generation via Balanced Score-Based Diffusion


80. BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)


81. The Robustness of Differentiable Causal Discovery in Misspecified Scenarios


82. PubSub-VFL: Towards Efficient Two-Party Split Learning in Heterogeneous Environments via Publisher/Subscriber Architecture


83. A Text-Image Fusion Method with Data Augmentation Capabilities for Referring Medical Image Segmentation


84. When Personalization Tricks Detectors: The Feature-Inversion Trap in Machine-Generated Text Detection


85. A Function Centric Perspective On Flat and Sharp Minima


86. Low-Field Magnetic Resonance Image Quality Enhancement using a Conditional Flow Matching Model


87. Tokenization Disparities as Infrastructure Bias: How Subword Systems Create Inequities in LLM Access and Efficiency


88. Phenome-Wide Multi-Omics Integration Uncovers Distinct Archetypes of Human Aging


89. LiteVPNet: A Lightweight Network for Video Encoding Control in Quality-Critical Applications


90. Deep Attention-guided Adaptive Subsampling


91. LLM-REVal: Can We Trust LLM Reviewers Yet?


92. (R)evolution of Programming: Vibe Coding as a Post-Coding Paradigm


93. Finite-time Convergence Analysis of Actor-Critic with Evolving Reward


94. Simple Projection Variants Improve ColBERT Performance


95. Causal Inspired Multi Modal Recommendation


96. Deep SPI: Safe Policy Improvement via World Models


97. Chinese ModernBERT with Whole-Word Masking


98. Quantum Annealing for Staff Scheduling in Educational Environments


99. TFGA-Net: Temporal-Frequency Graph Attention Network for Brain-Controlled Speaker Extraction


100. HiLoRA: Adaptive Hierarchical LoRA Routing for Training-Free Domain Generalization


101. Human-in-the-Loop Bandwidth Estimation for Quality of Experience Optimization in Real-Time Video Communication


102. Shallow Robustness, Deep Vulnerabilities: Multi-Turn Evaluation of Medical LLMs


103. Diffusion Models for Reinforcement Learning: Foundations, Taxonomy, and Development


104. PromptLocate: Localizing Prompt Injection Attacks


105. MoRA: On-the-fly Molecule-aware Low-Rank Adaptation Framework for LLM-based Multi-Modal Molecular Assistant


106. Analysing Moral Bias in Finetuned LLMs through Mechanistic Interpretability


107. HALF: Harm-Aware LLM Fairness Evaluation Aligned with Deployment


108. DE3S: Dual-Enhanced Soft-Sparse-Shape Learning for Medical Early Time-Series Classification


109. Revisiting Meta-Learning with Noisy Labels: Reweighting Dynamics and Theoretical Guarantees


110. CompoDistill: Attention Distillation for Compositional Reasoning in Multimodal LLMs


111. From Knowledge to Treatment: Large Language Model Assisted Biomedical Concept Representation for Drug Repurposing


112. Budget-constrained Active Learning to Effectively De-censor Survival Data


113. Credal Transformer: A Principled Approach for Quantifying and Mitigating Hallucinations in Large Language Models


114. SafeMT: Multi-turn Safety for Multimodal Language Models


115. Understanding the Modality Gap: An Empirical Study on the Speech-Text Alignment Mechanism of Large Speech Language Models


116. Chimera: State Space Models Beyond Sequences


117. Deep Associations, High Creativity: A Simple yet Effective Metric for Evaluating Large Language Models


118. An AI-Based Behavioral Health Safety Filter and Dataset for Identifying Mental Health Crises in Text-Based Conversations


119. Enhancing Neural Code Representation with Additional Context


120. A Review on Domain Adaption and Generative Adversarial Networks(GANs)


121. MEASURE: Multi-scale Minimal Sufficient Representation Learning for Domain Generalization in Sleep Staging


122. Your VAR Model is Secretly an Efficient and Explainable Generative Classifier


123. APCE: Adaptive Progressive Context Expansion for Long Context Processing


124. Generative AI and Firm Productivity: Field Experiments in Online Retail


125. Hierarchical Alignment: Surgical Fine-Tuning via Functional Layer Specialization in Large Language Models


126. Multi-stage Prompt Refinement for Mitigating Hallucinations in Large Language Models


127. CPR: Mitigating Large Language Model Hallucinations with Curative Prompt Refinement


128. PanoTPS-Net: Panoramic Room Layout Estimation via Thin Plate Spline Transformation


129. Conjecturing: An Overlooked Step in Formal Mathematical Reasoning


130. Learning Dynamics of VLM Finetuning


131. CTIArena: Benchmarking LLM Knowledge and Reasoning Across Heterogeneous Cyber Threat Intelligence


132. Direct Multi-Token Decoding


133. Y-shaped Generative Flows


134. Sculpting Latent Spaces With MMD: Disentanglement With Programmable Priors


135. TopoAlign: A Framework for Aligning Code to Math via Topological Decomposition


136. Discrepancy Detection at the Data Level: Toward Consistent Multilingual Question Answering


137. Indoor Localization using Compact, Telemetry-Agnostic, Transfer-Learning Enabled Decoder-Only Transformer


138. Integrating Sequential and Relational Modeling for User Events: Datasets and Prediction Tasks


139. MammoDINO: Anatomically Aware Self-Supervision for Mammographic Images


140. Countermind: A Multi-Layered Security Architecture for Large Language Models


141. Data or Language Supervision: What Makes CLIP Better than DINO?


142. Combining Euclidean and Hyperbolic Representations for Node-level Anomaly Detection


143. Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning


144. BlackIce: A Containerized Red Teaming Toolkit for AI Security Testing


145. PHANTOM RECALL: When Familiar Puzzles Fool Smart Models


146. GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving


147. Audio-Guided Visual Perception for Audio-Visual Navigation


148. AwareCompiler: Agentic Context-Aware Compiler Optimization via a Synergistic Knowledge-Data Driven Framework


149. The Adoption Paradox: A Comparative Analysis of Veterinary AI Adoption in China and the North America


150. Artificial Intelligence for Optimal Learning: A Comparative Approach towards AI-Enhanced Learning Environments


151. Zero-Shot Large Language Model Agents for Fully Automated Radiotherapy Treatment Planning


152. Fast and Interpretable Protein Substructure Alignment via Optimal Transport


153. Celebrity Profiling on Short Urdu Text using Twitter Followers’ Feed


154. SeeingSounds: Learning Audio-to-Visual Alignment via Text


155. Scaling Law in LLM Simulated Personality: More Detailed and Realistic Persona Profile Is All You Need


156. Serial-Parallel Dual-Path Architecture for Speaking Style Recognition


157. Modeling Hypergraph Using Large Language Models


158. Dual Perspectives on Non-Contrastive Self-Supervised Learning


159. Leveraging LLMs, IDEs, and Semantic Embeddings for Automated Move Method Refactoring