전체 AI 논문 - 2025-10-22

1. Decoding Funded Research: Comparative Analysis of Topic Models and Uncovering the Effect of Gender and Geographic Location


2. Seg the HAB: Language-Guided Geospatial Algae Bloom Reasoning and Segmentation


3. Sherlock Your Queries: Learning to Ask the Right Questions for Dialogue-Based Retrieval


4. Query Decomposition for RAG: Balancing Exploration-Exploitation


5. Comparative Expressivity for Structured Argumentation Frameworks with Uncertain Rules and Premises


6. Leveraging Association Rules for Better Predictions and Better Explanations


7. VAR: Visual Attention Reasoning via Structured Search and Backtracking


8. QuantEvolve: Automating Quantitative Strategy Discovery through Multi-Agent Evolutionary Framework


9. Extracting alignment data in open models


10. SOCIA-Nabla: Textual Gradient Meets Multi-Agent Orchestration for Automated Simulator Generation


11. Physics-guided Emulators Reveal Resilience and Fragility under Operational Latencies and Outages


12. Counterfactual Reasoning for Steerable Pluralistic Value Alignment of Large Language Models


13. Crucible: Quantifying the Potential of Control Algorithms through LLM Agents


14. AndroidControl-Curated: Revealing the True Potential of GUI Agents through Benchmark Purification


15. StarBench: A Turn-Based RPG Benchmark for Agentic Multimodal Decision-Making and Information Seeking


16. LAFA: Agentic LLM-Driven Federated Analytics over Decentralized Data Sources


17. Probabilistic Modeling of Intentions in Socially Intelligent LLM Agents


18. CircuitSeer: Mining High-Quality Data by Probing Mathematical Reasoning Circuits in LLMs


19. PlanU: Large Language Model Decision Making through Planning under Uncertainty


20. AlphaOPT: Formulating Optimization Programs with Self-Improving LLM Experience Library


21. Automated urban waterlogging assessment and early warning through a mixture of foundation models


22. Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents


23. Deep Learning-Based Control Optimization for Glass Bottle Forming


24. Heterogeneous Adversarial Play in Interactive Environments


25. Memory-Augmented State Machine Prompting: A Novel LLM Agent Framework for Real-Time Strategy Games


26. ShortcutBreaker: Low-Rank Noisy Bottleneck with Global Perturbation Attention for Multi-Class Unsupervised Anomaly Detection


27. Earth AI: Unlocking Geospatial Insights with Foundation Models and Cross-Modal Reasoning


28. Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming


29. Illusions of reflection: open-ended task reveals systematic failures in Large Language Models’ reflective reasoning


30. ssToken: Self-modulated and Semantic-aware Token Selection for LLM Fine-tuning


31. A Definition of AGI


32. FST.ai 2.0: An Explainable AI Ecosystem for Fair, Fast, and Inclusive Decision-Making in Olympic and Paralympic Taekwondo


33. Local Coherence or Global Validity? Investigating RLVR Traces in Math Domains


34. AgentChangeBench: A Multi-Dimensional Evaluation Framework for Goal-Shift Robustness in Conversational AI


35. Saber: An Efficient Sampling with Adaptive Acceleration and Backtracking Enhanced Remasking for Diffusion Language Model


36. LLM-Based Multi-Agent System for Simulating and Analyzing Marketing and Consumer Behavior


37. Annotating the Chain-of-Thought: A Behavior-Labeled Dataset for AI Safety


38. Learning from Generalization Patterns: An Evaluation-Driven Approach to Enhanced Data Augmentation for Fine-Tuning Small Language Models


39. Measuring Reasoning in LLMs: a New Dialectical Angle


40. SMaRT: Select, Mix, and ReinvenT - A Strategy Fusion Framework for LLM-Driven Reasoning and Planning


41. Planned Diffusion


42. CompactPrompt: A Unified Pipeline for Prompt Data Compression in LLM Workflows


43. Subject-Event Ontology Without Global Time: Foundations and Execution Semantics


44. OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning


45. FABRIC: Framework for Agent-Based Realistic Intelligence Creation


46. Beyond More Context: Retrieval Diversity Boosts Multi-Turn Intent Understanding


47. Activation Manifold Projection: Liberating Task-Specific Behaviors from LLM Architectures


48. Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs


49. How Do LLMs Use Their Depth?


50. LightMem: Lightweight and Efficient Memory-Augmented Generation


51. Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model


52. Lyapunov-Aware Quantum-Inspired Reinforcement Learning for Continuous-Time Vehicle Control: A Feasibility Study


53. DP$^2$O-SR: Direct Perceptual Preference Optimization for Real-World Image Super-Resolution


54. Towards Faithful and Controllable Personalization via Critique-Post-Edit Reinforcement Learning


55. Actor-Free Continuous Control via Structurally Maximizable Q-Functions


56. An Explainable Hybrid AI Framework for Enhanced Tuberculosis and Symptom Detection


57. Fine-Tuned Thoughts: Leveraging Chain-of-Thought Reasoning for Industrial Asset Health Monitoring


58. Online SFT for LLM Reasoning: Surprising Effectiveness of Self-Tuning without Rewards


59. Computational Foundations for Strategic Coopetition: Formalizing Interdependence and Complementarity


60. Verifiable Accuracy and Abstention Rewards in Curriculum RL to Alleviate Lost-in-Conversation


61. HarmNet: A Framework for Adaptive Multi-Turn Jailbreak Attacks on Large Language Models


62. Causally Perturbed Fairness Testing


63. Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options


64. Fetch.ai: An Architecture for Modern Multi-Agent Systems


65. Exploring Membership Inference Vulnerabilities in Clinical Large Language Models


66. Reasoning Language Model Inference Serving Unveiled: An Empirical Study


67. Binary Quadratic Quantization: Beyond First-Order Quantization for Real-Valued Matrix Compression


68. ε-Seg: Sparsely Supervised Semantic Segmentation of Microscopy Data


69. C-SWAP: Explainability-Aware Structured Pruning for Efficient Neural Networks Compression


70. Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views


71. A Rectification-Based Approach for Distilling Boosted Trees into Decision Trees


72. The Cost-Benefit of Interdisciplinarity in AI for Mental Health


73. Kaleido: Open-Sourced Multi-Subject Reference Video Generation Model


74. Large language models for folktale type automation based on motifs: Cinderella case study


75. WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality


76. RAISE: A Unified Framework for Responsible AI Scoring and Evaluation


77. EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval


78. Pay Attention to the Triggers: Constructing Backdoors That Survive Distillation


79. Zero-Shot Vehicle Model Recognition via Text-Based Retrieval-Augmented Generation


80. One Size Fits All? A Modular Adaptive Sanitization Kit (MASK) for Customizable Privacy-Preserving Phone Scam Detection


81. Benchmarking Fairness-aware Graph Neural Networks in Knowledge Graphs


82. CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment


83. Simple and Efficient Heterogeneous Temporal Graph Neural Network


84. DeLoad: Demand-Driven Short-Video Preloading with Scalable Watch-Time Estimation


85. ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization


86. ScaleNet: Scaling up Pretrained Neural Networks with Incremental Parameters


87. Optimistic Higher-Order Superposition


88. On AI Verification in Open RAN


89. Learning from N-Tuple Data with M Positive Instances: Unbiased Risk Estimation and Theoretical Guarantees


90. Automated Wicket-Taking Delivery Segmentation and Weakness Detection in Cricket Videos Using OCR-Guided YOLOv8 and Trajectory Modeling


91. MENTOR: A Reinforcement Learning Framework for Model Enhancement via Teacher-Optimized Rewards in Small Models


92. S2AP: Score-space Sharpness Minimization for Adversarial Pruning


93. PGTT: Phase-Guided Terrain Traversal for Perceptive Legged Locomotion


94. Scalable, Explainable and Provably Robust Anomaly Detection with One-Step Flow Matching


95. MoMaGen: Generating Demonstrations under Soft and Hard Constraints for Multi-Step Bimanual Mobile Manipulation


96. Higher Embedding Dimension Creates a Stronger World Model for a Simple Sorting Task


97. From Retrieval to Generation: Unifying External and Parametric Knowledge for Medical Question Answering


98. Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs


99. StreamingTOM: Streaming Token Compression for Efficient Video Understanding


100. Latent-Info and Low-Dimensional Learning for Human Mesh Recovery and Parallel Optimization


101. SPIKE: Stable Physics-Informed Kernel Evolution Method for Solving Hyperbolic Conservation Laws


102. Learning under Quantization for High-Dimensional Linear Regression


103. NTKMTL: Mitigating Task Imbalance in Multi-Task Learning from Neural Tangent Kernel Perspective


104. DelvePO: Direction-Guided Self-Evolving Framework for Flexible Prompt Optimization


105. Hyperbolic Space Learning Method Leveraging Temporal Motion Priors for Human Mesh Recovery


106. Finding the Sweet Spot: Optimal Data Augmentation Ratio for Imbalanced Credit Scoring Using ADASYN


107. Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs


108. EVER: Edge-Assisted Auto-Verification for Mobile MR-Aided Operation


109. The Emergence of Complex Behavior in Large-Scale Ecological Environments


110. VLSU: Mapping the Limits of Joint Multimodal Understanding for AI Safety


111. Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge


112. RadDiagSeg-M: A Vision Language Model for Joint Diagnosis and Multi-Target Segmentation in Radiology


113. VelocityNet: Real-Time Crowd Anomaly Detection via Person-Specific Velocity Analysis


114. ActivationReasoning: Logical Reasoning in Latent Activation Spaces


115. Automatic Prompt Generation via Adaptive Selection of Prompting Techniques


116. SafeCoop: Unravelling Full Stack Safety in Agentic Collaborative Driving


117. Latent Discrete Diffusion Models


118. From AutoRecSys to AutoRecLab: A Call to Build, Evaluate, and Govern Autonomous Recommender-Systems Research Labs


119. Enhancing mortality prediction in cardiac arrest ICU patients through meta-modeling of structured clinical data from MIMIC-IV


120. Accelerating Vision Transformers with Adaptive Patch Sizes


121. R2BC: Multi-Agent Imitation Learning from Single-Agent Demonstrations


122. RL-Driven Security-Aware Resource Allocation Framework for UAV-Assisted O-RAN


123. Any-Depth Alignment: Unlocking Innate Safety Alignment of LLMs to Any-Depth


124. R2L: Reliable Reinforcement Learning: Guaranteed Return & Reliable Policies in Reinforcement Learning


125. Fine-tuning Flow Matching Generative Models with Intermediate Feedback


126. SPACeR: Self-Play Anchoring with Centralized Reference Models


127. Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models


128. Measure-Theoretic Anti-Causal Representation Learning


129. Language Models as Semantic Augmenters for Sequential Recommenders


130. Cross-Domain Long-Term Forecasting: Radiation Dose from Sparse Neutron Sensor via Spatio-Temporal Operator Network


131. TriggerNet: A Novel Explainable AI Framework for Red Palm Mite Detection and Multi-Model Comparison and Heuristic-Guided Annotation


132. SAVANT: Semantic Analysis with Vision-Augmented Anomaly deTection


133. From Local to Global: Revisiting Structured Pruning Paradigms for Large Language Models


134. DynaQuery: A Self-Adapting Framework for Querying Structured and Multimodal Data


135. Is Multilingual LLM Watermarking Truly Multilingual? A Simple Back-Translation Solution


136. BadScientist: Can a Research Agent Write Convincing but Unsound Papers that Fool LLM Reviewers?


137. SimBA: Simplifying Benchmark Analysis Using Performance Matrices Alone


138. Universal Spectral Tokenization via Self-Supervised Panchromatic Representation Learning


139. Studying the Effects of Robot Intervention on School Shooters in Virtual Reality


140. PLAGUE: Plug-and-play framework for Lifelong Adaptive Generation of Multi-turn Exploits


141. Intuitionistic $j$-Do-Calculus in Topos Causal Models


142. Trust in foundation models and GenAI: A geographic perspective


143. Believe It or Not: How Deeply do LLMs Believe Implanted Facts?


144. The Integration of Artificial Intelligence in Undergraduate Medical Education in Spain: Descriptive Analysis and International Perspectives


145. UniRL-Zero: Reinforcement Learning on Unified Models with Joint Language Model and Diffusion Model Experts


146. XDXD: End-to-end crystal structure determination with low resolution X-ray diffraction


147. AtlasKV: Augmenting LLMs with Billion-Scale Knowledge Graphs in 20GB VRAM


148. From Observations to Parameters: Detecting Changepoint in Nonlinear Dynamics with Simulation-based Inference


149. From Charts to Code: A Hierarchical Benchmark for Multimodal Models


150. Attracting Commercial Artificial Intelligence Firms to Support National Security through Collaborative Contracts


151. Diagnosing Representation Dynamics in NER Model Extension


152. EvoSyn: Generalizable Evolutionary Data Synthesis for Verifiable Learning


153. SpecAgent: A Speculative Retrieval and Forecasting Agent for Code Completion


154. Efficient Toxicity Detection in Gaming Chats: A Comparative Study of Embeddings, Fine-Tuned Transformers and LLMs


155. Rewarding the Journey, Not Just the Destination: A Composite Path and Answer Self-Scoring Reward Mechanism for Test-Time Reinforcement Learning


156. Select-Then-Decompose: From Empirical Analysis to Adaptive Selection Strategy for Task Decomposition in Large Language Models


157. CLAWS:Creativity detection for LLM-generated solutions using Attention Window of Sections


158. CBINNS: Cancer Biology-Informed Neural Network for Unknown Parameter Estimation and Missing Physics Identification


159. ParaVul: A Parallel Large Language Model and Retrieval-Augmented Framework for Smart Contract Vulnerability Detection


160. JT-Safe: Intrinsically Enhancing the Safety and Trustworthiness of LLMs


161. Data Unlearning Beyond Uniform Forgetting via Diffusion Time and Frequency Selection


162. Self-Evidencing Through Hierarchical Gradient Decomposition: A Dissipative System That Maintains Non-Equilibrium Steady-State by Minimizing Variational Free Energy


163. Uncertainty-Aware Post-Hoc Calibration: Mitigating Confidently Incorrect Predictions Beyond Calibration Metrics


164. NeuCo-Bench: A Novel Benchmark Framework for Neural Embeddings in Earth Observation


165. TACLA: An LLM-Based Multi-Agent Tool for Transactional Analysis Training in Education


166. Interpretability Framework for LLMs in Undergraduate Calculus


167. BreakFun: Jailbreaking LLMs via Schema Exploitation


168. The Sherpa.ai Blind Vertical Federated Learning Paradigm to Minimize the Number of Communications



170. Automated Algorithm Design for Auto-Tuning Optimizers


171. L-MoE: End-to-End Training of a Lightweight Mixture of Low-Rank Adaptation Experts


172. Long-Context Attention Benchmark: From Kernel Efficiency to Distributed Context Parallelism


173. Hierarchical Federated Unlearning for Large Language Models


174. MIN-Merging: Merge the Important Neurons for Model Merging


175. Hey Pentti, We Did It!: A Fully Vector-Symbolic Lisp


176. Metrics and evaluations for computational and sustainable AI efficiency


177. When Intelligence Fails: An Empirical Study on Why LLMs Struggle with Password Cracking


178. From Flows to Words: Can Zero-/Few-Shot LLMs Detect Network Intrusions? A Grammar-Constrained, Calibrated Evaluation on UNSW-NB15


179. Does GenAI Rewrite How We Write? An Empirical Study on Two-Million Preprints


180. POPI: Personalizing LLMs via Optimized Natural Language Preference Inference


181. Outraged AI: Large language models prioritise emotion over cost in fairness enforcement


182. Decoding Listeners Identity: Person Identification from EEG Signals Using a Lightweight Spiking Transformer


183. DRL-Based Resource Allocation for Energy-Efficient IRS-Assisted UAV Spectrum Sharing Systems


184. 3D Weakly Supervised Semantic Segmentation via Class-Aware and Geometry-Guided Pseudo-Label Refinement


185. Repairing Tool Calls Using Post-tool Execution Reflection and RAG


186. Auditing and Mitigating Bias in Gender Classification Algorithms: A Data-Centric Approach


187. A Survey of Recursive and Recurrent Neural Networks


188. MUSE: Model-based Uncertainty-aware Similarity Estimation for zero-shot 2D Object Detection and Segmentation


189. Deploying Atmospheric and Oceanic AI Models on Chinese Hardware and Framework: Migration Strategies, Performance Optimization and Analysis


190. Pre to Post-Treatment Glioblastoma MRI Prediction using a Latent Diffusion Model


191. CARLE: A Hybrid Deep-Shallow Learning Framework for Robust and Explainable RUL Estimation of Rolling Element Bearings


192. MAT-Agent: Adaptive Multi-Agent Training Optimization


193. Modeling Layered Consciousness with Multi-Agent Large Language Models


194. GRETEL: A Goal-driven Retrieval and Execution-based Trial Framework for LLM Tool Selection Enhancing


195. Brain-Language Model Alignment: Insights into the Platonic Hypothesis and Intermediate-Layer Advantage


196. Synthetic EEG Generation using Diffusion Models for Motor Imagery Tasks


197. Multi-Agent Design Assistant for the Simulation of Inertial Fusion Energy


198. Speak to a Protein: An Interactive Multimodal Co-Scientist for Protein Analysis


199. Carbon-Aware Orchestration of Integrated Satellite Aerial Terrestrial Networks via Digital Twin


200. A Biophysical-Model-Informed Source Separation Framework For EMG Decomposition


201. LLM Assisted Alpha Fairness for 6 GHz WiFi and NR_U Coexistence: An Agentic Orchestrator for Throughput, Energy, and SLA


202. Visual Space Optimization for Zero-shot Learning