전체 AI 논문 - 2025-09-24

1. Cross-Cultural Transfer of Commonsense Reasoning in LLMs: Evidence from the Arab World


2. AgentInit: Initializing LLM-based Multi-Agent Systems via Diversity and Expertise Orchestration for Effective and Efficient Collaboration


3. Code Driven Planning with Domain-Adaptive Critic


4. Towards Causal Representation Learning with Observable Sources as Auxiliaries


5. Landmarks, Monuments, and Beacons: Understanding Generative Calls to Action


6. Remaining Time Prediction in Outbound Warehouse Processes: A Case Study (Short Paper)


7. From latent factors to language: a user study on LLM-generated explanations for an inherently interpretable matrix-based recommender system


8. LLM-based Agents Suffer from Hallucinations: A Survey of Taxonomy, Methods, and Directions


9. Data Efficient Adaptation in Large Language Models via Continuous Low-Rank Fine-Tuning


10. How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective


11. LongCat-Flash-Thinking Technical Report


12. Memory in Large Language Models: Mechanisms, Evaluation and Evolution


13. Conf-Profile: A Confidence-Driven Reasoning Paradigm for Label-Free User Profiling


14. MAPO: Mixed Advantage Policy Optimization


15. Model selection meets clinical semantics: Optimizing ICD-10-CM prediction via LLM-as-Judge evaluation, redundancy-aware sampling, and section-aware fine-tuning


16. Bounded PCTL Model Checking of Large Language Model Outputs


17. The AGNTCY Agent Directory Service: Architecture and Implementation


18. Experience Scaling: Post-Deployment Evolution For Large Language Models


19. Autonomous Data Agents: A New Opportunity for Smart Data


20. Advances in Large Language Models for Medicine


21. Implementation of airborne ML models with semantics preservation


22. TERAG: Token-Efficient Graph-Based Retrieval-Augmented Generation


23. Adaptive Learning in Spatial Agent-Based Models for Climate Risk Assessment: A Geospatial Framework with Evolutionary Economic Agents


24. Solving Math Word Problems Using Estimation Verification and Equation Generation


25. LLMZ+: Contextual Prompt Whitelist Principles for Agentic LLMs


26. FERA: Foil Fencing Referee Assistant Using Pose-Based Multi-Label Move Recognition and Rule Reasoning


27. Memory-QA: Answering Recall Questions Based on Multimodal Memories


28. Instruction-Following Evaluation in Function Calling for Large Language Models


29. ATLAS: Benchmarking and Adapting LLMs for Global Trade via Harmonized Tariff Code Classification


30. Gödel Test: Can Large Language Models Solve Easy Conjectures?


31. Evaluating the Safety and Skill Reasoning of Large Reasoning Models Under Compute Constraints


32. The Illusion of Readiness: Stress Testing Large Frontier Models on Multimodal Medical Benchmarks


33. Towards General Computer Control with Hierarchical Agents and Multi-Level Action Spaces


34. An N-Plus-1 GPT Agency for Critical Solution of Mechanical Engineering Analysis Problems


35. From “What to Eat?” to Perfect Recipe: ChefMind’s Chain-of-Exploration for Ambiguous User Intent in Recipe Recommendation


36. Multimodal Health Risk Prediction System for Chronic Diseases via Vision-Language Fusion and Large Language Models


37. Similarity Field Theory: A Mathematical Framework for Intelligence


38. nDNA – the Semantic Helix of Artificial Cognition


39. Change in Quantitative Bipolar Argumentation: Sufficient, Necessary, and Counterfactual Explanations


40. MMCD: Multi-Modal Collaborative Decision-Making for Connected Autonomy with Knowledge Distillation


41. An Outcome-Based Educational Recommender System


42. Synthesizing Attitudes, Predicting Actions (SAPA): Behavioral Theory-Guided LLMs for Ridesourcing Mode Choice Modeling


43. Large Language Models and Operations Research: A Structured Survey


44. Foam-Agent: An End-to-End Composable Multi-Agent Framework for Automating CFD Simulation in OpenFOAM


45. HSGM: Hierarchical Segment-Graph Memory for Scalable Long-Text Semantics


46. Position Paper: Integrating Explainability and Uncertainty Estimation in Medical AI


47. SPADE: A Large Language Model Framework for Soil Moisture Pattern Recognition and Anomaly Detection in Precision Agriculture


48. A Cost-Benefit Analysis of On-Premise Large Language Model Deployment: Breaking Even with Commercial LLM Services


49. Audio-Based Pedestrian Detection in the Presence of Vehicular Noise


50. SOE: Sample-Efficient Robot Policy Self-Improvement via On-Manifold Exploration


51. MOIS-SAM2: Exemplar-based Segment Anything Model 2 for multilesion interactive segmentation of neurobromas in whole-body MRI


52. WolBanking77: Wolof Banking Speech Intent Classification Dataset


53. SloPalSpeech: A 2,8000-Hour Slovak Speech Corpus from Parliamentary Data


54. Adversarially-Refined VQ-GAN with Dense Motion Tokenization for Spatio-Temporal Heatmaps


55. Reinforcement Learning on Pre-Training Data


56. Finding My Voice: Generative Reconstruction of Disordered Speech for Automated Clinical Evaluation


57. MsFIN: Multi-scale Feature Interaction Network for Traffic Accident Anticipation


58. Systematic Comparative Analysis of Large Pretrained Language Models on Contextualized Medication Event Extraction


59. FedFusion: Federated Learning with Diversity- and Cluster-Aware Encoders for Robust Adaptation under Label Scarcity


60. HyKid: An Open MRI Dataset with Expert-Annotated Multi-Structure and Choroid Plexus in Pediatric Hydrocephalus


61. Steering Multimodal Large Language Models Decoding for Context-Aware Safety


62. YAC: Bridging Natural Language and Interactive Visual Exploration with Generative AI for Biomedical Data Discovery


63. Soft Tokens, Hard Truths


64. RoSe: Robust Self-supervised Stereo Matching under Adverse Weather Conditions


65. Generative Propaganda


66. Anecdoctoring: Automated Red-Teaming Across Language and Place


67. On the Soundness and Consistency of LLM Agents for Executing Test Cases Written in Natural Language


68. GSTM-HMU: Generative Spatio-Temporal Modeling for Human Mobility Understanding


69. Analysis on distribution and clustering of weight


70. FedFiTS: Fitness-Selected, Slotted Client Scheduling for Trustworthy Federated Learning in Healthcare AI


71. Towards Practical Multi-label Causal Discovery in High-Dimensional Event Sequences via One-Shot Graph Aggregation


72. FUNCanon: Learning Pose-Aware Action Primitives via Functional Object Canonicalization for Generalizable Robotic Manipulation


73. Algorithms for Adversarially Robust Deep Learning


74. Pathways of Thoughts: Multi-Directional Thinking for Long-form Personalized Question Answering


75. Training Flow Matching Models with Reliable Labels via Self-Purification


76. Citrus-V: Advancing Medical Foundation Models with Unified Medical Image Grounding for Clinical Reasoning


77. A Mega-Study of Digital Twins Reveals Strengths, Weaknesses and Opportunities for Further Improvement


78. Graph Neural Networks with Similarity-Navigated Probabilistic Feature Copying


79. World4RL: Diffusion World Models for Policy Refinement with Reinforcement Learning for Robotic Manipulation


80. Beyond Backpropagation: Exploring Innovative Algorithms for Energy-Efficient Deep Neural Network Training


81. Reduced-Order Model-Guided Reinforcement Learning for Demonstration-Free Humanoid Locomotion


82. Fully Learnable Neural Reward Machines


83. Pure Vision Language Action (VLA) Models: A Comprehensive Survey


84. VIR-Bench: Evaluating Geospatial and Temporal Understanding of MLLMs via Travel Video Itinerary Reconstruction


85. Eva-VLA: Evaluating Vision-Language-Action Models’ Robustness Under Real-World Physical Variations


86. Towards Privacy-Aware Bayesian Networks: A Credal Approach


87. No Labels Needed: Zero-Shot Image Classification with Collaborative Self-Learning



89. Tackling GNARLy Problems: Graph Neural Algorithmic Reasoning Reimagined through Reinforcement Learning


90. LiDAR Point Cloud Image-based Generation Using Denoising Diffusion Probabilistic Models


91. The AI Literacy Heptagon: A Structured Approach to AI Literacy in Higher Education


92. Diversity Boosts AI-Generated Text Detection


93. When Ads Become Profiles: Large-Scale Audit of Algorithmic Biases and LLM Profiling Risks


94. NGRPO: Negative-enhanced Group Relative Policy Optimization


95. Failure Makes the Agent Stronger: Enhancing Accuracy through Structured Reflection for Reliable Tool Interactions


96. Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters


97. A Kernel Space-based Multidimensional Sparse Model for Dynamic PET Image Denoising


98. Detection of security smells in IaC scripts through semantics-aware code and language processing


99. VGGT-DP: Generalizable Robot Control via Vision Foundation Models


100. AECBench: A Hierarchical Benchmark for Knowledge Evaluation of Large Language Models in the AEC Field


101. Financial Risk Relation Identification through Dual-view Adaptation


102. DiSSECT: Structuring Transfer-Ready Medical Image Representations through Discrete Self-Supervision


103. When Long Helps Short: How Context Length in Supervised Fine-tuning Affects Behavior of Large Language Models


104. Security smells in infrastructure as code: a taxonomy update beyond the seven sins


105. Complexity of Activity Patterns in a Bio-Inspired Hopfield-Type Network in Different Topologies


106. MV-UMI: A Scalable Multi-View Interface for Cross-Embodiment Learning


107. COLT: Enhancing Video Large Language Models with Continual Tool Usage


108. A Generalized Bisimulation Metric of State Similarity between Markov Decision Processes: From Theoretical Propositions to Applications


109. MemOrb: A Plug-and-Play Verbal-Reinforcement Memory Layer for E-Commerce Customer Service


110. RSVG-ZeroOV: Exploring a Training-Free Framework for Zero-Shot Open-Vocabulary Visual Grounding in Remote Sensing Images


111. An overview of neural architectures for self-supervised audio representation learning from masked spectrograms


112. LEAF-Mamba: Local Emphatic and Adaptive Fusion State Space Model for RGB-D Salient Object Detection


113. NaviSense: A Multimodal Assistive Mobile application for Object Retrieval by Persons with Visual Impairment


114. SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer


115. Do You Need Proprioceptive States in Visuomotor Policies?


116. Learning neuroimaging models from health system-scale data


117. Generalizable Domain Adaptation for Sim-and-Real Policy Co-Training


118. HyperAdapt: Simple High-Rank Adaptation


119. BRAID: Input-Driven Nonlinear Dynamical Modeling of Neural-Behavioral Data


120. The Case for Negative Data: From Crash Reports to Counterfactuals for Reasonable Driving


121. Flow marching for a generative PDE foundation model


122. End-to-End Crop Row Navigation via LiDAR-Based Deep Reinforcement Learning


123. FlexSED: Towards Open-Vocabulary Sound Event Detection


124. SynSonic: Augmenting Sound Event Detection through Text-to-Audio Diffusion ControlNet and Effective Sample Filtering


125. OraPO: Oracle-educated Reinforcement Learning for Data-efficient and Factual Radiology Report Generation


126. VLN-Zero: Rapid Exploration and Cache-Enabled Neurosymbolic Vision-Language Planning for Zero-Shot Transfer in Robot Navigation


127. TsqLoRA: Towards Sensitivity and Quality Low-Rank Adaptation for Efficient Fine-Tuning


128. LCMF: Lightweight Cross-Modality Mambaformer for Embodied Robotics VQA


129. The Ranking Blind Spot: Decision Hijacking in LLM-based Text Ranking


130. Interaction Topological Transformer for Multiscale Learning in Porous Materials


131. Explore the Reinforcement Learning for the LLM based ASR and TTS system


132. CPCLDETECTOR: Knowledge Enhancement and Alignment Selection for Chinese Patronizing and Condescending Language Detection


133. SoundCompass: Navigating Target Sound Extraction With Effective Directional Clue Integration In Complex Acoustic Scenes


134. Global Minimizers of Sigmoid Contrastive Loss


135. Symphony-MoE: Harmonizing Disparate Pre-trained Models into a Coherent Mixture-of-Experts


136. CCQA: Generating Question from Solution Can Improve Inference-Time Reasoning in SLMs


137. No Verifiable Reward for Prosody: Toward Preference-Guided Prosody Learning in TTS


138. Automatic coherence-driven inference on arguments


139. APRIL: Active Partial Rollouts in Reinforcement Learning to tame long-tail generation


140. Coherence-driven inference for cybersecurity


141. A Rhythm-Aware Phrase Insertion for Classical Arabic Poetry Composition


142. Dynamical Modeling of Behaviorally Relevant Spatiotemporal Patterns in Neural Imaging Data


143. Hyperbolic Coarse-to-Fine Few-Shot Class-Incremental Learning


144. LAWCAT: Efficient Distillation from Quadratic to Linear Attention with Convolution across Tokens for Long Context Modeling


145. Zero-Shot Visual Deepfake Detection: Can AI Predict and Prevent Fake Content Before It’s Created?


146. CogniLoad: A Synthetic Natural Language Reasoning Benchmark With Tunable Length, Intrinsic Difficulty, and Distractor Density


147. PrioriTouch: Adapting to User Contact Preferences for Whole-Arm Physical Human-Robot Interaction


148. Developing an AI framework to automatically detect shared decision-making in patient-doctor conversations


149. Scattering Transformer: A Training-Free Transformer Architecture for Heart Murmur Detection


150. Context Lineage Assurance for Non-Human Identities in Critical Multi-Agent Systems


151. Assistive Decision-Making for Right of Way Navigation at Uncontrolled Intersections


152. Check Field Detection Agent (CFD-Agent) using Multimodal Large Language and Vision Language Models


153. An Artificial Intelligence Value at Risk Approach: Metrics and Models


154. Graph Enhanced Trajectory Anomaly Detection


155. Align Where the Words Look: Cross-Attention-Guided Patch Alignment with Contrastive and Transport Regularization for Bengali Captioning


156. Multi-Worker Selection based Distributed Swarm Learning for Edge IoT with Non-i.i.d. Data


157. FastMTP: Accelerating LLM Inference with Enhanced Multi-Token Prediction


158. Reading Between the Lines: Scalable User Feedback via Implicit Sentiment in Developer Prompts


159. Chiplet-Based RISC-V SoC with Modular AI Acceleration


160. A Single Image Is All You Need: Zero-Shot Anomaly Localization Without Training Data


161. Brittleness and Promise: Knowledge Graph Based Reward Modeling for Diagnostic Reasoning


162. Evaluating Large Language Models for Detecting Antisemitism


163. PEEK: Guiding and Minimal Image Representations for Zero-Shot Generalization of Robot Manipulation Policies


164. Perceptions of AI Across Sectors: A Comparative Review of Public Attitudes


165. Enhanced Interpretable Knowledge Tracing for Students Performance Prediction with Human understandable Feature Space


166. Automatic Classification of Magnetic Chirality of Solar Filaments from H-Alpha Observations


167. Variational Task Vector Composition


168. Conversational Orientation Reasoning: Egocentric-to-Allocentric Navigation with Multimodal Chain-of-Thought


169. MNV-17: A High-Quality Performative Mandarin Dataset for Nonverbal Vocalization Recognition in Speech


170. TinyEcoWeedNet: Edge Efficient Real-Time Aerial Agricultural Weed Detection


171. HazeFlow: Revisit Haze Physical Model as ODE and Non-Homogeneous Haze Generation for Real-World Dehazing


172. Qianfan-VL: Domain-Enhanced Universal Vision-Language Models


173. V-SenseDrive: A Privacy-Preserving Road Video and In-Vehicle Sensor Fusion Framework for Road Safety & Driver Behaviour Modelling


174. Visionerves: Automatic and Reproducible Hybrid AI for Peripheral Nervous System Recognition Applied to Endometriosis Cases


175. VLA-LPAF: Lightweight Perspective-Adaptive Fusion for Vision-Language-Action to Enable More Unconstrained Robotic Manipulation


176. The Describe-Then-Generate Bottleneck: How VLM Descriptions Alter Image Generation Outcomes


177. A Framework for Generating Artificial Datasets to Validate Absolute and Relative Position Concepts


178. Developing Training Procedures for Piecewise-linear Spline Activation Functions in Neural Networks


179. Event Causality Identification with Synthetic Control


180. WLFM: A Well-Logs Foundation Model for Multi-Task and Cross-Well Geological Interpretation


181. HyperNAS: Enhancing Architecture Representation for NAS Predictor via Hypernetwork


182. Sparse Training Scheme for Multimodal LLM


183. Augmenting Limited and Biased RCTs through Pseudo-Sample Matching-Based Observational Data Fusion Method


184. ConceptFlow: Hierarchical and Fine-grained Concept-Based Explanation for Convolutional Neural Networks


185. Early Prediction of Multi-Label Care Escalation Triggers in the Intensive Care Unit Using Electronic Health Records


186. AdaSTI: Conditional Diffusion Models with Adaptive Dependency Modeling for Spatio-Temporal Imputation


187. Weight Mapping Properties of a Dual Tree Single Clock Adiabatic Capacitive Neuron


188. KM-GPT: An Automated Pipeline for Reconstructing Individual Patient Data from Kaplan-Meier Plots


189. A Machine Learning Framework for Pathway-Driven Therapeutic Target Discovery in Metabolic Disorders


190. LoRALib: A Standardized Benchmark for Evaluating LoRA-MoE Methods


191. From Parameters to Performance: A Data-Driven Study on LLM Structure and Development


192. SDGF: Fusing Static and Multi-Scale Dynamic Correlations for Multivariate Time Series Forecasting


193. Self-Evolving LLMs via Continual Instruction Tuning


194. Two ways to knowledge?


195. Research on Metro Transportation Flow Prediction Based on the STL-GRU Combined Model


196. Safe-SAIL: Towards a Fine-grained Safety Landscape of Large Language Models via Sparse Autoencoder Interpretation Framework


197. Anomaly Detection in Electric Vehicle Charging Stations Using Federated Learning


198. NurseSchedRL: Attention-Guided Reinforcement Learning for Nurse-Patient Assignment


199. A Coopetitive-Compatible Data Generation Framework for Cross-silo Federated Learning


200. MobileRL: Online Agentic Reinforcement Learning for Mobile GUI Agents


201. Amortized Latent Steering: Low-Cost Alternative to Test-Time Optimization


202. Prompt Optimization Meets Subspace Representation Learning for Few-shot Out-of-Distribution Detection


203. Solve it with EASE


204. BULL-ODE: Bullwhip Learning with Neural ODEs and Universal Differential Equations under Stochastic Demand


205. Data Valuation and Selection in a Federated Model Marketplace