전체 AI 논문 - 2025-10-02

1. Generalized Parallel Scaling with Interdependent Generations


2. Apriel-1.5-15b-Thinker


3. Exploring Network-Knowledge Graph Duality: A Case Study in Agentic Supply Chain Risk Analysis


4. PRISM-Consult: A Panel-of-Experts Architecture for Clinician-Aligned Diagnosis


5. Optimizing Fairness in Production Planning: A Human-Centric Approach to Machine and Workforce Allocation


6. Safety Instincts: LLMs Learn to Trust Their Internal Compass for Self-Defense


7. Typed Chain-of-Thought: A Curry-Howard Framework for Verifying LLM Reasoning


8. Activation-Deactivation: A General Framework for Robust Post-hoc Explainable AI


9. Uncovering the Computational Ingredients of Human-Like Representations in LLMs


10. Shape Happens: Automatic Feature Manifold Discovery in LLMs via Supervised Multi-Dimensional Scaling


11. Integrating AI and Ensemble Forecasting: Explainable Materials Planning with Scorecards and Trend Insights for a Large-Scale Manufacturer


12. Adaptive Federated Few-Shot Rare-Disease Diagnosis with Energy-Aware Secure Aggregation


13. QUASAR: Quantum Assembly Code Generation Using Tool-Augmented LLMs via Agentic RL


14. A Neuro-Fuzzy System for Interpretable Long-Term Stock Market Forecasting


15. Test-Time Search in Neural Graph Coarsening Procedures for the Capacitated Vehicle Routing Problem


16. On Discovering Algorithms for Adversarial Imitation Learning


17. FusionAdapter for Few-Shot Relation Learning in Multimodal Knowledge Graphs


18. Unveiling Interesting Insights: Monte Carlo Tree Search for Knowledge Discovery


19. Learning Compact Representations of LLM Abilities via Item Response Theory


20. Improving Cryptocurrency Pump-and-Dump Detection through Ensemble-Based Models and Synthetic Oversampling Techniques


21. Benchmarking Machine Learning Models for Fault Classification and Localization in Power System Protection


22. Logical Consistency Between Disagreeing Experts and Its Role in AI Safety


23. Semantic Bridges Between First Order c-Representations and Cost-Based Semantics: An Initial Perspective


24. Benchmarking Agentic Systems in Automated Scientific Information Extraction with ChemX


25. AI in data science education: experiences from the classroom


26. DIA: The Adversarial Exposure of Deterministic Inversion in Diffusion Models


27. EvolProver: Advancing Automated Theorem Proving by Evolving Formalized Problems via Symmetry and Difficulty


28. AttentionDep: Domain-Aware Attention for Explainable Depression Severity Assessment


29. ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning


30. Relevance-Zone Reduction in Game Solving


31. Batch-CAM: Introduction to better reasoning in convolutional deep learning models


32. Expected Attention: KV Cache Compression by Estimating Attention from Future Queries Distribution


33. Collaborative-Distilled Diffusion Models (CDDM) for Accelerated and Lightweight Trajectory Prediction


34. Is Model Editing Built on Sand? Revealing Its Illusory Success and Fragile Foundation


35. HARPA: A Testability-Driven, Literature-Grounded Framework for Research Ideation


36. ACON: Optimizing Context Compression for Long-horizon LLM Agents


37. Toward Safer Diffusion Language Models: Discovery and Mitigation of Priming Vulnerability


38. Data Quality Challenges in Retrieval-Augmented Generation


39. VIRTUE: Visual-Interactive Text-Image Universal Embedder


40. Rethinking Reward Models for Multi-Domain Test-Time Scaling


41. Expandable Decision-Making States for Multi-Agent Deep Reinforcement Learning in Soccer Tactical Analysis


42. Automated Evaluation can Distinguish the Good and Bad AI Responses to Patient Questions about Hospitalization


43. Towards Self-Evolving Benchmarks: Synthesizing Agent Trajectories via Test-Time Exploration under Validate-by-Reproduce Paradigm


44. Semantic-Driven AI Agent Communications: Challenges and Solutions


45. Hierarchical Reasoning Model: A Critical Supplementary Material


46. When Hallucination Costs Millions: Benchmarking AI Agents in High-Stakes Adversarial Financial Markets


47. BiasBusters: Uncovering and Mitigating Tool Selection Bias in Large Language Models


48. ICL Optimized Fragility


49. MAGIC-MASK: Multi-Agent Guided Inter-Agent Collaboration with Mask-Based Explainability for Reinforcement Learning


50. DualTune: Decoupled Fine-Tuning for On-Device Agentic Systems


51. Thinkquel: A Model Dedicated to Text-to-dbt Using Synthetic Data and a Span-Aware Objective


52. Object-Centric Case-Based Reasoning via Argumentation


53. Drones that Think on their Feet: Sudden Landing Decisions with Embodied AI


54. AuditAgent: Expert-Guided Multi-Agent Reasoning for Cross-Document Fraudulent Evidence Discovery


55. Judging by Appearances? Auditing and Intervening Vision-Language Models for Bail Prediction


56. Towards a Framework for Supporting the Ethical and Regulatory Certification of AI Systems


57. NeurIPS should lead scientific consensus on AI policy


58. ARS: Adaptive Reasoning Suppression for Efficient Large Reasoning Language Models


59. ToolBrain: A Flexible Reinforcement Learning Framework for Agentic Tools


60. Learning to Lead Themselves: Agentic AI in MAS using MARL


61. TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments


62. COM-BOM: Bayesian Exemplar Search for Efficiently Exploring the Accuracy-Calibration Pareto Frontier


63. Code2Video: A Code-centric Paradigm for Educational Video Generation


64. EditTrack: Detecting and Attributing AI-assisted Image Editing


65. Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity


66. Fiaingen: A financial time series generative method matching real-world data quality


67. Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards


68. GRAD: Generative Retrieval-Aligned Demonstration Sampler for Efficient Few-Shot Reasoning


69. Social Welfare Function Leaderboard: When LLM Agents Allocate Social Welfare


70. Prosperity before Collapse: How Far Can Off-Policy RL Reach with Stale Data on LLMs?


71. mR3: Multilingual Rubric-Agnostic Reward Reasoning Models


72. TabINR: An Implicit Neural Representation Framework for Tabular Data Imputation


73. A Practitioner’s Guide to Multi-turn Agentic Reinforcement Learning


74. Rethinking Thinking Tokens: LLMs as Improvement Operators



76. Hybrid Dialogue State Tracking for Persian Chatbots: A Language Model-Based Approach


77. GEM: A Gym for Agentic LLMs


78. Interpreting Language Models Through Concept Descriptions: A Survey


79. Authentic Discrete Diffusion Model


80. CurES: From Gradient Analysis to Efficient Curriculum Learning for Reasoning LLMs


81. The Good, the Bad, and the Sampled: a No-Regret Approach to Safe Online Classification


82. TextCAM: Explaining Class Activation Map with Text



84. Bridging the Gap Between Simulated and Real Network Data Using Transfer Learning


85. Benchmarking Foundation Models with Retrieval-Augmented Generation in Olympic-Level Physics Problem Solving


86. Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers


87. RiskPO: Risk-based Policy Optimization via Verifiable Reward for LLM Post-Training


88. “We are not Future-ready”: Understanding AI Privacy Risks and Existing Mitigation Strategies from the Perspective of AI Developers in Europe


89. Bridging Language Gaps: Advances in Cross-Lingual Information Retrieval with Multilingual LLMs


90. TubeDAgger: Reducing the Number of Expert Interventions with Stochastic Reach-Tubes


91. Span-level Detection of AI-generated Scientific Text via Contrastive Learning and Structural Calibration


92. GLAI: GreenLightningAI for Accelerated Training through Knowledge Decoupling


93. Advancing Automated Ethical Profiling in SE: a Zero-Shot Evaluation of LLM Reasoning


94. A Technique Based on Trade-off Maps to Visualise and Analyse Relationships Between Objectives in Optimisation Problems


95. Gather-Scatter Mamba: Accelerating Propagation with Efficient State Space Model


96. Erase to Improve: Erasable Reinforcement Learning for Search-Augmented LLMs


97. Can World Models Benefit VLMs for World Dynamics?


98. Mechanistic Interpretability as Statistical Estimation: A Variance Analysis of EAP-IG


99. Feature Identification for Hierarchical Contrastive Learning


100. Towards Verifiable Federated Unlearning: Framework, Challenges, and The Road Ahead


101. Stabilizing Policy Gradients for Sample-Efficient Reinforcement Learning in LLM Reasoning


102. What You See is What You Ask: Evaluating Audio Descriptions


103. MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control


104. Fast, Secure, and High-Capacity Image Watermarking with Autoencoded Text Vectors


105. Solar PV Installation Potential Assessment on Building Facades Based on Vision and Language Foundation Models


106. MetaLogic: Robustness Evaluation of Text-to-Image Models via Logically Equivalent Prompts


107. Uncertainty-Aware Concept Bottleneck Models with Enhanced Interpretability


108. UniverSR: Unified and Versatile Audio Super-Resolution via Vocoder-Free Flow Matching


109. Multi-Objective Task-Aware Predictor for Image-Text Alignment


110. From Scores to Preferences: Redefining MOS Benchmarking for Speech Quality Reward Modeling


111. Neural Diffusion Processes for Physically Interpretable Survival Prediction


112. Extreme Blind Image Restoration via Prompt-Conditioned Information Bottleneck


113. CroSTAta: Cross-State Transition Attention Transformer for Robotic Manipulation



115. Inclusive Easy-to-Read Generation for Individuals with Cognitive Impairments


116. Facilitating Cognitive Accessibility with LLMs: A Multi-Task Approach to Easy-to-Read Text Generation


117. Align Your Tangent: Training Better Consistency Models via Manifold-Aligned Tangents


118. Tenyidie Syllabification corpus creation and deep learning applications


119. FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression


120. What Did I Learn? Operational Competence Assessment for AI-Based Trajectory Planners


121. Hybrid Training for Vision-Language-Action Models


122. AI-Driven Self-Evolving Software: A Promising Path Toward Software Automation


123. U-DFA: A Unified DINOv2-Unet with Dual Fusion Attention for Multi-Dataset Medical Segmentation


124. SAGE-LD: Towards Scalable and Generalizable End-to-End Language Diarization via Simulated Data Augmentation


125. Adaptive Shared Experts with LoRA-Based Mixture of Experts for Multi-Task Learning


126. Panorama: Fast-Track Nearest Neighbors


127. Memory Determines Learning Direction: A Theory of Gradient-Based Optimization in State Space Models


128. PromptPilot: Improving Human-AI Collaboration Through LLM-Enhanced Prompt Engineering


129. On Predictability of Reinforcement Learning Dynamics for Large Language Models


130. EMR-AGENT: Automating Cohort and Feature Extraction from EMR Databases


131. Forestpest-YOLO: A High-Performance Detection Framework for Small Forestry Pests


132. Architectural Transformations and Emerging Verification Demands in AI-Enabled Cyber-Physical Systems


133. Adaptive Data-Knowledge Alignment in Genetic Perturbation Prediction


134. Copy-Paste to Mitigate Large Language Model Hallucinations


135. Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs


136. Relative-Absolute Fusion: Rethinking Feature Extraction in Image-Based Iterative Method Selection for Solving Sparse Linear Systems


137. MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance


138. Normal-Abnormal Guided Generalist Anomaly Detection


139. Exploring System 1 and 2 communication for latent reasoning in LLMs


140. From Human Hands to Robot Arms: Manipulation Skills Transfer via Trajectory Alignment


141. Black-Box Time-Series Domain Adaptation via Cross-Prompt Foundation Models


142. PodEval: A Multimodal Evaluation Framework for Podcast Audio Generation


143. Make a Video Call with LLM: A Measurement Campaign over Five Mainstream Apps


144. Analyzing Latent Concepts in Code Language Models


145. Feature Identification via the Empirical NTK


146. Integrating Offline Pre-Training with Online Fine-Tuning: A Reinforcement Learning Approach for Robot Social Navigation


147. TimeEmb: A Lightweight Static-Dynamic Disentanglement Framework for Time Series Forecasting


148. UrbanGraph: Physics-Informed Spatio-Temporal Dynamic Heterogeneous Graphs for Urban Microclimate Prediction


149. Measuring and Controlling the Spectral Bias for Self-Supervised Image Denoising


150. Cloud Investigation Automation Framework (CIAF): An AI-Driven Approach to Cloud Forensics


151. A Call to Action for a Secure-by-Design Generative AI Paradigm


152. Plug-and-Play Prompt Refinement via Latent Feedback for Diffusion Model Alignment


153. Automated Structured Radiology Report Generation with Rich Clinical Context


154. Domain-Specialized Interactive Segmentation Framework for Meningioma Radiotherapy Planning


155. David and Goliath in Medical Vision: Convolutional Networks vs Biomedical Vision Language Models


156. EgoTraj-Bench: Towards Robust Trajectory Prediction Under Ego-view Noisy Observations


157. AbsTopK: Rethinking Sparse Autoencoders For Bidirectional Features


158. Physics-Informed Neural Controlled Differential Equations for Scalable Long Horizon Multi-Agent Motion Forecasting


159. SAGE-Music: Low-Latency Symbolic Music Generation via Attribute-Specialized Key-Value Head Sharing


160. Train on Validation (ToV): Fast data selection with applications to fine-tuning


161. Discrete Wavelet Transform as a Facilitator for Expressive Latent Space Representation in Variational Autoencoders in Satellite Imagery


162. Combining Large Language Models and Gradient-Free Optimization for Automatic Control Policy Synthesis


163. Attribution Gradients: Incrementally Unfolding Citations for Critical Examination of Attributed AI Answers


164. DiSA-IQL: Offline Reinforcement Learning for Robust Soft Robot Control under Distribution Shifts


165. In-Context Curiosity: Distilling Exploration for Decision-Pretrained Transformers on Bandit Tasks


166. Navigating the Synchrony-Stability Frontier in Adaptive Chatbots


167. Structural Refinement of Bayesian Networks for Efficient Model Parameterisation


168. Reasoning-Aware Prompt Orchestration: A Foundation Model for Multi-Agent Language Model Coordination


169. A Framework for Selection of Machine Learning Algorithms Based on Performance Metrices and Akaike Information Criteria in Healthcare, Telecommunication, and Marketing Sector


170. DecepChain: Inducing Deceptive Reasoning in Large Language Models


171. MAVUL: Multi-Agent Vulnerability Detection via Contextual Reasoning and Interactive Refinement


172. Digital Domination: A Case for Republican Liberty in Artificial Intelligence


173. Barriers for Learning in an Evolving World: Mathematical Understanding of Loss of Plasticity


174. Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models


175. o-MEGA: Optimized Methods for Explanation Generation and Analysis


176. Data driven approaches in nanophotonics: A review of AI-enabled metadevices


177. SLogic: Subgraph-Informed Logical Rule Learning for Knowledge Graph Completion


178. Efficient Layer-wise LLM Fine-tuning for Revision Intention Prediction


179. Retrieval-Augmented Generation for Electrocardiogram-Language Models


180. Learning Energy-based Variational Latent Prior for VAEs


181. A Hierarchical Agentic Framework for Autonomous Drone-Based Visual Inspection


182. TASER: Translation Assessment via Systematic Evaluation and Reasoning


183. Can AI agents understand spoken conversations about data visualizations in online meetings?


184. SecureBERT 2.0: Advanced Language Model for Cybersecurity Intelligence


185. Debunk the Myth of SFT Generalization


186. BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses


187. The Pitfalls of KV Cache Compression


188. TGPO: Temporal Grounded Policy Optimization for Signal Temporal Logic Tasks


189. Thoughtbubbles: an Unsupervised Method for Parallel Thinking in Latent Space


190. Directed-MAML: Meta Reinforcement Learning Algorithm with Task-directed Approximation


191. LoRAFusion: Efficient LoRA Fine-Tuning for LLMs


192. GRPO-$λ$: Credit Assignment improves LLM Reasoning


193. PrunedLoRA: Robust Gradient-Based structured pruning for Low-rank Adaptation in Fine-tuning


194. Why Can’t Transformers Learn Multiplication? Reverse-Engineering Reveals Long-Range Dependency Pitfalls


195. A Systematic Study of Large Language Models for Task and Motion Planning With PDDLStream


196. CHAI: Command Hijacking against embodied AI


197. Personalized Reasoning: Just-In-Time Personalization and Why LLMs Fail At It


198. Privacy-Preserving Learning-Augmented Data Structures


199. Partial Identification Approach to Counterfactual Fairness Assessment


200. RoboPilot: Generalizable Dynamic Robotic Manipulation with Dual-thinking Modes


201. Stealing AI Model Weights Through Covert Communication Channels


202. Which Rewards Matter? Reward Selection for Reinforcement Learning under Limited Feedback


203. Optimizing What Matters: AUC-Driven Learning for Robust Neural Retrieval


204. Nonparametric Identification of Latent Concepts


205. BigBang-Proton Technical Report: Next-Word-Prediction is Scientific Multitask Learner


206. Direct Token Optimization: A Self-contained Approach to Large Language Model Unlearning


207. Simulating Student Success in the Age of GenAI: A Kantian-Axiomatic Perspective


208. SoREX: Towards Self-Explainable Social Recommendation with Relevant Ego-Path Extraction


209. Adaptive and Resource-efficient Agentic AI Systems for Mobile and Embedded Devices: A Survey


210. Identifying All ε-Best Arms in (Misspecified) Linear Bandits


211. Geo-R1: Unlocking VLM Geospatial Reasoning with Cross-View Reinforcement Learning


212. Intelligent 5S Audit: Application of Artificial Intelligence for Continuous Improvement in the Automotive Industry


213. AstroMMBench: A Benchmark for Evaluating Multimodal Large Language Models Capabilities in Astronomy


214. Efficient CNN Compression via Multi-method Low Rank Factorization and Feature Map Similarity


215. Survey of AI-Powered Approaches for Osteoporosis Diagnosis in Medical Imaging


216. Less is More: Lean yet Powerful Vision-Language Model for Autonomous Driving


217. FSDENet: A Frequency and Spatial Domains based Detail Enhancement Network for Remote Sensing Semantic Segmentation


218. HiDe: Rethinking The Zoom-IN method in High Resolution MLLMs via Hierarchical Decoupling


219. Object-AVEdit: An Object-level Audio-Visual Editing Model


220. AI-Based Stroke Rehabilitation Domiciliary Assessment System with ST_GCN Attention


221. Deep Learning Approaches with Explainable AI for Differentiating Alzheimer Disease and Mild Cognitive Impairment


222. Explanation-Driven Counterfactual Testing for Faithfulness in Vision-Language Model Explanations


223. Reinforcement Learning-Based Prompt Template Stealing for Text-to-Image Models


224. Beyond the Prompt: Gender Bias in Text-to-Image Models, with a Case Study on Hospital Professions


225. Culture In a Frame: C$^3$B as a Comic-Based Benchmark for Multimodal Culturally Awareness


226. Uncovering Intrinsic Capabilities: A Paradigm for Data Curation in Vision-Language Models


227. AutoPK: Leveraging LLMs and a Hybrid Similarity Metric for Advanced Retrieval of Pharmacokinetic Data from Complex Tables and Documents


228. DexBench: Benchmarking LLMs for Personalized Decision Making in Diabetes Management


229. On Robustness of Vision-Language-Action Model against Multi-Modal Perturbations


230. Deep Learning-Based Pneumonia Detection from Chest X-ray Images: A CNN Approach with Performance Analysis and Clinical Implications


231. Review of Hallucination Understanding in Large Language and Vision Models


232. Hybrid Deep Learning for Hyperspectral Single Image Super-Resolution


233. WaveMind: Towards a Conversational EEG Foundation Model Aligned to Textual and Visual Modalities


234. VibeCodeHPC: An Agent-Based Iterative Prompting Auto-Tuner for HPC Code Generation Using LLMs


235. Temporal-Aware Iterative Speech Model for Dementia Detection


236. Enhancing Safety in Diabetic Retinopathy Detection: Uncertainty-Aware Deep Learning Models with Rejection Capabilities


237. Rethinking RoPE Scaling in Quantized LLM: Theory, Outlier, and Channel-Band Analysis with Weight Rescaling


238. Learning Inter-Atomic Potentials without Explicit Equivariance


239. EpidemIQs: Prompt-to-Paper LLM Agents for Epidemic Modeling and Analysis


240. IA aplicada al análisis del conflicto Irán-Israel: Mapeo de discursos en YouTube


241. Methodological Framework for Quantifying Semantic Test Coverage in RAG Systems


242. Autonomous Multi-Robot Infrastructure for AI-Enabled Healthcare Delivery and Diagnostics


243. MARS: Audio Generation via Multi-Channel Autoregression on Spectrograms


244. PCPO: Proportionate Credit Policy Optimization for Aligning Image Generation Models


245. ReLumix: Extending Image Relighting to Video via Video Diffusion Models


246. EVO-LRP: Evolutionary Optimization of LRP for Interpretable Model Explanations