전체 AI 논문 - 2025-09-12

1. The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs


2. Boosting Embodied AI Agents through Perception-Generation Disaggregation and Asynchronous Pipeline Execution


3. Compositional Concept Generalization with Variational Quantum Circuits


4. SEDM: Scalable Self-Evolving Distributed Memory for Agents


5. Inteligencia Artificial jurídica y el desafío de la veracidad: análisis de alucinaciones, optimización de RAG y principios para una integración responsable


6. TORSO: Template-Oriented Reasoning Towards General Tasks


7. Curriculum-Based Multi-Tier Semantic Exploration via Deep Reinforcement Learning


8. Towards Adaptive ML Benchmarks: Web-Agent-Driven Construction, Domain Expansion, and Metric Optimization


9. Measuring Implicit Spatial Coordination in Teams: Effects on Collective Intelligence and Performance


10. Explaining Tournament Solutions with Minimal Supports


11. LightAgent: Production-level Open-source Agentic AI Framework


12. Tree-OPO: Off-policy Monte Carlo Tree-Guided Advantage Optimization for Multistep Reasoning


13. Fusing Knowledge and Language: A Comparative Study of Knowledge Graph-Based Question Answering with LLMs



15. Enabling Regulatory Multi-Agent Collaboration: Architecture, Challenges, and Solutions


16. ProgD: Progressive Multi-scale Decoding with Dynamic Graphs for Joint Multi-agent Motion Forecasting


17. Mind Meets Space: Rethinking Agentic Spatial Intelligence from a Neuroscience-inspired Perspective


18. Anti-Money Laundering Machine Learning Pipelines; A Technical Analysis on Identifying High-risk Bank Clients with Supervised Learning


19. Understanding Economic Tradeoffs Between Human and AI Agents in Bargaining Games


20. Instructional Prompt Optimization for Few-Shot LLM-Based Recommendations on Cold-Start Users


21. Uncertainty Awareness and Trust in Explainable AI- On Trust Calibration using Local and Global Explanations


22. ForTIFAI: Fending Off Recursive Training Induced Failure for AI Models


23. Global Constraint LLM Agents for Text-to-Model Translation


24. Automated Unity Game Template Generation from GDDs via NLP and Multi-Modal LLMs


25. An Interval Type-2 Version of Bayes Theorem Derived from Interval Probability Range Estimates Provided by Subject Matter Experts


26. ButterflyQuant: Ultra-low-bit LLM Quantization through Learnable Orthogonal Butterfly Transforms


27. CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models


28. SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning


29. Feasibility-Guided Fair Adaptive Offline Reinforcement Learning for Medicaid Care Management


30. Retrieval-Augmented Generation for Reliable Interpretation of Radio Regulations


31. Explaining Concept Drift through the Evolution of Group Counterfactuals


32. LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering


33. Mechanistic Learning with Guided Diffusion Models to Predict Spatio-Temporal Brain Tumor Growth


34. Graph Alignment via Dual-Pass Spectral Encoding and Latent Space Communication


35. ObjectReact: Learning Object-Relative Control for Visual Navigation


36. Fluent but Unfeeling: The Emotional Blind Spots of Language Models


37. Invisible Attributes, Visible Biases: Exploring Demographic Shortcuts in MRI-based Alzheimer’s Disease Classification


38. An improved educational competition optimizer with multi-covariance learning operators for global optimization problems


39. Improving Video Diffusion Transformer Training by Multi-Feature Fusion and Alignment from Self-Supervised Vision Encoders


40. A modified RIME algorithm with covariance learning and diversity enhancement for numerical optimization


41. Towards Explainable Job Title Matching: Leveraging Semantic Textual Relatedness and Knowledge Graphs


42. Explainable AI for Accelerated Microstructure Imaging: A SHAP-Guided Protocol on the Connectome 2.0 scanner


43. Incorporating AI Incident Reporting into Telecommunications Law and Policy: Insights from India


44. OpenFake: An Open Dataset and Platform Toward Large-Scale Deepfake Detection


45. Prompt Pirates Need a Map: Stealing Seeds helps Stealing Prompts


46. Resource-Efficient Glioma Segmentation on Sub-Saharan MRI


47. ENSI: Efficient Non-Interactive Secure Inference for Large Language Models


48. We’re Still Doing It (All) Wrong: Recommender Systems, Fifteen Years Later


49. LLMs Don’t Know Their Own Decision Boundaries: The Unreliability of Self-Generated Counterfactual Explanations


50. MetaLLMix : An XAI Aided LLM-Meta-learning Based Approach for Hyper-parameters Optimization


51. Robust Non-Linear Correlations via Polynomial Regression


52. Classification of Driver Behaviour Using External Observation Techniques for Autonomous Vehicles


53. MoSE: Unveiling Structural Patterns in Graphs via Mixture of Subgraph Experts


54. OmniEVA: Embodied Versatile Planner via Task-Adaptive 3D-Grounded and Embodiment-aware Reasoning


55. Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization


56. Modality-Agnostic Input Channels Enable Segmentation of Brain lesions in Multimodal MRI with Sequences Unavailable During Training


57. Adaptive Knowledge Distillation using a Device-Aware Teacher for Low-Complexity Acoustic Scene Classification


58. CoAtNeXt:An Attention-Enhanced ConvNeXtV2-Transformer Hybrid Model for Gastric Tissue Classification


59. Virtual staining for 3D X-ray histology of bone implants


60. Vejde: A Framework for Inductive Deep Reinforcement Learning Based on Factor Graph Color Refinement


61. Incentivizing Safer Actions in Policy Optimization for Constrained Reinforcement Learning


62. Bona fide Cross Testing Reveals Weak Spot in Audio Deepfake Detection Systems


63. Improving Synthetic Data Training for Contextual Biasing Models with a Keyword-Aware Cost Function


64. Efficient Trie-based Biasing using K-step Prediction for Rare Word Recognition


65. On Integrating Large Language Models and Scenario-Based Programming for Improving Software Reliability


66. Probing Pre-trained Language Models on Code Changes: Insights from ReDef, a High-Confidence Just-in-Time Defect Prediction Dataset


67. Dark-ISP: Enhancing RAW Image Processing for Low-Light Object Detection


68. EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs


69. Adaptive Pareto-Optimal Token Merging for Edge Transformer Models in Semantic Communication


70. Target-oriented Multimodal Sentiment Classification with Counterfactual-enhanced Debiasing


71. A Knowledge Noise Mitigation Framework for Knowledge-based Visual Question Answering


72. HISPASpoof: A New Dataset For Spanish Speech Forensics


73. OCELOT 2023: Cell Detection from Cell-Tissue Interaction Challenge


74. Video Understanding by Design: How Datasets Shape Architectures and Insights


75. Objectness Similarity: Capturing Object-Level Fidelity in 3D Scene Evaluation


76. ViRanker: A BGE-M3 & Blockwise Parallel Transformer Cross-Encoder for Vietnamese Reranking


77. Automated Classification of Tutors’ Dialogue Acts Using Generative AI: A Case Study Using the CIMA Corpus


78. Character-Level Perturbations Disrupt LLM Watermarks


79. DP-FedLoRA: Privacy-Enhanced Federated Fine-Tuning for On-Device Large Language Models


80. Towards Confidential and Efficient LLM Inference with Dual Privacy Protection


81. SQAP-VLA: A Synergistic Quantization-Aware Pruning Framework for High-Performance Vision-Language-Action Models


82. KoopMotion: Learning Almost Divergence Free Koopman Flow Fields for Motion Planning


83. STRIDE: Scalable and Interpretable XAI via Subset-Free Functional Decomposition


84. Improving LLM Safety and Helpfulness using SFT and DPO: A Study on OPT-350M


85. A Scoping Review of Machine Learning Applications in Power System Protection and Disturbance Management


86. MoWE : A Mixture of Weather Experts


87. Stated Preference for Interaction and Continued Engagement (SPICE): Evaluating an LLM’s Willingness to Re-engage in Conversation


88. Envy-Free but Still Unfair: Envy-Freeness Up To One Item (EF-1) in Personalized Recommendation


89. Personalized Sleep Prediction via Deep Adaptive Spatiotemporal Modeling and Sparse Data


90. Can Vision-Language Models Solve Visual Math Equations?


91. Open-sci-ref-0.01: open and reproducible reference baselines for language model and dataset comparison


92. Implicit Neural Representations of Intramyocardial Motion and Strain


93. Similarity-based Outlier Detection for Noisy Object Re-Identification Using Beta Mixtures


94. Instance-Optimal Matrix Multiplicative Weight Update and Its Quantum Applications


95. PromptGuard: An Orchestrated Prompting Framework for Principled Synthetic Text Generation for Vulnerable Populations using LLMs with Enhanced Safety, Fairness, and Controllability


96. Recurrence Meets Transformers for Universal Multimodal Retrieval


97. Benchmarking Energy Efficiency of Large Language Models Using vLLM


98. Investigating Student Interaction Patterns with Large Language Model-Powered Course Assistants in Computer Science Courses


99. Multi Robot Coordination in Highly Dynamic Environments: Tackling Asymmetric Obstacles and Limited Communication


100. A vibe coding learning design to enhance EFL students’ talking to, through, and about AI


101. Safe and Certifiable AI Systems: Concepts, Challenges, and Lessons Learned


102. Uncertainty Estimation using Variance-Gated Distributions


103. Deep opacity and AI: A threat to XAI and to privacy protection mechanisms


104. PerFairX: Is There a Balance Between Fairness and Personality in Large Language Model Recommendations?