전체 AI 논문 - 2025-09-18

1. Hierarchical Learning for Maze Navigation: Emergence of Mental Representations via Second-Order Learning


2. CrowdAgent: Multi-Agent Managed Multi-Source Annotation System


3. Exploring Major Transitions in the Evolution of Biological Cognition With Artificial Neural Networks


4. An Exhaustive DPLL Approach to Model Counting over Integer Linear Constraints with Simplification Techniques


5. MIRA: Empowering One-Touch AI Services on Smartphones with MLLM-based Instruction Recommendation


6. THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning


7. InfraMind: A Novel Exploration-based GUI Agentic Framework for Mission-critical Industrial Management


8. See, Think, Act: Teaching Multimodal Agents to Effectively Interact with GUI by Identifying Toggles


9. Programmable Cognitive Bias in Social Agents


10. Gen AI in Proof-based Math Courses: A Pilot Study


11. AI Agents with Human-Like Collaborative Tools: Adaptive Strategies for Enhanced Problem-Solving


12. SteeringControl: Holistic Evaluation of Alignment Steering in LLMs


13. From Next Token Prediction to (STRIPS) World Models – Preliminary Results


14. The Art of Saying “Maybe”: A Conformal Lens for Uncertainty Benchmarking in VLMs


15. $Agent^2$: An Agent-Generates-Agent Framework for Reinforcement Learning Automation


16. Asterisk Operator


17. Semantic Fusion with Fuzzy-Membership Features for Controllable Language Modelling


18. Agentic UAVs: LLM-Driven Autonomy with Integrated Tool-Calling and Cognitive Reasoning


19. Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning


20. OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft


21. Imagined Autocurricula


22. Position: AI Safety Must Embrace an Antifragile Perspective


23. FRIT: Using Causal Importance to Improve Chain-of-Thought Faithfulness


24. Evaluation Awareness Scales Predictably in Open-Weights Large Language Models


25. Explicit Reasoning Makes Better Judges: A Systematic Study on Accuracy, Efficiency, and Robustness


26. Apertus: Democratizing Open and Compliant LLMs for Global Language Environments


27. Language models’ activations linearly encode training-order recency


28. A Universal Banach–Bregman Framework for Stochastic Iterations: Unifying Stochastic Mirror Descent, Learning and LLM Training


29. Dense Video Understanding with Gated Residual Tokenization


30. Bridging Past and Future: Distribution-Aware Alignment for Time Series Forecasting


31. Synthesizing Behaviorally-Grounded Reasoning Chains: A Data-Generation Framework for Personal Finance LLMs


32. TGPO: Tree-Guided Preference Optimization for Robust Web Agent Reinforcement Learning


33. Where Do Tokens Go? Understanding Pruning Behaviors in STEP at High Resolutions


34. Reasoning Efficiently Through Adaptive Chain-of-Thought Compression: A Self-Optimizing Framework


35. Queen Detection in Beehives via Environmental Sensor Fusion for Low-Power Edge Computing


36. Machines are more productive than humans until they aren’t, and vice versa


37. Comprehensive Evaluation of CNN-Based Audio Tagging Models on Resource-Constrained Devices


38. Prompt2Auto: From Motion Prompt to Automated Control via Geometry-Invariant One-Shot Gaussian Process Learning


39. PhenoGnet: A Graph-Based Contrastive Learning Framework for Disease Similarity Prediction


40. SSL-SSAW: Self-Supervised Learning with Sigmoid Self-Attention Weighting for Question-Based Sign Language Translation


41. You Are What You Train: Effects of Data Composition on Training Context-aware Machine Translation Models


42. Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale


43. RFM-Editing: Rectified Flow Matching for Text-guided Audio Editing


44. MOCHA: Multi-modal Objects-aware Cross-arcHitecture Alignment


45. Slim-SC: Thought Pruning for Efficient Scaling with Self-Consistency


46. Differential Privacy in Federated Learning: Mitigating Inference Attacks with Randomized Response


47. LLM Agents for Interactive Workflow Provenance: Reference Architecture and Evaluation Methodology


48. An Empirical Study on Failures in Automated Issue Solving


49. DSpAST: Disentangled Representations for Spatial Audio Reasoning with Large Language Models


50. MAP: End-to-End Autonomous Driving with Map-Assisted Planning


51. Ensemble of Pre-Trained Models for Long-Tailed Trajectory Prediction


52. Do Large Language Models Understand Word Senses?


53. FedSSG: Expectation-Gated and History-Aware Drift Alignment for Federated Learning


54. Synthetic Data Generation for Screen Time and App Usage


55. Combating Biomedical Misinformation through Multi-modal Claim Detection and Evidence-based Verification


56. Combining Evidence and Reasoning for Biomedical Fact-Checking


57. Masked Diffusion Models as Energy Minimization


58. Understanding the Process of Human-AI Value Alignment


59. Towards a Physics Foundation Model


60. Bridging the Synthetic-Real Gap: Supervised Domain Adaptation for Robust Spacecraft 6-DoF Pose Estimation


61. Teaching According to Talents! Instruction Tuning LLMs with Competence-Aware Curriculum Learning


62. BWCache: Accelerating Video Diffusion Transformers through Block-Wise Caching


63. Who is Introducing the Failure? Automatically Attributing Failures of Multi-Agent Systems via Spectrum Analysis


64. Exploring Data and Parameter Efficient Strategies for Arabic Dialect Identifications


65. Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning


66. State Space Models over Directed Graphs


67. Mitigating Query Selection Bias in Referring Video Object Segmentation


68. Automated Triaging and Transfer Learning of Incident Learning Safety Reports Using Large Language Representational Models


69. DSCC-HS: A Dynamic Self-Reinforcing Framework for Hallucination Suppression in Large Language Models


70. CraftMesh: High-Fidelity Generative Mesh Manipulation via Poisson Seamless Fusion


71. Improving Context Fidelity via Native Retrieval-Augmented Reasoning


72. Prompt Stability in Code LLMs: Measuring Sensitivity across Emotion- and Personality-Driven Variations


73. AgentCTG: Harnessing Multi-Agent Collaboration for Fine-Grained Precise Control in Text Generation


74. Re-purposing SAM into Efficient Visual Projectors for MLLM-Based Referring Image Segmentation


75. CL$^2$GEC: A Multi-Discipline Benchmark for Continual Learning in Chinese Literature Grammatical Error Correction


76. DREAM: Domain-aware Reasoning for Efficient Autonomous Underwater Monitoring


77. Sparse Neurons Carry Strong Signals of Question Ambiguity in LLMs


78. Deep Lookup Network


79. GitHub’s Copilot Code Review: Can AI Spot Security Flaws Before You Commit?


80. DeepLogit: A sequentially constrained explainable deep learning modeling approach for transport policy analysis


81. Secure, Scalable and Privacy Aware Data Strategy in Cloud


82. Mind the Gap: Aligning Knowledge Bases with User Needs to Enhance Mental Health Retrieval


83. A reduced-order derivative-informed neural operator for subsurface fluid-flow


84. Modernizing Facebook Scoped Search: Keyword and Embedding Hybrid Retrieval with LLM Evaluation


85. Agentic JWT: A Secure Delegation Protocol for Autonomous AI Agents


86. Intelligent Healthcare Imaging Platform An VLM-Based Framework for Automated Medical Image Analysis and Clinical Report Generation


87. TreeIRL: Safe Urban Driving with Tree Search and Inverse Reinforcement Learning


88. Dense-Jump Flow Matching with Non-Uniform Time Scheduling for Robotic Policies: Mitigating Multi-Step Inference Degradation


89. Complexity Bounds for Smooth Convex Multiobjective Optimization


90. ColonCrafter: A Depth Estimation Model for Colonoscopy Videos Using Diffusion Priors


91. Reproducible workflow for online AI in digital health


92. Prompt2DAG: A Modular Methodology for LLM-Based Data Enrichment Pipeline Generation



94. MapAnything: Universal Feed-Forward Metric 3D Reconstruction


95. Justice in Judgment: Unveiling (Hidden) Bias in LLM-assisted Peer Reviews


96. EdiVal-Agent: An Object-Centric Framework for Automated, Scalable, Fine-Grained Evaluation of Multi-Turn Editing


97. The threat of analytic flexibility in using large language models to simulate human data: A call to attention


98. TICL: Text-Embedding KNN For Speech In-Context Learning Unlocks Speech Recognition Abilities of Large Multimodal Models


99. The Intercepted Self: How Generative AI Challenges the Dynamics of the Relational Self


100. A Domain Knowledge Informed Approach for Anomaly Detection of Electric Vehicle Interior Sounds


101. Landcover classification and change detection using remote sensing and machine learning: a case study of Western Fiji


102. Uncovering AI Governance Themes in EU Policies using BERTopic and Thematic Analysis


103. ASTREA: Introducing Agentic Intelligence for Orbital Thermal Autonomy


104. An Empirical Analysis of VLM-based OOD Detection: Mechanisms, Advantages, and Sensitivity


105. Generative AI Pipeline for Interactive Prompt-driven 2D-to-3D Vascular Reconstruction for Fontan Geometries from Contrast-Enhanced X-Ray Fluoroscopy Imaging


106. The Provenance Problem: LLMs and the Breakdown of Citation Norms


107. Evaluating undergraduate mathematics examinations in the era of generative AI: a curriculum-level case study


108. Synthetic Data and the Shifting Ground of Truth


109. Hybrid Quantum-Classical Model for Image Classification


110. Label-Efficient Grasp Joint Prediction with Point-JEPA


111. Accuracy Paradox in Large Language Models: Regulating Hallucination Risks in Generative AI


112. Real World Robotic Exploration using Deep Neural Networks Trained in Photorealistic Reconstructed Environments


113. Proximity-Based Evidence Retrieval for Uncertainty-Aware Neural Networks


114. Explainable AI-Enhanced Supervisory Control for High-Precision Spacecraft Formation


115. Dual Actor DDPG for Airborne STAR-RIS Assisted Communications


116. Prognosis of COVID-19 using Artificial Intelligence: A Systematic Review and Meta-analysis


117. Joint data imputation and mechanistic modelling for simulating heart-brain interactions in incomplete datasets