전체 AI 논문 - 2025-08-29

1. Model Science: getting serious about verification, explanation and control of AI systems


2. SWIRL: A Staged Workflow for Interleaved Reinforcement Learning in Mobile GUI Control


3. Flocking Behavior: An Innovative Inspiration for the Optimization of Production Plants


4. CASE: An Agentic AI Framework for Enhancing Scam Intelligence in Digital Payments


5. Tracking World States with Language Models: State-Based Evaluation Using Chess


6. Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?


7. InquireMobile: Teaching VLM-based Mobile Agent to Request Human Assistance via Reinforcement Fine-Tuning


8. Instructional Agents: LLM Agents on Automated Course Material Generation for Teaching Faculties


9. ReST-RL: Achieving Accurate Code Reasoning of LLMs with Optimized Self-Training and Decoding


10. Skill-based Explanations for Serendipitous Course Recommendation


11. Democracy-in-Silico: Institutional Design as Alignment in AI-Governed Polities


12. Caught in the Act: a mechanistic approach to detecting deception


13. SLIM: Subtrajectory-Level Elimination for More Effective Reasoning


14. Reliable Weak-to-Strong Monitoring of LLM Agents


15. Quantized but Deceptive? A Multi-Dimensional Truthfulness Evaluation of Quantized LLMs


16. Aleks: AI powered Multi Agent System for Autonomous Scientific Discovery via Data-Driven Approaches in Plant Science


17. Sycophancy as compositions of Atomic Psychometric Traits


18. CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning


19. Discrete-Guided Diffusion for Scalable and Safe Multi-Robot Motion Planning


20. Patch Progression Masked Autoencoder with Fusion CNN Network for Classifying Evolution Between Two Pairs of 2D OCT Slices


21. DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis


22. Large Language Models (LLMs) for Electronic Design Automation (EDA)


23. Symphony: A Decentralized Multi-Agent Framework for Scalable Collective Intelligence


24. HPC Digital Twins for Evaluating Scheduling Policies, Incentive Structures and their Impact on Power and Cooling


25. Decomposing Behavioral Phase Transitions in LLMs: Order Parameters for Emergent Misalignment


26. Cross-Platform E-Commerce Product Categorization and Recategorization: A Multimodal Hierarchical Classification Approach


27. Linear-Time Demonstration Selection for In-Context Learning via Gradient Estimation


28. MathBuddy: A Multimodal System for Affective Math Tutoring


29. Diffusion Language Models Know the Answer Before Decoding


30. GLSim: Detecting Object Hallucinations in LVLMs via Global-Local Similarity


31. Dhati+: Fine-tuned Large Language Models for Arabic Subjectivity Evaluation


32. WaveHiT-SR: Hierarchical Wavelet Network for Efficient Image Super-Resolution


33. The Next Layer: Augmenting Foundation Models with Structure-Preserving and Attention-Guided Learning for Local Patches to Global Context Awareness in Computational Pathology


34. Logical Reasoning with Outcome Reward Models for Test-Time Scaling


35. The Information Dynamics of Generative Diffusion


36. AI-Powered Detection of Inappropriate Language in Medical School Curricula


37. Generative AI for Testing of Autonomous Driving Systems: A Survey


38. Multispectral LiDAR data for extracting tree points in urban and suburban areas



40. PSO-Merging: Merging Models Based on Particle Swarm Optimization


41. Gradient Rectification for Robust Calibration under Distribution Shift


42. From Research to Reality: Feasibility of Gradient Inversion Attacks in Federated Learning


43. ERSR: An Ellipse-constrained pseudo-label refinement and symmetric regularization framework for semi-supervised fetal head segmentation in ultrasound images


44. Bootstrapping Learned Cost Models with Synthetic SQL Queries


45. A bag of tricks for real-time Mitotic Figure detection


46. NLKI: A lightweight Natural Language Knowledge Integration Framework for Improving Small VLMs in Commonsense VQA Tasks


47. Attention is also needed for form design


48. Safety Alignment Should Be Made More Than Just A Few Attention Heads


49. Topological Uncertainty for Anomaly Detection in the Neural-network EoS Inference with Neutron Star Data


50. Survey of Specialized Large Language Model


51. Arbitrary Precision Printed Ternary Neural Networks with Holistic Evolutionary Approximation


52. Intellectual Property in Graph-Based Machine Learning as a Service: Attacks and Defenses


53. Beyond BEV: Optimizing Point-Level Tokens for Collaborative Perception


54. Invited Paper: Feature-to-Classifier Co-Design for Mixed-Signal Smart Flexible Wearables for Healthcare at the Extreme Edge


55. Divide, Weight, and Route: Difficulty-Aware Optimization with Dynamic Expert Fusion for Long-tailed Recognition


56. Training for Obsolescence? The AI-Driven Education Trap


57. Towards Instance-wise Personalized Federated Learning via Semi-Implicit Bayesian Prompt Tuning


58. A Scenario-Oriented Survey of Federated Recommender Systems: Techniques, Challenges, and Future Directions


59. LFD: Layer Fused Decoding to Exploit External Knowledge in Retrieval-Augmented Generation


60. FinCast: A Foundation Model for Financial Time-Series Forecasting


61. IELDG: Suppressing Domain-Specific Noise with Inverse Evolution Layers for Domain Generalized Semantic Segmentation


62. CompLex: Music Theory Lexicon Constructed by Autonomous Agents for Automatic Music Generation


63. Complementary Learning System Empowers Online Continual Learning of Vehicle Motion Forecasting in Smart Cities


64. Hallucinating with AI: AI Psychosis as Distributed Delusions


65. Towards stable AI systems for Evaluating Arabic Pronunciations


66. Towards a Holistic and Automated Evaluation Framework for Multi-Level Comprehension of LLMs in Book-Length Contexts


67. Interact-Custom: Customized Human Object Interaction Image Generation


68. Multimodal Prototype Alignment for Semi-supervised Pathology Image Segmentation


69. Generative Models for Synthetic Data: Transforming Data Mining in the GenAI Era


70. Energy-Efficient Learning-Based Beamforming for ISAC-Enabled V2X Networks


71. FlowDet: Overcoming Perspective and Scale Challenges in Real-Time End-to-End Traffic Detection


72. Bi-LoRA: Efficient Sharpness-Aware Minimization for Fine-Tuning Large-Scale Models


73. Just Because You Can, Doesn’t Mean You Should: LLMs for Data Fitting


74. Taming the Chaos: Coordinated Autoscaling for Heterogeneous and Disaggregated LLM Inference


75. Language Models Identify Ambiguities and Exploit Loopholes


76. WEBEYETRACK: Scalable Eye-Tracking for the Browser via On-Device Few-Shot Personalization


77. Orchid: Orchestrating Context Across Creative Workflows with Generative AI


78. A Self-Supervised Mixture-of-Experts Framework for Multi-behavior Recommendation


79. Learning Game-Playing Agents with Generative Code Optimization


80. Servant, Stalker, Predator: How An Honest, Helpful, And Harmless (3H) Agent Unlocks Adversarial Skills


81. Sat2Flow: A Structure-Aware Diffusion Framework for Human Flow Generation from Satellite Imagery


82. PoolFlip: A Multi-Agent Reinforcement Learning Security Environment for Cyber Defense


83. Data-Efficient Symbolic Regression via Foundation Model Distillation


84. Improving Low-Resource Translation with Dictionary-Guided Fine-Tuning and RL: A Spanish-to-Wayuunaiki Study


85. Concurrent validity of computer-vision artificial intelligence player tracking software using broadcast footage


86. Automatic Question & Answer Generation Using Generative Large Language Model (LLM)


87. SIExVulTS: Sensitive Information Exposure Vulnerability Detection System using Transformer Models and Static Analysis



89. Incentivized Lipschitz Bandits


90. Addressing Weak Authentication like RFID, NFC in EVs and EVCs using AI-powered Adaptive Authentication


91. Bridging Language Gaps: Enhancing Few-Shot Language Adaptation


92. “She was useful, but a bit too optimistic”: Augmenting Design with Interactive Virtual Personas


93. Data-Augmented Few-Shot Neural Stencil Emulation for System Identification of Computer Models


94. A perishable ability? The future of writing in the face of generative artificial intelligence


95. Even Heads Fix Odd Errors: Mechanistic Discovery and Surgical Repair in Transformer Attention


96. One Joke to Rule them All? On the (Im)possibility of Generalizing Humor


97. Fine-Tuning Vision-Language Models for Neutrino Event Analysis in High-Energy Physics Experiments


98. Database Entity Recognition with Data Augmentation and Deep Learning


99. Inference of Human-derived Specifications of Object Placement via Demonstration


100. Grounding the Ungrounded: A Spectral-Graph Framework for Quantifying Hallucinations in multimodal LLMs


101. LongReasonArena: A Long Reasoning Benchmark for Large Language Models


102. Atrial Fibrillation Prediction Using a Lightweight Temporal Convolutional and Selective State Space Architecture


103. Reflective Agreement: Combining Self-Mixture of Agents with a Sequence Tagger for Robust Event Extraction


104. Re:Frame – Retrieving Experience From Associative Memory


105. Quantum Entanglement as Super-Confounding: From Bell’s Theorem to Robust Machine Learning


106. Deep Data Hiding for ICAO-Compliant Face Images: A Survey


107. AT-CXR: Uncertainty-Aware Agentic Triage for Chest X-rays


108. An Investigation on Group Query Hallucination Attacks


109. MIDAS: Multimodal Interactive Digital-humAn Synthesis via Real-time Autoregressive Video Generation


110. MedVQA-TREE: A Multimodal Reasoning and Retrieval Framework for Sarcopenia Prediction


111. (DEMO) Deep Reinforcement Learning Based Resource Allocation in Distributed IoT Systems


112. What Makes AI Applications Acceptable or Unacceptable? A Predictive Moral Framework


113. Automated classification of natural habitats using ground-level imagery


114. Are Companies Taking AI Risks Seriously? A Systematic Analysis of Companies’ AI Risk Disclosures in SEC 10-K forms


115. Sistema de Reconocimiento Facial Federado en Conjuntos Abiertos basado en OpenMax


116. Advancements in Crop Analysis through Deep Learning and Explainable AI


117. Geo2Vec: Shape- and Distance-Aware Neural Representation of Geospatial Entities


118. Epistemic Trade-Off: An Analysis of the Operational Breakdown and Ontological Limits of “Certainty-Scope” in AI


119. 2D Ultrasound Elasticity Imaging of Abdominal Aortic Aneurysms Using Deep Neural Networks


120. CellINR: Implicitly Overcoming Photo-induced Artifacts in 4D Live Fluorescence Microscopy


121. DemoBias: An Empirical Study to Trace Demographic Biases in Vision Foundation Models


122. Object Detection with Multimodal Large Vision-Language Models: An In-depth Review


123. Stand on The Shoulders of Giants: Building JailExpert from Previous Attack Experience


124. Efficient Model-Based Purification Against Adversarial Attacks for LiDAR Segmentation


125. Seeing Like a Designer Without One: A Study on Unsupervised Slide Quality Assessment via Designer Cue Augmentation


126. Tricking LLM-Based NPCs into Spilling Secrets


127. Prompt-in-Content Attacks: Exploiting Uploaded Inputs to Hijack LLM Behavior


128. RL-Finetuned LLMs for Privacy-Preserving Synthetic Rewriting


129. CORE: Lossless Compression for Retrieval-Augmented LLMs via Reinforcement Learning


130. CORTEX: Composite Overlay for Risk Tiering and Exposure in Operational AI Systems


131. FLAIRR-TS – Forecasting LLM-Agents with Iterative Refinement and Retrieval for Time Series


132. Towards Production-Worthy Simulation for Autonomous Cyber Operations


133. POT: Inducing Overthinking in LLMs via Black-Box Iterative Optimization


134. MixGAN: A Hybrid Semi-Supervised and Generative Approach for DDoS Detection in Cloud-Integrated IoT Networks


135. Rethinking Reasoning in LLMs: Neuro-Symbolic Local RetoMaton Beyond ICL and CoT


136. Whisper based Cross-Lingual Phoneme Recognition between Vietnamese and English


137. Should LLMs be WEIRD? Exploring WEIRDness and Human Rights in Large Language Models


138. MultiPL-MoE: Multi-Programming-Lingual Extension of Large Language Models through Hybrid Mixture-of-Experts


139. The Aegis Protocol: A Foundational Security Framework for Autonomous AI Agents


140. A Theory of Information, Variation, and Artificial Intelligence


141. Lossless Compression of Neural Network Components: Weights, Checkpoints, and K/V Caches in Low-Precision Formats


142. Emotional Manipulation by AI Companions


143. TTF-VLA: Temporal Token Fusion via Pixel-Attention Integration for Vision-Language-Action Models


144. Real-Time Intuitive AI Drawing System for Collaboration: Enhancing Human Creativity through Formal and Contextual Intent Integration


145. MuSpike: A Benchmark and Evaluation Framework for Symbolic Music Generation with Spiking Neural Networks


146. Federated Fine-Tuning of Sparsely-Activated Large Language Models on Resource-Constrained Devices


147. MovieCORE: COgnitive REasoning in Movies