전체 AI 논문 - 2025-08-26

1. LLM-Based Agents for Competitive Landscape Mapping in Drug Asset Due Diligence


2. Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning


3. Modular Embedding Recomposition for Incremental Learning



5. Causal Beam Selection for Reliable Initial Access in AI-driven Beam Management


6. Do What? Teaching Vision-Language-Action Models to Reject the Impossible


7. AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications


8. The next question after Turing’s question: Introducing the Grow-AI test


9. Competition and Attraction Improve Model Fusion


10. Graph RAG as Human Choice Model: Building a Data-Driven Mobility Agent with Preference Chain


11. Bridging the Gap in Ophthalmic AI: MM-Retinal-Reason Dataset and OphthaReason Model toward Dynamic Multimodal Reasoning


12. Extending FKG.in: Towards a Food Claim Traceability Network


13. IR-Agent: Expert-Inspired LLM Agents for Structure Elucidation from Infrared Spectra


14. InMind: Evaluating LLMs in Capturing and Applying Individual Human Reasoning Styles


15. Integrating Time Series into LLMs via Multi-layer Steerable Embedding Fusion for Enhanced Forecasting


16. Urban Comfort Assessment in the Era of Digital Planning: A Multidimensional, Data-driven, and AI-assisted Framework


17. Generative Foundation Model for Structured and Unstructured Electronic Health Records


18. MMAPG: A Training-Free Framework for Multimodal Multi-hop Question Answering via Adaptive Planning Graphs


19. CoFE: A Framework Generating Counterfactual ECG for Explainable Cardiac AI-Diagnostics


20. T-ILR: a Neurosymbolic Integration for LTLf


21. MV-RAG: Retrieval Augmented Multiview Diffusion


22. Hierarchical Decision-Making for Autonomous Navigation: Integrating Deep Reinforcement Learning and Fuzzy Logic in Four-Wheel Independent Steering and Driving Systems


23. A Disease-Centric Vision-Language Foundation Model for Precision Oncology in Kidney Cancer


24. Sparse but Wrong: Incorrect L0 Leads to Incorrect Features in Sparse Autoencoders


25. Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution


26. Enhanced NIRMAL Optimizer With Damped Nesterov Acceleration: A Comparative Analysis


27. RL Is Neither a Panacea Nor a Mirage: Understanding Supervised vs. Reinforcement Learning Fine-Tuning for LLMs


28. Towards Open World Detection: A Survey


29. Guiding Diffusion Models with Reinforcement Learning for Stable Molecule Generation


30. Comparative Analysis of UAV Path Planning Algorithms for Efficient Navigation in Urban 3D Environments


31. FLAMES: Improving LLM Math Reasoning via a Fine-Grained Analysis of the Data Synthesis Pipeline


32. On Zero-Shot Reinforcement Learning


33. Post Hoc Regression Refinement via Pairwise Rankings


34. SafeSpace: An Integrated Web Application for Digital Safety and Emotional Well-being


35. FraPPE: Fast and Efficient Preference-based Pure Exploration


36. Disentangled Multi-modal Learning of Histology and Transcriptomics for Cancer Characterization


37. HOSt3R: Keypoint-free Hand-Object 3D Reconstruction from RGB images


38. PediatricsMQA: a Multi-modal Pediatrics Question Answering Benchmark


39. OPERA: A Reinforcement Learning–Enhanced Orchestrated Planner-Executor Architecture for Reasoning-Oriented Multi-Hop Retrieval


40. Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish


41. A Lightweight Group Multiscale Bidirectional Interactive Network for Real-Time Steel Surface Defect Detection


42. Domain-aligned generative downscaling enhances projections of extreme climate events


43. MedQARo: A Large-Scale Benchmark for Medical Question Answering in Romanian



45. Confusion is the Final Barrier: Rethinking Jailbreak Evaluation and Investigating the Real Misuse Threat of LLMs


46. Uppaal Coshy: Automatic Synthesis of Compact Shields for Hybrid Systems


47. Unsupervised Online Detection of Pipe Blockages and Leakages in Water Distribution Networks


48. Vevo2: Bridging Controllable Speech and Singing Voice Generation via Unified Prosody Learning


49. LLMSymGuard: A Symbolic Safety Guardrail Framework Leveraging Interpretable Jailbreak Concepts



51. Retrieval Enhanced Feedback via In-context Neural Error-book


52. Exploiting Information Redundancy in Attention Maps for Extreme Quantization of Vision Transformers


53. A Multimodal-Multitask Framework with Cross-modal Relation and Hierarchical Interactive Attention for Semantic Comprehension


54. Representation Learning of Auxiliary Concepts for Improved Student Modeling and Exercise Recommendation


55. From Confidence to Collapse in LLM Factual Robustness


56. MCPVerse: An Expansive, Real-World Benchmark for Agentic Tool Use


57. A Reduction of Input/Output Logics to SAT


58. A XAI-based Framework for Frequency Subband Characterization of Cough Spectrograms in Chronic Respiratory Disease


59. FlexMUSE: Multimodal Unification and Semantics Enhancement Framework with Flexible interaction for Creative Writing


60. An Investigation of Visual Foundation Models Robustness


61. OmniCache: A Trajectory-Oriented Global Perspective on Training-Free Cache Reuse for Diffusion Transformer Models


62. SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning


63. Set Transformer Architectures and Synthetic Data Generation for Flow-Guided Nanoscale Localization


64. A Relay-Chain-Powered Ciphertext-Policy Attribute-Based Encryption in Intelligent Transportation Systems


65. LLM-Assisted Semantic Alignment and Integration in Collaborative Model-Based Systems Engineering Using SysML v2


66. Motor Imagery EEG Signal Classification Using Minimally Random Convolutional Kernel Transform and Hybrid Deep Learning


67. EGRA:Toward Enhanced Behavior Graphs and Representation Alignment for Multimodal Recommendation


68. Towards Recommending Usability Improvements with Multimodal Large Language Models


69. STA-GANN: A Valid and Generalizable Spatio-Temporal Kriging Approach


70. Through the Looking Glass: A Dual Perspective on Weakly-Supervised Few-Shot Segmentation


71. Beyond Human-prompting: Adaptive Prompt Tuning with Semantic Alignment for Anomaly Detection


72. On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models


73. Take That for Me: Multimodal Exophora Resolution with Interactive Questioning for Ambiguous Out-of-View Instructions


74. Machine Learning in Micromobility: A Systematic Review of Datasets, Techniques, and Applications


75. CommonKV: Compressing KV Cache with Cross-layer Parameter Sharing


76. The Fools are Certain; the Wise are Doubtful: Exploring LLM Confidence in Code Completion


77. Spacetime-GR: A Spacetime-Aware Generative Model for Large Scale Online POI Recommendation


78. ANSC: Probabilistic Capacity Health Scoring for Datacenter-Scale Reliability


79. CYCLE-INSTRUCT: Fully Seed-Free Instruction Tuning via Dual Self-Training and Cycle Consistency


80. GPLight+: A Genetic Programming Method for Learning Symmetric Traffic Signal Control Policy


81. Two-flow Feedback Multi-scale Progressive Generative Adversarial Network


82. On Task Vectors and Gradients


83. Cooperative Design Optimization through Natural Language Interaction


84. From Benchmark Data To Applicable Program Repair: An Experience Report


85. OpenWHO: A Document-Level Parallel Corpus for Health Translation in Low-Resource Languages


86. Enhanced predictions of the Madden-Julian oscillation using the FuXi-S2S machine learning model: Insights into physical mechanisms


87. Pareto Actor-Critic for Communication and Computation Co-Optimization in Non-Cooperative Federated Learning Services


88. Time Series Based Network Intrusion Detection using MTF-Aided Transformer


89. CoVeRaP: Cooperative Vehicular Perception through mmWave FMCW Radars


90. Breaking Barriers in Software Testing: The Power of AI-Driven Automation


91. Automated Multi-label Classification of Eleven Retinal Diseases: A Benchmark of Modern Architectures and a Meta-Ensemble on a Large Synthetic Dataset


92. Panoptic Segmentation of Environmental UAV Images : Litter Beach


93. Representation Learning with Adaptive Superpixel Coding


94. ASIC-Agent: An Autonomous Multi-Agent System for ASIC Design with Benchmark Evaluation


95. Strategic Sample Selection for Improved Clean-Label Backdoor Attacks in Text Classification


96. Noise, Adaptation, and Strategy: Assessing LLM Fidelity in Decision-Making


97. Probabilistic Forecasting Cryptocurrencies Volatility: From Point to Quantile Forecasts


98. HyperFlexis: Joint Design of Algorithms and Systems for Multi-SLO Serving and Fast Scaling


99. Information Ecosystem Reengineering via Public Sector Knowledge Representation


100. Evaluating Structured Decoding for Text-to-Table Generation: Evidence from Three Datasets


101. Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search


102. Beyond Imaging: Vision Transformer Digital Twin Surrogates for 3D+T Biological Tissue Dynamics


103. TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill and Decode Inference


104. Lean Meets Theoretical Computer Science: Scalable Synthesis of Theorem Proving Challenges in Formal-Informal Pairs


105. Annif at the GermEval-2025 LLMs4Subjects Task: Traditional XMTC Augmented by Efficient LLMs


106. DeepMEL: A Multi-Agent Collaboration Framework for Multimodal Entity Linking


107. NEAT: Concept driven Neuron Attribution in LLMs


108. Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning


109. CARFT: Boosting LLM Reasoning via Contrastive Learning with Annotated Chain-of-Thought-based Reinforced Fine-Tuning


110. Securing Swarms: Cross-Domain Adaptation for ROS2-based CPS Anomaly Detection


111. Beyond Individuals: Collective Predictive Coding for Memory, Attention, and the Emergence of Language


112. Building and Measuring Trust between Large Language Models


113. MGSC: A Multi-granularity Consistency Framework for Robust End-to-end Asr


114. Coarse-to-Fine Personalized LLM Impressions for Streamlined Radiology Reports


115. CIA+TA Risk Assessment for AI Reasoning Vulnerabilities


116. Statistical Comparative Analysis of Semantic Similarities and Model Transferability Across Datasets for Short Answer Grading


117. MorphNAS: Differentiable Architecture Search for Morphologically-Aware Multilingual NER


118. Alvorada-Bench: Can Language Models Solve Brazilian University Entrance Exams?


119. A Functionality-Grounded Benchmark for Evaluating Web Agents in E-commerce Domains


120. Who’s Asking? Investigating Bias Through the Lens of Disability Framed Queries in LLMs


121. DAIQ: Auditing Demographic Attribute Inference from Question in LLMs


122. Mini-Omni-Reasoner: Token-Level Thinking-in-Speaking in Large Speech Models


123. An Auditable Pipeline for Fuzzy Full-Text Screening in Systematic Reviews: Integrating Contrastive Semantic Highlighting and LLM Judgment


124. Straggler-Resilient Federated Learning over A Hybrid Conventional and Pinching Antenna Network


125. Research on intelligent generation of structural demolition suggestions based on multi-model collaboration


126. User-Assistant Bias in LLMs


127. SCOPE: A Generative Approach for LLM Prompt Compression


128. From Clicks to Preference: A Multi-stage Alignment Framework for Generative Query Suggestion in Conversational System


129. Detecting Hope, Hate, and Emotion in Arabic Textual Speech and Multi-modal Memes Using Large Language Models


130. Chain-of-Query: Unleashing the Power of LLMs in SQL-Aided Table Understanding via Multi-Agent Collaboration


131. Uplifted Attackers, Human Defenders: The Cyber Offense-Defense Balance for Trailing-Edge Organizations


132. KL-based self-distillation for large language models


133. SurfaceLogicKV: Surface and Logic Attention Behaviors are All You Need for Robust KV Cache Compression


134. ALAS: Autonomous Learning Agent for Self-Updating Language Models


135. ReportBench: Evaluating Deep Research Agents via Academic Survey Tasks


136. MAC: A Live Benchmark for Multimodal Large Language Models in Scientific Understanding


137. LingVarBench: Benchmarking LLM for Automated Named Entity Recognition in Structured Synthetic Spoken Transcriptions


138. Persuasiveness and Bias in LLM: Investigating the Impact of Persuasiveness and Reinforcement of Bias in Language Models


139. Benchmarking the Medical Understanding and Reasoning of Large Language Models in Arabic Healthcare Tasks



141. InteChar: A Unified Oracle Bone Character List for Ancient Chinese Language Modeling


142. KG-o1: Enhancing Multi-hop Question Answering in Large Language Models via Knowledge Graph Integration


143. Learning in Focus: Detecting Behavioral and Collaborative Engagement Using Vision Transformers