전체 AI 논문 - 2025-09-19

1. Generalizable Geometric Image Caption Synthesis


2. Internalizing Self-Consistency in Language Models: Multi-Agent Consensus Alignment


3. From Sea to System: Exploring User-Centered Explainable AI for Maritime Decision Support


4. Calibrated Generative AI as Meta-Reviewer: A Systemic Functional Linguistics Discourse Analysis of Reviews of Peer Reviews


5. A Knowledge-driven Adaptive Collaboration of LLMs for Enhancing Medical Decision-making


6. Set Contribution Functions for Quantitative Bipolar Argumentation and their Principles


7. Sentinel Agents for Secure and Trustworthy Agentic AI in Multi-Agent Systems


8. Explainable AI for Infection Prevention and Control: Modeling CPE Acquisition and Patient Outcomes in an Irish Hospital with Transformers


9. OpenLens AI: Fully Autonomous Research Agent for Health Infomatics


10. Enhancing Retrieval Augmentation via Adversarial Collaboration


11. The NazoNazo Benchmark: A Cost-Effective and Extensible Test of Insight-Based Reasoning in LLMs


12. RationAnomaly: Log Anomaly Detection with Rationality via Chain-of-Thought and Reinforcement Learning


13. Understanding the Thinking Process of Reasoning Models: A Perspective from Schoenfeld’s Episode Theory


14. AgentCompass: Towards Reliable Evaluation of Agentic Workflows in Production


15. SynBench: A Benchmark for Differentially Private Text Generation


16. (P)rior(D)yna(F)low: A Priori Dynamic Workflow Construction via Multi-Agent Collaboration


17. Rationality Check! Benchmarking the Rationality of Large Language Models


18. DeKeyNLU: Enhancing Natural Language to SQL Generation through Task Decomposition and Keyword Extraction


19. Beyond the high score: Prosocial ability profiles of multi-agent populations


20. From Mimicry to True Intelligence (TI) - A New Paradigm for Artificial General Intelligence


21. VCBench: Benchmarking LLMs in Venture Capital


22. Detecting Pipeline Failures through Fine-Grained Analysis of Web Agents


23. From Capabilities to Performance: Evaluating Key Functional Properties of LLM Architectures in Penetration Testing


24. Unified Crew Planning and Replanning Optimization in Multi-Line Metro Systems Considering Workforce Heterogeneity


25. Explicit Context-Driven Neural Acoustic Modeling for High-Fidelity RIR Generation


26. FlowRL: Matching Reward Distributions for LLM Reasoning


27. Orion: Fuzzing Workflow Automation


28. TITAN: A Trajectory-Informed Technique for Adaptive Parameter Freezing in Large-Scale VQE


29. Fast and Fluent Diffusion Language Models via Convolutional Decoding and Rejective Fine-tuning


30. SMARTER: A Data-efficient Framework to Improve Toxicity Detection with Explanation via Self-augmenting Large Language Models


31. Watermarking and Anomaly Detection in Machine Learning Models for LORA RF Fingerprinting


32. Semi-Supervised 3D Medical Segmentation from 2D Natural Images Pretrained Model


33. Leveraging Geometric Visual Illusions as Perceptual Inductive Biases for Vision Models


34. Exploring How Audio Effects Alter Emotion with Foundation Models


35. WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance


36. The mechanization of science illustrated by the Lean formalization of the multi-graded Proj construction


37. Vulnerable Agent Identification in Large-Scale Multi-Agent Reinforcement Learning


38. TextMine: LLM-Powered Knowledge Extraction for Humanitarian Mine Action


39. Listening, Imagining \& Refining: A Heuristic Optimized ASR Correction Framework with LLMs


40. Communication Efficient Split Learning of ViTs with Attention-based Double Compression


41. Balancing Sparse RNNs with Hyperparameterization Benefiting Meta-Learning


42. Credit Card Fraud Detection


43. Reinforcement Learning Agent for a 2D Shooter Game


44. From Patterns to Predictions: A Shapelet-Based Framework for Directional Forecasting in Noisy Financial Markets


45. Sample Efficient Experience Replay in Non-stationary Environments


46. CLEAR: A Comprehensive Linguistic Evaluation of Argument Rewriting by Large Language Models


47. Attention Beyond Neighborhoods: Reviving Transformer for Graph Clustering


48. Sea-ing Through Scattered Rays: Revisiting the Image Formation Model for Realistic Underwater Image Generation


49. Blockchain-Enabled Explainable AI for Trusted Healthcare Systems


50. The Role of Touch: Towards Optimal Tactile Sensing Distribution in Anthropomorphic Hands for Dexterous In-Hand Manipulation


51. M4Diffuser: Multi-View Diffusion Policy with Manipulability-Aware Control for Robust Mobile Manipulation


52. RoboEye: Enhancing 2D Robotic Object Identification with Selective 3D Geometric Keypoint Matching


53. Discrete optimal transport is a strong audio adversarial attack


54. Estimating Respiratory Effort from Nocturnal Breathing Sounds for Obstructive Sleep Apnoea Screening


55. Cross-Modal Knowledge Distillation for Speech Large Language Models


56. Patent Language Model Pretraining with ModernBERT


57. Back to Ear: Perceptually Driven High Fidelity Music Reconstruction


58. A Multi-To-One Interview Paradigm for Efficient MLLM Evaluation


59. AI-Driven Multi-Agent Vehicular Planning for Battery Efficiency and QoS in 6G Smart Cities


60. DPANet: Dual Pyramid Attention Network for Multivariate Time Series Forecasting


61. Exploring the Global-to-Local Attention Scheme in Graph Transformers: An Empirical Study


62. MARIC: Multi-Agent Reasoning for Image Classification


63. MeanFlowSE: one-step generative speech enhancement via conditional mean flow


64. Empathy-R1: A Chain-of-Empathy and Reinforcement Learning Framework for Long-Form Mental Health Support


65. [Re] Improving Interpretation Faithfulness for Vision Transformers


66. Not All Degradations Are Equal: A Targeted Feature Denoising Framework for Generalizable Image Super-Resolution


67. Diffusion-Based Scenario Tree Generation for Multivariate Time Series Prediction and Multistage Stochastic Optimization


68. ProtoMedX: Towards Explainable Multi-Modal Prototype Learning for Bone Health Classification


69. Template-Based Cortical Surface Reconstruction with Minimal Energy Deformation


70. OnlineMate: An LLM-Based Multi-Agent Companion System for Cognitive Support in Online Learning


71. Structure-Aware Contrastive Learning with Fine-Grained Binding Representations for Drug Discovery


72. TableDART: Dynamic Adaptive Multi-Modal Routing for Table Understanding


73. Spatial Audio Motion Understanding and Reasoning


74. Threat Modeling for Enhancing Security of IoT Audio Classification Devices under a Secure Protocols Framework


75. MUSE: MCTS-Driven Red Teaming Framework for Enhanced Multi-Turn Dialogue Safety in Large Language Models


76. DeCoP: Enhancing Self-Supervised Time Series Representation with Dependency Controlled Pre-training


77. Mitigating Intra-Speaker Variability in Diarization with Style-Controllable Speech Augmentation


78. Towards Human-like Multimodal Conversational Agent by Generating Engaging Speech


79. Reveal and Release: Iterative LLM Unlearning with Self-generated Data


80. Automating Modelica Module Generation Using Large Language Models: A Case Study on Building Control Description Language


81. Adversarial Distilled Retrieval-Augmented Guarding Model for Online Malicious Intent Detection


82. LSTC-MDA: A Unified Framework for Long-Short Term Temporal Convolution and Mixed Data Augmentation in Skeleton-Based Action Recognition


83. Enterprise AI Must Enforce Participant-Aware Access Control


84. A Case for Computing on Unstructured Data


85. ATLANTIS: AI-driven Threat Localization, Analysis, and Triage Intelligence System


86. Can I Trust This Chatbot? Assessing User Privacy in AI-Healthcare Chatbot Applications


87. Do Vision-Language Models See Urban Scenes as People Do? An Urban Perception Benchmark


88. VisMoDAl: Visual Analytics for Evaluating and Improving Corruption Robustness of Vision-Language Models


89. LLM Jailbreak Detection for (Almost) Free!


90. Catch Me If You Can? Not Yet: LLMs Still Struggle to Imitate the Implicit Writing Styles of Everyday Authors


91. ClearFairy: Capturing Creative Workflows through Decision Structuring, In-Situ Questioning, and Rationale Inference


92. Leveraging Artificial Intelligence as a Strategic Growth Catalyst for Small and Medium-sized Enterprises


93. Delta Knowledge Distillation for Large Language Models


94. BEACON: Behavioral Malware Classification with Large Language Model Embeddings and Deep Learning


95. Introducing OmniGEC: A Silver Multilingual Dataset for Grammatical Error Correction


96. Process-Supervised Reinforcement Learning for Interactive Multimodal Tool-Use Agents


97. AToken: A Unified Tokenizer for Vision


98. Correct-Detect: Balancing Performance and Ambiguity Through the Lens of Coreference Resolution in LLMs


99. Simulating a Bias Mitigation Scenario in Large Language Models


100. When Content is Goliath and Algorithm is David: The Style and Semantic Effects of Generative Search Engine


101. A Taxonomy of Prompt Defects in LLM Systems


102. Q-ROAR: Outlier-Aware Rescaling for RoPE Position Interpolation in Quantized Long-Context LLMs


103. eIQ Neutron: Redefining Edge-AI Inference with Integrated NPU and Compiler Innovations


104. Embodied sensorimotor control: computational modeling of the neural control of movement


105. DreamControl: Human-Inspired Whole-Body Humanoid Control for Scene Interaction via Guided Diffusion


106. Near-Real-Time Resource Slicing for QoS Optimization in 5G O-RAN using Deep Reinforcement Learning


107. Beyond Classification: Evaluating LLMs for Fine-Grained Automatic Malware Behavior Auditing


108. Deploying UDM Series in Real-Life Stuttered Speech Applications: A Clinical Evaluation Framework


109. FlowDrive: Energy Flow Field for End-to-End Autonomous Driving


110. Property-Isometric Variational Autoencoders for Sequence Modeling and Design


111. The Sum Leaks More Than Its Parts: Compositional Privacy Risks and Mitigations in Multi-Agent Collaboration


112. SCoGen: Scenario-Centric Graph-Based Synthesis of Real-World Code Problems


113. Towards Robust Agentic CUDA Kernel Benchmarking, Verification, and Optimization


114. Beyond Data Privacy: New Privacy Risks for Large Language Models


115. Constructive Conflict-Driven Multi-Agent Reinforcement Learning for Strategic Diversity


116. FedMentor: Domain-Aware Differential Privacy for Heterogeneous Federated LLMs in Mental Health


117. Discovering New Theorems via LLMs with In-Context Proof Learning in Lean


118. SpeechWeave: Diverse Multilingual Synthetic Text & Audio Data Generation Pipeline for Training Text to Speech Models


119. SparseDoctor: Towards Efficient Chat Doctor with Mixture of Experts Enhanced Large Language Models


120. DetectAnyLLM: Towards Generalizable and Robust Detection of Machine-Generated Text Across Domains and Models


121. Graph-Enhanced Retrieval-Augmented Question Answering for E-Commerce Customer Support


122. Efficient Hate Speech Detection: Evaluating 38 Models from Traditional Methods to Transformers


123. Evolution of Kernels: Automated RISC-V Kernel Optimization with Large Language Models


124. Shutdown Resistance in Large Language Models


125. From Correction to Mastery: Reinforced Distillation of Large Language Model Agents


126. JU-NLP at Touché: Covert Advertisement in Conversational AI-Generation and Detection Strategies


127. Opening the Black Box: Interpretable LLMs via Semantic Resonance Architecture


128. Hallucination Detection with the Internal Layers of LLMs


129. CrossPT: Exploring Cross-Task Transferability through Multi-Task Prompt Tuning


130. LLM-JEPA: Large Language Models Meet Joint Embedding Predictive Architectures


131. Advancing Conversational AI with Shona Slang: A Dataset and Hybrid Model for Digital Inclusion