전체 AI 논문 - 2025-10-30

1. TheraMind: A Strategic and Adaptive Agent for Longitudinal Psychological Counseling


2. BambooKG: A Neurobiologically-inspired Frequency-Weight Knowledge Graph


3. Navigation in a Three-Dimensional Urban Flow using Deep Reinforcement Learning


4. ALDEN: Reinforcement Learning for Active Navigation and Evidence Gathering in Long Documents


5. Counterfactual-based Agent Influence Ranker for Agentic AI Workflows


6. Standardization of Psychiatric Diagnoses – Role of Fine-tuned LLM Consortium and OpenAI-gpt-oss Reasoning LLM Enabled Decision Support System


7. Off-policy Reinforcement Learning with Model-based Exploration Augmentation


8. Zero Reinforcement Learning Towards General Domains


9. Retrieval Augmented Generation (RAG) for Fintech: Agentic Design and Evaluation


10. Predicate Renaming via Large Language Models


11. MTIR-SQL: Multi-turn Tool-Integrated Reasoning Reinforcement Learning for Text-to-SQL


12. Multi-Objective Search: Algorithms, Applications, and Emerging Directions


13. Instrumental goals in advanced AI systems: Features to be managed and not failures to be eliminated?


14. Agentic AI: A Comprehensive Survey of Architectures, Applications, and Future Directions


15. Grouping Nodes With Known Value Differences: A Lossless UCT-based Abstraction Algorithm


16. GAP: Graph-Based Agent Planning with Parallel Tool Use and Reinforcement Learning


17. From Medical Records to Diagnostic Dialogues: A Clinical-Grounded Approach and Dataset for Psychiatric Comorbidity


18. FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data


19. RAVR: Reference-Answer-guided Variational Reasoning for Large Language Models


20. Energy-Efficient Autonomous Driving with Adaptive Perception and Robust Decision


21. Agentic Moderation: Multi-Agent Design for Safer Vision-Language Models


22. KnowCoder-A1: Incentivizing Agentic Reasoning Capability with Outcome Supervision for KBQA


23. H3M-SSMoEs: Hypergraph-based Multimodal Learning with LLM Reasoning and Style-Structured Mixture of Experts


24. Reasoning-Aware GRPO using Process Mining


25. Aligning Large Language Models with Procedural Rules: An Autoregressive State-Tracking Prompting for In-Game Trading


26. Taming the Real-world Complexities in CPT E/M Coding with Large Language Models


27. Cyclic Counterfactuals under Shift-Scale Interventions


28. Scheduling Your LLM Reinforcement Learning with Reasoning Trees


29. Gaperon: A Peppered English-French Generative Language Model Suite


30. E-Scores for (In)Correctness Assessment of Generative Model Outputs


31. Task Completion Agents are Not Ideal Collaborators


32. The Limits of Obliviate: Evaluating Unlearning in LLMs via Stimulus-Knowledge Entanglement-Behavior Framework


33. LieSolver: A PDE-constrained solver for IBVPs using Lie symmetries


34. Physics-Guided Conditional Diffusion Networks for Microwave Image Reconstruction


35. The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution


36. Process-Level Trajectory Evaluation for Environment Configuration in Software Engineering Agents


37. Graph Network-based Structural Simulator: Graph Neural Networks for Structural Dynamics


38. User Misconceptions of LLM-Based Conversational Programming Assistants


39. Subgraph Federated Learning via Spectral Methods


40. Learning to Plan & Schedule with Reinforcement-Learned Bimanual Robot Skills


41. Are Language Models Efficient Reasoners? A Perspective from Logic Programming


42. FARSIQA: Faithful and Advanced RAG System for Islamic Question Answering


43. Don’t Blind Your VLA: Aligning Visual Representations for OOD Generalization


44. BOLT-GAN: Bayes-Optimal Loss for Stable GAN Training


45. INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats


46. Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry


47. RegionE: Adaptive Region-Aware Generation for Efficient Image Editing


48. Lost in Phonation: Voice Quality Variation as an Evaluation Dimension for Speech Foundation Models


49. Leveraging an Atmospheric Foundational Model for Subregional Sea Surface Temperature Forecasting


50. Hybrid Quantum-Classical Recurrent Neural Networks



52. Comparative Study of UNet-based Architectures for Liver Tumor Segmentation in Multi-Phase Contrast-Enhanced Computed Tomography


53. FaCT: Faithful Concept Traces for Explaining Neural Network Decisions


54. Reflections on the Reproducibility of Commercial LLM Performance in Empirical Software Engineering Studies


55. TempoPFN: Synthetic Pre-training of Linear RNNs for Zero-shot Time Series Forecasting


56. An In-Depth Analysis of Cyber Attacks in Secured Platforms


57. Fine-Tuned Language Models for Domain-Specific Summarization and Tagging


58. Scalable Utility-Aware Multiclass Calibration


59. Grounded in Reality: Learning and Deploying Proactive LLM from Offline Logs


60. Alibaba International E-commerce Product Search Competition DcuRAGONs Team Technical Report


61. RLMEval: Evaluating Research-Level Neural Theorem Proving


62. Implicature in Interaction: Understanding Implicature Improves Alignment in Human-LLM Interaction


63. Improving Temporal Consistency and Fidelity at Inference-time in Perceptual Video Restoration by Zero-shot Image-based Diffusion Models


64. Adaptive End-to-End Transceiver Design for NextG Pilot-Free and CP-Free Wireless Systems


65. BhashaBench V1: A Comprehensive Benchmark for the Quadrant of Indic Domains


66. GPTOpt: Towards Efficient LLM-Based Black-Box Optimization



68. Hallucinations in Bibliographic Recommendation: Citation Frequency as a Proxy for Training Data Redundancy


69. Position: Biology is the Challenge Physics-Informed ML Needs to Evolve


70. A Convexity-dependent Two-Phase Training Algorithm for Deep Neural Networks


71. Multi-party Agent Relation Sampling for Multi-party Ad Hoc Teamwork


72. MMEdge: Accelerating On-device Multimodal Inference via Pipelined Sensing and Encoding


73. 4-Doodle: Text to 3D Sketches that Move!


74. Dense and Diverse Goal Coverage in Multi Goal Reinforcement Learning


75. SynHLMA:Synthesizing Hand Language Manipulation for Articulated Object with Discrete Human Object Interaction Representation


76. IBNorm: Information-Bottleneck Inspired Normalization for Representation Learning


77. TV-Rec: Time-Variant Convolutional Filter for Sequential Recommendation


78. Scaling Up Bayesian DAG Sampling


79. One-shot Humanoid Whole-body Motion Learning


80. Learning Disentangled Speech- and Expression-Driven Blendshapes for 3D Talking Face Animation


81. Studies for : A Human-AI Co-Creative Sound Artwork Using a Real-time Multi-channel Sound Generation Model


82. Cost-Sensitive Unbiased Risk Estimation for Multi-Class Positive-Unlabeled Learning


83. GReF: A Unified Generative Framework for Efficient Reranking via Ordered Multi-token Prediction


84. Human Resilience in the AI Era – What Machines Can’t Replace


85. Fed-PELAD: Communication-Efficient Federated Learning for Massive MIMO CSI Feedback with Personalized Encoders and a LoRA-Adapted Shared Decoder


86. SFMS-ALR: Script-First Multilingual Speech Synthesis with Adaptive Locale Resolution


87. Transformers in Medicine: Improving Vision-Language Alignment for Medical Image Captioning



89. Lipschitz-aware Linearity Grafting for Certified Robustness


90. Bridging the Divide: End-to-End Sequence-Graph Learning


91. Learning Low Rank Neural Representations of Hyperbolic Wave Dynamics from Data


92. The Neural Differential Manifold: An Architecture with Explicit Geometric Structure


93. Learning Fair Graph Representations with Multi-view Information Bottleneck


94. Monopoly Deal: A Benchmark Environment for Bounded One-Sided Response Games


95. GAPMAP: Mapping Scientific Knowledge Gaps in Biomedical Literature Using Large Language Models


96. Scalable predictive processing framework for multitask caregiving robots


97. Efficient License Plate Recognition via Pseudo-Labeled Supervision with Grounding DINO and YOLOv8


98. StorageXTuner: An LLM Agent-Driven Automatic Tuning Framework for Heterogeneous Storage Systems


99. Towards Human-AI Synergy in Requirements Engineering: A Framework and Preliminary Study


100. Emergence of Minimal Circuits for Indirect Object Identification in Attention-Only Transformers


101. Epileptic Seizure Detection and Prediction from EEG Data: A Machine Learning Approach with Clinical Validation


102. FaRAccel: FPGA-Accelerated Defense Architecture for Efficient Bit-Flip Attack Resilience in Transformer Models


103. LRT-Diffusion: Calibrated Risk-Aware Guidance for Diffusion Policies


104. FT-ARM: Fine-Tuned Agentic Reflection Multimodal Language Model for Pressure Ulcer Severity Classification with Reasoning


105. Hammering the Diagnosis: Rowhammer-Induced Stealthy Trojan Attacks on ViT-Based Medical Imaging


106. Sequences of Logits Reveal the Low Rank Structure of Language Models


107. SCOUT: A Lightweight Framework for Scenario Coverage Assessment in Autonomous Driving


108. Finding Culture-Sensitive Neurons in Vision-Language Models


109. KAN-GCN: Combining Kolmogorov-Arnold Network with Graph Convolution Network for an Accurate Ice Sheet Emulator


110. Trust Dynamics in Strategic Coopetition: Computational Foundations for Requirements Engineering in Multi-Agent Systems


111. Understanding Multi-View Transformers


112. Fair Indivisible Payoffs through Shapley Value


113. Efficiency Without Cognitive Change: Evidence from Human Interaction with Narrow AI Systems


114. The Narrative Continuity Test: A Conceptual Framework for Evaluating Identity Persistence in AI Systems


115. The Generation Phases of Flow Matching: a Denoising Perspective


116. Do Chatbots Walk the Talk of Responsible AI?


117. Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation


118. SafeEditor: Unified MLLM for Efficient Post-hoc T2I Safety Editing


119. Towards a Method for Synthetic Generation of PWA Transcripts


120. Perception, Understanding and Reasoning, A Multimodal Benchmark for Video Fake News Detection


121. Deep Feature Optimization for Enhanced Fish Freshness Assessment


122. DualCap: Enhancing Lightweight Image Captioning via Dual Retrieval with Similar Scenes Visual Prompts


123. ProofSketch: Efficient Verified Reasoning for Large Language Models


124. COMMUNITYNOTES: A Dataset for Exploring the Helpfulness of Fact-Checking Explanations


125. CT-Less Attenuation Correction Using Multiview Ensemble Conditional Diffusion Model on High-Resolution Uncorrected PET Images


126. MASPRM: Multi-Agent System Process Reward Model


127. From Narrative to Action: A Hierarchical LLM-Agent Framework for Human Mobility Generation


128. Fortytwo: Swarm Inference with Peer-Ranked Consensus


129. Large Language Models Report Subjective Experience Under Self-Referential Processing


130. Mutual Wanting in Human–AI Interaction: Empirical Evidence from Large-Scale Analysis of GPT Model Transitions


131. A Survey on Efficient Vision-Language-Action Models


132. SwiftEmbed: Ultra-Fast Text Embeddings via Static Token Lookup for Real-Time Applications


133. PISA-Bench: The PISA Index as a Multilingual and Multimodal Metric for the Evaluation of Vision-Language Models


134. The Underappreciated Power of Vision Models for Graph Structural Understanding


135. ESCA: Enabling Seamless Codec Avatar Execution through Algorithm and Hardware Co-Optimization for Virtual Reality


136. AI & Data Competencies: Scaffolding holistic AI literacy in Higher Education


137. Cross-Enhanced Multimodal Fusion of Eye-Tracking and Facial Features for Alzheimer’s Disease Diagnosis


138. Confidence is Not Competence


139. DMVFC: Deep Learning Based Functionally Consistent Tractography Fiber Clustering Using Multimodal Diffusion MRI and Functional MRI


140. Combining SAR Simulators to Train ATR Models with Synthetic Data


141. Towards Fine-Grained Human Motion Video Captioning


142. Topic-aware Large Language Models for Summarizing the Lived Healthcare Experiences Described in Health Stories


143. Dual-Domain Deep Learning-Assisted NOMA-CSK Systems for Secure and Efficient Vehicular Communications


144. Falcon: A Comprehensive Chinese Text-to-SQL Benchmark for Enterprise-Grade Evaluation


145. Dingtalk DeepResearch: A Unified Multi Agent Framework for Adaptive Intelligence in Enterprise Environments


146. Stable-by-Design Neural Network-Based LPV State-Space Models for System Identification


147. Beyond Function-Level Search: Repository-Aware Dual-Encoder Code Retrieval with Adversarial Verification


148. EcoScaleNet: A Lightweight Multi Kernel Network for Long Sequence 12 lead ECG Classification


149. PulseFi: A Low Cost Robust Machine Learning System for Accurate Cardiopulmonary and Apnea Monitoring Using Channel State Information


150. Cardi-GPT: An Expert ECG-Record Processing Chatbot


151. Flows, straight but not so fast: Exploring the design space of Rectified Flows in Protein Design


152. Beyond Models: A Framework for Contextual and Cultural Intelligence in African AI Deployment


153. AmarDoctor: An AI-Driven, Multilingual, Voice-Interactive Digital Health Application for Primary Care Triage and Patient Management to Bridge the Digital Health Divide for Bengali Speakers


154. The Epistemic Suite: A Post-Foundational Diagnostic Methodology for Assessing AI Knowledge Claims


155. Modelling the Interplay of Eye-Tracking Temporal Dynamics and Personality for Emotion Detection in Face-to-Face Settings


156. Large-Scale Network Embedding in Apache Spark