전체 AI 논문 - 2025-10-24

1. Real Deep Research for AI, Robotics and Beyond


2. A Coherence-Based Measure of AGI


3. Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs


4. The Shape of Reasoning: Topological Analysis of Reasoning Traces in Large Language Models


5. Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges


6. Fluidity Index: Next-Generation Super-intelligence Benchmarks


7. Towards Reliable Evaluation of Large Language Models for Multilingual and Multimodal E-Commerce Applications


8. Towards the Formalization of a Trustworthy AI for Mining Interpretable Models explOiting Sophisticated Algorithms


9. Efficient Algorithms for Computing Random Walk Centrality


10. What Defines Good Reasoning in LLMs? Dissecting Reasoning Steps with Multi-Aspect Evaluation


11. Transferable Graph Learning for Transmission Congestion Management via Busbar Splitting


12. Lost in Translation: Policymakers are not really listening to Citizen Concerns about AI


13. FLORA: Unsupervised Knowledge Graph Alignment by Fuzzy Logic


14. Neural Reasoning for Robust Instance Retrieval in $\mathcal{SHOIQ}$


15. A computational model and tool for generating more novel opportunities in professional innovation processes


16. IKnow: Instruction-Knowledge-Aware Continual Pretraining for Effective Domain Adaptation


17. LLM-empowered knowledge graph construction: A survey


18. Collateral Damage Assessment Model for AI System Target Engagement in Military Operations


19. Bias by Design? How Data Practices Shape Fairness in AI Healthcare Systems


20. Multi-Step Reasoning for Embodied Question Answering via Tool Augmentation


21. Classical Feature Embeddings Help in BERT-Based Human Mobility Prediction


22. Using Large Language Models for Abstraction of Planning Domains - Extended Version


23. Individualized Cognitive Simulation in Large Language Models: Evaluating Different Cognitive Representation Methods


24. Merge and Conquer: Evolutionarily Optimizing AI for 2048


25. The Lock-In Phase Hypothesis: Identity Consolidation as a Precursor to AGI


26. TRUST: A Decentralized Framework for Auditing Large Language Model Reasoning



28. Human-Centered LLM-Agent System for Detecting Anomalous Digital Asset Transactions


29. AI PB: A Grounded Generative Agent for Personalized Investment Insights


30. LLMs can hide text in other text of the same length.ipynb


31. AI-Driven Personalized Learning: Predicting Academic Per-formance Through Leadership Personality Traits


32. A new wave of vehicle insurance fraud fueled by generative AI


33. RELATE: A Schema-Agnostic Perceiver Encoder for Multimodal Relational Graphs


34. Surfer 2: The Next Generation of Cross-Platform Computer Use Agents


35. DAG-Math: Graph-Guided Mathematical Reasoning in LLMs


36. Branch-and-Browse: Efficient and Controllable Web Exploration with Tree-Structured Reasoning and Action Memory


37. Benchmarking Reasoning Reliability in Artificial Intelligence Models for Energy-System Analysis


38. A Quantum-Inspired Algorithm for Solving Sudoku Puzzles and the MaxCut Problem


39. Towards General Modality Translation with Contrastive and Predictive Latent Diffusion Bridge


40. VAMOS: A Hierarchical Vision-Language-Action Model for Capability-Modulated and Steerable Navigation


41. GSWorld: Closed-Loop Photo-Realistic Simulation Suite for Robotic Manipulation


42. Small Drafts, Big Verdict: Information-Intensive Visual Reasoning via Speculation


43. On the Detectability of LLM-Generated Text: What Exactly Is LLM-Generated Text?


44. The Reality Gap in Robotics: Challenges, Solutions, and Best Practices


45. Compress to Impress: Efficient LLM Adaptation Using a Single Gradient Step on 100 Samples


46. Simple Context Compression: Mean-Pooling and Multi-Ratio Training


47. Bayesian Inference of Primordial Magnetic Field Parameters from CMB with Spherical Graph Neural Networks


48. A Use-Case Specific Dataset for Measuring Dimensions of Responsible Performance in LLM-generated Text


49. Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost


50. FieldGen: From Teleoperated Pre-Manipulation Trajectories to Field-Guided Data Generation


51. RAGRank: Using PageRank to Counter Poisoning in CTI LLM Pipelines


52. Reinforcement Learning and Consumption-Savings Behavior


53. Empathic Prompting: Non-Verbal Context Integration for Multimodal LLM Conversations


54. Thought Communication in Multiagent Collaboration


55. Co-Designing Quantum Codes with Transversal Diagonal Gates via Multi-Agent Systems



57. User Perceptions of Privacy and Helpfulness in LLM Responses to Privacy-Sensitive Scenarios


58. Unsupervised Anomaly Prediction with N-BEATS and Graph Neural Network in Multi-variate Semiconductor Process Time Series


59. Real-Time Gait Adaptation for Quadrupeds using Model Predictive Control and Reinforcement Learning


60. Fusing Narrative Semantics for Financial Volatility Forecasting


61. Exploring Large Language Models for Access Control Policy Synthesis and Summarization


62. Neural Diversity Regularizes Hallucinations in Small Models


63. A Scalable, Causal, and Energy Efficient Framework for Neural Decoding with Spiking Neural Networks


64. R2-SVC: Towards Real-World Robust and Expressive Zero-shot Singing Voice Conversion


65. GRACE: GRaph-based Addiction Care prEdiction


66. Finding the Sweet Spot: Trading Quality, Cost, and Speed During Inference-Time LLM Reflection


67. The Reasoning Lingua Franca: A Double-Edged Sword for Multilingual AI


68. Why Did Apple Fall To The Ground: Evaluating Curiosity In Large Language Model


69. Deep Learning in Dental Image Analysis: A Systematic Review of Datasets, Methodologies, and Emerging Challenges


70. Quantum Processing Unit (QPU) processing time Prediction with Machine Learning


71. Equitable Survival Prediction: A Fairness-Aware Survival Modeling (FASM) Approach


72. Black Box Absorption: LLMs Undermining Innovative Ideas


73. PSO-XAI: A PSO-Enhanced Explainable AI Framework for Reliable Breast Cancer Detection


74. BUSTED at AraGenEval Shared Task: A Comparative Study of Transformer-Based Models for Arabic AI-Generated Text Detection


75. Practical Code RAG at Scale: Task-Aware Retrieval Design Choices under Compute Budgets


76. Generalizable Reasoning through Compositional Energy Minimization


77. OnlineSplatter: Pose-Free Online 3D Reconstruction for Free-Moving Objects


78. Resounding Acoustic Fields with Reciprocity


79. Unsupervised Domain Adaptation via Similarity-based Prototypes for Cross-Modality Segmentation


80. Can ChatGPT Code Communication Data Fairly?: Empirical Evidence from Multiple Collaborative Tasks


81. Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence


82. AdaDoS: Adaptive DoS Attack via Deep Adversarial Reinforcement Learning in SDN


83. Structural Invariance Matters: Rethinking Graph Rewiring through Graph Metrics


84. GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learning


85. The Dog the Cat Chased Stumped the Model: Measuring When Language Models Abandon Structure for Shortcuts


86. ARC-Encoder: learning compressed text representations for large language models


87. Fake-in-Facext: Towards Fine-Grained Explainable DeepFake Analysis


88. Metis-HOME: Hybrid Optimized Mixture-of-Experts for Multimodal Reasoning


89. Hierarchical Sequence Iteration for Heterogeneous Question Answering


90. Steering Evaluation-Aware Language Models To Act Like They Are Deployed


91. Hurdle-IMDL: An Imbalanced Learning Framework for Infrared Rainfall Retrieval


92. RECALL: REpresentation-aligned Catastrophic-forgetting ALLeviation via Hierarchical Model Merging


93. Structures generated in a multiagent system performing information fusion in peer-to-peer resource-constrained networks


94. Transferable Black-Box One-Shot Forging of Watermarks via Image Preference Models


95. Symbolic Regression and Differentiable Fits in Beyond the Standard Model Physics


96. MolBridge: Atom-Level Joint Graph Refinement for Robust Drug-Drug Interaction Event Prediction


97. UniSE: A Unified Framework for Decoder-only Autoregressive LM-based Speech Enhancement


98. Dynamic Weight Adjustment for Knowledge Distillation: Leveraging Vision Transformer for High-Accuracy Lung Cancer Detection and Real-Time Deployment


99. Balancing Specialization and Centralization: A Multi-Agent Reinforcement Learning Benchmark for Sequential Industrial Control


100. FLAS: a combination of proactive and reactive auto-scaling architecture for distributed services


101. Relative-Based Scaling Law for Neural Language Models



103. The Impact of Negated Text on Hallucination with Large Language Models


104. Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models


105. What do AI-Generated Images Want?


106. Teaching Language Models to Reason with Tools


107. Multi-Task Deep Learning for Surface Metrology


108. GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?


109. MemER: Scaling Up Memory for Robot Control via Experience Retrieval


110. LEGO: A Lightweight and Efficient Multiple-Attribute Unlearning Framework for Recommender Systems


111. Enhancing Security in Deep Reinforcement Learning: A Comprehensive Survey on Adversarial Attacks and Defenses


112. DB-FGA-Net: Dual Backbone Frequency Gated Attention Network for Multi-Class Classification with Grad-CAM Interpretability


113. RAG-Stack: Co-Optimizing RAG Quality and Performance From the Vector Database Perspective


114. A Parameter-Efficient Mixture-of-Experts Framework for Cross-Modal Geo-Localization


115. Breakdance Video classification in the age of Generative AI


116. UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning


117. Context-level Language Modeling by Learning Predictive Context Embeddings


118. Limits of PRM-Guided Tree Search for Mathematical Reasoning with LLMs


119. Towards AI Agents for Course Instruction in Higher Education: Early Experiences from the Field


120. What Does It Take to Build a Performant Selective Classifier?


121. Tri-Modal Severity Fused Diagnosis across Depression and Post-traumatic Stress Disorders


122. Multi-Objective Reinforcement Learning with Max-Min Criterion: A Game-Theoretic Approach


123. Why LVLMs Are More Prone to Hallucinations in Longer Responses: The Role of Context


124. Federated Learning via Meta-Variational Dropout


125. QKCV Attention: Enhancing Time Series Forecasting with Static Categorical Embeddings for Both Lightweight and Pre-trained Foundation Models


126. FinCARE: Financial Causal Analysis with Reasoning and Evidence


127. High-order Interactions Modeling for Interpretable Multi-Agent Q-Learning


128. Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents


129. Assessing the Feasibility of Early Cancer Detection Using Routine Laboratory Data: An Evaluation of Machine Learning Approaches on an Imbalanced Dataset


130. Stuck in the Matrix: Probing Spatial Reasoning in Large Language Models


131. PPMStereo: Pick-and-Play Memory Construction for Consistent Dynamic Stereo Matching


132. Mixture-of-Minds: Multi-Agent Reinforcement Learning for Table Understanding


133. Collective Communication for 100k+ GPUs


134. IB-GAN: Disentangled Representation Learning with Information Bottleneck Generative Adversarial Networks


135. Are Stereotypes Leading LLMs’ Zero-Shot Stance Detection ?


136. SAID: Empowering Large Language Models with Self-Activating Internal Defense


137. Leveraging the Power of Large Language Models in Entity Linking via Adaptive Routing and Targeted Reasoning


138. On the Structure of Stationary Solutions to McKean-Vlasov Equations with Applications to Noisy Transformers


139. StableSketcher: Enhancing Diffusion Model for Pixel-based Sketch Generation via Visual Question Answering Feedback


140. CreativityPrism: A Holistic Benchmark for Large Language Model Creativity


141. ShapeX: Shapelet-Driven Post Hoc Explanations for Time Series Classification Models


142. Ask What Your Country Can Do For You: Towards a Public Red Teaming Model


143. Approximate Model Predictive Control for Microgrid Energy Management via Imitation Learning


144. Beyond One-Way Influence: Bidirectional Opinion Dynamics in Multi-Turn Human-LLM Interactions


145. The Temporal Graph of Bitcoin Transactions


146. Optimized Distortion in Linear Social Choice


147. Forging GEMs: Advancing Greek NLP through Quality-Based Corpus Curation and Specialized Pre-training


148. Beyond MedQA: Towards Real-world Clinical Decision Making in the Era of LLMs


149. A Framework for the Adoption and Integration of Generative AI in Midsize Organizations and Enterprises (FAIGMOE)


150. LLM-Augmented Symbolic NLU System for More Reliable Continuous Causal Statement Interpretation


151. Revisiting Zeroth-Order Optimization: Minimum-Variance Two-Point Estimators and Directionally Aligned Perturbations


152. A Tutorial on Cognitive Biases in Agentic AI-Driven 6G Autonomous Networks


153. LyriCAR: A Difficulty-Aware Curriculum Reinforcement Learning Framework For Controllable Lyric Translation


154. On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization


155. Robust Reinforcement Learning in Finance: Modeling Market Impact with Elliptic Uncertainty Sets


156. Learning from Supervision with Semantic and Episodic Memory: A Reflective Approach to Agent Adaptation


157. Large Language Model enabled Mathematical Modeling


158. Can They Dixit? Yes they Can! Dixit as a Playground for Multimodal Language Model Capabilities


159. From Optimization to Prediction: Transformer-Based Path-Flow Estimation to the Traffic Assignment Problem


160. Quantifying Feature Importance for Online Content Moderation


161. Stream: Scaling up Mechanistic Interpretability to Long Context in LLMs via Sparse Attention


162. From Large to Small: Transferring CUDA Optimization Expertise via Reasoning Graph


163. An Evaluation of the Pedagogical Soundness and Usability of AI-Generated Lesson Plans Across Different Models and Prompt Frameworks in High-School Physics


164. Can Reasoning Models Obfuscate Reasoning? Stress-Testing Chain-of-Thought Monitorability


165. Prompt Decorators: A Declarative and Composable Syntax for Reasoning, Formatting, and Control in LLMs


166. CourtGuard: A Local, Multiagent Prompt Injection Classifier


167. SSL-SE-EEG: A Framework for Robust Learning from Unlabeled EEG Data with Self-Supervised Learning and Squeeze-Excitation Networks


168. SLYKLatent: A Learning Framework for Gaze Estimation Using Deep Facial Feature Learning