전체 AI 논문 - 2025-10-29

1. Bridging Tool Dependencies and Domain Knowledge: A Graph-Based Framework for In-Context Planning


2. OrchDAG: Complex Tool Orchestration in Multi-Turn Interactions with Plan DAGs


3. Advancing site-specific disease and pest management in precision agriculture: From reasoning-driven foundation models to adaptive, feedback-based learning


4. FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling


5. Generative AI for Healthcare: Fundamentals, Challenges, and Perspectives


6. From Cross-Task Examples to In-Task Prompts: A Graph-Based Pseudo-Labeling Framework for In-context Learning


7. Adaptive Surrogate Gradients for Sequential Reinforcement Learning in Spiking Neural Networks


8. Affordance Representation and Recognition for Autonomous Agents



10. Human-Level Reasoning: A Comparative Study of Large Language Models on Logical and Abstract Reasoning


11. OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows


12. APTBench: Benchmarking Agentic Potential of Base LLMs During Pre-Training


13. Improving LLM Reasoning via Dependency-Aware Query Decomposition and Logic-Parallel Content Expansion


14. Policy Cards: Machine-Readable Runtime Governance for Autonomous AI Agents


15. An N-of-1 Artificial Intelligence Ecosystem for Precision Medicine


16. A Unified Geometric Space Bridging AI Models and the Human Brain


17. VDSAgents: A PCS-Guided Multi-Agent System for Veridical Data Science Automation


18. Generative Large Language Models (gLLMs) in Content Analysis: A Practical Guide for Communication Research


19. Retrieval and Argumentation Enhanced Multi-Agent LLMs for Judgmental Forecasting


20. Verifying Large Language Models’ Reasoning Paths via Correlation Matrix Rank


21. Investigating Intra-Abstraction Policies For Non-exact Abstraction Algorithms


22. MCP-Flow: Facilitating LLM Agents to Master Real-World, Diverse and Scaling MCP Tools


23. MGA: Memory-Driven GUI Agent for Observation-Centric Interaction


24. UniPlanner: A Unified Motion Planning Framework for Autonomous Vehicle Decision-Making Systems via Multi-Dataset Integration


25. BLM$_1$: A Boundless Large Model for Cross-Space, Cross-Task, and Cross-Embodiment Learning


26. BMGQ: A Bottom-up Method for Generating Complex Multi-hop Reasoning Questions from Semi-structured Data


27. From Observability Data to Diagnosis: An Evolving Multi-agent System for Incident Management in Cloud Systems


28. HistoLens: An Interactive XAI Toolkit for Verifying and Mitigating Flaws in Vision-Language Models for Histopathology


29. Modeling Electric Vehicle Car-Following Behavior: Classical vs Machine Learning Approach


30. LLMLogAnalyzer: A Clustering-Based Log Analysis Chatbot using Large Language Models


31. OneCast: Structured Decomposition and Modular Generation for Cross-Domain Time Series Forecasting


32. Discovering Heuristics with Large Language Models (LLMs) for Mixed-Integer Programs: Single-Machine Scheduling


33. Learning Individual Movement Shifts After Urban Disruptions with Social Infrastructure Reliance


34. The Sign Estimator: LLM Alignment in the Face of Choice Heterogeneity


35. Decentralized Causal Discovery using Judo Calculus


36. Latent Chain-of-Thought for Visual Reasoning


37. Agentic AI Security: Threats, Defenses, Evaluation, and Open Challenges


38. Hybrid Modeling, Sim-to-Real Reinforcement Learning, and Large Language Model Driven Control for Digital Twins


39. Generating Creative Chess Puzzles


40. From Benchmarks to Business Impact: Deploying IBM Generalist Agent in Enterprise Production


41. Decentralized Multi-Agent Goal Assignment for Path Planning using Large Language Models


42. ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents


43. Why Foundation Models in Pathology Are Failing


44. Evaluating In Silico Creativity: An Expert Review of AI Chess Compositions


45. Test-Time Tuned Language Models Enable End-to-end De Novo Molecular Structure Generation from MS/MS Spectra


46. Multi-Environment POMDPs: Discrete Model Uncertainty Under Partial Observability


47. AI and the Decentering of Disciplinary Creativity


48. Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents


49. Does Object Binding Naturally Emerge in Large Pretrained Vision Transformers?


50. ComboBench: Can LLMs Manipulate Physical Devices to Play Virtual Reality Games?


51. Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents


52. Tongyi DeepResearch Technical Report


53. Greedy Sampling Is Provably Efficient for RLHF


54. AgentFold: Long-Horizon Web Agents with Proactive Context Management


55. ParallelMuse: Agentic Parallel Thinking for Deep Information Seeking


56. Repurposing Synthetic Data for Fine-grained Search Agent Supervision


57. Fast algorithms enabling optimization and deep learning for photoacoustic tomography in a circular detection geometry


58. Dissecting Role Cognition in Medical LLMs via Neuronal Ablation


59. Learning to Drive Safely with Hybrid Options


60. Multi-Agent Scenario Generation in Roundabouts with a Transformer-enhanced Conditional Variational Autoencoder


61. InteractComp: Evaluating Search Agents With Ambiguous Queries


62. The Cost of Robustness: Tighter Bounds on Parameter Complexity for Robust Memorization in ReLU Nets


63. Causal Ordering for Structure Learning From Time Series


64. All in one timestep: Enhancing Sparsity and Energy efficiency in Multi-level Spiking Neural Networks


65. Zero-Shot Cross-Lingual Transfer using Prefix-Based Adaptation


66. DistDF: Time-Series Forecasting Needs Joint-Distribution Wasserstein Alignment


67. LoRA-DA: Data-Aware Initialization for Low-Rank Adaptation via Asymptotic Analysis


68. Quantum-Resistant Networks Using Post-Quantum Cryptography


69. Audio Signal Processing Using Time Domain Mel-Frequency Wavelet Coefficient


70. Local Performance vs. Out-of-Distribution Generalization: An Empirical Analysis of Personalized Federated Learning in Heterogeneous Data Environments


71. Design and Optimization of Cloud Native Homomorphic Encryption Workflows for Privacy-Preserving ML Inference


72. Online neural fusion of distortionless differential beamformers for robust speech enhancement


73. Diffusion Models for Wireless Transceivers: From Pilot-Efficient Channel Estimation to AI-Native 6G Receivers


74. A word association network methodology for evaluating implicit biases in LLMs compared to humans


75. Sample-efficient and Scalable Exploration in Continuous-Time RL


76. Mitigating Hallucination in Large Language Models (LLMs): An Application-Oriented Survey on RAG, Reasoning, and Agentic Systems


77. Iterative Critique-Refine Framework for Enhancing LLM Personalization


78. Charting the European LLM Benchmarking Landscape: A New Taxonomy and a Set of Best Practices


79. Rethinking Visual Intelligence: Insights from Video Pretraining


80. Can LLMs Write Faithfully? An Agent-Based Evaluation of LLM-generated Islamic Content


81. MiniOneRec: An Open-Source Framework for Scaling Generative Recommendation


82. Metadata-Driven Retrieval-Augmented Generation for Financial Question Answering


83. Perception Learning: A Formal Separation of Sensory Representation Learning from Decision Learning


84. LongWeave: A Long-Form Generation Benchmark Bridging Real-World Relevance and Verifiability


85. Beyond MCQ: An Open-Ended Arabic Cultural QA Benchmark with Dialect Variants


86. Few-Shot Remote Sensing Image Scene Classification with CLIP and Prompt Learning


87. Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning


88. Transformers can do Bayesian Clustering


89. ViPER: Empowering the Self-Evolution of Visual Perception Abilities in Vision-Language Model


90. Training-free Source Attribution of AI-generated Images via Resynthesis


91. Survey and Tutorial of Reinforcement Learning Methods in Process Systems Engineering


92. DynaRend: Learning 3D Dynamics via Masked Future Rendering for Robotic Manipulation


93. Trajectory Design for UAV-Based Low-Altitude Wireless Networks in Unknown Environments: A Digital Twin-Assisted TD3 Approach


94. Enabling Near-realtime Remote Sensing via Satellite-Ground Collaboration of Large Vision-Language Models


95. MAGNET: A Multi-Graph Attentional Network for Code Clone Detection


96. PaTaRM: Bridging Pairwise and Pointwise Signals via Preference-Aware Task-Adaptive Reward Modeling


97. Closing Gaps: An Imputation Analysis of ICU Vital Signs


98. MuSaG: A Multimodal German Sarcasm Dataset with Full-Modal Annotations


99. SymMaP: Improving Computational Efficiency in Linear Solvers through Symbolic Preconditioning


100. Self-supervised Synthetic Pretraining for Inference of Stellar Mass Embedded in Dense Gas


101. Enhancing Vision-Language Models for Autonomous Driving through Task-Specific Prompting and Spatial Reasoning


102. Ko-MuSR: A Multistep Soft Reasoning Benchmark for LLMs Capable of Understanding Korean


103. Beyond Line-Level Filtering for the Pretraining Corpora of LLMs


104. VC4VG: Optimizing Video Captions for Text-to-Video Generation


105. Compositional Image Synthesis with Inference-Time Scaling


106. LagMemo: Language 3D Gaussian Splatting Memory for Multi-modal Open-vocabulary Multi-goal Visual Navigation


107. Taming the Tail: NoI Topology Synthesis for Mixed DL Workloads on Chiplet-Based Accelerators


108. Model-Guided Dual-Role Alignment for High-Fidelity Open-Domain Video-to-Audio Generation


109. Learning Parameterized Skills from Demonstrations


110. Covert Surveillance in Smart Devices: A SCOUR Framework Analysis of Youth Privacy Implications


111. FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic


112. PULSE: Privileged Knowledge Transfer from Electrodermal Activity to Low-Cost Sensors for Stress Monitoring


113. SynAD: Enhancing Real-World End-to-End Autonomous Driving Models through Synthetic Data Integration


114. Learning from History: A Retrieval-Augmented Framework for Spatiotemporal Prediction


115. Causal-Aware Generative Adversarial Networks with Reinforcement Learning


116. Geometric Algorithms for Neural Combinatorial Optimization with Constraints


117. ResNet: Enabling Deep Convolutional Neural Networks through Residual Learning


118. Improved Accuracy of Robot Localization Using 3-D LiDAR in a Hippocampus-Inspired Model


119. Spatio-temporal Multivariate Time Series Forecast with Chosen Variables


120. NeuroPathNet: Dynamic Path Trajectory Learning for Brain Functional Connectivity Analysis


121. SpecKD: Speculative Decoding for Effective Knowledge Distillation of LLMs


122. Teaching LLMs to Abstain via Fine-Grained Semantic Confidence Reward


123. Lifecycle-Aware code generation: Leveraging Software Engineering Phases in LLMs


124. Training-Free Safe Text Embedding Guidance for Text-to-Image Diffusion Models


125. Mars-Bench: A Benchmark for Evaluating Foundation Models for Mars Science Tasks


126. STNet: Spectral Transformation Network for Solving Operator Eigenvalue Problem


127. HyperGraphX: Graph Transductive Learning with Hyperdimensional Computing and Message Passing


128. Diffusion Adaptive Text Embedding for Text-to-Image Diffusion Models


129. An efficient probabilistic hardware architecture for diffusion-like models


130. SafeVision: Efficient Image Guardrail with Robust Policy Adherence and Explainability


131. Neural USD: An object-centric framework for iterative editing and control


132. Uncovering the Potential Risks in Unlearning: Danger of English-only Unlearning in Multilingual LLMs


133. ChessQA: Evaluating Large Language Models for Chess Understanding


134. Auto prompting without training labels: An LLM cascade for product quality assessment in e-commerce catalogs


135. Modeling Biological Multifunctionality with Echo State Networks


136. Scalable GPU-Based Integrity Verification for Large Machine Learning Models


137. MFiSP: A Multimodal Fire Spread Prediction Framework


138. Agent-based Automated Claim Matching with Instruction-following LLMs


139. Key and Value Weights Are Probably All You Need: On the Necessity of the Query, Key, Value weight Triplet in Decoder-Only Transformers


140. DynaStride: Dynamic Stride Windowing with MMCoT for Instructional Multi-Scene Captioning


141. Group Interventions on Deep Networks for Causal Discovery in Subsystems


142. RS-ORT: A Reduced-Space Branch-and-Bound Algorithm for Optimal Regression Trees


143. Evaluating the effectiveness of LLM-based interoperability


144. PRO: Enabling Precise and Robust Text Watermark for Open-Source LLMs


145. OraPlan-SQL: A Planning-Centric Framework for Complex Bilingual NL2SQL Reasoning


146. A PDE-Informed Latent Diffusion Model for 2-m Temperature Downscaling


147. Can LLMs Narrate Tabular Data? An Evaluation Framework for Natural Language Representations of Text-to-SQL System Outputs


148. A Neural Model for Contextual Biasing Score Learning and Filtering


149. CRADLE Bench: A Clinician-Annotated Benchmark for Multi-Faceted Mental Health Crisis and Safety Risk Detection


150. A geometric and deep learning reproducible pipeline for monitoring floating anthropogenic debris in urban rivers using in situ cameras


151. CountFormer: A Transformer Framework for Learning Visual Repetition and Structure in Class-Agnostic Object Counting


152. Explainable Detection of AI-Generated Images with Artifact Localization Using Faster-Than-Lies and Vision-Language Models for Edge Devices


153. TDFlow: Agentic Workflows for Test Driven Software Engineering


154. Explaining Robustness to Catastrophic Forgetting Through Incremental Concept Formation


155. Debiasing Reward Models by Representation Learning with Guarantees


156. On the Societal Impact of Machine Learning


157. Parallel BiLSTM-Transformer networks for forecasting chaotic dynamics


158. Beyond Prompt Engineering: Neuro-Symbolic-Causal Architecture for Robust Multi-Objective AI Agents


159. QueryIPI: Query-agnostic Indirect Prompt Injection on Coding Agents


160. RefleXGen:The unexamined code is not worth using


161. MCPGuard : Automatically Detecting Vulnerabilities in MCP Servers


162. Sparsity and Superposition in Mixture of Experts


163. What Work is AI Actually Doing? Uncovering the Drivers of Generative AI Adoption


164. Traffic flow forecasting, STL decomposition, Hybrid model, LSTM, ARIMA, XGBoost, Intelligent transportation systems


165. Optimize Any Topology: A Foundation Model for Shape- and Resolution-Free Structural Topology Optimization


166. Transformers from Compressed Representations


167. Agentsway – Software Development Methodology for AI Agents-based Teams


168. Quanvolutional Neural Networks for Pneumonia Detection: An Efficient Quantum-Assisted Feature Extraction Paradigm


169. Aligning Diffusion Language Models via Unpaired Preference Optimization


170. Error Adjustment Based on Spatiotemporal Correlation Fusion for Traffic Forecasting


171. The Structural Scalpel: Automated Contiguous Layer Pruning for Large Language Models


172. Beyond Hidden-Layer Manipulation: Semantically-Aware Logit Interventions for Debiasing LLMs


173. Efficient Low Rank Attention for Long-Context Inference in Large Language Models


174. RoGBot: Relationship-Oblivious Graph-based Neural Network with Contextual Knowledge for Bot Detection


175. SAND: A Self-supervised and Adaptive NAS-Driven Framework for Hardware Trojan Detection


176. VisCoder2: Building Multi-Language Visualization Coding Agents


177. Spatially Aware Linear Transformer (SAL-T) for Particle Jet Tagging


178. Structure-Aware Fusion with Progressive Injection for Multimodal Molecular Representation Learning


179. Integrating Genomics into Multimodal EHR Foundation Models


180. Bridging Function Approximation and Device Physics via Negative Differential Resistance Networks


181. Combining Textual and Structural Information for Premise Selection in Lean


182. Flight Delay Prediction via Cross-Modality Adaptation of Large Language Models and Aircraft Trajectory Representation


183. Help the machine to help you: an evaluation in the wild of egocentric data cleaning via skeptical learning


184. Monotone and Separable Set Functions: Characterizations and Neural Models


185. Noise is All You Need: Solving Linear Inverse Problems by Noise Combination Sampling with Diffusion Models


186. LLMComp: A Language Modeling Paradigm for Error-Bounded Scientific Data Compression


187. Beyond Pairwise: Empowering LLM Alignment With Ranked Choice Modeling


188. NUM2EVENT: Interpretable Event Reasoning from Numerical time-series


189. Chain of Execution Supervision Promotes General Reasoning in Large Language Models


190. AI-Driven Development of a Publishing Imprint: Xynapse Traces


191. From Detection to Discovery: A Closed-Loop Approach for Simultaneous and Continuous Medical Knowledge Expansion and Depression Detection on Social Media


192. Speeding Up MACE: Low-Precision Tricks for Equivarient Force Fields


193. Genotype-Phenotype Integration through Machine Learning and Personalized Gene Regulatory Networks for Cancer Metastasis Prediction


194. Short Ticketing Detection Framework Analysis Report


195. An Enhanced Dual Transformer Contrastive Network for Multimodal Sentiment Analysis


196. Feedback Lunch: Deep Feedback Codes for Wiretap Channels


197. Preference Learning with Response Time: Robust Losses and Guarantees


198. Fine-tuning Large Language Models with Limited Data: A Survey and Practical Guide