전체 AI 논문 - 2026-02-27

1. Toward Expert Investment Teams:A Multi-Agent LLM System with Fine-Grained Trading Tasks


2. LLM Novice Uplift on Dual-Use, In Silico Biology Tasks


3. Generalized Rapid Action Value Estimation in Memory-Constrained Environments


4. Invariant Transformation and Resampling based Epistemic-Uncertainty Reduction


5. The logic of KM belief update is contained in the logic of AGM belief revision


6. ODEBrain: Continuous-Time EEG Graph for Modeling Dynamic Brain Networks


7. CXReasonAgent: Evidence-Grounded Diagnostic Reasoning Agent for Chest X-rays


8. Evaluating Stochasticity in Deep Research Agents


9. AgentDropoutV2: Optimizing Information Flow in Multi-Agent Systems via Test-Time Rectify-or-Reject Pruning


10. Mitigating Legibility Tax with Decoupled Prover-Verifier Games


11. A Model-Free Universal AI


12. Agency and Architectural Limits: Why Optimization-Based Systems Cannot Be Norm-Responsive


13. ReCoN-Ipsundrum: An Inspectable Recurrent Persistence Loop Agent with Affect-Coupled Control and Mechanism-Linked Consciousness Indicator Assays


14. SC-Arena: A Natural Language Benchmark for Single-Cell Reasoning with Knowledge-Augmented Evaluation


15. ESAA: Event Sourcing for Autonomous Agents in LLM-Based Software Engineering


16. A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring


17. PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering


18. The Trinity of Consistency as a Defining Principle for General World Models


19. On Sample-Efficient Generalized Planning via Learned Transition Models


20. Multi-Agent Large Language Model Based Emotional Detoxification Through Personalized Intensity Control for Consumer Protection


21. Three AI-agents walk into a bar . . . . `Lord of the Flies’ tribalism emerges among smart AI-Agents


22. Enhancing CVRP Solver through LLM-driven Automatic Heuristic Design


23. Learning-based Multi-agent Race Strategies in Formula 1



25. RepSPD: Enhancing SPD Manifold Representation in EEGs via Dynamic Graphs


26. Modeling Expert AI Diagnostic Alignment via Immutable Inference Snapshots


27. SPM-Bench: Benchmarking Large Language Models for Scanning Probe Microscopy


28. Certified Circuits: Stability Guarantees for Mechanistic Circuits


29. FactGuard: Agentic Video Misinformation Detection via Reinforcement Learning


30. General Agent Evaluation


31. OmniGAIA: Towards Native Omni-Modal AI Agents


32. Towards LLM-Empowered Knowledge Tracing via LLM-Student Hierarchical Behavior Alignment in Hyperbolic Space


33. The AI Research Assistant: Promise, Peril, and a Proof of Concept


34. DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation


35. FlexMS is a flexible framework for benchmarking deep learning-based mass spectrum prediction tools in metabolomics


36. When Should an AI Act? A Human-Centered Model of Scene, Context, and Behavior for Agentic AI Design


37. MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks


38. ClinDet-Bench: Beyond Abstention, Evaluating Judgment Determinability of LLMs in Clinical Decision-Making


39. AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications


40. Decomposing Physician Disagreement in HealthBench


41. Know What You Know: Metacognitive Entropy Calibration for Verifiable RL Reasoning


42. Generative Data Transformation: From Mixed to Unified Data


43. RLHFless: Serverless Computing for Efficient RLHF


44. Knob: A Physics-Inspired Gating Interface for Interpretable and Controllable Neural Dynamics


45. Toward Personalized LLM-Powered Agents: Foundations, Evaluation, and Future Directions


46. AHBid: An Adaptable Hierarchical Bidding Framework for Cross-Channel Advertising


47. MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios


48. SideQuest: Model-Driven KV Cache Management for Long-Horizon Agentic Reasoning


49. Correcting Human Labels for Rater Effects in AI Evaluation: An Item Response Theory Approach


50. Strategy Executability in Mathematical Reasoning: Leveraging Human-Model Differences for Effective Guidance


51. CourtGuard: A Model-Agnostic Framework for Zero-Shot Policy Adaptation in LLM Safety


52. Requesting Expert Reasoning: Augmenting LLM Agents with Learned Collaborative Intervention


53. Agentic AI for Intent-driven Optimization in Cell-free O-RAN


54. Cognitive Models and AI Algorithms Provide Templates for Designing Language Agents


55. A Mathematical Theory of Agency and Intelligence


56. Mirroring the Mind: Distilling Human-Like Metacognitive Strategies into Large Language Models


57. Mapping the Landscape of Artificial Intelligence in Life Cycle Assessment Using Large Language Models


58. VeRO: An Evaluation Harness for Agents to Optimize Agents


59. ConstraintBench: Benchmarking LLM Constraint Reasoning on Direct Optimization


60. CWM: Contrastive World Models for Action Feasibility Learning in Embodied Agent Pipelines


61. A Framework for Assessing AI Agent Decisions and Outcomes in AutoML Pipelines


62. How Do Latent Reasoning Methods Perform Under Weak and Strong Supervision?


63. ArchAgent: Agentic AI-driven Computer Architecture Discovery


64. Epistemic Filtering and Collective Hallucination: A Jury Theorem for Confidence-Calibrated Agents


65. Exploring Human Behavior During Abstract Rule Inference and Problem Solving with the Cognitive Abstraction and Reasoning Corpus


66. Towards Autonomous Memory Agents


67. Vibe Researching as Wolf Coming: Can AI Agents with Skills Replace or Augment Social Scientists?


68. Agent Behavioral Contracts: Formal Specification and Runtime Enforcement for Reliable Autonomous AI Agents


69. Multi-Level Causal Embeddings


70. FIRE: A Comprehensive Benchmark for Financial Intelligence and Reasoning Evaluation


71. Graph Your Way to Inspiration: Integrating Co-Author Graphs with Retrieval-Augmented Generation for Large Language Model Based Scientific Idea Generation


72. Model Agreement via Anchoring


73. SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation


74. SOTAlign: Semi-Supervised Alignment of Unimodal Vision and Language Models via Optimal Transport


75. FlashOptim: Optimizers for Memory Efficient Training


76. Understanding Usage and Engagement in AI-Powered Scientific Research Tools: The Asta Interaction Dataset


77. Bitwise Systolic Array Architecture for Runtime-Reconfigurable Multi-precision Quantized Multiplication on Hardware Accelerators


78. Utilizing LLMs for Industrial Process Automation


79. Evaluating Zero-Shot and One-Shot Adaptation of Small Language Models in Leader-Follower Interaction


80. Conformalized Neural Networks for Federated Uncertainty Quantification under Dual Heterogeneity


81. SPARTA: Scalable and Principled Benchmark of Tree-Structured Multi-hop QA over Text and Tables


82. Risk-Aware World Model Predictive Control for Generalizable End-to-End Autonomous Driving


83. Spatio-Temporal Token Pruning for Efficient High-Resolution GUI Agents


84. Scaling Search Relevance: Augmenting App Store Ranking with LLM-Generated Judgments


85. MovieTeller: Tool-augmented Movie Synopsis with ID Consistent Progressive Abstraction


86. Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding?


87. ColoDiff: Integrating Dynamic Consistency With Content Awareness for Colonoscopy Video Generation


88. Latent Gaussian Splatting for 4D Panoptic Occupancy Tracking


89. Efficient Encoder-Free Fourier-based 3D Large Multimodal Model


90. Modality Collapse as Mismatched Decoding: Information-Theoretic Limits of Multimodal LLMs


91. DyGnROLE: Modeling Asymmetry in Dynamic Graphs with Node-Role-Oriented Latent Encoding


92. Automated Vulnerability Detection in Source Code Using Deep Representation Learning


93. Devling into Adversarial Transferability on Image Classification: Review, Benchmark, and Evaluation


94. Accelerated Online Risk-Averse Policy Evaluation in POMDPs with Theoretical Guarantees and Novel CVaR Bounds


95. Quantity Convergence, Quality Divergence: Disentangling Fluency and Accuracy in L2 Mandarin Prosody


96. Make It Hard to Hear, Easy to Learn: Long-Form Bengali ASR and Speaker Diarization via Extreme Augmentation and Perfect Alignment


97. MoDora: Tree-Based Semi-Structured Document Analysis System


98. Affine-Scaled Attention: Towards Flexible and Stable Transformer Attention


99. LLMServingSim 2.0: A Unified Simulator for Heterogeneous and Disaggregated LLM Serving Infrastructure


100. Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization


101. Scattering Transform for Auditory Attention Decoding


102. Residual Koopman Spectral Profiling for Predicting and Preventing Transformer Training Instability


103. Discovery of Interpretable Physical Laws in Materials via Language-Model-Guided Symbolic Regression


104. MM-NeuroOnco: A Multimodal Benchmark and Instruction Dataset for MRI-Based Brain Tumor Diagnosis


105. pMoE: Prompting Diverse Experts Together Wins More in Visual Adaptation


106. A Holistic Framework for Robust Bangla ASR and Speaker Diarization with Optimized VAD and CTC Alignment


107. NoRA: Breaking the Linear Ceiling of Low-Rank Adaptation via Manifold Expansion


108. Learning Tangent Bundles and Characteristic Classes with Autoencoder Atlases


109. Test-Time Scaling with Diffusion Language Models via Reward-Guided Stitching


110. MEDNA-DFM: A Dual-View FiLM-MoE Model for Explainable DNA Methylation Prediction


111. Decentralized Ranking Aggregation: Gossip Algorithms for Borda and Copeland Consensus


112. Moral Preferences of LLMs Under Directed Contextual Influence


113. TCM-DiffRAG: Personalized Syndrome Differentiation Reasoning Method for Traditional Chinese Medicine based on Knowledge Graph and Chain of Thought


114. Hierarchy-of-Groups Policy Optimization for Long-Horizon Agentic Tasks


115. Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving


116. Natural Language Declarative Prompting (NLD-P): A Modular Governance Method for Prompt Design Under Model Drift


117. Probing for Knowledge Attribution in Large Language Models


118. QSIM: Mitigating Overestimation in Multi-Agent Reinforcement Learning via Action Similarity Weighted Q-Learning


119. TherapyProbe: Generating Design Knowledge for Relational Safety in Mental Health Chatbots Through Adversarial Simulation


120. Distributed LLM Pretraining During Renewable Curtailment Windows: A Feasibility Study


121. Towards Simulating Social Media Users with LLMs: Evaluating the Operational Validity of Conditioned Comment Prediction


122. AMLRIS: Alignment-aware Masked Learning for Referring Image Segmentation


123. Simulation-based Optimization for Augmented Reading


124. AgentSentry: Mitigating Indirect Prompt Injection in LLM Agents via Temporal Causal Diagnostics and Context Purification


125. SoPE: Spherical Coordinate-Based Positional Embedding for Enhancing Spatial Perception of 3D LVLMs


126. Same Words, Different Judgments: Modality Effects on Preference Alignment


127. IMMACULATE: A Practical LLM Auditing Framework via Verifiable Computation


128. Tokenization, Fusion and Decoupling: Bridging the Granularity Mismatch Between Large Language Models and Knowledge Graphs


129. Reinforcing Real-world Service Agents: Balancing Utility and Cost in Task-oriented Dialogue


130. SUPERGLASSES: Benchmarking Vision Language Models as Intelligent Agents for AI Smart Glasses


131. ViCLIP-OT: The First Foundation Vision-Language Model for Vietnamese Image-Text Retrieval with Optimal Transport


132. dLLM: Simple Diffusion Language Modeling


133. Instruction-based Image Editing with Planning, Reasoning, and Generation


134. ContextRL: Enhancing MLLM’s Knowledge Discovery Efficiency with Context-Augmented RL


135. CGSA: Class-Guided Slot-Aware Adaptation for Source-Free Object Detection


136. CoLyricist: Enhancing Lyric Writing with AI through Workflow-Aligned Support


137. Transformers converge to invariant algorithmic cores


138. BetterScene: 3D Scene Synthesis with Representation-Aligned Generative Model


139. TabDLM: Free-Form Tabular Data Generation via Joint Numerical-Language Diffusion


140. S2O: Early Stopping for Sparse Attention via Online Permutation


141. Guidance Matters: Rethinking the Evaluation Pitfall for Text-to-Image Generation


142. Quality-Aware Robust Multi-View Clustering for Heterogeneous Observation Noise


143. Addressing Climate Action Misperceptions with Generative AI


144. Operationalizing Fairness: Post-Hoc Threshold Optimization Under Hard Resource Limits


145. Stable Adaptive Thinking via Advantage Shaping and Length-Aware Gradient Regulation


146. Autoregressive Visual Decoding from EEG Signals


147. DrivePTS: A Progressive Learning Framework with Textual and Structural Enhancement for Driving Scene Generation


148. DisQ-HNet: A Disentangled Quantized Half-UNet for Interpretable Multimodal Image Synthesis Applications to Tau-PET Synthesis from T1 and FLAIR MRI


149. HARU-Net: Hybrid Attention Residual U-Net for Edge-Preserving Denoising in Cone-Beam Computed Tomography


150. Ruyi2 Technical Report


151. Generative Agents Navigating Digital Libraries


152. Predicting Tennis Serve directions with Machine Learning


153. Iterative Prompt Refinement for Dyslexia-Friendly Text Summarization Using GPT-4o


154. Efficient Dialect-Aware Modeling and Conditioning for Low-Resource Taiwanese Hakka Speech Processing


155. SignVLA: A Gloss-Free Vision-Language-Action Framework for Real-Time Sign Language-Guided Robotic Manipulation


156. Reinforcement-aware Knowledge Distillation for LLM Reasoning


157. From Shallow Bayesian Neural Networks to Gaussian Processes: General Convergence, Identifiability and Scalable Inference


158. Explainability-Aware Evaluation of Transfer Learning Models for IoT DDoS Detection Under Resource Constraints


159. Importance of Prompt Optimisation for Error Detection in Medical Notes Using Language Models


160. Sydney Telling Fables on AI and Humans: A Corpus Tracing Memetic Transfer of Persona between LLMs


161. Beyond Dominant Patches: Spatial Credit Redistribution For Grounded Vision-Language Models


162. Automating the Detection of Requirement Dependencies Using Large Language Models


163. Silent Egress: When Implicit Prompt Injection Makes LLM Agents Leak Without a Trace


164. A Fusion of context-aware based BanglaBERT and Two-Layer Stacked LSTM Framework for Multi-Label Cyberbullying Detection


165. ECHO: Encoding Communities via High-order Operators


166. From Bias to Balance: Fairness-Aware Paper Recommendation for Equitable Peer Review


167. veScale-FSDP: Flexible and High-Performance FSDP at Scale


168. GetBatch: Distributed Multi-Object Retrieval for ML Data Loading


169. Calibrated Test-Time Guidance for Bayesian Inference


170. HubScan: Detecting Hubness Poisoning in Retrieval-Augmented Generation Systems


171. Revisiting Chebyshev Polynomial and Anisotropic RBF Models for Tabular Regression


172. Contextual Memory Virtualisation: DAG-Based State Management and Structurally Lossless Trimming for LLM Agents


173. Enhancing Renal Tumor Malignancy Prediction: Deep Learning with Automatic 3D CT Organ Focused Attention


174. AeroDGS: Physically Consistent Dynamic Gaussian Splatting for Single-Sequence Aerial 4D Reconstruction


175. EyeLayer: Integrating Human Attention Patterns into LLM-Based Code Summarization


176. Learning geometry-dependent lead-field operators for forward ECG modeling


177. Scaling In, Not Up? Testing Thick Citation Context Analysis with GPT-5 and Fragile Prompts


178. GRAU: Generic Reconfigurable Activation Unit Design for Neural Network Hardware Accelerators


179. Decoder-based Sense Knowledge Distillation


180. Enabling clinical use of foundation models in histopathology


181. Structure and Redundancy in Large Language Models: A Spectral Study via Random Matrix Theory


182. A 1/R Law for Kurtosis Contrast in Balanced Mixtures


183. Training Agents to Self-Report Misbehavior


184. Decoding the Hook: A Multimodal LLM Framework for Analyzing the Hooking Period of Video Ads


185. AviaSafe: A Physics-Informed Data-Driven Model for Aviation Safety-Critical Cloud Forecasts


186. Learning Rewards, Not Labels: Adversarial Inverse Reinforcement Learning for Machinery Fault Detection


187. UpSkill: Mutual Information Skill Learning for Structured Response Diversity in LLMs


188. Manifold of Failure: Behavioral Attraction Basins in Language Models


189. Early Risk Stratification of Dosing Errors in Clinical Trials Using Machine Learning


190. Integrating Machine Learning Ensembles and Large Language Models for Heart Disease Prediction Using Voting Fusion


191. Learning to reconstruct from saturated data: audio declipping and high-dynamic range imaging


192. Positional-aware Spatio-Temporal Network for Large-Scale Traffic Prediction


193. CryoNet.Refine: A One-step Diffusion Model for Rapid Refinement of Structural Models with Cryo-EM Density Map Restraints


194. Poisoned Acoustics


195. Deep Sequence Modeling with Quantum Dynamics: Language as a Wave Function


196. Causal Direction from Convergence Time: Faster Training in the True Causal Direction


197. Zatom-1: A Multimodal Flow Foundation Model for 3D Molecules and Materials


198. Multi-Dimensional Spectral Geometry of Biological Knowledge in Single-Cell Transformer Representations


199. Analysis of LLMs Against Prompt Injection and Jailbreak Attacks


200. From Prompts to Performance: Evaluating LLMs for Task-based Parallel Code Generation


201. TT-SEAL: TTD-Aware Selective Encryption for Adversarially-Robust and Low-Latency Edge AI


202. Optimized Disaster Recovery for Distributed Storage Systems: Lightweight Metadata Architectures to Overcome Cryptographic Hashing Bottleneck


203. Unsupervised Denoising of Diffusion-Weighted Images with Bias and Variance Corrected Noise Modeling


204. FM-RME: Foundation Model Empowered Radio Map Estimation


205. To Deceive is to Teach? Forging Perceptual Robustness via Adversarial Reinforcement Learning


206. SmartChunk Retrieval: Query-Aware Chunk Compression with Planning for Efficient Document RAG


207. DS SERVE: A Framework for Efficient and Scalable Neural Retrieval


208. Misinformation Exposure in the Chinese Web: A Cross-System Evaluation of Search Engines, LLMs, and AI Overviews


209. What Makes an Ideal Quote? Recommending “Unexpected yet Rational” Quotations via Novelty


210. Comparative Analysis of Neural Retriever-Reranker Pipelines for Retrieval-Augmented Generation over Knowledge Graphs in E-commerce Applications


211. RAGdb: A Zero-Dependency, Embeddable Architecture for Multimodal Retrieval-Augmented Generation on the Edge


212. Retrieval-Augmented Generation Assistant for Anatomical Pathology Laboratories


213. Enriching Taxonomies Using Large Language Models


214. Survey on Neural Routing Solvers


215. Duel-Evolve: Reward-Free Test-Time Scaling via LLM Self-Preferences