전체 AI 논문 - 2026-04-02

1. HippoCamp: Benchmarking Contextual Agents on Personal Computers


2. Therefore I am. I Think


3. Detecting Multi-Agent Collusion Through Multi-Agent Interpretability


4. Adversarial Moral Stress Testing of Large Language Models


5. OmniMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory


6. PsychAgent: An Experience-Driven Lifelong Learning Agent for Self-Evolving Psychological Counselor


7. Experience as a Compass: Multi-agent RAG with Evolving Orchestration and Agent Prompts


8. Beyond Symbolic Solving: Multi Chain-of-Thought Voting for Geometric Reasoning in Large Language Models


9. Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants


10. Preference Guided Iterated Pareto Referent Optimisation for Accessible Route Planning


11. RefineRL: Advancing Competitive Programming with Self-Refinement Reinforcement Learning


12. UK AISI Alignment Evaluation Case-Study


13. CircuitProbe: Predicting Reasoning Circuits in Transformers via Stability Zone Detection


14. Agent psychometrics: Task-level performance prediction in agentic coding benchmarks


15. Ontology-Constrained Neural Reasoning in Enterprise Agentic Systems: A Neurosymbolic Architecture for Domain-Grounded AI Agents


16. BloClaw: An Omniscient, Multi-Modal Agentic Workspace for Next-Generation Scientific Discovery


17. Does Unification Come at a Cost? Uni-SafeBench: A Safety Benchmark for Unified Multimodal Large Models


18. Adaptive Parallel Monte Carlo Tree Search for Efficient Test-time Compute Scaling


19. The Silicon Mirror: Dynamic Behavioral Gating for Anti-Sycophancy in LLM Agents


20. Logarithmic Scores, Power-Law Discoveries: Disentangling Measurement from Coverage in Agent-Based Evaluation


21. Towards Reliable Truth-Aligned Uncertainty Estimation in Large Language Models


22. Execution-Verified Reinforcement Learning for Optimization Modeling


23. Self-Routing: Parameter-Free Expert Routing from Hidden States


24. Decision-Centric Design for LLM Systems


25. In harmony with gpt-oss


26. Signals: Trajectory Sampling and Triage for Agentic Interactions


27. Collaborative AI Agents and Critics for Fault Detection and Cause Analysis in Network Telemetry


28. Improvisational Games as a Benchmark for Social Intelligence of AI Agents: The Case of Connections


29. Human-in-the-Loop Control of Objective Drift in LLM-Assisted Computer Science Education


30. A Safety-Aware Role-Orchestrated Multi-Agent LLM Framework for Behavioral Health Communication Simulation


31. Open, Reliable, and Collective: A Community-Driven Framework for Tool-Using AI Agents


32. One Panel Does Not Fit All: Case-Adaptive Multi-Agent Deliberation for Clinical Prediction


33. How Emotion Shapes the Behavior of LLMs and Agents: A Mechanistic Study


34. LAtent Phase Inference from Short time sequences using SHallow REcurrent Decoders (LAPIS-SHRED)


35. The Recipe Matters More Than the Kitchen:Mathematical Foundations of the AI Weather Prediction Pipeline


36. $\texttt{YC-Bench}$: Benchmarking AI Agents for Long-Term Planning and Consistent Execution


37. CliffSearch: Structured Agentic Co-Evolution over Theory and Code for Scientific Algorithm Discovery


38. Neural Harmonic Textures for High-Quality Primitive Based Neural Reconstruction


39. ORBIT: Scalable and Verifiable Data Generation for Search Agents on a Tight Budget


40. A ROS 2 Wrapper for Florence-2: Multi-Mode Local Vision-Language Inference for Robotic Systems


41. Screening Is Enough


42. Online Reasoning Calibration: Test-Time Training Enables Generalizable Conformal LLM Reasoning


43. AdaLoRA-QAT: Adaptive Low-Rank and Quantization-Aware Segmentation


44. Brainstacks: Cross-Domain Cognitive Capabilities via Frozen MoE-LoRA Stacks for Continual LLM Learning


45. Looking into a Pixel by Nonlinear Unmixing – A Generative Approach


46. Paper Reconstruction Evaluation: Evaluating Presentation and Hallucination in AI-written Papers


47. Lightweight Prompt-Guided CLIP Adaptation for Monocular Depth Estimation


48. Trust and Reliance on AI in Education: AI Literacy and Need for Cognition as Moderators


49. Approximating Pareto Frontiers in Stochastic Multi-Objective Optimization via Hashing and Randomization


50. Temporal Dependencies in In-Context Learning: The Role of Induction Heads


51. TRACE: Training-Free Partial Audio Deepfake Detection via Embedding Trajectory Analysis of Speech Foundation Models


52. VibeGuard: A Security Gate Framework for AI-Generated Code


53. Adversarial Attacks in AI-Driven RAN Slicing: SLA Violations and Recovery


54. Automated Framework to Evaluate and Harden LLM System Instructions against Encoding Attacks


55. Aligning Recommendations with User Popularity Preferences


56. Revision or Re-Solving? Decomposing Second-Pass Gains in Multi-LLM Pipelines


57. Fast and Accurate Probing of In-Training LLMs’ Downstream Performances


58. Transfer learning for nonparametric Bayesian networks


59. OrgAgent: Organize Your Multi-Agent System like a Company


60. Query-Conditioned Evidential Keyframe Sampling for MLLM-Based Long-Form Video Understanding


61. EgoSim: Egocentric World Simulator for Embodied Interaction Generation


62. Multimodal Analysis of State-Funded News Coverage of the Israel-Hamas War on YouTube Shorts


63. Bridging Structured Knowledge and Data: A Unified Framework with Finance Applications


64. Do Phone-Use Agents Respect Your Privacy?


65. Dual Optimal: Make Your LLM Peer-like with Dignity


66. Flow-based Policy With Distributional Reinforcement Learning in Trajectory Optimization


67. WARP: Guaranteed Inner-Layer Repair of NLP Transformers


68. Learning Quantised Structure-Preserving Motion Representations for Dance Fingerprinting


69. Representation Selection via Cross-Model Agreement using Canonical Correlation Analysis


70. Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time


71. PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding


72. KUET at StanceNakba Shared Task: StanceMoE: Mixture-of-Experts Architecture for Stance Detection


73. Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies


74. Emotion Entanglement and Bayesian Inference for Multi-Dimensional Emotion Understanding


75. DVGT-2: Vision-Geometry-Action Model for Autonomous Driving at Scale


76. Routing-Free Mixture-of-Experts


77. Scalable Pretraining of Large Mixture of Experts Language Models on Aurora Super Computer


78. Thinking Wrong in Silence: Backdoor Attacks on Continuous Latent Reasoning


79. IWP: Token Pruning as Implicit Weight Pruning in Large Vision Language Models


80. BioCOMPASS: Integrating Biomarkers into Transformer-Based Immunotherapy Response Prediction


81. Spectral Compact Training: Pre-Training Large Language Models via Permanent Truncated SVD and Stiefel QR Retraction


82. A CEFR-Inspired Classification Framework with Fuzzy C-Means To Automate Assessment of Programming Skills in Scratch


83. GRASP: Gradient Realignment via Active Shared Perception for Multi-Agent Collaborative Optimization


84. To Memorize or to Retrieve: Scaling Laws for RAG-Considerate Pretraining


85. AutoEG: Exploiting Known Third-Party Vulnerabilities in Black-Box Web Applications


86. Learning to Hint for Reinforcement Learning


87. Internal APIs Are All You Need: Shadow APIs, Shared Discovery, and the Case Against Browser-First Agent Architectures


88. Procela: Epistemic Governance in Mechanistic Simulations Under Structural Uncertainty


89. Streaming Model Cascades for Semantic SQL


90. UniMixer: A Unified Architecture for Scaling Laws in Recommendation Systems


91. HabitatAgent: An End-to-End Multi-Agent System for Housing Consultation


92. MATHENA: Mamba-based Architectural Tooth Hierarchical Estimator and Holistic Evaluation Network for Anatomy


93. Optimsyn: Influence-Guided Rubrics Optimization for Synthetic Data Generation


94. Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding


95. Toward Optimal Sampling Rate Selection and Unbiased Classification for Precise Animal Activity Recognition


96. MAESIL: Masked Autoencoder for Enhanced Self-supervised Medical Image Learning


97. MOON3.0: Reasoning-aware Multimodal Representation Learning for E-commerce Product Understanding


98. Towards Initialization-dependent and Non-vacuous Generalization Bounds for Overparameterized Shallow Neural Networks


99. A Reasoning-Enabled Vision-Language Foundation Model for Chest X-ray Interpretation


100. Executing as You Generate: Hiding Execution Latency in LLM Code Generation


101. Not My Truce: Personality Differences in AI-Mediated Workplace Negotiation


102. First Logit Boosting: Visual Grounding Method to Mitigate Object Hallucination in Large Vision-Language Models


103. Polysemanticity or Polysemy? Lexical Identity Confounds Superposition Metrics


104. G-Drift MIA: Membership Inference via Gradient-Induced Feature Drift in LLMs


105. Learning Humanoid Navigation from Human Data


106. COTTA: Context-Aware Transfer Adaptation for Trajectory Prediction in Autonomous Driving


107. Improving Generalization of Deep Learning for Brain Metastases Segmentation Across Institutions


108. Deep Networks Favor Simple Data


109. EvolveTool-Bench: Evaluating the Quality of LLM-Generated Tool Libraries as Software Artifacts


110. RAGShield: Provenance-Verified Defense-in-Depth Against Knowledge Base Poisoning in Government Retrieval-Augmented Generation Systems


111. Go Big or Go Home: Simulating Mobbing Behavior with Braitenbergian Robots


112. The Persistent Vulnerability of Aligned AI Systems


113. Prompt-Guided Prefiltering for VLM Image Compression


114. Robust Multimodal Safety via Conditional Decoding


115. Asymmetric Actor-Critic for Multi-turn LLM Agents


116. SANA I2I: A Text Free Flow Matching Framework for Paired Image to Image Translation with a Case Study in Fetal MRI Artifact Reduction


117. VeriAct: Beyond Verifiability – Agentic Synthesis of Correct and Complete Formal Specifications


118. The Geometry of Compromise: Unlocking Generative Capabilities via Controllable Modality Alignment


119. Hybrid Energy-Based Models for Physical AI: Provably Stable Identification of Port-Hamiltonian Dynamics


120. Benchmarking Interaction, Beyond Policy: a Reproducible Benchmark for Collaborative Instance Object Navigation


121. LLM Essay Scoring Under Holistic and Analytic Rubrics: Prompt Effects and Bias


122. Hierarchical Apprenticeship Learning from Imperfect Demonstrations with Evolving Rewards


123. REM-CTX: Automated Peer Review via Reinforcement Learning with Auxiliary Context


124. Softmax gradient policy for variance minimization and risk-averse multi armed bandits


125. AI-Mediated Explainable Regulation for Justice


126. MAC-Attention: a Match-Amend-Complete Scheme for Fast and Accurate Attention Computation


127. Diversity-Aware Reverse Kullback-Leibler Divergence for Large Language Model Distillation


128. QUEST: A robust attention formulation using query-modulated spherical attention


129. Making Sense of AI Agents Hype: Adoption, Architectures, and Takeaways from Practitioners


130. Explainable AI for Blind and Low-Vision Users: Navigating Trust, Modality, and Interpretability in the Agentic Era


131. Agentic AI and Occupational Displacement: A Multi-Regional Task Exposure Analysis of Emerging Labor Market Disruption


132. NFC based inventory control system for secure and efficient communication


133. Unified Architecture Metamodel of Information Systems Developed by Generative AI


134. Neural-Assisted in-Motion Self-Heading Alignment


135. A Study on the Impact of Fault localization Granularity for Repository-Scale Code Repair Tasks


136. Epileptic Seizure Detection in Separate Frequency Bands Using Feature Analysis and Graph Convolutional Neural Network (GCN) from Electroencephalogram (EEG) Signals


137. Oblivion: Self-Adaptive Agentic Memory Control through Decay-Driven Activation


138. From Domain Understanding to Design Readiness: a playbook for GenAI-supported learning in Software Engineering


139. Hierarchical Pre-Training of Vision Encoders with Large Language Models


140. Beyond Symbolic Control: Societal Consequences of AI-Driven Workforce Displacement and the Imperative for Genuine Human Oversight Architectures


141. Learning to Play Blackjack: A Curriculum Learning Perspective


142. Terminal Agents Suffice for Enterprise Automation


143. Empirical Validation of the Classification-Verification Dichotomy for AI Safety Gates


144. Brain MR Image Synthesis with Multi-contrast Self-attention GAN


145. Perspective: Towards sustainable exploration of chemical spaces with machine learning


146. Temporal Memory for Resource-Constrained Agents: Continual Learning via Stochastic Compress-Add-Smooth


147. GenoBERT: A Language Model for Accurate Genotype Imputation


148. Towards Automatic Soccer Commentary Generation with Knowledge-Enhanced Visual Reasoning


149. The Energy Footprint of LLM-Based Environmental Analysis: LLMs and Domain Products


150. Task-Centric Personalized Federated Fine-Tuning of Language Models


151. Whittaker-Henderson smoother for long satellite image time series interpolation


152. DriftScript: A Domain-Specific Language for Programming Non-Axiomatic Reasoning Agents


153. When and Where: A Model Hippocampal Network Unifies Formation of Time Cells and Place Cells


154. “Who Am I, and Who Else Is Here?” Behavioral Differentiation Without Role Assignment in Multi-Agent LLM Systems


155. Brevity Constraints Reverse Performance Hierarchies in Language Models


156. WHBench: Evaluating Frontier LLMs with Expert-in-the-Loop Validation on Women’s Health Topics


157. Criterion Validity of LLM-as-Judge for Business Outcomes in Conversational Commerce


158. How Do Language Models Process Ethical Instructions? Deliberation, Consistency, and Other-Recognition Across Four Models


159. The Chronicles of RiDiC: Generating Datasets with Controlled Popularity Distribution for Long-form Factuality Evaluation


160. Think Twice Before You Write – an Entropy-based Decoding Strategy to Enhance LLM Reasoning


161. Are they human? Detecting large language models by probing human memory constraints


162. MSA-Thinker: Discrimination-Calibration Reasoning with Hint-Guided Reinforcement Learning for Multimodal Sentiment Analysis


163. Finding and Reactivating Post-Trained LLMs’ Hidden Safety Mechanisms


164. Quantifying Gender Bias in Large Language Models: When ChatGPT Becomes a Hiring Manager


165. Can LLMs Perceive Time? An Empirical Investigation


166. Eyla: Toward an Identity-Anchored LLM Architecture with Integrated Biological Priors – Vision, Implementation Attempt, and Lessons from AI-Assisted Development


167. How Trustworthy Are LLM-as-Judge Ratings for Interpretive Responses? Implications for Qualitative Research Workflows


168. Dynin-Omni: Omnimodal Unified Large Diffusion Language Model


169. LinearARD: Linear-Memory Attention Distillation for RoPE Restoration


170. A Reliability Evaluation of Hybrid Deterministic-LLM Based Approaches for Academic Course Registration PDF Information Extraction


171. Benchmark for Assessing Olfactory Perception of Large Language Models


172. Two-Stage Optimizer-Aware Online Data Selection for Large Language Models


173. Agentic AI – Physicist Collaboration in Experimental Particle Physics: A Proof-of-Concept Measurement with LEP Open Data