전체 AI 논문 - 2026-05-01

1. Synthetic Computers at Scale for Long-Horizon Productivity Simulation


2. LLM as Clinical Graph Structure Refiner: Enhancing Representation Learning in EEG Seizure Diagnosis


3. Intern-Atlas: A Methodological Evolution Graph as Research Infrastructure for AI Scientists


4. Normativity and Productivism: Ableist Intelligence? A Degrowth Analysis of AI Sign Language Translation Tools for Deaf People


5. Splitting Argumentation Frameworks with Collective Attacks and Supports


6. Mapping the Methodological Space of Classroom Interaction Research: Scale, Duration, and Modality in an Age of AI


7. What Makes a Good Terminal-Agent Benchmark Task: A Guideline for Adversarial, Difficult, and Legible Evaluation Design


8. Characterizing the Consistency of the Emergent Misalignment Persona


9. RHyVE: Competence-Aware Verification and Phase-Aware Deployment for LLM-Generated Reward Hypotheses


10. Agent-Agnostic Evaluation of SQL Accuracy in Production Text-to-SQL Systems


11. Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents


12. SpecVQA: A Benchmark for Spectral Understanding and Visual Question Answering in Scientific Images


13. A Pattern Language for Resilient Visual Agents


14. Exploring Interaction Paradigms for LLM Agents in Scientific Visualization


15. D3-Gym: Constructing Real-World Verifiable Environments for Data-Driven Discovery


16. From LLM-Driven Trading Card Generation to Procedural Relatedness: A Pokémon Case Study


17. Splitting Assumption-Based Argumentation Frameworks


18. Language Models Refine Mechanical Linkage Designs Through Symbolic Reflection and Modular Optimisation


19. LLMs as ASP Programmers: Self-Correction Enables Task-Agnostic Nonmonotonic Reasoning


20. GUI Agents with Reinforcement Learning: Toward Digital Inhabitants


21. The Effects of Visual Priming on Cooperative Behavior in Vision-Language Models


22. A Collective Variational Principle Unifying Bayesian Inference, Game Theory, and Thermodynamics


23. MM-StanceDet: Retrieval-Augmented Multi-modal Multi-agent Stance Detection


24. Taming the Centaur(s) with LAPITHS: a framework for a theoretically grounded interpretation of AI performances


25. From Unstructured Recall to Schema-Grounded Memory: Reliable AI Memory via Iterative, Schema-Aware Extraction


26. Simulating clinical interventions with a generative multimodal model of human physiology


27. Graph World Models: Concepts, Taxonomy, and Future Directions


28. In-Context Prompting Obsoletes Agent Orchestration for Procedural Tasks


29. Building Persona-Based Agents On Demand: Tailoring Multi-Agent Workflows to User Needs


30. Modeling Clinical Concern Trajectories in Language Model Agents


31. KellyBench: A Benchmark for Long-Horizon Sequential Decision Making


32. Rethinking Agentic Reinforcement Learning In Large Language Models


33. A Grid-Aware Agent-Based Model for Analyzing Electric Vehicle Charging Systems


34. ObjectGraph: From Document Injection to Knowledge Traversal – A Native File Format for the Agentic Era


35. MCPHunt: An Evaluation Framework for Cross-Boundary Data Propagation in Multi-Server MCP Agents


36. Focus Session: Autonomous Systems Dependability in the era of AI: Design Challenges in Safety, Security, Reliability and Certification


37. Post-Optimization Adaptive Rank Allocation for LoRA


38. WindowsWorld: A Process-Centric Benchmark of Autonomous GUI Agents in Professional Cross-Application Environments


39. Intent2Tx: Benchmarking LLMs for Translating Natural Language Intents into Ethereum Transactions


40. Autonomous Traffic Signal Optimization Using Digital Twin and Agentic AI for Real-Time Decision-Making


41. Consumer Attitudes Towards AI in Digital Health: A Mixed-Methods Survey in Australia


42. Iterative Multimodal Retrieval-Augmented Generation for Medical Question Answering


43. Auditing Frontier Vision-Language Models for Trustworthy Medical VQA: Grounding Failures, Format Collapse, and Domain Adaptation


44. Knowledge Graph Representations for LLM-Based Policy Compliance Reasoning


45. Contextual Agentic Memory is a Memo, Not True Memory


46. Bridging Values and Behavior: A Hierarchical Framework for Proactive Embodied Agents


47. When Agents Evolve, Institutions Follow


48. The TEA Nets framework combines AI and cognitive network science to model targets, events and actors in text


49. Fairness for distribution network operations and planning


50. From Context to Skills: Can Language Models Learn from Context Skillfully?


51. Optimization before Evaluation: Evaluation with Unoptimised Prompts Can be Misleading


52. Generative structure search for efficient and diverse discovery of molecular and crystal structures


53. Political Bias Audits of LLMs Capture Sycophancy to the Inferred Auditor


54. WaferSAGE: Large Language Model-Powered Wafer Defect Analysis via Synthetic Data Generation and Rubric-Guided Reinforcement Learning


55. Math Education Digital Shadows for facilitating learning with LLMs: Math performance, anxiety and confidence in simulated students and AIs


56. Trace-Level Analysis of Information Contamination in Multi-Agent Systems


57. SpatialGrammar: A Domain-Specific Language for LLM-Based 3D Indoor Scene Generation


58. In-Context Examples Suppress Scientific Knowledge Recall in LLMs


59. Belief-Guided Inference Control for Large Language Model Services via Verifiable Observations


60. PRTS: A Primitive Reasoning and Tasking System via Contrastive Representations


61. InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?


62. Leading Across the Spectrum of Human-AI Relationships: A Conceptual Framework for Increasingly Heterogeneous Teams


63. Robust Learning on Heterogeneous Graphs with Heterophily: A Graph Structure Learning Approach


64. Measurement Risk in Supervised Financial NLP: Rubric and Metric Sensitivity on JF-ICR


65. TIO-SHACL: Comprehensive SHACL validation for TMF Intent Ontologies


66. Safe Bilevel Delegation (SBD): A Formal Framework for Runtime Delegation Safety in Multi-Agent Systems


67. CoAX: Cognitive-Oriented Attribution eXplanation User Model of Human Understanding of AI Explanations


68. Heterogeneous Scientific Foundation Model Collaboration


69. Investigating More Explainable and Partition-Free Compositionality Estimation for LLMs: A Rule-Generation Perspective


70. End-to-End Evaluation and Governance of an EHR-Embedded AI Agent for Clinicians


71. METASYMBO: Multi-Agent Language-Guided Metamaterial Discovery via Symbolic Latent Evolution


72. Machine Collective Intelligence for Explainable Scientific Discovery


73. Learning Rate Engineering: From Coarse Single Parameter to Layered Evolution


74. The Two Boundaries: Why Behavioral AI Governance Fails Structurally


75. Mechanized Foundations of Structural Governance: Machine-Checked Proofs for Governed Intelligence


76. The Inverse-Wisdom Law: Architectural Tribalism and the Consensus Paradox in Agentic Swarms


77. OptimusKG: Unifying biomedical knowledge in a modern multimodal graph


78. AutoSurfer – Teaching Web Agents through Comprehensive Surfing, Learning, and Modeling


79. Reinforced Agent: Inference-Time Feedback for Tool-Calling Agents


80. When Roles Fail: Epistemic Constraints on Advocate Role Fidelity in LLM-Based Political Statement Analysis


81. Web2BigTable: A Bi-Level Multi-Agent LLM System for Internet-Scale Information Search and Extraction


82. Toward Personalized Digital Twins for Cognitive Decline Assessment: A Multimodal, Uncertainty-Aware Framework


83. Evaluating TabPFN for Mild Cognitive Impairment to Alzheimer’s Disease Conversion in Data Limited Settings


84. Interval Orders, Biorders and Credibility-limited Belief Revision


85. Step-level Optimization for Efficient Computer-use Agents


86. Optimal Stop-Loss and Take-Profit Parameterization for Autonomous Trading Agent Swarm


87. Unpacking Vibe Coding: Help-Seeking Processes in Student-AI Interactions While Programming


88. TRUST: A Framework for Decentralized AI Service v.0.1


89. Unsupervised Electrofacies Classification and Porosity Characterization in the Offshore Keta Basin Using Wireline Logs


90. Think it, Run it: Autonomous ML pipeline generation via self-healing multi-agent AI


91. End-to-end autonomous scientific discovery on a real optical platform


92. When Your LLM Reaches End-of-Life: A Framework for Confident Model Migration in Production Systems


93. Binary Spiking Neural Networks as Causal Models


94. Compositional Meta-Learning for Mitigating Task Heterogeneity in Physics-Informed Neural Networks


95. Computing Equilibrium beyond Unilateral Deviation


96. PhyCo: Learning Controllable Physical Priors for Generative Motion


97. FlexiTac: A Low-Cost, Open-Source, Scalable Tactile Sensing Solution for Robotic Systems


98. Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows


99. Crab: A Semantics-Aware Checkpoint/Restore Runtime for Agent Sandboxes


100. Latent Adversarial Detection: Adaptive Probing of LLM Activations for Multi-Turn Attack Detection


101. AdvDMD: Adversarial Reward Meets DMD For High-Quality Few-Step Generation


102. PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement Learning


103. Do Sparse Autoencoders Capture Concept Manifolds?


104. DEFault++: Automated Fault Detection, Categorization, and Diagnosis for Transformer Architectures



106. TopBench: A Benchmark for Implicit Prediction and Reasoning over Tabular Question Answering


107. Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling


108. PROMISE-AD: Progression-aware Multi-horizon Survival Estimation for Alzheimer’s Disease Progression and Dynamic Tracking


109. To Build or Not to Build? Factors that Lead to Non-Development or Abandonment of AI Systems


110. MIFair: A Mutual-Information Framework for Intersectionality and Multiclass Fairness


111. Reliable Answers for Recurring Questions: Boosting Text-to-SQL Accuracy with Template Constrained Decoding


112. Design Structure Matrix Modularization with Large Language Models


113. Learning from Disagreement: Clinician Overrides as Implicit Preference Signals for Clinical AI in Value-Based Care


114. ITS-Mina: A Harris Hawks Optimization-Based All-MLP Framework with Iterative Refinement and External Attention for Multivariate Time Series Forecasting


115. TransVLM: A Vision-Language Framework and Benchmark for Detecting Any Shot Transitions


116. From Mirage to Grounding: Towards Reliable Multimodal Circuit-to-Verilog Code Generation


117. Attractor FCM


118. Training-Free Tunnel Defect Inspection and Engineering Interpretation via Visual Recalibration and Entity Reconstruction


119. Can AI Be a Good Peer Reviewer? A Survey of Peer Review Process, Evaluation, and the Future


120. Beyond Semantics: Measuring Fine-Grained Emotion Preservation in Small Language Model-Based Machine Translation


121. AI Inference as Relocatable Electricity Demand: A Latency-Constrained Energy-Geography Framework


122. NeocorRAG: Less Irrelevant Information, More Explicit Evidence, and More Effective Recall via Evidence Chains


123. CastFlow: Learning Role-Specialized Agentic Workflows for Time Series Forecasting


124. How Generative AI Disrupts Search: An Empirical Study of Google Search, Gemini, and AI Overviews


125. Test Before You Deploy: Governing Updates in the LLM Supply Chain


126. RuC: HDL-Agnostic Rule Completion Benchmark Generation


127. Instruction-Guided Poetry Generation in Arabic and Its Dialects


128. Learning to Reason: Targeted Knowledge Discovery and Fuzzy Logic Update for Robust Image Recognition


129. Position-Aware Drafting for Inference Acceleration in LLM-Based Generative List-Wise Recommendation


130. Why Self-Supervised Encoders Want to Be Normal


131. AgentEconomist: An End-to-end Agentic System Translating Economic Intuitions into Executable Computational Experiments


132. Deep Learning-Based Segmentation of Peritoneal Cancer Index Regions from CT Imaging


133. VibroML: an automated toolkit for high-throughput vibrational analysis and dynamic instability remediation of crystalline materials using machine-learned potentials


134. One Single Hub Text Breaks CLIP: Identifying Vulnerabilities in Cross-Modal Encoders via Hubness


135. When Does Structure Matter in Continual Learning? Dimensionality Controls When Modularity Shapes Representational Geometry


136. ANCORA: Learning to Question via Manifold-Anchored Self-Play for Verifiable Reasoning


137. HAVEN: Hybrid Automated Verification ENgine for UVM Testbench Synthesis with LLMs


138. Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior


139. Robust Lightweight Crack Classification for Real-Time UAV Bridge Inspection


140. ZAYAN: Disentangled Contrastive Transformer for Tabular Remote Sensing Data


141. ClipTBP: Clip-Pair based Temporal Boundary Prediction with Boundary-Aware Learning for Moment Retrieval


142. Statistical Channel Fingerprint Construction for Massive MIMO: A Unified Tensor Learning Framework


143. RIHA: Report-Image Hierarchical Alignment for Radiology Report Generation


144. Beyond the Training Distribution: Mapping Generalization Boundaries in Neural Program Synthesis


145. APPSI-139: A Parallel Corpus of English Application Privacy Policy Summarization and Interpretation


146. Knowledge Affordances for Hybrid Human-AI Information Seeking


147. Debiasing Reward Models via Causally Motivated Inference-Time Intervention


148. Security Attack and Defense Strategies for Autonomous Agent Frameworks: A Layered Review with OpenClaw as a Case Study


149. Improving Graph Few-shot Learning with Hyperbolic Space and Denoising Diffusion


150. RAY-TOLD: Ray-Based Latent Dynamics for Dense Dynamic Obstacle Avoidance with TDMPC


151. Sampler-Robust Optimization under Generative Models


152. ABC: Any-Subset Autoregression via Non-Markovian Diffusion Bridges in Continuous Time and Space


153. AdaBFL: Multi-Layer Defensive Adaptive Aggregation for Bzantine-Robust Federated Learning


154. Secret Stealing Attacks on Local LLM Fine-Tuning through Supply-Chain Model Code Backdoors


155. Beyond the Mean: Within-Model Reliable Change Detection for LLM Evaluation


156. COHERENCE: Benchmarking Fine-Grained Image-Text Alignment in Interleaved Multimodal Contexts


157. TypeBandit: Type-Level Context Allocation and Reweighting for Effective Attribute Completion in Heterogeneous Graph Neural Networks


158. Profiles of AI Dependency: A Latent Class Analysis of Filipino Students’ Academic Competencies


159. Exploring the Adoption Intention in Using AI-Enabled Educational Tools Among Preservice Teachers in the Philippines: A Partial-Least Square Modeling


160. Toward Autonomous SOC Operations: End-to-End LLM Framework for Threat Detection, Query Generation, and Resolution in Security Operations


161. Pragmos: A Process Agentic Modeling System


162. BoostLoRA: Growing Effective Rank by Boosting Adapters


163. Learning When to Remember: Risk-Sensitive Contextual Bandits for Abstention-Aware Memory Retrieval in LLM-Based Coding Agents


164. BrainDINO: A Brain MRI Foundation Model for Generalizable Clinical Representation Learning


165. Evaluating Epistemic Guardrails in AI Reading Assistants: A Behavioral Audit of a Minimal Prototype


166. When 2D Tasks Meet 1D Serialization: On Serialization Friction in Structured Tasks


167. From Prompt to Physical Actuation: Holistic Threat Modeling of LLM-Enabled Robotic Systems


168. Self-Evolving Software Agents


169. Towards Accelerated SCF Workflows with Equivariant Density-Matrix Learning and Analytic Refinement


170. Compliance versus Sensibility: On the Reasoning Controllability in Large Language Models


171. Instruction Complexity Induces Positional Collapse in Adversarial LLM Evaluation


172. Addressing the Reality Gap: A Three-Tension Framework for Agentic AI Adoption


173. Upskilling with Generative AI: Practices and Challenges for Freelance Knowledge Workers


174. Theory Under Construction: Orchestrating Language Models for Research Software Where the Specification Evolves


175. Path-Lock Expert: Separating Reasoning Mode in Hybrid Thinking via Architecture-Level Separation


176. Learning to Spend: Model Predictive Control for Budgeting under Non-Stationary Returns


177. Preserving Temporal Dynamics in Time Series Generation


178. What Suppresses Nash Equilibrium Play in Large Language Models? Mechanistic Evidence and Causal Control


179. ConformaDecompose: Explaining Uncertainty via Calibration Localization


180. How to Guide Your Flow: Few-Step Alignment via Flow Map Reward Guidance


181. Enhancing Linux Privilege Escalation Attack Capabilities of Local LLM Agents


182. Lightweight Distillation of SAM 3 and DINOv3 for Edge-Deployable Individual-Level Livestock Monitoring and Longitudinal Visual Analytics


183. PALCAS: A Priority-Aware Intelligent Lane Change Advisory System for Autonomous Vehicles using Federated Reinforcement Learning


184. A Gated Hybrid Contrastive Collaborative Filtering Recommendation


185. Reconstruction by Generation: 3D Multi-Object Scene Reconstruction from Sparse Observations


186. Anomaly Detection in Soil Heavy Metal Contamination Using Unsupervised Learning for Environmental Risk Assessment


187. Useless but Safe? Benchmarking Utility Recovery with User Intent Clarification in Multi-Turn Conversations


188. Efficient Training on Multiple Consumer GPUs with RoundPipe


189. Learning Rate Transfer in Normalized Transformers


190. Detecting Clinical Discrepancies in Health Coaching Agents: A Dual-Stream Memory and Reconciliation Architecture


191. NORACL: Neurogenesis for Oracle-free Resource-Adaptive Continual Learning


192. Automatic Causal Fairness Analysis with LLM-Generated Reporting


193. Beyond Accuracy: LLM Variability in Evidence Screening for Software Engineering SLRs


194. When Continual Learning Moves to Memory: A Study of Experience Reuse in LLM Agents


195. Entropy-Dominated Temporal Vocal Dynamics as Digital Biomarkers for Depression Detection


196. Agent Name Service (ANS): A Proof-of-Concept Trust Layer for Secure AI Agent Discovery, Identity, and Governance in Kubernetes


197. People-Centred Medical Image Analysis


198. Simple Self-Conditioning Adaptation for Masked Diffusion Models


199. Multibit neural inference in a N-ary crossbar architecture


200. Defeasible Conditional Obligation in a Two-tiered Preference-based Semantics (Extended Version)


201. Fitting Horn DL Ontologies to ABox and Query Examples: A Tale of Simulation Quantifiers and Finite Models


202. Not All Memories Age the Same: Autodiscovery of Adaptive Decay in Knowledge Graphs


203. AgenticRecTune: Multi-Agent with Self-Evolving Skillhub for Recommendation System Optimization


204. Predictive Multi-Tier Memory Management for KV Cache in Large-Scale GPU Inference


205. The Impact of AI-Generated Text on the Internet


206. Learning-to-Explain through 20Q Gaming: An Explainable Recommender for Cybersecurity Education


207. DeepTutor: Towards Agentic Personalized Tutoring


208. Static Program Slicing Using Language Models With Dataflow-Aware Pretraining and Constrained Decoding


209. LLM Biases


210. CareGuardAI: Context-Aware Multi-Agent Guardrails for Clinical Safety & Hallucination Mitigation in Patient-Facing LLMs


211. Designing Ethical Learning for Agentic AI: Toegye Yi Hwang’s Ethical Emotion Regulation Framework


212. Simulating Validity: Modal Decoupling in MLLM Generated Feedback on Science Drawings


213. Can AI be a moral victim? The role of moral patiency and ownership perceptions in ethical judgments of using AI-generated content


214. Policy-Governed LLM Routing with Intent Matching for Instrument Laboratories


215. The Impact of LLM Self-Consistency and Reasoning Effort on Automated Scoring Accuracy and Cost


216. Agentic Compilation: Mitigating the LLM Rerun Crisis for Minimized-Inference-Cost Web Automation


217. Culture-inspired Multi-modal Color Palette Generation and Colorization: A Chinese Youth Subculture Case