전체 AI 논문 - 2026-04-29

1. Recursive Multi-Agent Systems


2. ADEMA: A Knowledge-State Orchestration Architecture for Long-Horizon Knowledge Synthesis with LLMAgents


3. Semi-Markov Reinforcement Learning for City-Scale EV Ride-Hailing with Feasibility-Guaranteed Actions


4. Action-Aware Generative Sequence Modeling for Short Video Recommendation


5. TrialCalibre: A Fully Automated Causal Engine for RCT Benchmarking and Observational Trial Calibration


6. StratFormer: Adaptive Opponent Modeling and Exploitation in Imperfect-Information Games


7. QAROO: AI-Driven Online Task Offloading for Energy-Efficient and Sustainable MEC Networks


8. Toward Scalable Terminal Task Synthesis via Skill Graphs


9. Scalable Inference Architectures for Compound AI Systems: A Production Deployment Study


10. RADD: Retrieval-Augmented Discrete Diffusion for Multi-Modal Knowledge Graph Completion


11. Think Before You Act – A Neurocognitive Governance Model for Autonomous AI Agents


12. HotComment: A Benchmark for Evaluating Popularity of Online Comments


13. The Nonverbal Syntax Framework: An Evidence-Based Tiered System for Inferring Learner States from Observable Behavioral Cues


14. OxyGent: Making Multi-Agent Systems Modular, Observable, and Evolvable via Oxy Abstraction


15. DualFact+: A Multimodal Fact Verification Framework for Procedural Video Understanding


16. Sample-efficient Neuro-symbolic Proximal Policy Optimization


17. Automated Adversarial Collaboration for Advancing Theory Building in the Cognitive Sciences


18. PHISHREV: A Hybrid Machine Learning and Post-Hoc Non-monotonic Reasoning Framework for Context-Aware Phishing Website Classification


19. Improving Zero-Shot Offline RL via Behavioral Task Sampling


20. SciEval: A Benchmark for Automatic Evaluation of K-12 Science Instructional Materials


21. PI-TTA: Physics-Informed Source-Free Test-Time Adaptation for Robust Human Activity Recognition on Mobile Devices


22. JURY-RL: Votes Propose, Proofs Dispose for Label-Free RLVR


23. Multi-action Tangled Program Graphs for Multi-task Reinforcement Learning with Continuous Control


24. Plausible but Wrong: A case study on Agentic Failures in Astrophysical Workflows


25. AutoResearchBench: Benchmarking AI Agents on Complex Scientific Literature Discovery


26. ValueAlpha: Agreement-Gated Stress Testing of LLM-Judged Investment Rationales Before Returns Are Observable


27. DATAREEL: Automated Data-Driven Video Story Generation with Animations


28. From Insight to Action: A Novel Framework for Interpretability-Guided Data Selection in Large Language Models


29. Training Transformers as a Universal Computer


30. Semantic Layers for Reliable LLM-Powered Data Analytics: A Paired Benchmark of Accuracy and Hallucination Across Three Frontier Models


31. Doing More With Less: Revisiting the Effectiveness of LLM Pruning for Test-Time Scaling


32. Cooperate to Compete: Strategic Coordination in Multi-Agent Conquest


33. Agentic Architect: An Agentic AI Framework for Architecture Design Exploration and Optimization


34. Evaluating Risks in Weak-to-Strong Alignment: A Bias-Variance Perspective


35. Leverage Laws: A Per-Task Framework for Human-Agent Collaboration


36. Toward a Science of Intent: Closure Gaps and Delegation Envelopes for Open-World AI Agents


37. Sparse Personalized Text Generation with Multi-Trajectory Reasoning


38. Assessing Y-Axis Influence: Bias in Multimodal Language Models on Chart-to-Table Translation


39. Adaptive Prompt Embedding Optimization for LLM Jailbreaking


40. S-SONDO: Self-Supervised Knowledge Distillation for General Audio Foundation Models


41. Latent Agents: A Post-Training Procedure for Internalized Multi-Agent Debate


42. Co-Director: Agentic Generative Video Storytelling


43. How Fast Should a Model Commit to Supervision? Training Reasoning Models on the Tsallis Loss Continuum


44. Toward a Functional Geometric Algebra for Natural Language Semantics


45. TSN-Affinity: Similarity-Driven Parameter Reuse for Continual Offline Reinforcement Learning


46. Three Models of RLHF Annotation: Extension, Evidence, and Authority


47. Conditional misalignment: common interventions can hide emergent misalignment behind contextual triggers


48. No Pedestrian Left Behind: Real-Time Detection and Tracking of Vulnerable Road Users for Adaptive Traffic Signal Control


49. When Errors Can Be Beneficial: A Categorization of Imperfect Rewards for Policy Gradient


50. RESTestBench: A Benchmark for Evaluating the Effectiveness of LLM-Generated REST API Test Cases from NL Requirements


51. Luminol-AIDetect: Fast Zero-shot Machine-Generated Text Detection based on Perplexity under Text Shuffling


52. Investigation into In-Context Learning Capabilities of Transformers


53. SIEVES: Selective Prediction Generalizes through Visual Evidence Scoring


54. G-Loss: Graph-Guided Fine-Tuning of Language Models


55. From Soliloquy to Agora: Memory-Enhanced LLM Agents with Decentralized Debate for Optimization Modeling


56. Towards Agentic Investigation of Security Alerts


57. PSI-Bench: Towards Clinically Grounded and Interpretable Evaluation of Depression Patient Simulators


58. MAIC-UI: Making Interactive Courseware with Generative UI


59. At the Edge of the Heart: ULP FPGA-Based CNN for On-Device Cardiac Feature Extraction in Smart Health Sensors for Astronauts


60. Sustained Gradient Alignment Mediates Subliminal Learning in a Multi-Step Setting: Evidence from MNIST Auxiliary Logit Distillation Experiment


61. Can Code Evaluation Metrics Detect Code Plagiarism?


62. CGU-ILALab at FoodBench-QA 2026: Comparing Traditional and LLM-based Approaches for Recipe Nutrient Estimation


63. Measuring the Sensitivity of Classification Models with the Error Sensitivity Profile


64. Threat-Oriented Digital Twinning for Security Evaluation of Autonomous Platforms


65. SAFEdit: Does Multi-Agent Decomposition Resolve the Reliability Challenges of Instructed Code Editing?


66. Verification of Neural Networks (Lecture Notes)


67. Cross-Lingual Jailbreak Detection via Semantic Codebooks


68. Learning Generalizable Multimodal Representations for Software Vulnerability Detection


69. Spreadsheet Modeling Experiments Using GPTs on Small Problem Statements and the Wall Task


70. CORAL: Adaptive Retrieval Loop for Culturally-Aligned Multilingual RAG


71. LLM-ReSum: A Framework for LLM Reflective Summarization through Self-Evaluation


72. Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models


73. Large language models eroding science understanding: an experimental study


74. Health System Scale Semantic Search Across Unstructured Clinical Notes


75. Emotive Architectures: The Role of LLMs in Adjusting Work Environments


76. Walking Through Uncertainty: An Empirical Study of Uncertainty Estimation for Audio-Aware Large Language Models


77. Marco-MoE: Open Multilingual Mixture-of-Expert Language Models with Efficient Upcycling


78. Benchmarking bandgap prediction in semiconductors under experimental and realistic evaluation settings


79. SnapGuard: Lightweight Prompt Injection Detection for Screenshot-Based Web Agents


80. From CRUD to Autonomous Agents: Formal Validation and Zero-Trust Security for Semantic Gateways in AI-Native Enterprise Systems


81. On Halting vs Converging in Recurrent Graph Neural Networks


82. Medoid Prototype Alignment for Cross-Plant Unknown Attack Detection in Industrial Control Systems


83. The Surprising Effectiveness of Canonical Knowledge Distillation for Semantic Segmentation


84. AI as Consumer and Participant: A Co-Design Agenda for MBSE Substrates and Methodology


85. Assistants, Not Architects: The Role of LLMs in Networked Systems Design


86. SymphonyGen: 3D Hierarchical Orchestral Generation with Controllable Harmony Skeleton


87. The Forensic Cost of Watermark Removal


88. From World-Gen to Quest-Line: A Dependency-Driven Prompt Pipeline for Coherent RPG Generation


89. DDA-Thinker: Decoupled Dual-Atomic Reinforcement Learning for Reasoning-Driven Image Editing


90. An Investigation of Linguistic Biases in LLM-Based Recommendations


91. Generative UI as an Accessibility Bridge: Lessons from C2C E-Commerce


92. Do LLMs Capture Embodied Cognition and Cultural Variation? Cross-Linguistic Evidence from Demonstratives


93. FED-FSTQ: Fisher-Guided Token Quantization for Communication-Efficient Federated Fine-Tuning of LLMs on Edge Devices


94. One-shot emergency psychiatric triage across 15 frontier AI chatbots


95. Co-Writing with AI: An Empirical Study of Diverse Academic Writing Workflows


96. ML-SAN: Multi-Level Speaker-Adaptive Network for Emotion Recognition in Conversations


97. Safe-Support Q-Learning: Learning without Unsafe Exploration


98. CoRE: Concept-Reasoning Expansion for Continual Brain Lesion Segmentation


99. Language corpora for the Dutch medical domain


100. GPT-Image-2 in the Wild: A Twitter Dataset of Self-Reported AI-Generated Images from the First Week of Deployment


101. The Structured Output Benchmark: A Multi-Source Benchmark for Evaluating Structured Output Quality in Large Language Models


102. GraphPL: Leveraging GNN for Efficient and Robust Modalities Imputation in Patchwork Learning


103. A Faceted Proposal for Transparent Attribution of AI-Assisted Text Production


104. VAE-Inf: A statistically interpretable generative paradigm for imbalanced classification


105. AHASD: Asynchronous Heterogeneous Architecture for LLM Adaptive Drafting Speculative Decoding on Mobile Devices


106. R$^3$-SQL: Ranking Reward and Resampling for Text-to-SQL


107. Cutscene Agent: An LLM Agent Framework for Automated 3D Cutscene Generation


108. Faithfulness-QA: A Counterfactual Entity Substitution Dataset for Training Context-Faithful RAG Models


109. QFlash: Bridging Quantization and Memory Efficiency in Vision Transformer Attention


110. The Thinking Pixel: Recursive Sparse Reasoning in Multimodal Diffusion Latents



112. Spectral bandits


113. Dynamic UGV-UAV Cooperative Path Planning in Uncertain Environments


114. Below-Chance Blindness: Prompted Underperformance in Small LLMs Produces Positional Bias Rather than Answer Avoidance


115. DRAGON: A Benchmark for Evidence-Grounded Visual Reasoning over Diagrams


116. Value-Sensitive AI for Prayer: Balancing the Agencies Between Human and AI Agents in Spiritual Context


117. DiRe-RAPIDS: Topology-faithful dimensionality reduction at scale


118. BARRED: Synthetic Training of Custom Policy Guardrails via Asymmetric Debate


119. Making AI-Assisted Grant Evaluation Auditable without Exposing the Model


120. Kohn-Sham Hamiltonian from Effective Field Theory: Quasiparticle Band Narrowing from Frozen Core Dynamics


121. How Can Reinforcement Learning Achieve Expert-level Placement?


122. Where Did It Go Wrong? Capability-Oriented Failure Attribution for Vision-and-Language Navigation Agents


123. The Role of Symmetry in Optimizing Overparameterized Networks


124. Gradient-Direction Sensitivity Reveals Linear-Centroid Coupling Hidden by Optimizer Trajectories


125. UnIte: Uncertainty-based Iterative Document Sampling for Domain Adaptation in Information Retrieval


126. Frictive Policy Optimization for LLMs: Epistemic Intervention, Risk-Sensitive Control, and Reflective Alignment


127. Towards Unified Multi-task EEG Analysis with Low-Rank Adaptation


128. M$^3$-VQA: A Benchmark for Multimodal, Multi-Entity, Multi-Hop Visual Question Answering


129. Knowledge Distillation Must Account for What It Loses


130. Structured Security Auditing and Robustness Enhancement for Untrusted Agent Skills


131. Optimally Auditing Adversarial Agents


132. Scalable Secure Biometric Authentication without Auxiliary Identifiers


133. Frontier Coding Agents Can Now Implement an AlphaZero Self-Play Machine Learning Pipeline For Connect Four That Performs Comparably to an External Solver


134. Analyzing LLM Reasoning to Uncover Mental Health Stigma


135. Barriers and Enablers of Online Instruction in Hospitality Education in the Philippines: An Exploratory Study


136. Dual-Track CoT: Budget-Aware Stepwise Guidance for Small LMs


137. Faithful Autoformalization via Roundtrip Verification and Repair


138. Internet of Everything in the 6G Era: Paradigms, Enablers, Potentials and Future Directions


139. EVT-Based Generative AI for Tail-Aware Channel Estimation


140. BifDet: A 3D Bifurcation Detection Dataset for Airway-Tree Modeling


141. Compute Aligned Training: Optimizing for Test Time Inference


142. BenchGuard: Who Guards the Benchmarks? Automated Auditing of LLM Agent Benchmarks


143. Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence


144. ViPO: Visual Preference Optimization at Scale


145. Learning from Noisy Preferences: A Semi-Supervised Learning Approach to Direct Preference Optimization


146. ADE: Adaptive Dictionary Embeddings – Scaling Multi-Anchor Representations to Large Language Models


147. Rethinking Layer Redundancy in Large Language Models: Calibration Objectives and Search for Depth Pruning


148. GAIA-v2-LILT: Multilingual Adaptation of Agent Benchmark beyond Translation


149. Large Language Models Explore by Latent Distilling


150. Libra-VLA: Achieving Learning Equilibrium via Asynchronous Coarse-to-Fine Dual-System


151. SUDP: Secret-Use Delegation Protocol for Agentic Systems


152. asRoBallet: Closing the Sim2Real Gap via Friction-Aware Reinforcement Learning for Underactuated Spherical Dynamics


153. Learning with Embedded Linear Equality Constraints via Variational Bayesian Inference


154. MultiHedge: Adaptive Coordination via Retrieval-Augmented Control


155. Transformer Approximations from ReLUs


156. Learning Illumination Control in Diffusion Models


157. spectroxide: A code package for computing cosmic microwave background spectral distortions


158. MotionBricks: Scalable Real-Time Motions with Modular Latent Generative Model and Smart Primitives


159. On the Trainability of Masked Diffusion Language Models via Blockwise Locality


160. Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity


161. A Comparative Evaluation of AI Agent Security Guardrails


162. Salca: A Sparsity-Aware Hardware Accelerator for Efficient Long-Context Attention Decoding


163. Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from Raw Corpora


164. SWE-QA: A Dataset and Benchmark for Complex Code Understanding


165. Time-varying Interaction Graph ODE for Dynamic Graph Representation Learning


166. A Comparative Analysis on the Performance of Upper Confidence Bound Algorithms in Adaptive Deep Neural Networks


167. Nautile-370M: Spectral Memory Meets Attention in a Small Reasoning Model


168. ITAS: A Multi-Agent Architecture for LLM-Based Intelligent Tutoring


169. From Prototype to Classroom: An Intelligent Tutoring System for Quantum Education


170. Versioned Late Materialization for Ultra-Long Sequence Training in Recommendation Systems at Scale


171. Architecture Determines Observability in Transformers


172. V.O.I.C.E (Voice, Ownership, Identity, Control, Expression): Risk Taxonomy of Synthetic Voice Generation From Empirical Data


173. Semantic Denial of Service in LLM-controlled robots


174. Liquid Neural Network Models for Natural Gas Spot Price Time-Series Forecasting


175. Cloud to Edge: Benchmarking LLM Inference On Hardware-Accelerated Single-Board Computers


176. Comparative Study of Bending Analysis using Physics-Informed Neural Networks and Numerical Dynamic Deflection in Perforated nanobeam


177. GCA-BULF: A Bottom-Up Framework for Short-Term Load Forecasting Using Grouped Critical Appliances


178. Back to Repair: A Minimal Denoising Network for Time Series Anomaly Detection