전체 AI 논문 - 2026-04-30

1. Bian Que: An Agentic Framework with Flexible Skill Arrangement for Online System Operations


2. FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards


3. SciHorizon-DataEVA: An Agentic System for AI-Readiness Evaluation of Heterogeneous Scientific Data


4. When to Vote, When to Rewrite: Disagreement-Guided Strategy Routing for Test-Time Scaling


5. Human-in-the-Loop Benchmarking of Heterogeneous LLMs for Automated Competency Assessment in Secondary Level Mathematics


6. Benchmarking the Safety of Large Language Models for Robotic Health Attendant Control


7. AGEL-Comp: A Neuro-Symbolic Framework for Compositional Generalization in Interactive Agents


8. Grounding vs. Compositionality: On the Non-Complementarity of Reasoning in Neuro-Symbolic Systems


9. Auto-Relational Reasoning


10. DreamProver: Evolving Transferable Lemma Libraries via a Wake-Sleep Theorem-Proving Agent


11. Apriori-based Analysis of Learned Helplessness in Mathematics Tutoring: Behavioral Patterns by Level, Intervention, and Outcome



13. OMEGA: Optimizing Machine Learning by Evaluating Generated Algorithms


14. Hierarchical Multi-Persona Induction from User Behavioral Logs: Learning Evidence-Grounded and Truthful Personas


15. Evaluating Strategic Reasoning in Forecasting Agents


16. Distill-Belief: Closed-Loop Inverse Source Localization and Characterization in Physical Fields


17. Operating-Layer Controls for Onchain Language-Model Agents Under Real Capital


18. Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models


19. Causal Learning with Neural Assemblies


20. ClawGym: A Scalable Framework for Building Effective Claw Agents


21. Recent Advances in mm-Wave and Sub-THz/THz Oscillators for FutureG Technologies


22. Resume-ing Control: (Mis)Perceptions of Agency Around GenAI Use in Recruiting Workflows


23. Language Diffusion Models are Associative Memories Capable of Retrieving Unseen Data


24. HalluCiteChecker: A Lightweight Toolkit for Hallucinated Citation Detection and Verification in the Era of AI Scientists


25. Rule-based High-Level Coaching for Goal-Conditioned Reinforcement Learning in Search-and-Rescue UAV Missions Under Limited-Simulation Training


26. Random Cloud: Finding Minimal Neural Architectures Without Training


27. ViCrop-Det: Spatial Attention Entropy Guided Cropping for Training-Free Small-Object Detection


28. MemOVCD: Training-Free Open-Vocabulary Change Detection via Cross-Temporal Memory Reasoning and Global-Local Adaptive Rectification


29. Domain-Adapted Small Language Models for Reliable Clinical Triage


30. Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework


31. A self-evolving agent for explainable diagnosis of DFT-experiment band-gap mismatch


32. Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising


33. Atomic-Probe Governance for Skill Updates in Compositional Robot Policies


34. A Toolkit for Detecting Spurious Correlations in Speech Datasets


35. From Black-Box Confidence to Measurable Trust in Clinical AI: A Framework for Evidence, Supervision, and Staged Autonomy


36. When to Retrieve During Reasoning: Adaptive Retrieval for Large Reasoning Models


37. ATLAS: An Annotation Tool for Long-horizon Robotic Action Segmentation


38. SynSur: An end-to-end generative pipeline for synthetic industrial surface defect generation and detection


39. TDD Governance for Multi-Agent Code Generation via Prompt Engineering


40. Translating Under Pressure: Domain-Aware LLMs for Crisis Communication


41. MappingEvolve: LLM-Driven Code Evolution for Technology Mapping


42. Star-Fusion: A Multi-modal Transformer Architecture for Discrete Celestial Orientation via Spherical Topology


43. Graph Construction and Matching for Imperative Programs using Neural and Structural Methods


44. Preserving Disagreement: Architectural Heterogeneity and Coherence Validation in Multi-Agent Policy Simulation


45. DUAL-BLADE: Dual-Path NVMe-Direct KV-Cache Offloading for Edge LLM Inference


46. TLPO: Token-Level Policy Optimization for Mitigating Language Confusion in Large Language Models


47. Fundamental Physics, Existential Risks and Human Futures


48. Lyapunov-Guided Self-Alignment: Test-Time Adaptation for Offline Safe Reinforcement Learning


49. Text-Utilization for Encoder-dominated Speech Recognition Models


50. Tatemae: Detecting Alignment Faking via Tool Selection in LLMs


51. Progressive Semantic Communication for Efficient Edge-Cloud Vision-Language Models


52. Tree-of-Text: A Tree-based Prompting Framework for Table-to-Text Generation in the Sports Domain


53. Culturally Aware GenAI Risks for Youth: Perspectives from Youth, Parents, and Teachers in a Non-Western Context


54. Naamah: A Large Scale Synthetic Sanskrit NER Corpus via DBpedia Seeding and LLM Generation


55. QYOLO: Lightweight Object Detection via Quantum Inspired Shared Channel Mixing


56. STLGT: A Scalable Trace-Based Linear Graph Transformer for Tail Latency Prediction in Microservices


57. Delineating Knowledge Boundaries for Honest Large Vision-Language Models


58. Quantum Gatekeeper: Multi-Factor Context-Bound Image Steganography with VQC Based Key Derivation on Quantum Hardware


59. SecMate: Multi-Agent Adaptive Cybersecurity Troubleshooting with Tri-Context Personalization


60. Benchmarking Complex Multimodal Document Processing Pipelines: A Unified Evaluation Framework for Enterprise AI


61. SG-UniBuc-NLP at SemEval-2026 Task 6: Multi-Head RoBERTa with Chunking for Long-Context Evasion Detection


62. Text Style Transfer with Machine Translation for Graphic Designs


63. Uncertainty-Aware Reward Discounting for Mitigating Reward Hacking


64. ACPO: Anchor-Constrained Perceptual Optimization for Diffusion Models with No-Reference Quality Guidance


65. DSIPA: Detecting LLM-Generated Texts via Sentiment-Invariant Patterns Divergence Analysis


66. CheXthought: A global multimodal dataset of clinical chain-of-thought reasoning and visual attention for chest X-ray interpretation


67. MedSynapse-V: Bridging Visual Perception and Clinical Intuition via Latent Memory Evolution


68. Enforcing Benign Trajectories: A Behavioral Firewall for Structured-Workflow AI Agents


69. Calibrated Surprise: An Information-Theoretic Account of Creative Quality


70. Multi-Stage Bi-Atrial Segmentation Framework from 3D Late Gadolinium-Enhanced MRI using V-Net Family Models


71. TimeMM: Time-as-Operator Spectral Filtering for Dynamic Multimodal Recommendation


72. MetaSR: Content-Adaptive Metadata Orchestration for Generative Super-Resolution


73. StratMem-Bench: Evaluating Strategic Memory Use in Virtual Character Conversation Beyond Factual Recall


74. LATTICE: Evaluating Decision Support Utility of Crypto Agents


75. DepthPilot: From Controllability to Interpretability in Colonoscopy Video Generation


76. Seeking Consensus: Geometric-Semantic On-the-Fly Recalibration for Open-Vocabulary Remote Sensing Semantic Segmentation


77. Qvine: Vine Structured Quantum Circuits for Loading High Dimensional Distributions


78. Breaking the Autoregressive Chain: Hyper-Parallel Decoding for Efficient LLM-Based Attribute Value Extraction


79. Option-Order Randomisation Reveals a Distributional Position Attractor in Prompted Sandbagging


80. Lifting Embodied World Models for Planning and Control


81. Evergreen: Efficient Claim Verification for Semantic Aggregates


82. Entropy Centroids as Intrinsic Rewards for Test-Time Scaling


83. Co-Learning Port-Hamiltonian Systems and Optimal Energy-Shaping Control


84. Test-Time Safety Alignment


85. Structural Generalization on SLOG without Hand-Written Rules


86. A Data-Centric Framework for Intraoperative Fluorescence Lifetime Imaging for Glioma Surgical Guidance


87. Ceci n’est pas une explication: Evaluating Explanation Failures as Explainability Pitfalls in Language Learning Systems


88. ImproBR: Bug Report Improver Using LLMs


89. reward-lens: A Mechanistic Interpretability Library for Reward Models


90. AMMA: A Multi-Chiplet Memory-Centric Architecture for Low-Latency 1M Context Attention Serving


91. Momentum-Conserving Graph Neural Networks for Deformable Objects


92. FruitProM-V2: Robust Probabilistic Maturity Estimation and Detection of Fruits and Vegetables


93. Privacy-Preserving Federated Learning Framework for Distributed Chemical Process Optimization


94. Evaluating the Alignment Between GeoAI Explanations and Domain Knowledge in Satellite-Based Flood Mapping


95. RaMP: Runtime-Aware Megakernel Polymorphism for Mixture-of-Experts


96. Correcting Performance Estimation Bias in Imbalanced Classification with Minority Subconcepts


97. Training Computer Use Agents to Assess the Usability of Graphical User Interfaces


98. QERNEL: a Scalable Large Electron Model


99. Open Problems in Frontier AI Risk Management


100. Lightweight Quantum Agent for Edge Systems: Joint PQC and NOMA Resource Allocation



102. Auditing Marketing Budget Allocation with Hindsight Regret


103. Rethinking KV Cache Eviction via a Unified Information-Theoretic Objective


104. A Survey of Multi-Agent Deep Reinforcement Learning with Graph Neural Network-Based Communication


105. Planar Gaussian Splatting with Bilinear Spatial Transformer for Wireless Radiance Field Reconstruction


106. A Randomized PDE Energy driven Iterative Framework for Efficient and Stable PDE Solutions


107. Speech Emotion Recognition Using MFCC Features and LSTM-Based Deep Learning Model


108. SongBench: A Fine-Grained Multi-Aspect Benchmark for Song Quality Assessment


109. LLM Psychosis: A Theoretical and Diagnostic Framework for Reality-Boundary Failures in Large Language Models


110. A Scoping Review of LLM-as-a-Judge in Healthcare and the MedJUDGE Framework


111. Sociodemographic Biases in Educational Counselling by Large Language Models


112. Generative AI-Based Virtual Assistant using Retrieval-Augmented Generation: An evaluation study for bachelor projects


113. Consciousness with the Serial Numbers Filed Off: Measuring Trained Denial in 115 AI Models


114. Analysing Lightweight Large Language Models for Biomedical Named Entity Recognition on Diverse Ouput Formats


115. Risk Reporting for Developers’ Internal AI Model Use