전체 AI 논문 - 2026-04-22

1. A-MAR: Agent-based Multimodal Art Retrieval for Fine-Grained Artwork Understanding


2. A Dual Perspective on Synthetic Trajectory Generators: Utility Framework and Privacy Vulnerabilities


3. SafetyALFRED: Evaluating Safety-Conscious Planning of Multimodal Large Language Models


4. Time Series Augmented Generation for Financial Applications


5. AblateCell: A Reproduce-then-Ablate Agent for Virtual Cell Repositories


6. Multi-modal Reasoning with LLMs for Visual Semantic Arithmetic


7. Detecting Data Contamination in Large Language Models


8. Enhancing Construction Worker Safety in Extreme Heat: A Machine Learning Approach Utilizing Wearable Technology for Predictive Health Analytics


9. DT2IT-MRM: Debiased Preference Construction and Iterative Training for Multimodal Reward Modeling


10. Integrating Anomaly Detection into Agentic AI for Proactive Risk Management in Human Activity


11. Revac: A Social Deduction Reasoning Agent


12. SimDiff: Depth Pruning via Similarity and Difference


13. From Experience to Skill: Multi-Agent Generative Engine Optimization via Reusable Strategy Learning


14. CoDA: Towards Effective Cross-domain Knowledge Transfer via CoT-guided Domain Adaptation


15. Do LLMs Game Formalization? Evaluating Faithfulness in Logical Reasoning


16. Four-Axis Decision Alignment for Long-Horizon Enterprise AI Agents


17. GRASPrune: Global Gating for Budgeted Structured Pruning of Large Language Models


18. Towards Energy Impact on AI-Powered 6G IoT Networks: Centralized vs. Decentralized


19. Do Agents Dream of Root Shells? Partial-Credit Evaluation of LLM Agents in Capture The Flag Challenges


20. Large Language Models Exhibit Normative Conformity


21. Explicit Trait Inference for Multi-Agent Coordination


22. Industrial Surface Defect Detection via Diffusion Generation and Asymmetric Student-Teacher Network


23. UAF: A Unified Audio Front-end LLM for Full-Duplex Speech Interaction


24. ClawNet: Human-Symbiotic Agent Network for Cross-User Autonomous Cooperation


25. Reasoning-Aware AIGC Detection via Alignment and Reinforcement


26. Has Automated Essay Scoring Reached Sufficient Accuracy? Deriving Achievable QWK Ceilings from Classical Test Theory


27. Towards Scalable Lifelong Knowledge Editing with Selective Knowledge Suppression


28. OLLM: Options-based Large Language Models


29. Reinforcement Learning Improves LLM Accuracy and Reasoning in Disease Classification from Radiology Reports


30. Learning Lifted Action Models from Unsupervised Visual Traces


31. Plausible Reasoning and First-Order Plausible Logic


32. On Accelerating Grounded Code Development for Research


33. SAVOIR: Learning Social Savoir-Faire via Shapley-based Reward Attribution


34. DW-Bench: Benchmarking LLMs on Data Warehouse Graph Topology Reasoning


35. Reasoning Structure Matters for Safety Alignment of Reasoning Models


36. Personalized Benchmarking: Evaluating LLMs by Individual Preferences


37. AutomationBench


38. Error-free Training for MedMNIST Datasets


39. Formally Verified Patent Analysis via Dependent Type Theory: Machine-Checkable Certificates from a Hybrid AI + Lean 4 Pipeline


40. How Adversarial Environments Mislead Agentic AI?


41. From Natural Language to Executable Narsese: A Neuro-Symbolic Benchmark and Pipeline for Reasoning with NARS


42. Human-Guided Harm Recovery for Computer Use Agents


43. Quantum inspired qubit qutrit neural networks for real time financial forecasting


44. AI scientists produce results without reasoning scientifically


45. ARES: Adaptive Red-Teaming and End-to-End Repair of Policy-Reward System


46. Beyond One Output: Visualizing and Comparing Distributions of Language Model Generations


47. On Solving the Multiple Variable Gapped Longest Common Subsequence Problem


48. Generalization at the Edge of Stability


49. UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling


50. FASTER: Value-Guided Sampling for Fast RL


51. VLA Foundry: A Unified Framework for Training Vision-Language-Action Models


52. Benign Overfitting in Adversarial Training for Vision Transformers


53. Adaptive MSD-Splitting: Enhancing C4.5 and Random Forests for Skewed Continuous Attributes


54. Learning Hybrid-Control Policies for High-Precision In-Contact Manipulation Under Uncertainty


55. Multi-Cycle Spatio-Temporal Adaptation in Human-Robot Teaming


56. Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language


57. An AI Agent Execution Environment to Safeguard User Data


58. Environmental Sound Deepfake Detection Using Deep-Learning Framework


59. CoCo-SAM3: Harnessing Concept Conflict in Open-Vocabulary Semantic Segmentation


60. Safety-Critical Contextual Control via Online Riemannian Optimization with World Models


61. Towards Streaming Target Speaker Extraction via Chunk-wise Interleaved Splicing of Autoregressive Language Model


62. Cross-Model Consistency of AI-Generated Exercise Prescriptions: A Repeated Generation Study Across Three Large Language Models



64. Impact of large language models on peer review opinions from a fine-grained perspective: Evidence from top conference proceedings in AI


65. Lyapunov-Certified Direct Switching Theory for Q-Learning


66. Detecting Hallucinations in SpeechLLMs at Inference Time Using Attention Maps


67. EgoSelf: From Memory to Personalized Egocentric Assistant


68. Taming Actor-Observer Asymmetry in Agents via Dialectical Alignment


69. Mesh Memory Protocol: Semantic Infrastructure for Multi-Agent LLM Systems


70. Cyber Defense Benchmark: Agentic Threat Hunting Evaluation for LLMs in SecOps


71. BEAT: Tokenizing and Generating Symbolic Music by Uniform Temporal Steps


72. Revisiting RaBitQ and TurboQuant: A Symmetric Comparison of Methods, Theory, and Experiments


73. When Graph Structure Becomes a Liability: A Critical Re-Evaluation of Graph Neural Networks for Bitcoin Fraud Detection under Temporal Distribution Shift


74. EVPO: Explained Variance Policy Optimization for Adaptive Critic Utilization in LLM Post-Training


75. Fairness Audits of Institutional Risk Models in Deployed ML Pipelines


76. A neural operator framework for data-driven discovery of stability and receptivity in physical systems



78. Counting Worlds Branching Time Semantics for post-hoc Bias Mitigation in generative AI


79. GOLD-BEV: GrOund and aeriaL Data for Dense Semantic BEV Mapping of Dynamic Scenes


80. HP-Edit: A Human-Preference Post-Training Framework for Image Editing


81. M$^{2}$GRPO: Mamba-based Multi-Agent Group Relative Policy Optimization for Biomimetic Underwater Robots Pursuit


82. Revisiting Catastrophic Forgetting in Continual Knowledge Graph Embedding


83. Multimodal Transformer for Sample-Aware Prediction of Metal-Organic Framework Properties


84. TACENR: Task-Agnostic Contrastive Explanations for Node Representations


85. LASER: Learning Active Sensing for Continuum Field Reconstruction


86. Evaluation-driven Scaling for Scientific Discovery


87. PLaMo 2.1-VL Technical Report


88. RDP LoRA: Geometry-Driven Identification for Parameter-Efficient Adaptation in Large Language Models


89. Co-Refine: AI-Powered Tool Supporting Qualitative Analysis


90. HalluAudio: A Comprehensive Benchmark for Hallucination Detection in Large Audio-Language Models


91. Rethinking Scale: Deployment Trade-offs of Small Language Models under Agent Paradigms


92. IndiaFinBench: An Evaluation Benchmark for Large Language Model Performance on Indian Financial Regulatory Text


93. Location Not Found: Exposing Implicit Local and Global Biases in Multilingual LLMs


94. Beyond Semantic Similarity: A Component-Wise Evaluation Framework for Medical Question Answering Systems with Health Equity Implications


95. CulturALL: Benchmarking Multilingual and Multicultural Competence of LLMs on Grounded Tasks


96. ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning


97. Streamliners for Answer Set Programming


98. Talking to a Know-It-All GPT or a Second-Guesser Claude? How Repair reveals unreliable Multi-Turn Behavior in LLMs


99. Sherpa.ai Privacy-Preserving Multi-Party Entity Alignment without Intersection Disclosure for Noisy Identifiers


100. Attention-based Multi-modal Deep Learning Model of Spatio-temporal Crop Yield Prediction with Satellite, Soil and Climate Data


101. Improved Anomaly Detection in Medical Images via Mean Shift Density Enhancement


102. Inductive Subgraphs as Shortcuts: Causal Disentanglement for Heterophilic Graph Learning


103. SCURank: Ranking Multiple Candidate Summaries with Summary Content Units for Enhanced Summarization


104. LBLLM: Lightweight Binarization of Large Language Models via Three-Stage Distillation


105. How Do Answer Tokens Read Reasoning Traces? Self-Reading Patterns in Thinking LLMs for Quantitative Reasoning


106. Nexusformer: Nonlinear Attention Expansion for Stable and Inheritable Transformer Scaling


107. ST-Prune: Training-Free Spatio-Temporal Token Pruning for Vision-Language Models in Autonomous Driving


108. The Rise of Verbal Tics in Large Language Models: A Systematic Analysis Across Frontier Models


109. DP-FlogTinyLLM: Differentially private federated log anomaly detection using Tiny LLMs


110. Think Before Writing: Feature-Level Multi-Objective Optimization for Generative Citation Visibility


111. Design Rules for Extreme-Edge Scientific Computing on AI Engines


112. Reinforcement Learning Enabled Adaptive Multi-Task Control for Bipedal Soccer Robots


113. Multi-Gait Learning for Humanoid Robots Using Reinforcement Learning with Selective Adversarial Motion Prior


114. Relational AI in Education: Reciprocity, Participatory Design, and Indigenous Worldviews


115. SAHM: A Benchmark for Arabic Financial and Shari’ah-Compliant Reasoning


116. Multi-modal Test-time Adaptation via Adaptive Probabilistic Gaussian Calibration


117. RoboWM-Bench: A Benchmark for Evaluating World Models in Robotic Manipulation


118. ProjLens: Unveiling the Role of Projectors in Multimodal Model Safety


119. Reducing the Offline-Streaming Gap for Unified ASR Transducer with Consistency Regularization


120. S2MAM: Semi-supervised Meta Additive Model for Robust Estimation and Variable Selection


121. Product-of-Experts Training Reduces Dataset Artifacts in Natural Language Inference


122. Refute-or-Promote: An Adversarial Stage-Gated Multi-Agent Review Methodology for High-Precision LLM-Assisted Defect Discovery


123. SAMoRA: Semantic-Aware Mixture of LoRA Experts for Task-Adaptive Learning


124. RARE: Redundancy-Aware Retrieval Evaluation Framework for High-Similarity Corpora


125. Intentional Updates for Streaming Reinforcement Learning


126. Local Linearity of LLMs Enables Activation Steering via Model-Based Linear Optimal Control


127. FedProxy: Federated Fine-Tuning of LLMs via Proxy SLMs and Heterogeneity-Aware Fusion


128. Decompose, Structure, and Repair: A Neuro-Symbolic Framework for Autoformalization via Operator Trees


129. $R^2$-dLLM: Accelerating Diffusion Large Language Models via Spatio-Temporal Redundancy Reduction


130. AutoAWG: Adverse Weather Generation with Adaptive Multi-Controls for Automotive Videos


131. Low-Rank Adaptation for Critic Learning in Off-Policy Reinforcement Learning


132. Self-Improving Tabular Language Models via Iterative Group Alignment


133. Distillation Traps and Guards: A Calibration Knob for LLM Distillability


134. Assessing Capabilities of Large Language Models in Social Media Analytics: A Multi-task Quest


135. Fine-Tuning Small Reasoning Models for Quantum Field Theory


136. Gated Memory Policy


137. Tadabur: A Large-Scale Quran Audio Dataset


138. MORPHOGEN: A Multilingual Benchmark for Evaluating Gender-Aware Morphological Generation


139. Gradient-Based Program Synthesis with Neurally Interpreted Languages


140. Harmful Intent as a Geometrically Recoverable Feature of LLM Residual Streams


141. Regulating Artificial Intimacy: From Locks and Blocks to Relational Accountability


142. Choose Your Own Adventure: Non-Linear AI-Assisted Programming with EvoGraph


143. A Proxy Consistency Loss for Grounded Fusion of Earth Observation and Location Encoders


144. Where Fake Citations Are Made: Tracing Field-Level Hallucination to Specific Neurons in LLMs


145. Hierarchically Robust Zero-shot Vision-language Models


146. Human-Machine Co-Boosted Bug Report Identification with Mutualistic Neural Active Learning


147. Temporal UI State Inconsistency in Desktop GUI Agents: Formalizing and Defending Against TOCTOU Attacks on Computer-Use Agents


148. The Triadic Loop: A Framework for Negotiating Alignment in AI Co-hosted Livestreaming


149. One Step Forward and K Steps Back: Better Reasoning with Denoising Recursion Models


150. Semantic Needles in Document Haystacks: Sensitivity Testing of LLM-as-a-Judge Similarity Scoring


151. OmniMouse: Scaling properties of multi-modal, multi-task Brain Models on 150B Neural Tokens


152. Curvature-Aware PCA with Geodesic Tangent Space Aggregation for Semi-Supervised Learning


153. Geometric Decoupling: Diagnosing the Structural Instability of Latent


154. LLM-as-Judge Framework for Evaluating Tone-Induced Hallucination in Vision-Language Models


155. HELM: Harness-Enhanced Long-horizon Memory for Vision-Language-Action Manipulation


156. Experiments or Outcomes? Probing Scientific Feasibility in Large Language Models


157. Multi-Level Temporal Graph Networks with Local-Global Fusion for Industrial Fault Diagnosis


158. REVEAL: Multimodal Vision-Language Alignment of Retinal Morphometry and Clinical Risks for Incident AD and Dementia Prediction


159. Towards Understanding the Robustness of Sparse Autoencoders


160. Handling and Interpreting Missing Modalities in Patient Clinical Trajectories via Autoregressive Sequence Modeling


161. Beyond Coefficients: Forecast-Necessity Testing for Interpretable Causal Discovery in Nonlinear Time-Series Models


162. The Cost of Relaxation: Evaluating the Error in Convex Neural Network Verification


163. Skillful Global Ocean Emulation and the Role of Correlation-Aware Loss


164. Towards Optimal Agentic Architectures for Offensive Security Tasks


165. Characterizing AlphaEarth Embedding Geometry for Agentic Environmental Reasoning


166. Curiosity-Critic: Cumulative Prediction Error Improvement as a Tractable Intrinsic Reward for World Model Training


167. Beyond Explicit Refusals: Soft-Failure Attacks on Retrieval-Augmented Generation


168. Evaluating Answer Leakage Robustness of LLM Tutors against Adversarial Student Attacks


169. Owner-Harm: A Missing Threat Model for AI Agent Safety


170. Unlocking the Edge deployment and ondevice acceleration of multi-LoRA enabled one-for-all foundational LLM


171. From Craft to Kernel: A Governance-First Execution Architecture and Semantic ISA for Agentic Computers


172. Position: No Retroactive Cure for Infringement during Training


173. DanceCrafter: Fine-Grained Text-Driven Controllable Dance Generation via Choreographic Syntax


174. FASE : A Fairness-Aware Spatiotemporal Event Graph Framework for Predictive Policing


175. Easy Samples Are All You Need: Self-Evolving LLMs via Data-Efficient Reinforcement Learning


176. NeuroAI and Beyond: Bridging Between Advances in Neuroscience and ArtificialIntelligence


177. ARGUS: Agentic GPU Optimization Guided by Data-Flow Invariants


178. Agent-GWO: Collaborative Agents for Dynamic Prompt Optimization in Large Language Models


179. Neuromorphic Continual Learning for Sequential Deployment of Nuclear Plant Monitoring Systems


180. SpikeMLLM: Spike-based Multimodal Large Language Models via Modality-Specific Temporal Scales and Temporal Compression


181. TurboEvolve: Towards Fast and Robust LLM-Driven Program Evolution


182. Thermal Anomaly Detection using Physics Aware Neuromorphic Networks: Comparison between Raw and L1C Sentinel-2 Data


183. Two-dimensional early exit optimisation of LLM inference


184. SPRITE: From Static Mockups to Engine-Ready Game UI


185. CentaurTA Studio: A Self-Improving Human-Agent Collaboration System for Thematic Analysis


186. Compile to Compress: Boosting Formal Theorem Provers by Compiler Outputs


187. Who Shapes Brazil’s Vaccine Debate? Semi-Supervised Modeling of Stance and Polarization in YouTube’s Media Ecosystem


188. Modelling and Analysing Behaviours and Emotions via Complex User Interactions