전체 AI 논문 - 2026-03-16

1. Semantic Invariance in Agentic AI


2. Developing and evaluating a chatbot to support maternal health care


3. When Right Meets Wrong: Bilateral Context Conditioning with Reward-Confidence Correction for GRPO


4. Steve-Evolving: Open-World Embodied Self-Evolution via Fine-Grained Diagnosis and Dual-Track Knowledge Distillation


5. Beyond Final Answers: CRYSTAL Benchmark for Transparent Multimodal Reasoning Evaluation


6. Structured Distillation for Personalized Agent Memory: 11x Token Reduction with Retrieval Preservation


7. Efficient and Interpretable Multi-Agent LLM Routing via Ant Colony Optimization


8. ODRL Policy Comparison Through Normalisation


9. Context is all you need: Towards autonomous model-based process design using agentic AI in flowsheet simulations


10. AI Model Modulation with Logits Redistribution


11. ToolTree: Efficient LLM Agent Tool Planning via Dual-Feedback Monte Carlo Tree Search and Bidirectional Pruning


12. On Using Machine Learning to Early Detect Catastrophic Failures in Marine Diesel Engines


13. AI Planning Framework for LLM-Based Web Agents


14. Generating Expressive and Customizable Evals for Timeseries Data Analysis Agents with AgentFuel


15. Efficient Reasoning with Balanced Thinking


16. Context-Enriched Natural Language Descriptions of Vessel Trajectories


17. PhysMoDPO: Physically-Plausible Humanoid Motion with Preference Optimization


18. Visual-ERM: Reward Modeling for Visual Equivalence


19. From Experiments to Expertise: Scientific Knowledge Consolidation for AI-Driven Computational Research


20. LLM Constitutional Multi-Agent Governance


21. Learnability and Privacy Vulnerability are Entangled in a Few Critical Weights


22. MXNorm: Reusing MXFP block scales for efficient tensor normalisation


23. Clustering Astronomical Orbital Synthetic Data Using Advanced Feature Extraction and Dimensionality Reduction Techniques


24. ESG-Bench: Benchmarking Long-Context ESG Reports for Hallucination Mitigation


25. Developing the PsyCogMetrics AI Lab to Evaluate Large Language Models and Advance Cognitive Science – A Three-Cycle Action Design Science Study


26. Geometry-Guided Camera Motion Understanding in VideoLLMs


27. BoSS: A Best-of-Strategies Selector as an Oracle for Deep Active Learning


28. Evaluating VLMs’ Spatial Reasoning Over Robot Motion: A Step Towards Robot Planning with Motion Preferences


29. Human-in-the-Loop LLM Grading for Handwritten Mathematics Assessments


30. GeoChemAD: Benchmarking Unsupervised Geochemical Anomaly Detection for Mineral Exploration


31. L2GTX: From Local to Global Time Series Explanations


32. Competition-Aware CPC Forecasting with Near-Market Coverage


33. Team RAS in 10th ABAW Competition: Multimodal Valence and Arousal Estimation Approach


34. Are General-Purpose Vision Models All We Need for 2D Medical Image Segmentation? A Cross-Dataset Empirical Study


35. Interrogating Design Homogenization in Web Vibe Coding


36. Purify Once, Edit Freely: Breaking Image Protections under Model Mismatch


37. SortScrews: A Dataset and Baseline for Real-time Screw Classification


38. SAW: Toward a Surgical Action World Model via Controllable and Scalable Video Generation


39. daVinci-Env: Open SWE Environment Synthesis at Scale


40. ARL-Tangram: Unleash the Resource Efficiency in Agentic Reinforcement Learning


41. Fair Lung Disease Diagnosis from Chest CT via Gender-Adversarial Attention Multiple Instance Learning


42. Is Human Annotation Necessary? Iterative MBR Distillation for Error Span Detection in Machine Translation


43. Efficient Real-World Autonomous Racing via Attenuated Residual Policy Optimization


44. Delta1 with LLM: symbolic and neural integration for credible and explainable reasoning


45. Thinking in Streaming Video


46. Surprised by Attention: Predictable Query Dynamics for Time Series Anomaly Detection


47. Stake the Points: Structure-Faithful Instance Unlearning


48. FedBPrompt: Federated Domain Generalization Person Re-Identification via Body Distribution Aware Visual Prompts


49. Learning from Child-Directed Speech in Two-Language Scenarios: A French-English Case Study


50. Human-Centered Evaluation of an LLM-Based Process Modeling Copilot: A Mixed-Methods Study with Domain Experts


51. Finite Difference Flow Optimization for RL Post-Training of Text-to-Image Models


52. Team LEYA in 10th ABAW Competition: Multimodal Ambivalence/Hesitancy Recognition Approach


53. Hierarchical Reference Sets for Robust Unsupervised Detection of Scattered and Clustered Outliers


54. DAST: A Dual-Stream Voice Anonymization Attacker with Staged Training


55. Mask2Flow-TSE: Two-Stage Target Speaker Extraction with Masking and Flow Matching


56. Hierarchical Dual-Change Collaborative Learning for UAV Scene Change Captioning


57. Residual SODAP: Residual Self-Organizing Domain-Adaptive Prompting with Structural Knowledge Preservation for Continual Learning


58. Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation


59. The RIGID Framework: Research-Integrated, Generative AI-Mediated Instructional Design


60. Empowering Semantic-Sensitive Underwater Image Enhancement with VLM


61. FC-Track: Overlap-Aware Post-Association Correction for Online Multi-Object Tracking


62. TaoBench: Do Automated Theorem Prover LLMs Generalize Beyond MathLib?


63. MoKus: Leveraging Cross-Modal Knowledge Transfer for Knowledge-Aware Concept Customization


64. SRAM-Based Compute-in-Memory Accelerator for Linear-decay Spiking Neural Networks


65. Graph In-Context Operator Networks for Generalizable Spatiotemporal Prediction


66. CognitionCapturerPro: Towards High-Fidelity Visual Decoding from EEG/MEG via Multi-modal Information and Asymmetric Alignment


67. CMHANet: A Cross-Modal Hybrid Attention Network for Point Cloud Registration


68. IGASA: Integrated Geometry-Aware and Skip-Attention Modules for Enhanced Point Cloud Registration


69. Altered Thoughts, Altered Actions: Probing Chain-of-Thought Vulnerabilities in VLA Robotic Manipulation


70. Cost-Efficient Multimodal LLM Inference via Cross-Tier GPU Heterogeneity


71. Seeing Eye to Eye: Enabling Cognitive Alignment Through Shared First-Person Perspective in Human-AI Collaboration


72. HSEmotion Team at ABAW-10 Competition: Facial Expression Recognition, Valence-Arousal Estimation, Action Unit Detection and Fine-Grained Violence Classification


73. Federated Hierarchical Clustering with Automatic Selection of Optimal Cluster Numbers


74. Experimental evidence of progressive ChatGPT models self-convergence


75. MetaKE: Meta-learning Aligned Knowledge Editing via Bi-level Optimization


76. Marker-Based 3D Reconstruction of Aggregates with a Comparative Analysis of 2D and 3D Morphologies


77. RetroReasoner: A Reasoning LLM for Strategic Retrosynthesis Prediction


78. From Text to Forecasts: Bridging Modality Gap with Temporal Evolution Semantic Space


79. Continual Learning in Large Language Models: Methods, Challenges, and Opportunities


80. LR-SGS: Robust LiDAR-Reflectance-Guided Salient Gaussian Splatting for Self-Driving Scene Reconstruction


81. LightMoE: Reducing Mixture-of-Experts Redundancy through Expert Replacing


82. Spend Less, Reason Better: Budget-Aware Value Tree Search for LLM Agents


83. The Economics of AI Supply Chain Regulation


84. Towards unified brain-to-text decoding across speech production and perception


85. VLM4Rec: Multimodal Semantic Representation for Recommendation with Large Vision-Language Models


86. When Drafts Evolve: Speculative Decoding Meets Online Learning


87. Literary Narrative as Moral Probe : A Cross-System Framework for Evaluating AI Ethical Reasoning and Refusal Behavior


88. FastDSAC: Unlocking the Potential of Maximum Entropy RL in High-Dimensional Humanoid Control


89. CarPLAN: Context-Adaptive and Robust Planning with Dynamic Scene Awareness for Autonomous Driving


90. Mastering Negation: Boosting Grounding Models via Grouped Opposition-Based Learning


91. Feynman: Knowledge-Infused Diagramming Agent for Scalable Visual Designs


92. Optimize Wider, Not Deeper: Consensus Aggregation for Policy Optimization


93. Swap-guided Preference Learning for Personalized Reinforcement Learning from Human Feedback


94. Early Pruning for Public Transport Routing


95. CA-HFP: Curvature-Aware Heterogeneous Federated Pruning with Model Reconstruction


96. Multiscale Structure-Guided Latent Diffusion for Multimodal MRI Translation


97. AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents


98. Reinforcement Learning for Diffusion LLMs with Entropy-Guided Step Selection and Stepwise Advantages


99. CALF: Communication-Aware Learning Framework for Distributed Reinforcement Learning


100. Embedded Quantum Machine Learning in Embedded Systems: Feasibility, Hybrid Architectures, and Quantum Co-Processors


101. Spatio-Semantic Expert Routing Architecture with Mixture-of-Experts for Referring Image Segmentation


102. TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning


103. LLM BiasScope: A Real-Time Bias Analysis Platform for Comparative LLM Evaluation


104. When LLM Judge Scores Look Good but Best-of-N Decisions Fail


105. Red-Teaming Vision-Language-Action Models via Quality Diversity Prompt Generation for Robust Robot Policies


106. ELLA: Generative AI-Powered Social Robots for Early Language Development at Home


107. Naïve PAINE: Lightweight Text-to-Image Generation Improvement with Prompt Evaluation


108. TRACE: Temporal Rule-Anchored Chain-of-Evidence on Knowledge Graphs for Interpretable Stock Movement Prediction


109. One-Step Flow Policy: Self-Distillation for Fast Visuomotor Policies


110. The Perfection Paradox: From Architect to Curator in AI-Assisted API Design


111. CLARE: Classification-based Regression for Electron Temperature Prediction


112. Shattering the Shortcut: A Topology-Regularized Benchmark for Multi-hop Medical Reasoning in LLMs


113. Operationalising Cyber Risk Management Using AI: Connecting Cyber Incidents to MITRE ATT&CK Techniques, Security Controls, and Metrics


114. Unmasking Biases and Reliability Concerns in Convolutional Neural Networks Analysis of Cancer Pathology Images


115. Revisiting Model Stitching In the Foundation Model Era


116. Test-Time Strategies for More Efficient and Accurate Agentic RAG


117. SPARROW: Learning Spatial Precision and Temporal Referential Consistency in Pixel-Grounded Video MLLMs


118. Budget-Sensitive Discovery Scoring: A Formally Verified Framework for Evaluating AI-Guided Scientific Selection


119. Optimizing Task Completion Time Updates Using POMDPs


120. Maximum Entropy Exploration Without the Rollouts


121. Thermodynamics of Reinforcement Learning Curricula


122. VQQA: An Agentic Approach for Video Evaluation and Quality Improvement


123. HCP-DCNet: A Hierarchical Causal Primitive Dynamic Composition Network for Self-Improving Causal Understanding


124. A Geometrically-Grounded Drive for MDL-Based Optimization in Deep Learning


125. Global Evolutionary Steering: Refining Activation Steering Control via Cross-Layer Consistency


126. Synthetic Data Generation for Brain-Computer Interfaces: Overview, Benchmarking, and Future Directions


127. Detecting Miscitation on the Scholarly Web through LLM-Augmented Text-Rich Graph Learning


128. From Garbage to Gold: A Data-Architectural Theory of Predictive Robustness


129. The DIME Architecture: A Unified Operational Algorithm for Neural Representation, Dynamics, Control and Integration


130. Predictive Analytics for Foot Ulcers Using Time-Series Temperature and Pressure Data


131. Prompt Injection as Role Confusion


132. Aligning Language Models from User Interactions


133. Diagnosing Retrieval Bias Under Multiple In-Context Knowledge Updates in Large Language Models


134. Task-Specific Knowledge Distillation via Intermediate Probes


135. DART: Input-Difficulty-AwaRe Adaptive Threshold for Early-Exit DNNs