전체 AI 논문 - 2026-04-09

1. How Much LLM Does a Self-Revising Agent Actually Need?


2. Reason in Chains, Learn in Trees: Self-Rectification and Grafting for Multi-turn Agent Policy Optimization


3. EVGeoQA: Benchmarking LLMs on Dynamic, Multi-Objective Geo-Spatial Exploration


4. Planning Task Shielding: Detecting and Repairing Flaws in Planning Tasks through Turning them Unsolvable


5. A-MBER: Affective Memory Benchmark for Emotion Recognition


6. CAFP: A Post-Processing Framework for Group Fairness via Counterfactual Model Averaging


7. EmoMAS: Emotion-Aware Multi-Agent System for High-Stakes Edge-Deployable Negotiation with Bayesian Orchestration


8. What’s Missing in Screen-to-Action? Towards a UI-in-the-Loop Paradigm for Multimodal GUI Reasoning


9. Explaining Neural Networks in Preference Learning: a Post-hoc Inductive Logic Programming Approach


10. Beyond Surface Judgments: Human-Grounded Risk Evaluation of LLM-Generated Disinformation


11. Riemann-Bench: A Benchmark for Moonshot Mathematics


12. FVD: Inference-Time Alignment of Diffusion Models via Fleming-Viot Resampling


13. TurboAgent: An LLM-Driven Autonomous Multi-Agent Framework for Turbomachinery Aerodynamic Design


14. Steering the Verifiability of Multimodal AI Hallucinations


15. ATANT: An Evaluation Framework for AI Continuity


16. AgentGate: A Lightweight Structured Routing Engine for the Internet of Agents


17. Reasoning Fails Where Step Flow Breaks


18. KD-MARL: Resource-Aware Knowledge Distillation in Multi-Agent Reinforcement Learning


19. Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability


20. On Emotion-Sensitive Decision Making of Small Language Model Agents


21. BDI-Kit Demo: A Toolkit for Programmable and Conversational Data Harmonization


22. ProofSketcher: Hybrid LLM + Lightweight Proof Checker for Reliable Math/Logic Reasoning


23. Qualixar OS: A Universal Operating System for AI Agent Orchestration


24. SELFDOUBT: Uncertainty Quantification for Reasoning LLMs via the Hedge-to-Verify Ratio


25. SymptomWise: A Deterministic Reasoning Layer for Reliable and Efficient AI Systems


26. Weakly Supervised Distillation of Hallucination Signals into Transformer Representations


27. Toward Reducing Unproductive Container Moves: Predicting Service Requirements and Dwell Times


28. Blind Refusal: Language Models Refuse to Help Users Evade Unjust, Absurd, and Illegitimate Rules


29. High-Precision Estimation of the State-Space Complexity of Shogi via the Monte Carlo Method


30. Toward a Tractability Frontier for Exact Relevance Certification


31. MoRight: Motion Control Done Right


32. RoSHI: A Versatile Robot-oriented Suit for Human Data In-the-Wild


33. Syntax Is Easy, Semantics Is Hard: Evaluating LLMs for LTL Translation


34. Evaluating In-Context Translation with Synchronous Context-Free Grammar Transduction


35. Chatbot-Based Assessment of Code Understanding in Automated Programming Assessment Systems


36. Region-Graph Optimal Transport Routing for Mixture-of-Experts Whole-Slide Image Classification


37. CADENCE: Context-Adaptive Depth Estimation for Navigation and Computational Efficiency


38. Android Coach: Improve Online Agentic Training Efficiency with Single State Multiple Actions


39. Making Room for AI: Multi-GPU Molecular Dynamics with Deep Potentials in GROMACS


40. A Systematic Study of Retrieval Pipeline Design for Retrieval-Augmented Medical Question Answering


41. Validated Intent Compilation for Constrained Routing in LEO Mega-Constellations


42. Designing Safe and Accountable GenAI as a Learning Companion with Women Banned from Formal Education


43. $k$-server-bench: Automating Potential Discovery for the $k$-Server Conjecture


44. TraceSafe: A Systematic Assessment of LLM Guardrails on Multi-Step Tool-Calling Trajectories


45. Mixture Proportion Estimation and Weakly-supervised Kernel Test for Conditional Independence


46. The ATOM Report: Measuring the Open Language Model Ecosystem


47. TeaLeafVision: An Explainable and Robust Deep Learning Framework for Tea Leaf Disease Classification


48. Energy-based Tissue Manifolds for Longitudinal Multiparametric MRI Analysis


49. Bridging MRI and PET physiology: Untangling complementarity through orthogonal representations


50. Dynamic Context Evolution for Scalable Synthetic Data Generation


51. Energy Saving for Cell-Free Massive MIMO Networks: A Multi-Agent Deep Reinforcement Learning Approach


52. CSA-Graphs: A Privacy-Preserving Structural Dataset for Child Sexual Abuse Research


53. Self-Discovered Intention-aware Transformer for Multi-modal Vehicle Trajectory Prediction


54. Mixed-Initiative Context: Structuring and Managing Context for Human-AI Collaboration


55. Assessing the Added Value of Onboard Earth Observation Processing with the IRIDE HEO Service Segment


56. Information as Structural Alignment: A Dynamical Theory of Continual Learning


57. The Impact of Steering Large Language Models with Persona Vectors in Educational Applications


58. SurFITR: A Dataset for Surveillance Image Forgery Detection and Localisation


59. STRIDE-ED: A Strategy-Grounded Stepwise Reasoning Framework for Empathetic Dialogue Systems


60. Flow Motion Policy: Manipulator Motion Planning with Flow Matching Models


61. AV-SQL: Decomposing Complex Text-to-SQL Queries with Agentic Views


62. AEROS: A Single-Agent Operating Architecture with Embodied Capability Modules


63. KITE: Keyframe-Indexed Tokenized Evidence for VLM-Based Robot Failure Analysis



65. ConceptTracer: Interactive Analysis of Concept Saliency and Selectivity in Neural Representations


66. AgentCity: Constitutional Governance for Autonomous Agent Economies via Separation of Power


67. Self-Preference Bias in Rubric-Based Evaluation of Large Language Models


68. Stress Estimation in Elderly Oncology Patients Using Visual Wearable Representations and Multi-Instance Learning


69. Generative Phomosaic with Structure-Aligned and Personalized Diffusion


70. CAAP: Capture-Aware Adversarial Patch Attacks on Palmprint Recognition Models


71. Frailty Estimation in Elderly Oncology Patients Using Multimodal Wearable Data and Multi-Instance Learning


72. An empirical study of LoRA-based fine-tuning of large language models for automated test case generation


73. A First Guess is Rarely the Final Answer: Learning to Search in the Travelling Salesperson Problem


74. Multi-modal user interface control detection using cross-attention


75. FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling


76. Q-Zoom: Query-Aware Adaptive Perception for Efficient Multimodal Large Language Models


77. The AI Skills Shift: Mapping Skill Obsolescence, Emergence, and Transition Pathways in the LLM Era


78. XR-CareerAssist: An Immersive Platform for Personalised Career Guidance Leveraging Extended Reality and Multimodal AI


79. SentinelSphere: Integrating AI-Powered Real-Time Threat Detection with Cybersecurity Awareness Training


80. Do We Need Distinct Representations for Every Speech Token? Unveiling and Exploiting Redundancy in Large Speech Language Models


81. Physical Adversarial Attacks on AI Surveillance Systems:Detection, Tracking, and Visible–Infrared Evasion


82. Digital Skin, Digital Bias: Uncovering Tone-Based Biases in LLMs and Emoji Embeddings


83. MedDialBench: Benchmarking LLM Diagnostic Robustness under Parametric Adversarial Patient Behaviors


84. HingeMem: Boundary Guided Long-Term Memory with Query Adaptive Retrieval for Scalable Dialogues


85. On the Step Length Confounding in LLM Reasoning Data Selection


86. Towards Privacy-Preserving Large Language Model: Text-free Inference Through Alignment and Adaptation


87. WRAP++: Web discoveRy Amplified Pretraining


88. Environmental, Social and Governance Sentiment Analysis on Slovene News: A Novel Dataset and Models


89. OmniTabBench: Mapping the Empirical Frontiers of GBDTs, Neural Networks, and Foundation Models for Tabular Data at Scale


90. SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems


91. MoBiE: Efficient Inference of Mixture of Binary Experts under Post-Training Quantization


92. Instance-Adaptive Parametrization for Amortized Variational Inference


93. FedDAP: Domain-Aware Prototype Learning for Federated Learning under Domain Shift


94. Evaluating Repository-level Software Documentation via Question Answering and Feature-Driven Development


95. Sparse-Aware Neural Networks for Nonlinear Functionals: Mitigating the Exponential Dependence on Dimension



97. FlowExtract: Procedural Knowledge Extraction from Maintenance Flowcharts


98. TeamLLM: A Human-Like Team-Oriented Collaboration Framework for Multi-Step Contextualized Tasks


99. Evaluating LLM-Based 0-to-1 Software Generation in End-to-End CLI Tool Scenarios


100. Luwen Technical Report


101. URMF: Uncertainty-aware Robust Multimodal Fusion for Multimodal Sarcasm Detection


102. The Traveling Thief Problem with Time Windows: Benchmarks and Heuristics


103. Fine-grained Approaches for Confidence Calibration of LLMs in Automated Code Revision


104. HQF-Net: A Hybrid Quantum-Classical Multi-Scale Fusion Network for Remote Sensing Image Segmentation


105. ChemVLR: Prioritizing Reasoning in Perception for Chemical Vision-Language Understanding


106. Between Century and Poet: Graph-Based Lexical Semantic Change in Persian Poetry


107. A Graph-Enhanced Defense Framework for Explainable Fake News Detection with LLM


108. Restoring Heterogeneity in LLM-based Social Simulation: An Audience Segmentation Approach


109. A Parameter-Efficient Transfer Learning Approach through Multitask Prompt Distillation and Decomposition for Clinical NLP


110. RPM-Net Reciprocal Point MLP Network for Unknown Network Security Threat Detection


111. SHAPE: Stage-aware Hierarchical Advantage via Potential Estimation for LLM Reasoning


112. SubFLOT: Submodel Extraction for Efficient and Personalized Federated Learning via Optimal Transport


113. Logical Robots: Declarative Multi-Agent Programming in Logica


114. CubeGraph: Efficient Retrieval-Augmented Generation for Spatial and Temporal Data


115. The Detection–Extraction Gap: Models Know the Answer Before They Can Say It


116. TwinLoop: Simulation-in-the-Loop Digital Twins for Online Multi-Agent Reinforcement Learning


117. Scientific Knowledge-driven Decoding Constraints Improving the Reliability of LLMs


118. LLM-based Schema-Guided Extraction and Validation of Missing-Person Intelligence from Heterogeneous Data Sources


119. AI-Driven Research for Databases


120. SkillSieve: A Hierarchical Triage Framework for Detecting Malicious AI Agent Skills


121. Soft-Quantum Algorithms


122. Database Querying under Missing Values Governed by Missingness Mechanisms


123. Adaptive Differential Privacy for Federated Medical Image Segmentation Across Diverse Modalities


124. Efficient Quantization of Mixture-of-Experts with Theoretical Generalization Guarantees


125. MedConclusion: A Benchmark for Biomedical Conclusion Generation from Structured Abstracts


126. Improving Robustness In Sparse Autoencoders via Masked Regularization


127. Discrete Flow Matching Policy Optimization


128. Inference-Time Code Selection via Symbolic Equivalence Partitioning


129. Distributed Interpretability and Control for Large Language Models


130. Hybrid ResNet-1D-BiGRU with Multi-Head Attention for Cyberattack Detection in Industrial IoT Environments


131. Multi-objective Evolutionary Merging Enables Efficient Reasoning Models


132. From Load Tests to Live Streams: Graph Embedding-Based Anomaly Detection in Microservice Architectures


133. The Defense Trilemma: Why Prompt Injection Defense Wrappers Fail?


134. Continual Visual Anomaly Detection on the Edge: Benchmark and Efficient Solutions


135. The Depth Ceiling: On the Limits of Large Language Models in Discovering Latent Planning


136. Neural Computers


137. Team Fusion@ SU@ BC8 SympTEMIST track: transformer-based approach for symptom recognition and linking


138. When to Call an Apple Red: Humans Follow Introspective Rules, VLMs Don’t


139. Attention Flows: Tracing LLM Conceptual Engagement via Story Summaries


140. Towards Resilient Intrusion Detection in CubeSats: Challenges, TinyML Solutions, and Future Directions


141. Say Something Else: Rethinking Contextual Privacy as Information Sufficiency


142. FMI@SU ToxHabits: Evaluating LLMs Performance on Toxic Habit Extraction in Spanish Clinical Texts


143. Toward a universal foundation model for graph-structured data


144. MorphDistill: Distilling Unified Morphological Knowledge from Pathology Foundation Models for Colorectal Cancer Survival Prediction


145. Uncertainty Estimation for Deep Reconstruction in Actuatic Disaster Scenarios with Autonomous Vehicles


146. The Master Key Hypothesis: Unlocking Cross-Model Capability Transfer via Linear Subspace Alignment


147. Beyond Functional Correctness: Design Issues in AI IDE-Generated Large-Scale Projects


148. WebSP-Eval: Evaluating Web Agents on Website Security and Privacy Tasks


149. A Severity-Based Curriculum Learning Strategy for Arabic Medical Text Generation


150. GS-Surrogate: Deformable Gaussian Splatting for Parameter Space Exploration of Ensemble Simulations


151. In-Context Learning in Speech Language Models: Analyzing the Role of Acoustic Features, Linguistic Structure, and Induction Heads


152. DietDelta: A Vision-Language Approach for Dietary Assessment via Before-and-After Images


153. Bi-Level Optimization for Single Domain Generalization


154. Severity-Aware Weighted Loss for Arabic Medical Text Generation


155. “Don’t Be Afraid, Just Learn”: Insights from Industry Practitioners to Prepare Software Engineers in the Age of Generative AI


156. BiScale-GTR: Fragment-Aware Graph Transformers for Multi-Scale Molecular Representation Learning


157. A Novel Automatic Framework for Speaker Drift Detection in Synthesized Speech


158. Blockchain and AI: Securing Intelligent Networks for the Future


159. AgentOpt v0.1 Technical Report: Client-Side Optimization for LLM-Based Agent


160. TalkLoRA: Communication-Aware Mixture of Low-Rank Adaptation for Large Language Models


161. Harnessing Hyperbolic Geometry for Harmful Prompt Detection and Sanitization


162. ClawLess: A Security Model of AI Agents


163. DosimeTron: Automating Personalized Monte Carlo Radiation Dosimetry in PET/CT with Agentic AI


164. Plasma GraphRAG: Physics-Grounded Parameter Selection for Gyrokinetic Simulations


165. Towards the Development of an LLM-Based Methodology for Automated Security Profiling in Compliance with Ukrainian Cybersecurity Regulations


166. MAT-Cell: A Multi-Agent Tree-Structured Reasoning Framework for Batch-Level Single-Cell Annotation


167. MO-RiskVAE: A Multi-Omics Variational Autoencoder for Survival Risk Modeling in Multiple MyelomaMO-RiskVAE


168. Attribution-Driven Explainable Intrusion Detection with Encoder-Based Large Language Models


169. ToxReason: A Benchmark for Mechanistic Chemical Toxicity Reasoning via Adverse Outcome Pathway


170. Incentive-Aware Multi-Fidelity Optimization for Generative Advertising in Large Language Models


171. From Exposure to Internalization: Dual-Stream Calibration for In-context Clinical Reasoning


172. $S^3$: Stratified Scaling Search for Test-Time in Diffusion Language Models


173. Spectral Edge Dynamics Reveal Functional Modes of Learning


174. Learning the Stellar Structure Equations via Self-supervised Physics-Informed Neural Networks


175. SE-Enhanced ViT and BiLSTM-Based Intrusion Detection for Secure IIoT and IoMT Environments


176. FLeX: Fourier-based Low-rank EXpansion for multilingual transfer


177. DISSECT: Diagnosing Where Vision Ends and Language Priors Begin in Scientific VLMs


178. SALLIE: Safeguarding Against Latent Language & Image Exploits


179. The Art of Building Verifiers for Computer Use Agents


180. Negotiating Privacy with Smart Voice Assistants: Risk-Benefit and Control-Acceptance Tensions


181. Automating Database-Native Function Code Synthesis with LLMs


182. Ontology-based knowledge graph infrastructure for interoperable atomistic simulation data


183. Probabilistic Language Tries: A Unified Framework for Compression, Decision Policies, and Execution Reuse


184. The Geometry of Forgetting


185. Development of ML model for triboelectric nanogenerator based sign language detection system


186. From experimentation to engagement: on the paradox of participatory AI and power in contexts of forced displacement and humanitarian crises


187. The End of the Foundation Model Era: Open-Weight Models, Sovereign AI, and Inference as Infrastructure


188. Blending Human and LLM Expertise to Detect Hallucinations and Omissions in Mental Health Chatbot Responses


189. Governing frontier general-purpose AI in the public sector: adaptive risk management and policy capacity under uncertainty through 2030


190. Unsupervised Neural Network for Automated Classification of Surgical Urgency Levels in Medical Transcriptions


191. Invisible Influences: Investigating Implicit Intersectional Biases through Persona Engineering in Large Language Models


192. Code Sharing In Prediction Model Research: A Scoping Review


193. Illocutionary Explanation Planning for Source-Faithful Explanations in Retrieval-Augmented Language Models


194. Distributional Open-Ended Evaluation of LLM Cultural Value Alignment Based on Value Codebook


195. Extracting Breast Cancer Phenotypes from Clinical Notes: Comparing LLMs with Classical Ontology Methods


196. A Comparative Study of Demonstration Selection for Practical Large Language Models-based Next POI Prediction


197. The Human Condition as Reflected in Contemporary Large Language Models


198. Tool-MCoT: Tool Augmented Multimodal Chain-of-Thought for Content Safety Moderation


199. SensorPersona: An LLM-Empowered System for Continual Persona Extraction from Longitudinal Mobile Sensor Streams


200. Front-End Ethics for Sensor-Fused Health Conversational Agents: An Ethical Design Space for Biometrics


201. Cross-Lingual Transfer and Parameter-Efficient Adaptation in the Turkic Language Family: A Theoretical Framework for Low-Resource Language Models


202. Beyond Facts: Benchmarking Distributional Reading Comprehension in Large Language Models


203. Thinking in Graphs with CoMAP: A Shared Visual Workspace for Designing Project-Based Learning


204. Concentrated siting of AI data centers drives regional power-system stress under rising global compute demand


205. Temporally Phenotyping GLP-1RA Case Reports with Large Language Models: A Textual Time Series Corpus and Risk Modeling


206. Consistency-Guided Decoding with Proof-Driven Disambiguation for Three-Way Logical Question Answering


207. Hallucination as output-boundary misclassification: a composite abstention architecture for language models


208. Depression Detection at the Point of Care: Automated Analysis of Linguistic Signals from Routine Primary Care Encounters


209. The Stepwise Informativeness Assumption: Why are Entropy Dynamics and Reasoning Correlated in LLMs?


210. Harf-Speech: A Clinically Aligned Framework for Arabic Phoneme-Level Speech Assessment


211. LLM Spirals of Delusion: A Benchmarking Audit Study of AI Chatbot Interfaces


212. Full State-Space Visualisation of the 8-Puzzle: Feasibility, Design, and Educational Use


213. Benchmarking LLM Tool-Use in the Wild


214. A Goal-Oriented Chatbot for Engaging the Elderly Through Family Photo Conversations


215. VenusBench-Mobile: A Challenging and User-Centric Benchmark for Mobile GUI Agents with Capability Diagnostics



217. Robustness Risk of Conversational Retrieval: Identifying and Mitigating Noise Sensitivity in Qwen3-Embedding Model



219. EviSnap: Faithful Evidence-Cited Explanations for Cold-Start Cross-Domain Recommendation


220. LLM-Augmented Knowledge Base Construction For Root Cause Analysis


221. Fighting AI with AI: AI-Agent Augmented DNS Blocking of LLM Services during Student Evaluations


222. Knowledge Graphs Generation from Cultural Heritage Texts: Combining LLMs and Ontological Engineering for Scholarly Debates


223. Implantable Adaptive Cells: A Novel Enhancement for Pre-Trained U-Nets in Medical Image Segmentation