전체 AI 논문 - 2025-11-04

1. MolChord: Structure-Sequence Alignment for Protein-Guided Drug Design


2. Interaction as Intelligence Part II: Asynchronous Human-Agent Rollout for Long-Horizon Task Training


3. Validity Is What You Need


4. Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning


5. VeriMoA: A Mixture-of-Agents Framework for Spec-to-HDL Generation


6. InnovatorBench: Evaluating Agents’ Ability to Conduct Innovative LLM Research


7. SIGMA: Search-Augmented On-Demand Knowledge Integration for Agentic Mathematical Reasoning


8. Mechanics of Learned Reasoning 1: TempoBench, A Benchmark for Interpretable Deconstruction of Reasoning System Performance


9. GeoFM: Enhancing Geometric Reasoning of MLLMs via Synthetic Data Generation through Formal Language


10. DeepCompress: A Dual Reward Strategy for Dynamically Exploring and Compressing Reasoning Chains


11. Dialogue as Discovery: Navigating Human Intent Through Principled Inquiry


12. Realistic pedestrian-driver interaction modelling using multi-agent RL with human perceptual-motor constraints


13. ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use


14. An In-depth Study of LLM Contributions to the Bin Packing Problem


15. Discriminative Rule Learning for Outcome-Guided Process Model Discovery


16. Reinforcement Learning for Long-Horizon Unordered Tasks: From Boolean to Coupled Reward Machines


17. GUI-Rise: Structured Reasoning and History Summarization for GUI Navigation


18. Fints: Efficient Inference-Time Personalization for LLMs with Fine-Grained Instance-Tailored Steering


19. From product to system network challenges in system of systems lifecycle management


20. Glia: A Human-Inspired AI for Automated Systems Design and Optimization


21. CombiGraph-Vis: A Curated Multimodal Olympiad Benchmark for Discrete Mathematical Reasoning


22. Adaptive Data Flywheel: Applying MAPE Control Loops to AI Agent Improvement


23. e1: Learning Adaptive Control of Reasoning Effort


24. Causal Masking on Spatial Data: An Information-Theoretic Case for Learning Spatial Datasets with Unimodal Language Models


25. SUSTAINABLE Platform: Seamless Smart Farming Integration Towards Agronomy Automation


26. Cognition Envelopes for Bounded AI Reasoning in Autonomous UAS Operations


27. The Denario project: Deep knowledge AI agents for scientific discovery


28. Inverse Knowledge Search over Verifiable Reasoning: Synthesizing a Scientific Encyclopedia from a Long Chains-of-Thought Knowledge Base


29. CATArena: Evaluation of LLM Agents through Iterative Tournament Competitions


30. Continuous Autoregressive Language Models


31. PETAR: Localized Findings Generation with Mask-Aware Vision-Language Modeling for PET Automated Reporting


32. Challenges in Credit Assignment for Multi-Agent Reinforcement Learning in Open Agent Systems


33. Community Detection on Model Explanation Graphs for Explainable AI


34. Information-Theoretic Greedy Layer-wise Training for Traffic Sign Recognition


35. VessShape: Few-shot 2D blood vessel segmentation by leveraging shape priors from synthetic images


36. Sketch-to-Layout: Sketch-Guided Multimodal Layout Generation


37. Best Practices for Biorisk Evaluations on Open-Weight Bio-Foundation Models


38. Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning


39. Towards Universal Video Retrieval: Generalizing Video Embedding via Synthesized Multimodal Pyramid Curriculum


40. CodeAlignBench: Assessing Code Generation Models on Developer-Preferred Code Adjustments


41. Toward Accurate Long-Horizon Robotic Manipulation: Language-to-Action with Foundation Models via Scene Graphs


42. Sybil-Resistant Service Discovery for Agent Economies


43. EBT-Policy: Energy Unlocks Emergent Physical Reasoning Capabilities


44. DialectalArabicMMLU: Benchmarking Dialectal Capabilities in Arabic and Multilingual Language Models


45. TetraJet-v2: Accurate NVFP4 Training for Large Language Models with Oscillation Suppression and Outlier Control


46. Leveraging Generic Time Series Foundation Models for EEG Classification


47. Context-Gated Cross-Modal Perception with Visual Mamba for PET-CT Lung Tumor Segmentation


48. DP-FedPGN: Finding Global Flat Minima for Differentially Private Federated Learning via Penalizing Gradient Norm


49. InertialAR: Autoregressive 3D Molecule Generation with Inertial Frames


50. FedAdamW: A Communication-Efficient Optimizer with Convergence and Generalization Guarantees for Federated Large Models


51. Thought Branches: Interpreting LLM Reasoning Requires Resampling


52. VCORE: Variance-Controlled Optimization-based Reweighting for Chain-of-Thought Supervision


53. CoMViT: An Efficient Vision Backbone for Supervised Classification in Medical Imaging


54. Mitigating Semantic Collapse in Partially Relevant Video Retrieval


55. Learning Soft Robotic Dynamics with Active Exploration


56. Who Does Your Algorithm Fail? Investigating Age and Ethnic Bias in the MAMA-MIA Dataset


57. Atlas-Alignment: Making Interpretability Transferable Across Language Models


58. FedMuon: Accelerating Federated Learning with Matrix Orthogonalization


59. Balancing Knowledge Updates: Toward Unified Modular Editing in LLMs


60. Spiking Neural Networks: The Future of Brain-Inspired Computing


61. Measuring Chain-of-Thought Monitorability Through Faithfulness and Verbosity


62. Fine-Tuning Open Video Generators for Cinematic Scene Synthesis: A Small-Data Pipeline with LoRA and Wan2.1 I2V


63. Generative Semantic Coding for Ultra-Low Bitrate Visual Communication and Analysis


64. CASR-Net: An Image Processing-focused Deep Learning-based Coronary Artery Segmentation and Refinement Network for X-ray Coronary Angiogram


65. Un-Attributability: Computing Novelty From Retrieval & Semantic Similarity


66. Can LLMs Help You at Work? A Sandbox for Evaluating LLM Agents in Enterprise Environments


67. HiF-DTA: Hierarchical Feature Learning Network for Drug-Target Affinity Prediction


68. FOCUS: Efficient Keyframe Selection for Long Video Understanding


69. Why Do Multilingual Reasoning Gaps Emerge in Reasoning Language Models?


70. MedCalc-Eval and MedCalc-Env: Advancing Medical Calculation Capabilities of Large Language Models


71. Higher-order Linear Attention


72. Languages are Modalities: Cross-Lingual Alignment via Encoder Injection


73. Not All Instances Are Equally Valuable: Towards Influence-Weighted Dataset Distillation



75. Beyond a Million Tokens: Benchmarking and Enhancing Long-Term Memory in LLMs


76. Vintage Code, Modern Judges: Meta-Validation in Low Data Regimes


77. DRAMA: Unifying Data Retrieval and Analysis for Open-Domain Analytic Queries


78. Soft Task-Aware Routing of Experts for Equivariant Representation Learning


79. Privacy-Aware Continual Self-Supervised Learning on Multi-Window Chest Computed Tomography for Domain-Shift Robustness


80. Multi-Modal Feature Fusion for Spatial Morphology Analysis of Traditional Villages via Hierarchical Graph Neural Networks


81. Feature-Function Curvature Analysis: A Geometric Framework for Explaining Differentiable Models


82. MemeArena: Automating Context-Aware Unbiased Evaluation of Harmfulness Understanding for Multimodal Large Language Models


83. Vectorized Online POMDP Planning


84. Unvalidated Trust: Cross-Stage Vulnerabilities in Large Language Model Architectures


85. Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications


86. Dual-level Progressive Hardness-Aware Reweighting for Cross-View Geo-Localization


87. FMint-SDE: A Multimodal Foundation Model for Accelerating Numerical Simulation of SDEs via Error Correction


88. Adaptive Defense against Harmful Fine-Tuning for Large Language Models via Bayesian Data Scheduler


89. H2-Cache: A Novel Hierarchical Dual-Stage Cache for High-Performance Acceleration of Generative Diffusion Models


90. Generating Accurate and Detailed Captions for High-Resolution Images


91. MARIA: A Framework for Marginal Risk Assessment without Ground Truth in AI Systems


92. Exploring Landscapes for Better Minima along Valleys


93. ZEBRA: Towards Zero-Shot Cross-Subject Generalization for Universal Brain Visual Decoding


94. AURA: A Reinforcement Learning Framework for AI-Driven Adaptive Conversational Surveys


95. Expressive Range Characterization of Open Text-to-Audio Models


96. QiNN-QJ: A Quantum-inspired Neural Network with Quantum Jump for Multimodal Sentiment Analysis


97. Adapting Large Language Models to Emerging Cybersecurity using Retrieval Augmented Generation


98. Towards a Measure of Algorithm Similarity


99. Consistency Training Helps Stop Sycophancy and Jailbreaks


100. Detecting Data Contamination in LLMs via In-Context Learning


101. Dataset Creation and Baseline Models for Sexism Detection in Hausa


102. Elastic Architecture Search for Efficient Language Models


103. A Multi-Modal Neuro-Symbolic Approach for Spatial Reasoning-Based Visual Grounding in Robotics


104. Jasmine: A Simple, Performant and Scalable JAX-based World Modeling Codebase


105. A Framework for Fair Evaluation of Variance-Aware Bandit Algorithms


106. AIOT based Smart Education System: A Dual Layer Authentication and Context-Aware Tutoring Framework for Learning Environments


107. LLMs are Overconfident: Evaluating Confidence Interval Calibration with FermiEval


108. Fine-Grained Iterative Adversarial Attacks with Limited Computation Budget


109. Overview of the MEDIQA-OE 2025 Shared Task on Medical Order Extraction from Doctor-Patient Consultations


110. Frame Semantic Patterns for Identifying Underreporting of Notifiable Events in Healthcare: The Case of Gender-Based Violence



112. Can machines think efficiently?


113. LLM-based Multi-class Attack Analysis and Mitigation Framework in IoT/IIoT Networks


114. Mind the Gaps: Auditing and Reducing Group Inequity in Large-Scale Mobility Prediction


115. RepV: Safety-Separable Latent Spaces for Scalable Neurosymbolic Plan Verification


116. Scale-Aware Curriculum Learning for Ddata-Efficient Lung Nodule Detection with YOLOv11


117. Heterogeneous Robot Collaboration in Unstructured Environments with Grounded Generative Intelligence


118. How Similar Are Grokipedia and Wikipedia? A Multi-Dimensional Textual and Structural Comparison


119. BI-DCGAN: A Theoretically Grounded Bayesian Framework for Efficient and Diverse GANs


120. Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench


121. Leveraging Foundation Models for Enhancing Robot Perception and Action


122. Broken-Token: Filtering Obfuscated Prompts by Counting Characters-Per-Token


123. CAS-Spec: Cascade Adaptive Self-Speculative Decoding for On-the-Fly Lossless Inference Acceleration of LLMs


124. Accurate Target Privacy Preserving Federated Learning Balancing Fairness and Utility


125. SpotIt: Evaluating Text-to-SQL Evaluation with Formal Verification


126. Category-Aware Semantic Caching for Heterogeneous LLM Workloads


127. Diffusion-Driven Generation of Minimally Preprocessed Brain MRI


128. VISAT: Benchmarking Adversarial and Distribution Shift Robustness in Traffic Sign Recognition with Visual Attributes


129. R3GAN-based Optimal Strategy for Augmenting Small Medical Dataset


130. LeMat-Synth: a multi-modal toolbox to curate broad synthesis procedure databases from scientific literature


131. Cross-Corpus Validation of Speech Emotion Recognition in Urdu using Domain-Knowledge Acoustic Features


132. See the Speaker: Crafting High-Resolution Talking Faces from Speech with Prior Guidance and Region Refinement


133. GACA-DiT: Diffusion-based Dance-to-Music Generation with Genre-Adaptive Rhythm and Context-Aware Alignment


134. Systematic Absence of Low-Confidence Nighttime Fire Detections in VIIRS Active Fire Product: Evidence of Undocumented Algorithmic Filtering


135. Impact of clinical decision support systems (cdss) on clinical outcomes and healthcare delivery in low- and middle-income countries: protocol for a systematic review and meta-analysis


136. Reinforcement Learning for Accelerator Beamline Control: a simulation-based approach


137. EARS-UDE: Evaluating Auditory Response in Sensory Overload with Universal Differential Equations


138. VeriStruct: AI-assisted Automated Verification of Data-Structure Modules in Verus


139. Detecting Prefix Bias in LLM-based Reward Models


140. A Transformer-based Neural Architecture Search Method


141. A Neural Architecture Search Method using Auxiliary Evaluation Metric based on ResNet Architecture