전체 AI 논문 - 2025-11-07

1. Outbidding and Outbluffing Elite Humans: Mastering Liar’s Poker via Self-Play and Reinforcement Learning


2. Explaining Decisions in ML Models: a Parameterized Complexity Analysis (Part I)


3. Towards Scalable Web Accessibility Audit with MLLMs as Copilots


4. From Five Dimensions to Many: Large Language Models as Precise and Interpretable Psychological Profilers


5. Adobe Summit Concierge Evaluation with Human in the Loop


6. Toward Autonomous Engineering Design: A Knowledge-Guided Multi-Agent Framework


7. Uncovering Bugs in Formal Explainers: A Case Study with PyXAI


8. A Proprietary Model-Based Safety Response Framework for AI Agents


9. Using Multi-modal Large Language Model to Boost Fireworks Algorithm’s Ability in Settling Challenging Optimization Tasks


10. miniF2F-Lean Revisited: Reviewing Limitations and Charting a Path Forward


11. Large language models require a new form of oversight: capability-based monitoring


12. SnapStream: Efficient Long Sequence Decoding on Dataflow Accelerators


13. Epidemiology of Large Language Models: A Benchmark for Observational Distribution Knowledge


14. No-Human in the Loop: Agentic Evaluation at Scale for Recommendation


15. PublicAgent: Multi-Agent Design Principles From an LLM-Based Open Data Analysis Framework


16. Evaluating Control Protocols for Untrusted AI Agents


17. Grounded Misunderstandings in Asymmetric Dialogue: A Perspectivist Annotation Scheme for MapTask


18. AnaFlow: Agentic LLM-based Workflow for Reasoning-Driven Explainable and Sample-Efficient Analog Circuit Sizing


19. The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents


20. Structured Matrix Scaling for Multi-Class Calibration


21. Whisper Leak: a side-channel attack on Large Language Models


22. DQN Performance with Epsilon Greedy Policies and Prioritized Experience Replay


23. ChiMDQA: Towards Comprehensive Chinese Document QA with Fine-grained Evaluation


24. Explaining Human Choice Probabilities with Simple Vector Representations


25. Watermarking Large Language Models in Europe: Interpreting the AI Act in Light of Technology


26. LiveTradeBench: Seeking Real-World Alpha with Large Language Models


27. Visualization Biases MLLM’s Decision Making in Network Data Tasks


28. Step-Audio-EditX Technical Report


29. PerfDojo: Automated ML Library Generation for Heterogeneous Architectures


30. Learning Under Laws: A Constraint-Projected Neural PDE Solver that Eliminates Hallucinations


31. Multi-User Personalisation in Human-Robot Interaction: Using Quantitative Bipolar Argumentation Frameworks for Preferences Conflict Resolution


32. Imitation Learning in the Deep Learning Era: A Novel Taxonomy and Recent Advances


33. AILA–First Experiments with Localist Language Models


34. MultiZebraLogic: A Multilingual Logical Reasoning Benchmark


35. Uncovering Code Insights: Leveraging GitHub Artifacts for Deeper Code Understanding


36. SOLVE-Med: Specialized Orchestration for Leading Vertical Experts across Medical Specialties


37. Efficient Neural Networks with Discrete Cosine Transform Activations


38. A Theoretical Framework for Environmental Similarity and Vessel Mobility as Coupled Predictors of Marine Invasive Species Pathways


39. ROSBag MCP Server: Analyzing Robot Data with LLMs for Agentic Embodied AI Applications


40. Development of the Bioinspired Tendon-Driven DexHand 021 with Proprioceptive Compliance Control


41. CareMedEval dataset: Evaluating Critical Appraisal and Reasoning in the Biomedical Field


42. Inter-Agent Trust Models: A Comparative Study of Brief, Claim, Proof, Stake, Reputation and Constraint in Agentic Web Protocol Design-A2A, AP2, ERC-8004, and Beyond


43. Light over Heavy: Automated Performance Requirements Quantification with Linguistic Inducement


44. Adaptable Hindsight Experience Replay for Search-Based Learning


45. Computational Imaging Meets LLMs: Zero-Shot IDH Mutation Prediction in Brain Gliomas


46. Decoupling Augmentation Bias in Prompt Learning for Vision-Language Models


47. Open Source State-Of-the-Art Solution for Romanian Speech Recognition


48. Generative Artificial Intelligence in Bioinformatics: A Systematic Review of Models, Applications, and Methodological Advances


49. Discourse-Aware Scientific Paper Recommendation via QA-Style Summarization and Multi-Level Contrastive Learning


50. Benchmarking the Thinking Mode of Multimodal Large Language Models in Clinical Tasks


51. Extending Fair Null-Space Projections for Continuous Attributes to Kernel Methods


52. How to Evaluate Speech Translation with Source-Aware Neural MT Metrics


53. When Generative Artificial Intelligence meets Extended Reality: A Systematic Review


54. Comparing the Performance of LLMs in RAG-based Question-Answering: A Case Study in Computer Science Literature


55. Generative deep learning for foundational video translation in ultrasound


56. GMoPE:A Prompt-Expert Mixture Framework for Graph Foundation Models


57. Node-Based Editing for Multimodal Generation of Text, Audio, Image, and Vide


58. Hybrid Fact-Checking that Integrates Knowledge Graphs, Large Language Models, and Search-Based Retrieval Agents Improves Interpretable Claim Verification


59. LGM: Enhancing Large Language Models with Conceptual Meta-Relations and Iterative Retrieval


60. Retrofitters, pragmatists and activists: Public interest litigation for accountable automated decision-making


61. QG-CoC: Question-Guided Chain-of-Captions for Large Multimodal Models


62. A Quantized VAE-MLP Botnet Detection Model: A Systematic Evaluation of Quantization-Aware Training and Post-Training Quantization Strategies


63. Efficient Linear Attention for Multivariate Time Series Modeling via Entropy Equality


64. Optimizing Earth-Moon Transfer and Cislunar Navigation: Integrating Low-Energy Trajectories, AI Techniques and GNSS-R Technologies


65. GraphCliff: Short-Long Range Gating for Subtle Differences but Critical Changes


66. RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring


67. Who Sees the Risk? Stakeholder Conflicts and Explanatory Policies in LLM-based Risk Assessment


68. Forecast2Anomaly (F2A): Adapting Multivariate Time Series Foundation Models for Anomaly Prediction


69. From Measurement to Expertise: Empathetic Expert Adapters for Context-Based Empathy in Conversational AI Agents


70. Deploying Rapid Damage Assessments from sUAS Imagery for Disaster Response


71. Optimal Boundary Control of Diffusion on Graphs via Linear Programming


72. EGMOF: Efficient Generation of Metal-Organic Frameworks Using a Hybrid Diffusion-Transformer Architecture


73. Control Barrier Function for Aligning Large Language Models


74. Image-Intrinsic Priors for Integrated Circuit Defect Detection and Novel Class Discovery via Self-Supervised Learning


75. An Augmentation Overlap Theory of Contrastive Learning


76. FP-AbDiff: Improving Score-based Antibody Design by Capturing Nonequilibrium Dynamics through the Underlying Fokker-Planck Equation


77. Adaptive Detection of Software Aging under Workload Shift


78. CARMA: Comprehensive Automatically-annotated Reddit Mental Health Dataset for Arabic


79. Scaling Multi-Agent Environment Co-Design with Diffusion Models


80. Sparse, self-organizing ensembles of local kernels detect rare statistical anomalies


81. Reading Between the Lines: The One-Sided Conversation Problem


82. Adaptive-Sensorless Monitoring of Shipping Containers


83. SLIP: Structural-aware Language-Image Pretraining for Vision-Language Alignment


84. Systematizing LLM Persona Design: A Four-Quadrant Technical Taxonomy for AI Companion Applications


85. Value of Information-Enhanced Exploration in Bootstrapped DQN


86. EvtSlowTV - A Large and Diverse Dataset for Event-Based Depth Estimation


87. Power Constrained Nonstationary Bandits with Habituation and Recovery Dynamics


88. From Narrow to Wide: Autoencoding Transformers for Ultrasound Bandwidth Recovery


89. Zero-shot data citation function classification using transformer-based large language models (LLMs)


90. Generative Hints


91. Performance Evaluation of Bitstring Representations in a Linear Genetic Programming Framework


92. A Criminology of Machines


93. NABench: Large-Scale Benchmarks of Nucleotide Foundation Models for Fitness Prediction


94. Predicting Weekly Fishing Concentration Zones through Deep Learning Integration of Heterogeneous Environmental Spatial Datasets


95. Test-time Adaptation of Tiny Recursive Models


96. AgentSLA : Towards a Service Level Agreement for AI Agents


97. NEF-NET+: Adapting Electrocardio panorama in the wild


98. Stochastic Deep Graph Clustering for Practical Group Formation


99. A Novel Reservoir Computing Framework for Chaotic Time Series Prediction Using Time Delay Embedding and Random Fourier Features


100. Academics and Generative AI: Empirical and Epistemic Indicators of Policy-Practice Voids


101. FATE: A Formal Benchmark Series for Frontier Algebra of Multiple Difficulty Levels


102. Analysis of AdvFusion: Adapter-based Multilingual Learning for Code Large Language Models


103. Proof-of-Spiking-Neurons(PoSN): Neuromorphic Consensus for Next-Generation Blockchains


104. LM-Fix: Lightweight Bit-Flip Detection and Rapid Recovery Framework for Language Models


105. Mathematical exploration and discovery at scale


106. Digitizing Spermatogenesis Lineage at Nanoscale Resolution In Tissue-Level Electron Microscopy


107. SELF-REDRAFT: Eliciting Intrinsic Exploration-Exploitation Balance in Test-Time Scaling for Code Generation


108. Consciousness-ECG Transformer for Conscious State Estimation System with Real-Time Monitoring


109. Approaching Low-Cost Cardiac Intelligence with Semi-Supervised Knowledge Distillation


110. EEGReXferNet: A Lightweight Gen-AI Framework for EEG Subspace Reconstruction via Cross-Subject Transfer Learning and Channel-Aware Embedding


111. Spatio-Temporal Attention Network for Epileptic Seizure Prediction


112. AI-Enhanced Wi-Fi Sensing Through Single Transceiver Pair


113. Digital Transformation Chatbot (DTchatbot): Integrating Large Language Model-based Chatbot in Acquiring Digital Transformation Needs


114. Evaluating Generative AI as an Educational Tool for Radiology Resident Report Drafting


115. An extended reality-based framework for user risk training in urban built environment