전체 AI 논문 - 2025-10-20

1. PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedback and Robust Reasoning Scaffold


2. Demo: Guide-RAG: Evidence-Driven Corpus Curation for Retrieval-Augmented Generation in Long COVID


3. Self-evolving expertise in complex non-verifiable subject domains: dialogue as implicit meta-RL


4. Preliminary Quantitative Study on Explainability and Trust in AI Systems


5. Towards Relaxed Multimodal Inputs for Gait-based Parkinson’s Disease Assessment


6. AURA: An Agent Autonomy Risk Assessment Framework


7. Invoice Information Extraction: Methods and Performance Evaluation


8. Direct Preference Optimization with Unobserved Preference Heterogeneity: The Necessity of Ternary Preferences


9. Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation


10. Unleashing Scientific Reasoning for Bio-experimental Protocol Generation via Structured Component-based Reward Mechanism


11. Context-aware deep learning using individualized prior information reduces false positives in disease risk prediction and longitudinal health assessment


12. JudgeSQL: Reasoning over SQL Candidates with Weighted Consensus Tournament


13. Hypergraph Contrastive Sensor Fusion for Multimodal Fault Diagnosis in Induction Motors


14. Taming the Judge: Deconflicting AI Feedback for Stable Reinforcement Learning


15. Adaptive Minds: Empowering Agents with LoRA-as-Tools


16. MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games


17. Corrigibility Transformation: Constructing Goals That Accept Updates


18. Advancing Routing-Awareness in Analog ICs Floorplanning


19. Towards Flash Thinking via Decoupled Advantage Policy Optimization


20. VERITAS: Leveraging Vision Priors and Expert Fusion to Improve Multimodal Data


21. WebGen-V Bench: Structured Representation for Enhancing Visual Design in LLM-based Web Generation and Evaluation


22. AUGUSTUS: An LLM-Driven Multimodal Agent System with Contextualized User Memory


23. Experience-Driven Exploration for Efficient API-Free AI Agents


24. Multi-dimensional Data Analysis and Applications Basing on LLM Agents and Knowledge Graph Interactions


25. From Checklists to Clusters: A Homeostatic Account of AGI Evaluation


26. WELD: A Large-Scale Longitudinal Dataset of Emotional Dynamics for Ubiquitous Affective Computing


27. HugAgent: Evaluating LLMs in Simulating Human-Like Individual Reasoning on Open-Ended Tasks


28. Towards Error Centric Intelligence I, Beyond Observational Learning


29. Procedural Game Level Design with Deep Reinforcement Learning


30. OpenEstimate: Evaluating LLMs on Reasoning Under Uncertainty with Real-World Data


31. OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM


32. PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction


33. InfiMed-ORBIT: Aligning LLMs on Open-Ended Complex Tasks via Rubric-Based Incremental Training


34. Self-Certifying Primal-Dual Optimization Proxies for Large-Scale Batch Economic Dispatch


35. Enhanced Sentiment Interpretation via a Lexicon-Fuzzy-Transformer Framework


36. SNOO: Step-K Nesterov Outer Optimizer - The Surprising Effectiveness of Nesterov Momentum Applied to Pseudo-Gradients


37. GENESIS: A Generative Model of Episodic-Semantic Interaction


38. Chronos-2: From Univariate to Universal Forecasting


39. AB-UPT for Automotive and Aerospace Applications


40. Controlling the image generation process with parametric activation functions


41. Semantic segmentation with coarse annotations


42. NDM: A Noise-driven Detection and Mitigation Framework against Implicit Sexual Intentions in Text-to-Image Generation


43. LLMs Judge Themselves: A Game-Theoretic Framework for Human-Aligned Evaluation


44. Attention Sinks in Diffusion Language Models


45. RLAF: Reinforcement Learning from Automaton Feedback


46. DGME-T: Directional Grid Motion Encoding for Transformer-Based Historical Camera Movement Classification


47. ProSh: Probabilistic Shielding for Model-free Reinforcement Learning


48. Beyond-Diagonal RIS Under Non-Idealities: Learning-Based Architecture Discovery and Optimization


49. ProofOptimizer: Training Language Models to Simplify Proofs without Human Demonstrations


50. Exploring the Synergy of Quantitative Factors and Newsflow Representations from Large Language Models for Stock Return Prediction


51. KS-Net: Multi-layer network model for determining the rotor type from motor parameters in interior PMSMs


52. Towards Label-Free Brain Tumor Segmentation: Unsupervised Learning with Multimodal MRI


53. Mixture of Experts Approaches in Dense Retrieval Tasks


54. ProofBridge: Auto-Formalization of Natural Language Proofs in Lean via Joint Embeddings


55. CarBoN: Calibrated Best-of-N Sampling Improves Test-time Reasoning


56. Valeo Near-Field: a novel dataset for pedestrian intent detection


57. Enhance Large Language Models as Recommendation Systems with Collaborative Filtering


58. CQD-SHAP: Explainable Complex Query Answering via Shapley Values


59. Lightweight CycleGAN Models for Cross-Modality Image Transformation and Experimental Quality Assessment in Fluorescence Microscopy


60. The Spark Effect: On Engineering Creative Diversity in Multi-Agent AI Systems


61. SpikeVox: Towards Energy-Efficient Speech Therapy Framework with Spike-driven Generative Language Models


62. KITE: A Benchmark for Evaluating Korean Instruction-Following Abilities in Large Language Models


63. ClapperText: A Benchmark for Text Recognition in Low-Resource Archival Documents


64. Think Parallax: Solving Multi-Hop Problems via Multi-View Knowledge-Graph-Based Retrieval-Augmented Generation


65. Rethinking Cross-lingual Gaps from a Statistical Viewpoint


66. TokenTiming: A Dynamic Alignment Method for Universal Speculative Decoding Model Pairs


67. MCA: Modality Composition Awareness for Robust Composed Multimodal Retrieval


68. Revisiting Knowledge Distillation: The Hidden Role of Dataset Size


69. Language Models are Injective and Hence Invertible


70. AI Adoption in NGOs: A Systematic Literature Review


71. The Road Less Traveled: Enhancing Exploration in LLMs via Sequential Sampling


72. DeceptionBench: A Comprehensive Benchmark for AI Deception Behaviors in Real-world Scenarios


73. OffSim: Offline Simulator for Model-based Offline Inverse Reinforcement Learning


74. An Experimental Study of Real-Life LLM-Proposed Performance Improvements


75. Selecting and Combining Large Language Models for Scalable Code Clone Detection


76. SoK: Taxonomy and Evaluation of Prompt Security in Large Language Models


77. Learning to Answer from Correct Demonstrations


78. Robust Optimization in Causal Models and G-Causal Normalizing Flows


79. Expediting Reinforcement Learning by Incorporating Knowledge About Temporal Causality in the Environment


80. A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning


81. Select Less, Reason More: Prioritizing Evidence Purity for Video Reasoning


82. Learning to Detect Unknown Jailbreak Attacks in Large Vision-Language Models


83. Fine-Tuning MedGemma for Clinical Captioning to Enhance Multimodal RAG over Malaysia CPGs


84. Robust High-Resolution Multi-Organ Diffusion MRI Using Synthetic-Data-Tuned Prompt Learning


85. MARIS: Marine Open-Vocabulary Instance Segmentation with Geometric Enhancement and Semantic Alignment


86. DroneAudioset: An Audio Dataset for Drone-based Search and Rescue


87. Towards Robust Zero-Shot Reinforcement Learning


88. Cortical-SSM: A Deep State Space Model for EEG and ECoG Motor Imagery Decoding


89. Kernel Regression in Structured Non-IID Settings: Theory and Implications for Denoising Score Learning


90. GaussGym: An open-source real-to-sim framework for learning locomotion from pixels


91. When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling


92. Readability Reconsidered: A Cross-Dataset Analysis of Reference-Free Metrics


93. ASBI: Leveraging Informative Real-World Data for Active Black-Box Simulator Tuning


94. BeLLMan: Controlling LLM Congestion


95. DSSmoothing: Toward Certified Dataset Ownership Verification for Pre-trained Language Models via Dual-Space Smoothing


96. Latent Diffusion Model without Variational Autoencoder


97. VERA-MH Concept Paper


98. Identifying internal patterns in (1+1)-dimensional directed percolation using neural networks


99. MTmixAtt: Integrating Mixture-of-Experts with Multi-Mix Attention for Large-Scale Recommendation


100. Exemplar-Guided Planing: Enhanced LLM Agent for KGQA


101. Post-Processing Methods for Improving Accuracy in MRI Inpainting


102. Foundation Models for Scientific Discovery: From Paradigm Enhancement to Paradigm Transition


103. TACL: Threshold-Adaptive Curriculum Learning Strategy for Enhancing Medical Text Understanding


104. TraceCoder: Towards Traceable ICD Coding via Multi-Source Knowledge Integration


105. Robust Layerwise Scaling Rules by Proper Weight Decay Tuning


106. DRO-InstructZero: Distributionally Robust Prompt Optimization for Large Language Models


107. Planner and Executor: Collaboration between Discrete Diffusion And Autoregressive Models in Reasoning


108. Adaptive Individual Uncertainty under Out-Of-Distribution Shift with Expert-Routed Conformal Prediction


109. Extending Audio Context for Long-Form Understanding in Large Audio-Language Models


110. ReasonIF: Large Reasoning Models Fail to Follow Instructions During Reasoning


111. Automotive Crash Dynamics Modeling Accelerated with Machine Learning


112. The Economics of AI Foundation Models: Openness, Competition, and Governance


113. Structure-R1: Dynamically Leveraging Structural Knowledge in LLM Reasoning through Reinforcement Learning


114. XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models


115. FarsiMCQGen: a Persian Multiple-choice Question Generation Framework


116. Latent Topic Synthesis: Leveraging LLMs for Electoral Ad Analysis


117. DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning


118. Targeted Attacks and Defenses for Distributed Federated Learning in Vehicular Networks


119. Continual Learning via Sparse Memory Finetuning


120. Operator Flow Matching for Timeseries Forecasting


121. Beyond Outcome-Based Imperfect-Recall: Higher-Resolution Abstractions for Imperfect-Information Games


122. DMRetriever: A Family of Models for Improved Text Retrieval in Disaster Management


123. Sequential Comics for Jailbreaking Multimodal Large Language Models via Structured Visual Storytelling


124. The Coverage Principle: How Pre-training Enables Post-Training


125. UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos


126. Active Honeypot Guardrail System: Probing and Confirming Multi-Turn LLM Jailbreaks


127. DeLeaker: Dynamic Inference-Time Reweighting For Semantic Leakage Mitigation in Text-to-Image Models


128. From Universal Approximation Theorem to Tropical Geometry of Multi-Layer Perceptrons


129. Hybrid Autoencoder-Based Framework for Early Fault Detection in Wind Turbines


130. Can generative AI figure out figurative language? The influence of idioms on essay scoring by ChatGPT, Gemini, and Deepseek


131. Rethinking Toxicity Evaluation in Large Language Models: A Multi-Label Perspective


132. TangledFeatures: Robust Feature Selection in Highly Correlated Spaces


133. Automated Snippet-Alignment Data Augmentation for Code Translation


134. VaultGemma: A Differentially Private Gemma Model


135. Evaluation and Implementation of Machine Learning Algorithms to Predict Early Detection of Kidney and Heart Disease in Diabetic Patients


136. PC-UNet: An Enforcing Poisson Statistics U-Net for Positron Emission Tomography Denoising


137. GAZE:Governance-Aware pre-annotation for Zero-shot World Model Environments


138. The Role of Federated Learning in Improving Financial Security: A Survey


139. Constrained Diffusion for Protein Design with Hard Structural Constraints


140. RegimeFolio: A Regime Aware ML System for Sectoral Portfolio Optimization in Dynamic Markets


141. DeepAries: Adaptive Rebalancing Interval Selection for Enhanced Portfolio Selection


142. Design and Analysis of Parallel Artificial Protozoa Optimizer (P-APO) using CUDA Architecture


143. Reinforcement Learning with Stochastic Reward Machines


144. End-to-End Multi-Modal Diffusion Mamba


145. Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models