전체 AI 논문 - 2026-02-04

1. AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations


2. Conformal Thinking: Risk Control for Reasoning on a Compute Budget


3. Understanding Agent Scaling in LLM-Based Multi-Agent Systems via Diversity


4. AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration


5. TodyComm: Task-Oriented Dynamic Communication for Multi-Round LLM-based Multi-Agent System


6. Mitigating Conversational Inertia in Multi-Turn Agents


7. Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration


8. Can LLMs Do Rocket Science? Exploring the Limits of Complex Reasoning with GTOC 12


9. EHRWorld: A Patient-Centric Medical World Model for Long-Horizon Clinical Trajectories


10. Persona Generators: Generating Diverse Synthetic Personas at Scale


11. Group Selection as a Safeguard Against AI Substitution


12. When Routing Collapses: On the Degenerate Convergence of LLM Routers


13. IntentRL: Training Proactive User-intent Agents for Open-ended Deep Research via Reinforcement Learning


14. The Dual Role of Abstracting over the Irrelevant in Symbolic Explanations: Cognitive Effort vs. Understanding


15. CRL-VLA: Continual Vision-Language-Action Learning


16. Ontology-to-tools compilation for executable semantic constraint enforcement in LLM agents


17. DiscoverLLM: From Executing Intents to Discovering Them


18. Feasible strategies for conflict resolution within intuitionistic fuzzy preference-based conflict situations


19. Risk Awareness Injection: Calibrating Vision-Language Models for Safety without Compromising Utility


20. GFlowPO: Generative Flow Network as a Language Model Prompt Optimizer


21. Building Interpretable Models for Moral Decision-Making


22. MentalSeek-Dx: Towards Progressive Hypothetico-Deductive Reasoning for Real-world Psychiatric Diagnosis


23. Memora: A Harmonic Memory Representation Balancing Abstraction and Specificity


24. Rejecting Arguments Based on Doubt in Structured Bipolar Argumentation


25. MeetBench-XL: Calibrated Multi-Dimensional Evaluation and Learned Dual-Policy Agents for Real-Time Meetings


26. Agentic Proposing: Enhancing Large Language Model Reasoning via Compositional Skill Synthesis


27. CSR-Bench: A Benchmark for Evaluating the Cross-modal Safety and Reliability of MLLMs


28. LPS-Bench: Benchmarking Safety Awareness of Computer-Use Agents in Long-Horizon Planning under Benign and Adversarial Scenarios


29. Accordion-Thinking: Self-Regulated Step Summaries for Efficient and Readable LLM Reasoning


30. The Necessity of a Unified Framework for LLM-Based Agent Evaluation


31. TAME: A Trustworthy Test-Time Evolution of Agent Memory with Systematic Benchmarking


32. Beyond Quantity: Trajectory Diversity Scaling for Code Agents


33. VALUEFLOW: Toward Pluralistic and Steerable Value-based Alignment in Large Language Models


34. Enhancing Foundation VLM Robustness to Missing Modality: Scalable Diffusion for Bi-directional Feature Restoration


35. General Agents Contain World Models, even under Partial Observability and Stochasticity


36. Understanding Multi-Agent LLM Frameworks: A Unified Benchmark and Experimental Analysis


37. Risky-Bench: Probing Agentic Safety Risks under Real-World Deployment


38. De-conflating Preference and Qualification: Constrained Dual-Perspective Reasoning for Job Recommendation with Large Language Models


39. MAS-ProVe: Understanding the Process Verification of Multi-Agent Systems


40. KANFIS A Neuro-Symbolic Framework for Interpretable and Uncertainty-Aware Learning


41. Visual Reasoning over Time Series via Multi-Agent System


42. RC-GRPO: Reward-Conditioned Group Relative Policy Optimization for Multi-Turn Tool Calling Agents


43. STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models


44. Distilling LLM Reasoning into Graph of Concept Predictors


45. Methods and Open Problems in Differentiable Social Choice: Learning Mechanisms, Decisions, and Alignment


46. Agent Alpha: Tree Search Unifying Generation, Exploration and Evaluation for Computer-Use Agents


47. Large Language Models Can Take False First Steps at Inference-time Planning


48. Are LLMs Biased Like Humans? Causal Reasoning as a Function of Prior Knowledge, Irrelevant Information, and Reasoning Budget


49. Structuring Value Representations via Geometric Coherence in Markov Decision Processes


50. Generative Engine Optimization: A VLM and Agent Framework for Pinterest Acquisition Growth


51. UAT-LITE: Inference-Time Uncertainty-Aware Attention for Pretrained Transformers


52. DeltaEvolve: Accelerating Scientific Discovery through Momentum-Driven Evolution


53. Reasoning about Reasoning: BAPO Bounds on Chain-of-Thought Token Complexity in LLMs


54. FIRE-Bench: Evaluating Agents on the Rediscovery of Scientific Insights


55. Minimal Computational Preconditions for Subjective Perspective in Artificial Agents


56. Aligning Language Model Benchmarks with Pairwise Preferences


57. “I May Not Have Articulated Myself Clearly”: Diagnosing Dynamic Instability in LLM Reasoning at Inference Time



59. AutoSizer: Automatic Sizing of Analog and Mixed-Signal Circuits via Large Language Model (LLM) Agents


60. Chain of Simulation: A Dual-Mode Reasoning Framework for Large Language Models with Dynamic Problem Routing


61. Scaling-Aware Adapter for Structure-Grounded LLM Reasoning


62. Dynamic Mix Precision Routing for Efficient Multi-step LLM Interaction


63. ATLAS : Adaptive Self-Evolutionary Research Agent with Task-Distributed Multi-LLM Supporters


64. MARS: Modular Agent with Reflective Search for Automated AI Research


65. A Positive Case for Faithfulness: LLM Self-Explanations Help Predict Model Behavior


66. PeerRank: Autonomous LLM Evaluation Through Web-Grounded, Bias-Controlled Peer Review


67. Uncertainty and Fairness Awareness in LLM-Based Recommendation Systems


68. Experience-Driven Multi-Agent Systems Are Training-free Context-aware Earth Observers


69. CreditAudit: 2D Auditing for LLM Evaluation and Selection


70. PLATE: Plasticity-Tunable Efficient Adapters for Geometry-Aware Continual Learning


71. PrevizWhiz: Combining Rough 3D Scenes and 2D Video to Guide Generative Video Previsualization


72. Accelerating Scientific Research with Gemini: Case Studies and Common Techniques


73. Adaptive Evidence Weighting for Audio-Spatiotemporal Fusion


74. Antidistillation Fingerprinting


75. Enhancing Imbalanced Node Classification via Curriculum-Guided Feature Learning and Three-Stage Attention Network


76. Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation


77. Do We Need Asynchronous SGD? On the Near-Optimality of Synchronous Methods


78. Conformal Reachability for Safe Control in Unknown Environments


79. WebSentinel: Detecting and Localizing Prompt Injection Attacks for Web Agents


80. Fast Sampling for Flows and Diffusions with Lazy and Point Mass Stochastic Interpolants


81. Efficient Estimation of Kernel Surrogate Models for Task Attribution


82. Reward Redistribution for CVaR MDPs using a Bellman Operator on L-infinity


83. DiffLOB: Diffusion Models for Counterfactual Generation in Limit Order Books


84. An Empirical Study of Collective Behaviors and Social Dynamics in Large Language Model Agents


85. UniGeM: Unifying Data Mixing and Selection via Geometric Exploration and Mining


86. Decision-oriented benchmarking to transform AI weather forecast access: Application to the Indian monsoon


87. Zero-shot large vision-language model prompting for automated bone identification in paleoradiology x-ray archives


88. Cognitively Diverse Multiple-Choice Question Generation: A Hybrid Multi-Agent Framework with Large Language Models


89. Anytime Pretraining: Horizon-Free Learning-Rate Schedules with Weight Averaging


90. Agent Primitives: Reusable Latent Building Blocks for Multi-Agent Systems


91. OCRTurk: A Comprehensive OCR Benchmark for Turkish


92. LLM-Inspired Pretrain-Then-Finetune for Small-Data, Large-Scale Optimization


93. Rethinking the Reranker: Boundary-Aware Evidence Selection for Robust Retrieval-Augmented Generation


94. QuAIL: Quality-Aware Inertial Learning for Robust Training under Data Corruption


95. Universal One-third Time Scaling in Learning Peaked Distributions


96. ContraLog: Log File Anomaly Detection with Contrastive Learning and Masked Language Modeling


97. Equilibrium Propagation for Non-Conservative Systems


98. Efficient Sequential Neural Network with Spatial-Temporal Attention and Linear LSTM for Robust Lane Detection Using Multi-Frame Images


99. RAGTurk: Best Practices for Retrieval Augmented Generation in Turkish


100. Tutorial on Reasoning for IR & IR for Reasoning


101. BIRDTurk: Adaptation of the BIRD Text-to-SQL Dataset to Turkish



103. A Lightweight Library for Energy-Based Joint-Embedding Predictive Architectures


104. APEX: Probing Neural Networks via Activation Perturbation


105. $V_0$: A Generalist Value Model for Any Policy at State Zero


106. Don’t believe everything you read: Understanding and Measuring MCP Behavior under Misleading Tool Descriptions


107. Use Graph When It Needs: Efficiently and Adaptively Integrating Retrieval-Augmented Generation with Graphs


108. EVE: Efficient Verification of Data Erasure through Customized Perturbation in Approximate Unlearning


109. HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing


110. ELIQ: A Label-Free Framework for Quality Assessment of Evolving AI-Generated Images


111. When Single Answer Is Not Enough: Rethinking Single-Step Retrosynthesis Benchmarks for LLMs


112. Morphe: High-Fidelity Generative Video Streaming with Vision Foundation Model


113. D3PIA: A Discrete Denoising Diffusion Model for Piano Accompaniment Generation From Lead sheet


114. Live or Lie: Action-Aware Capsule Multiple Instance Learning for Risk Assessment in Live Streaming Platforms


115. Not All Negative Samples Are Equal: LLMs Learn Better from Plausible Reasoning


116. Mitigating Staleness in Asynchronous Pipeline Parallelism via Basis Rotation


117. CMR: Contractive Mapping Embeddings for Robust Humanoid Locomotion on Unstructured Terrains


118. Explaining the Explainer: Understanding the Inner Workings of Transformer-based Symbolic Regression Models


119. Generative Decompression: Optimal Lossy Decoding Against Distribution Mismatch


120. Reparameterization Flow Policy Optimization


121. DeepDFA: Injecting Temporal Logic in Deep Learning for Sequential Subsymbolic Applications


122. Self-Verification Dilemma: Experience-Driven Suppression of Overused Checking in LLM Reasoning


123. ScDiVa: Masked Discrete Diffusion for Joint Modeling of Single-Cell Identity and Expression


124. Beyond Variance: Prompt-Efficient RLVR via Rare-Event Amplification and Bidirectional Pairing


125. Hierarchical Concept-to-Appearance Guidance for Multi-Subject Image Generation


126. Socratic-Geo: Synthetic Data Generation and Geometric Reasoning via Multi-Agent Interaction


127. Precision in Practice: Knowledge Guided Code Summarizing Grounded in Industrial Expectations


128. On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models


129. Chain-of-Goals Hierarchical Policy for Long-Horizon Offline Goal-Conditioned RL


130. Toward a Sustainable Federated Learning Ecosystem: A Practical Least Core Mechanism for Payoff Allocation


131. An Approximate Ascent Approach To Prove Convergence of PPO


132. Rethinking Benign Relearning: Syntax as the Hidden Driver of Unlearning Failures


133. SLIM-Diff: Shared Latent Image-Mask Diffusion with Lp loss for Data-Scarce Epilepsy FLAIR MRI


134. MeKi: Memory-based Expert Knowledge Injection for Efficient LLM Scaling


135. Causal Graph Learning via Distributional Invariance of Cause-Effect Relationship


136. Robustness as an Emergent Property of Task Performance


137. Tiled Prompts: Overcoming Prompt Underspecification in Image and Video Super-Resolution


138. MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic Reinforcement Learning


139. Multiparameter Uncertainty Mapping in Quantitative Molecular MRI using a Physics-Structured Variational Autoencoder (PS-VAE)


140. RDT2: Exploring the Scaling Limit of UMI Data Towards Zero-Shot Cross-Embodiment Generalization


141. Entropy-Gated Selective Policy Optimization:Token-Level Gradient Allocation for Hybrid Training of Large Language Models


142. Learning to Select: Query-Aware Adaptive Dimension Selection for Dense Retrieval


143. Full end-to-end diagnostic workflow automation of 3D OCT via foundation model-driven AI for retinal diseases


144. Periodic Regularized Q-Learning


145. R1-SyntheticVL: Is Synthetic Data from Generative Models Ready for Multimodal Large Language Model?


146. POP: Prefill-Only Pruning for Efficient Large Model Inference


147. Global Geometry Is Not Enough for Vision Representations


148. Unveiling Covert Toxicity in Multimodal Data via Toxicity Association Graphs: A Graph-Based Metric and Interpretable Detection Framework


149. GraDE: A Graph Diffusion Estimator for Frequent Subgraph Discovery in Neural Architectures


150. ATACompressor: Adaptive Task-Aware Compression for Efficient Long-Context Processing in LLMs


151. Distribution-Aware End-to-End Embedding for Streaming Numerical Features in Click-Through Rate Prediction


152. Topology Matters: A Cautionary Case Study of Graph SSL on Neuro-Inspired Benchmarks


153. Latent Neural-ODE for Model-Informed Precision Dosing: Overcoming Structural Assumptions in Pharmacokinetics


154. Lookahead Sample Reward Guidance for Test-Time Scaling of Diffusion Models


155. Hand3R: Online 4D Hand-Scene Reconstruction in the Wild


156. Reinforcement Learning with Promising Tokens for Large Language Models


157. Prompt Augmentation Scales up GRPO Training on Mathematical Reasoning


158. Privasis: Synthesizing the Largest “Public” Private Dataset from Scratch


159. MemCast: Memory-Driven Time Series Forecasting with Experience-Conditioned Reasoning


160. Intelligent Front-End Personalization: AI-Driven UI Adaptation


161. Internet of Agentic AI: Incentive-Compatible Distributed Teaming and Workflow


162. Self-Hinting Language Models Enhance Reinforcement Learning


163. SwiftVLM: Efficient Vision-Language Model Inference via Cross-Layer Token Bypass


164. Contrastive Concept-Tree Search for LLM-Assisted Algorithm Discovery


165. Beyond Cropping and Rotation: Automated Evolution of Powerful Task-Specific Augmentations with Generative Models


166. Quantized Evolution Strategies: High-precision Fine-tuning of Quantized LLMs at Low-precision Cost



168. “I’m happy even though it’s not real”: GenAI Photo Editing as a Remembering Experience


169. Task–Specificity Score: Measuring How Much Instructions Really Matter for Supervision


170. TextME: Bridging Unseen Modalities Through Text Descriptions


171. PRISM: Structured Optimization via Anisotropic Spectral Shaping


172. Training and Simulation of Quadrupedal Robot in Adaptive Stair Climbing for Indoor Firefighting: An End-to-End Reinforcement Learning Approach


173. The Trigger in the Haystack: Extracting and Reconstructing LLM Backdoor Triggers


174. FlashSinkhorn: IO-Aware Entropic Optimal Transport


175. Shortcut Features as Top Eigenfunctions of NTK: A Linear Neural Network Case and More


176. JRDB-Pose3D: A Multi-person 3D Human Pose and Shape Estimation Dataset for Robotics


177. Evaluating LLMs When They Do Not Know the Answer: Statistical Evaluation of Mathematical Reasoning via Comparative Signals


178. Towards Considerate Embodied AI: Co-Designing Situated Multi-Site Healthcare Robots from Abstract Concepts to High-Fidelity Prototypes


179. CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs


180. SAFE-KD: Risk-Controlled Early-Exit Distillation for Vision Backbones


181. Bongards at the Boundary of Perception and Reasoning: Programs or Language?


182. Consistency Deep Equilibrium Models


183. FedKRSO: Communication and Memory Efficient Federated Fine-Tuning of Large Language Models


184. CVE-Factory: Scaling Expert-Level Agentic Tasks for Code Security Vulnerability


185. VOILA: Value-of-Information Guided Fidelity Selection for Cost-Aware Multimodal Question Answering


186. Causal Graph Spatial-Temporal Autoencoder for Reliable and Interpretable Process Monitoring


187. Adaptive Batch Sizes Using Non-Euclidean Gradient Noise Scales for Stochastic Sign and Spectral Descent


188. NLI:Non-uniform Linear Interpolation Approximation of Nonlinear Operations for Efficient LLMs Inference


189. Aligning Forest and Trees in Images and Long Captions for Visually Grounded Understanding


190. Where Norms and References Collide: Evaluating LLMs on Normative Reasoning


191. Embodiment-Aware Generalist Specialist Distillation for Unified Humanoid Whole-Body Control


192. Synthetic Data Augmentation for Medical Audio Classification: A Preliminary Evaluation


193. Nüwa: Mending the Spatial Integrity Torn by VLM Token Pruning


194. Equal Access, Unequal Interaction: A Counterfactual Audit of LLM Fairness


195. RPG-AE: Neuro-Symbolic Graph Autoencoders with Rare Pattern Mining for Provenance-Based Anomaly Detection


196. Refining Decision Boundaries In Anomaly Detection Using Similarity Search Within the Feature Space


197. A Multi-scale Linear-time Encoder for Whole-Slide Image Analysis


198. Notes on the Reward Representation of Posterior Updates


199. A Random Matrix Theory Perspective on the Consistency of Diffusion Models


200. Spatiotemporal Decision Transformer for Traffic Coordination


201. Manifold-Constrained Energy-Based Transition Models for Offline Reinforcement Learning


202. Moving On, Even When You’re Broken: Fail-Active Trajectory Generation via Diffusion Policies Conditioned on Embodiment and Task


203. HALT: Hallucination Assessment via Log-probs as Time series


204. Mixture of Concept Bottleneck Experts


205. Learning-Infused Formal Reasoning: From Contract Synthesis to Artifact Reuse and Formal Semantics


206. Causal Flow Q-Learning for Robust Offline Reinforcement Learning


207. Semantics-Aware Generative Latent Data Augmentation for Learning in Low-Resource Domains


208. Tabula RASA: Exposing and Breaking the Relational Bottleneck in Transformers


209. From Tokens to Numbers: Continuous Number Modeling for SVG Generation


210. LmPT: Conditional Point Transformer for Anatomical Landmark Detection on 3D Point Clouds


211. Joint Learning of Hierarchical Neural Options and Abstract World Model


212. Causality–Δ: Jacobian-Based Dependency Analysis in Flow Matching Models


213. Simulating Human Audiovisual Search Behavior


214. Structure-Preserving Learning Improves Geometry Generalization in Neural PDEs


215. Cross-Temporal Attention Fusion (CTAF) for Multimodal Physiological Signals in Self-Supervised Learning


216. Evaluating False Alarm and Missing Attacks in CAN IDS


217. Provable Effects of Data Replay in Continual Learning: A Feature Learning Perspective


218. Scaling Small Agents Through Strategy Auctions


219. Entropy-Guided Dynamic Tokens for Graph-LLM Alignment in Molecular Understanding


220. TopoPrune: Robust Data Pruning via Unified Latent Space Topology


221. When Noise Lowers The Loss: Rethinking Likelihood-Based Evaluation in Music Large Language Models


222. WAXAL: A Large-Scale Multilingual African Language Speech Corpus


223. Predicting first-episode homelessness among US Veterans using longitudinal EHR data: time-varying models and social risk factors


224. CAPS: Unifying Attention, Recurrence, and Alignment in Transformer-based Time Series Forecasting


225. Search-Augmented Masked Diffusion Models for Constrained Generation


226. BinaryPPO: Efficient Policy Optimization for Binary Classification


227. Every Bit Counts: A Theoretical Study of Precision-Expressivity Tradeoffs in Quantized Transformers


228. Sparsely Supervised Diffusion


229. Eidolon: A Practical Post-Quantum Signature Scheme Based on k-Colorability in the Age of Graph Neural Networks


230. Monotonicity as an Architectural Bias for Robust Language Models


231. MARA: Continuous SE(3)-Equivariant Attention for Molecular Force Fields


232. Benchmarking Large Language Models for Zero-shot and Few-shot Phishing URL Detection


233. WideSeek: Advancing Wide Research via Multi-Agent Scaling


234. Performance of Small Language Model Pretraining on FABRIC: An Empirical Study


235. Trailer Reimagined: An Innovative, Llm-DRiven, Expressive Automated Movie Summary framework (TRAILDREAMS)


236. Trustworthy Blockchain-based Federated Learning for Electronic Health Records: Securing Participant Identity with Decentralized Identifiers and Verifiable Credentials


237. Recommender system in X inadvertently profiles ideological positions of users


238. Learning Consistent Causal Abstraction Networks


239. CryoLVM: Self-supervised Learning from Cryo-EM Density Maps with Large Vision Models


240. daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently


241. A Semi-Supervised Pipeline for Generalized Behavior Discovery from Animal-Borne Motion Time Series


242. TinyGuard:A lightweight Byzantine Defense for Resource-Constrained Federated Learning via Statistical Update Fingerprints


243. Testing Storage-System Correctness: Challenges, Fuzzing Limitations, and AI-Augmented Opportunities


244. Exploring Silicon-Based Societies: An Early Study of the Moltbook Agent Community


245. Discovering Data Manifold Geometry via Non-Contracting Flows


246. Gender Dynamics and Homophily in a Social Network of LLM Agents


247. Fine-Tuning Language Models to Know What They Know


248. AI Assisted Economics Measurement From Survey: Evidence from Public Employee Pension Choice


249. CaST: Causal Discovery via Spatio-Temporal Graphs in Disaster Tweets


250. Step-Wise Refusal Dynamics in Autoregressive and Diffusion Language Models


251. RAP: KV-Cache Compression via RoPE-Aligned Pruning


252. Social Catalysts, Not Moral Agents: The Illusion of Alignment in LLM Societies


253. ContextEvolve: Multi-Agent Context Compression for Systems Code Optimization


254. To Defend Against Cyber Attacks, We Must Teach AI Agents to Hack


255. Effective Frontiers: A Unification of Neural Scaling Laws


256. Learnable Koopman-Enhanced Transformer-Based Time Series Forecasting with Spectral Control


257. VividVoice: A Unified Framework for Scene-Aware Visually-Driven Speech Synthesis


258. Agentic Observability: Automated Alert Triage for Adobe E-Commerce


259. Constitutional Spec-Driven Development: Enforcing Security by Construction in AI-Assisted Code Generation


260. QuantLRM: Quantization of Large Reasoning Models via Fine-Tuning Signals


261. ProphetKV: User-Query-Driven Selective Recomputation for Efficient KV Cache Reuse in Retrieval-Augmented Generation


262. Product Interaction: An Algebraic Formalism for Deep Learning Architectures


263. Reward Shaping for Inference-Time Alignment: A Stackelberg Game Perspective


264. Trajectory Consistency for One-Step Generation on Euler Mean Flows


265. DECEIVE-AFC: Adversarial Claim Attacks against Search-Enabled LLM-based Fact-Checking Systems


266. IceBench-S2S: A Benchmark of Deep Learning for Challenging Subseasonal-to-Seasonal Daily Arctic Sea Ice Forecasting in Deep Latent Space


267. A Comparative Simulation Study of the Fairness and Accuracy of Predictive Policing Systems in Baltimore City


268. High Rank Matrix Completion via Grassmannian Proxy Fusion


269. MathlibLemma: Folklore Lemma Generation and Benchmark for Formal Mathematics


270. Auditing Sybil: Explaining Deep Lung Cancer Risk Prediction Through Generative Interventional Attributions


271. PA-MIL: Phenotype-Aware Multiple Instance Learning Guided by Language Prompting and Genotype-to-Phenotype Relationships


272. The Alignment Curse: Cross-Modality Jailbreak Transfer in Omni-Models


273. Beyond Experience Retrieval: Learning to Generate Utility-Optimized Structured Experience for Frozen LLMs


274. Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards


275. BatCoder: Self-Supervised Bidirectional Code-Documentation Learning via Back-Translation


276. EEO-TFV: Escape-Explore Optimizer for Web-Scale Time-Series Forecasting and Vision Analysis


277. HyPAC: Cost-Efficient LLMs-Human Hybrid Annotation with PAC Error Guarantees


278. ToolTok: Tool Tokenization for Efficient and Generalizable GUI Agents


279. naPINN: Noise-Adaptive Physics-Informed Neural Networks for Recovering Physics from Corrupted Measurement


280. D$^2$Quant: Accurate Low-bit Post-Training Weight Quantization for LLMs


281. Beyond Alignment: Expanding Reasoning Capacity via Manifold-Reshaping Policy Optimization


282. SPA-Cache: Singular Proxies for Adaptive Caching in Diffusion Language Models


283. Toward Ultra-Long-Horizon Sequential Model Editing


284. Auto-Augmentation Contrastive Learning for Wearable-based Human Activity Recognition


285. From Sparse Decisions to Dense Reasoning: A Multi-attribute Trajectory Paradigm for Multimodal Moderation


286. Enhancing Psychologists’ Understanding through Explainable Deep Learning Framework for ADHD Diagnosis


287. CADENT: Gated Hybrid Distillation for Sample-Efficient Transfer in Reinforcement Learning


288. Incident-Guided Spatiotemporal Traffic Forecasting


289. The “Robert Boulton” Singularity: Semantic Tunneling and Manifold Unfolding in Recursive AI


290. Community Norms in the Spotlight: Enabling Task-Agnostic Unsupervised Pre-Training to Benefit Online Social Media


291. GASTON: Graph-Aware Social Transformer for Online Networks


292. TabularMath: Evaluating Computational Extrapolation in Tabular Learning via Program-Verified Synthesis


293. IMU-1: Sample-Efficient Pre-training of Small Language Models


294. Scaled Dot-Product Attention implements projection of inputs onto a common surface


295. Artificial Intelligence for Inclusive Engineering Education: Advancing Equality, Diversity, and Ethical Leadership


296. Evaluation of Large Language Models’ educational feedback in Higher Education: potential, limitations and implications for educational practice


297. GraphDancer: Training LLMs to Explore and Reason over Graphs via Curriculum Reinforcement Learning


298. What Drives Length of Stay After Elective Spine Surgery? Insights from a Decade of Predictive Modeling


299. Measuring Individual User Fairness with User Similarity and Effectiveness Disparity


300. Efficient Edge Rewiring Strategies for Enhancing PageRank Fairness


301. Training Data Governance for Brain Foundation Models


302. Beyond Translation: Cross-Cultural Meme Transcreation with Vision-Language Models


303. CodeGuard: Improving LLM Guardrails in CS Education


304. Precoding-Oriented CSI Feedback Design with Mutual Information Regularized VQ-VAE


305. Learning-augmented smooth integer programs with PAC-learnable oracles


306. Joint single-shot ToA and DoA estimation for VAA-based BLE ranging with phase ambiguity: A deep learning-based approach


307. Sparse Adapter Fusion for Continual Learning in NLP


308. UNSO: Unified Newton Schulz Orthogonalization


309. Test-Time Detoxification without Training or Learning Anything


310. STEMVerse: A Dual-Axis Diagnostic Framework for STEM Reasoning in Large Language Models


311. RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System


312. Kimi K2.5: Visual Agentic Intelligence