전체 AI 논문 - 2025-10-07

1. Coevolutionary Continuous Discrete Diffusion: Make Your Diffusion Language Model a Latent Reasoner


2. CoDA: Agentic Systems for Collaborative Data Visualization


3. Improving Cooperation in Collaborative Embodied AI


4. A Study of Rule Omission in Raven’s Progressive Matrices


5. From Facts to Foils: Designing and Evaluating Counterfactual Explanations for Smart Environments


6. Onto-Epistemological Analysis of AI Explanations


7. Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models


8. Reward Model Routing in Alignment


9. Take Goodhart Seriously: Principled Limit on General-Purpose AI Optimization


10. Beyond the Final Answer: Evaluating the Reasoning Trajectories of Tool-Augmented Agents


11. NCV: A Node-Wise Consistency Verification Approach for Low-Cost Structured Error Localization in LLM Reasoning


12. Automated Constraint Specification for Job Scheduling by Regulating Generative Model with Domain-Specific Representation


13. ARMs: Adaptive Red-Teaming Agent against Multimodal Models with Plug-and-Play Attacks


14. AutoMaAS: Self-Evolving Multi-Agent Architecture Search for Large Language Models


15. A Concept of Possibility for Real-World Events


16. Geolog-IA: Conversational System for Academic Theses


17. On the Role of Temperature Sampling in Test-Time Scaling


18. Mitigating Modal Imbalance in Multimodal Reasoning


19. Multimodal Large Language Model Framework for Safe and Interpretable Grid-Integrated EVs


20. A Benchmark Study of Deep Reinforcement Learning Algorithms for the Container Stowage Planning Problem


21. Agentic Additive Manufacturing Alloy Discovery


22. Orchestrating Human-AI Teams: The Manager Agent as a Unifying Research Challenge


23. Multimodal Function Vectors for Spatial Relations


24. Safe and Efficient In-Context Learning via Risk Control


25. RefineShot: Rethinking Cinematography Understanding with Foundational Skill Evaluation


26. BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks


27. Reward Models are Metrics in a Trench Coat


28. Improving GUI Grounding with Explicit Position-to-Coordinate Mapping


29. Test-Time Defense Against Adversarial Attacks via Stochastic Resonance of Latent Ensembles


30. Self-Anchor: Large Language Model Reasoning via Step-by-step Attention Alignment


31. Abstain and Validate: A Dual-LLM Policy for Reducing Noise in Agentic Program Repair


32. Wave-GMS: Lightweight Multi-Scale Generative Model for Medical Image Segmentation


33. Simulation to Rules: A Dual-VLM Framework for Formal Visual Planning


34. Topic Modeling as Long-Form Generation: Can Long-Context LLMs revolutionize NTM via Zero-Shot Prompting?


35. UniShield: An Adaptive Multi-Agent Framework for Unified Forgery Image Detection and Localization


36. SpineBench: A Clinically Salient, Level-Aware Benchmark Powered by the SpineMed-450k Corpus


37. Stimulus-Voltage-Based Prediction of Action Potential Onset Timing: Classical vs. Quantum-Inspired Approaches


38. Signature-Informed Transformer for Asset Allocation


39. HAVIR: HierArchical Vision to Image Reconstruction using CLIP-Guided Versatile Diffusion


40. Distilled Protein Backbone Generation


41. What Drives Compositional Generalization in Visual Generative Models?


42. A Study of Neural Polar Decoders for Communication


43. A Unified Deep Reinforcement Learning Approach for Close Enough Traveling Salesman Problem


44. Comparative Analysis of Parameterized Action Actor-Critic Reinforcement Learning Algorithms for Web Search Match Plan Generation


45. Semantic Differentiation in Speech Emotion Recognition: Insights from Descriptive and Expressive Speech Roles


46. ZeroShotOpt: Towards Zero-Shot Pretrained Models for Efficient Black-Box Optimization


47. When and Where do Events Switch in Multi-Event Video Generation?


48. CHORD: Customizing Hybrid-precision On-device Model for Sequential Recommendation with Device-cloud Collaboration


49. Investigating The Smells of LLM Generated Code


50. Learning Robust Diffusion Models from Imprecise Supervision


51. BrainIB++: Leveraging Graph Neural Networks and Information Bottleneck for Functional Brain Biomarkers in Schizophrenia


52. From high-frequency sensors to noon reports: Using transfer learning for shaft power prediction in maritime


53. Untargeted Jailbreak Attack


54. AI Generated Child Sexual Abuse Material - What’s the Harm?


55. Corrosion Risk Estimation for Heritage Preservation: An Internet of Things and Machine Learning Approach Using Temperature and Humidity


56. Grounding Large Language Models in Clinical Evidence: A Retrieval-Augmented Generation System for Querying UK NICE Clinical Guidelines


57. Ergodic Risk Measures: Towards a Risk-Aware Foundation for Continual Reinforcement Learning


58. Multimodal Carotid Risk Stratification with Large Vision-Language Models: Benchmarking, Fine-Tuning, and Clinical Insights


59. WavInWav: Time-domain Speech Hiding via Invertible Neural Network


60. FeDABoost: Fairness Aware Federated Learning with Adaptive Boosting


61. FinReflectKG - MultiHop: Financial QA Benchmark for Reasoning with Knowledge Graph Evidence


62. DMark: Order-Agnostic Watermarking for Diffusion Large Language Models


63. Global Convergence of Policy Gradient for Entropy Regularized Linear-Quadratic Control with multiplicative noise


64. Representing Beauty: Towards a Participatory but Objective Latent Aesthetics


65. Constraint Satisfaction Approaches to Wordle: Novel Heuristics and Cross-Lexicon Validation


66. Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-Speech


67. Knowledge-Aware Modeling with Frequency Adaptive Learning for Battery Health Prognostics


68. Evaluating Large Language Models for IUCN Red List Species Information


69. A Computational Framework for Interpretable Text-Based Personality Assessment from Social Media


70. Dissecting Transformers: A CLEAR Perspective towards Green AI


71. Relevance-Aware Thresholding in Online Conformal Prediction for Time Series


72. Work Zones challenge VLM Trajectory Planning: Toward Mitigation and Robust Autonomous Driving


73. OptunaHub: A Platform for Black-Box Optimization


74. Pareto-optimal Non-uniform Language Generation


75. MaskCD: Mitigating LVLM Hallucinations by Image Head Masked Contrastive Decoding


76. Align Your Query: Representation Alignment for Multimodality Medical Object Detection


77. Fusing Multi- and Hyperspectral Satellite Data for Harmful Algal Bloom Monitoring with Self-Supervised and Hierarchical Deep Learning


78. Hierarchical Generalized Category Discovery for Brain Tumor Classification in Digital Pathology


79. Prototyping Digital Social Spaces through Metaphor-Driven Design: Translating Spatial Concepts into an Interactive Social Simulation


80. SAE-RNA: A Sparse Autoencoder Model for Interpreting RNA Language Model Representations


81. TravelBench : Exploring LLM Performance in Low-Resource Domains


82. CST-AFNet: A dual attention-based deep learning framework for intrusion detection in IoT networks


83. A $1000\times$ Faster LLM-enhanced Algorithm For Path Planning in Large-scale Grid Maps


84. Fully automated inverse co-optimization of templates and block copolymer blending recipes for DSA lithography


85. Time-To-Inconsistency: A Survival Analysis of Large Language Model Robustness to Adversarial Attacks


86. A Novel Unified Lightweight Temporal-Spatial Transformer Approach for Intrusion Detection in Drone Networks


87. RAMAC: Multimodal Risk-Aware Offline Reinforcement Learning and the Role of Behavior Regularization


88. Fine-Tuning Diffusion Models via Intermediate Distribution Shaping


89. Can Data-Driven Dynamics Reveal Hidden Physics? There Is A Need for Interpretable Neural Operators


90. To Compress or Not? Pushing the Frontier of Lossless GenAI Model Weights Compression with Exponent Concentration


91. HALO: Memory-Centric Heterogeneous Accelerator with 2.5D Integration for Low-Batch LLM Inference


92. AgenticRAG: Tool-Augmented Foundation Models for Zero-Shot Explainable Recommender Systems


93. TutorBench: A Benchmark To Assess Tutoring Capabilities Of Large Language Models


94. When Researchers Say Mental Model/Theory of Mind of AI, What Are They Really Talking About?


95. Automatic Building Code Review: A Case Study


96. A Trajectory Generator for High-Density Traffic and Diverse Agent-Interaction Scenarios


97. MINERVA: Mutual Information Neural Estimation for Supervised Feature Selection


98. How Confident are Video Models? Empowering Video Models to Express their Uncertainty


99. Oracle-RLAIF: An Improved Fine-Tuning Framework for Multi-modal Video Models through Reinforcement Learning from Ranking Feedback


100. ToolTweak: An Attack on Tool Selection in LLM-based Agents


101. Knowledge-Graph Based RAG System Evaluation Framework


102. PHORECAST: Enabling AI Understanding of Public Health Outreach Across Populations


103. From Pixels to Factors: Learning Independently Controllable State Variables for Reinforcement Learning


104. Litespark Technical Report: High-Throughput, Energy-Efficient LLM Training Framework


105. SIMSplat: Predictive Driving Scene Editing with Language-aligned 4D Gaussian Splatting


106. CLARITY: Clinical Assistant for Routing, Inference, and Triage


107. Market-Based Data Subset Selection – Principled Aggregation of Multi-Criteria Example Utility


108. How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models


109. Dynamic Target Attack


110. NEURODNAAI: Neural pipeline approaches for the advancing dna-based information storage as a sustainable digital medium using deep learning framework


111. Cross-Platform DNA Methylation Classifier for the Eight Molecular Subtypes of Group 3 & 4 Medulloblastoma


112. RainSeer: Fine-Grained Rainfall Reconstruction via Physics-Guided Modeling


113. Extreme value forecasting using relevance-based data augmentation with deep learning models


114. Glaucoma Detection and Structured OCT Report Generation via a Fine-tuned Multimodal Large Language Model


115. Linear RNNs for autoregressive generation of long music samples


116. Hyperparameters are all you need: Using five-step inference for an original diffusion model to generate images comparable to the latest distillation model


117. CWM: An Open-Weights LLM for Research on Code Generation with World Models


118. On The Fragility of Benchmark Contamination Detection in Reasoning Models


119. Scaling Homomorphic Applications in Deployment


120. Pretraining with hierarchical memories: separating long-tail and common knowledge


121. A Hybrid CAPTCHA Combining Generative AI with Keystroke Dynamics for Enhanced Bot Detection


122. A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory


123. Federated Spatiotemporal Graph Learning for Passive Attack Detection in Smart Grids


124. Training Dynamics of Parametric and In-Context Knowledge Utilization in Language Models


125. Beyond Manuals and Tasks: Instance-Level Context Learning for LLM Agents


126. A Cross-Lingual Analysis of Bias in Large Language Models Using Romanian History


127. ChunkLLM: A Lightweight Pluggable Framework for Accelerating LLMs Inference


128. Spiral of Silence in Large Language Model Agents


129. Emission-GPT: A domain-specific language model agent for knowledge retrieval, emission inventory and data analysis


130. DiffuSpec: Unlocking Diffusion Language Models for Speculative Decoding


131. Privacy in the Age of AI: A Taxonomy of Data Risks


132. Measuring Physical-World Privacy Awareness of Large Language Models: An Evaluation Benchmark


133. Evaluating Bias in Spoken Dialogue LLMs for Real-World Decisions and Recommendations


134. Language, Culture, and Ideology: Personalizing Offensiveness Detection in Political Tweets with Reasoning LLMs


135. LLMSQL: Upgrading WikiSQL for the LLM Era of Text-to-SQL


136. An Investigation into the Performance of Non-Contrastive Self-Supervised Learning Methods for Network Intrusion Detection


137. mini-vec2vec: Scaling Universal Geometry Alignment with Linear Transformations


138. Small Language Models for Curriculum-based Guidance


139. Breaking the MoE LLM Trilemma: Dynamic Expert Clustering with Structured Compression


140. $\texttt{BluePrint}$: A Social Media User Dataset for LLM Persona Evaluation and Training


141. CATMark: A Context-Aware Thresholding Framework for Robust Cross-Task Watermarking in Large Language Models


142. DRIFT: Learning from Abundant User Dissatisfaction in Real-World Preference Learning


143. Evaluating Uncertainty Quantification Methods in Argumentative Large Language Models


144. Optimizing Long-Form Clinical Text Generation with Claim-Based Rewards


145. CRACQ: A Multi-Dimensional Approach To Automated Document Assessment


146. KurdSTS: The Kurdish Semantic Textual Similarity


147. FormalML: A Benchmark for Evaluating Formal Subgoal Completion in Machine Learning Theory


148. Where Did It Go Wrong? Attributing Undesirable LLM Behaviors via Representation Gradient Tracing


149. Human Mobility Datasets Enriched With Contextual and Social Dimensions


150. A High-Capacity and Secure Disambiguation Algorithm for Neural Linguistic Steganography


151. Synthetic Dialogue Generation for Interactive Conversational Elicitation & Recommendation (ICER)


152. EntropyLong: Effective Long-Context Training via Predictive Uncertainty


153. SelfJudge: Faster Speculative Decoding via Self-Supervised Judge Verification


154. AMANDA: Agentic Medical Knowledge Augmentation for Data-Efficient Medical Visual Question Answering


155. KAME: Tandem Architecture for Enhancing Knowledge in Real-Time Speech-to-Speech Conversational AI


156. Hallucination-Resistant, Domain-Specific Research Assistant with Self-Evaluation and Vector-Grounded Retrieval


157. Agentic-AI Healthcare: Multilingual, Privacy-First Framework with MCP Agents


158. Hallucination reduction with CASAL: Contrastive Activation Steering For Amortized Learning


159. Modeling the Attack: Detecting AI-Generated Text by Quantifying Adversarial Perturbations


160. Multiplicative-Additive Constrained Models:Toward Joint Visualization of Interactive and Independent Effects


161. Representation Learning for Compressed Video Action Recognition via Attentive Cross-modal Interaction with Motion Enhancement