전체 AI 논문 - 2026-01-08

1. MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents


2. InfiAgent: An Infinite-Horizon Framework for General-Purpose Autonomous Agents


3. Automatic Prompt Engineering with No Task Cues and No Tuning


4. A framework for assuring the accuracy and fidelity of an AI-enabled Digital Twin of en route UK airspace


5. Explainable Fuzzy GNNs for Leak Detection in Water Distribution Networks


6. Rationale-Grounded In-Context Learning for Time Series Reasoning with Multimodal Large Language Models


7. Batch-of-Thought: Cross-Instance Learning for Enhanced LLM Reasoning


8. Logical Phase Transitions: Understanding Collapse in LLM Logical Reasoning


9. ReTreVal: Reasoning Tree with Validation – A Hybrid Framework for Enhanced LLM Multi-Step Reasoning


10. SimRPD: Optimizing Recruitment Proactive Dialogue Agents through Simulator-Based Data Evaluation and Selection


11. M3MAD-Bench: Are Multi-Agent Debates Really Effective Across Domains and Modalities?


12. Sample-Efficient Neurosymbolic Deep Reinforcement Learning


13. Quantum-enhanced long short-term memory with attention for spatial permeability prediction in oilfield reservoirs


14. Causal-Enhanced AI Agents for Medical Research Screening


15. HAL: Inducing Human-likeness in LLMs with Alignment


16. LLM Agent Framework for Intelligent Change Analysis in Urban Environment using Remote Sensing Imagery


17. The Path Ahead for Agentic AI: Challenges and Opportunities


18. Time-Scaling Is What Agents Need Now


19. Learning User Preferences Through Interaction for Long-Term Collaboration


20. Learning from Prompt itself: the Hierarchical Attribution Prompt Optimization


21. Inferring Causal Graph Temporal Logic Formulas to Expedite Reinforcement Learning in Temporally Extended Tasks


22. AWARE-US: Benchmark for Preference-Aware Resolution in Tool-Calling Agents


23. An Empirical Study of On-Device Translation for Real-Time Live-Stream Chat on Mobile Devices


24. Orchestral AI: A Framework for Agent Orchestration


25. SimpleMem: Efficient Lifelong Memory for LLM Agents


26. Textual Explanations and Their Evaluations for Reinforcement Learning Policy


27. Multi-RADS Synthetic Radiology Report Dataset and Head-to-Head Benchmarking of 41 Open-Weight and Proprietary Language Models


28. The Sonar Moment: Benchmarking Audio-Language Models in Audio Geo-Localization


29. The Fake Friend Dilemma: Trust and the Political Economy of Conversational AI


30. Fine-tuning Small Language Models as Efficient Enterprise Search Relevance Labelers


31. UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward


32. Counterfactual Fairness with Graph Uncertainty


33. Recursive querying of neural networks via weighted structures


34. DIP: Dynamic In-Context Planner For Diffusion Language Models


35. UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision


36. AnatomiX, an Anatomy-Aware Grounded Multimodal Large Language Model for Chest X-Ray Interpretation


37. Decentralized Autoregressive Generation


38. Multi-Modal Data-Enhanced Foundation Models for Prediction and Control in Wireless Networks: A Survey


39. Rapid Augmentations for Time Series (RATS): A High-Performance Library for Time Series Augmentation


40. Prompt-Counterfactual Explanations for Generative AI System Behavior


41. Self-Verification is All You Need To Pass The Japanese Bar Examination


42. Limited Linguistic Diversity in Embodied AI Datasets


43. Unified Thinker: A General Reasoning Modular Core for Image Generation


44. LeafLife: An Explainable Deep Learning Framework with Robustness for Grape Leaf Disease Recognition


45. ToxiGAN: Toxic Data Augmentation via LLM-Guided Directional Adversarial Generation


46. Transformers self-organize like newborn visual systems when trained in prenatal worlds


47. Who Laughs with Whom? Disentangling Influential Factors in Humor Preferences across User Clusters and LLMs


48. Text-Guided Layer Fusion Mitigates Hallucination in Multimodal LLMs


49. Grad-ELLM: Gradient-based Explanations for Decoder-only LLMs


50. Joint Encoding of KV-Cache Blocks for Scalable LLM Serving


51. Do LLMs Encode Functional Importance of Reasoning Tokens?


52. IBISAgent: Reinforcing Pixel-Level Visual Reasoning in MLLMs for Universal Biomedical Object Referring and Segmentation


53. On the Intrinsic Limits of Transformer Image Embeddings in Non-Solvable Spatial Reasoning


54. Motion Blur Robust Wheat Pest Damage Detection with Dynamic Fuzzy Feature Fusion


55. Lil: Less is Less When Applying Post-Training Sparse-Attention Algorithms in Long-Decode Stage


56. PiDR: Physics-Informed Inertial Dead Reckoning for Autonomous Platforms


57. Validating Generalist Robots with Situation Calculus and STL Falsification


58. Causal Manifold Fairness: Enforcing Geometric Invariance in Representation Learning


59. Dementia-R1: Reinforced Pretraining and Reasoning from Unstructured Clinical Notes for Real-World Dementia Prognosis


60. In-Context Reinforcement Learning through Bayesian Fusion of Context and Value Prior


61. SentGraph: Hierarchical Sentence Graph for Multi-hop Retrieval-Augmented Question Answering


62. JPU: Bridging Jailbreak Defense and Unlearning via On-Policy Path Rectification


63. Learning to Act Robustly with View-Invariant Latent Actions


64. Towards Faithful Reasoning in Comics for Small MLLMs


65. ULS+: Data-driven Model Adaptation Enhances Lesion Segmentation


66. LAMS-Edit: Latent and Attention Mixing with Schedulers for Improved Content Preservation in Diffusion-Based Image and Style Editing


67. Interpretable All-Type Audio Deepfake Detection with Audio LLMs via Frequency-Time Reinforcement Learning


68. Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders


69. Correct, Concise and Complete: Multi-stage Training For Adaptive Reasoning


70. MoE Adapter for Large Audio Language Models: Sparsity, Disentanglement, and Gradient-Conflict-Free


71. The World is Not Mono: Enabling Spatial Understanding in Large Audio-Language Models


72. SastBench: A Benchmark for Testing Agentic SAST Triage


73. PrismVAU: Prompt-Refined Inference System for Multimodal Video Anomaly Understanding


74. DCG ReID: Disentangling Collaboration and Guidance Fusion Representations for Multi-modal Vehicle Re-Identification


75. RAL2M: Retrieval Augmented Learning-To-Match Against Hallucination in Compliance-Guaranteed Service Systems


76. TA-Prompting: Enhancing Video Large Language Models for Dense Video Captioning via Temporal Anchors


77. LOST-3DSG: Lightweight Open-Vocabulary 3D Scene Graphs with Semantic Tracking in Dynamic Environments


78. LongBench Pro: A More Realistic and Comprehensive Bilingual Long-Context Evaluation Benchmark


79. TiMem: Temporal-Hierarchical Memory Consolidation for Long-Horizon Conversational Agents


80. Breaking Self-Attention Failure: Rethinking Query Initialization for Infrared Small Target Detection


81. MiMo-V2-Flash Technical Report


82. Closing the Reality Gap: Zero-Shot Sim-to-Real Deployment for Dexterous Force-Based Grasping and Manipulation


83. UniSRCodec: Unified and Low-Bitrate Single Codebook Codec with Sub-Band Reconstruction


84. Netflix Artwork Personalization via LLM Post-training


85. Q-Regularized Generative Auto-Bidding: From Suboptimal Trajectories to Optimal Policies


86. Window-based Membership Inference Attacks Against Fine-tuned Large Language Models


87. Hypothesize-Then-Verify: Speculative Root Cause Analysis for Microservices with Pathwise Parallelism


88. Agentic Memory Enhanced Recursive Reasoning for Root Cause Localization in Microservices


89. Foreground-Aware Dataset Distillation via Dynamic Patch Selection


90. Privacy-Preserving AI-Enabled Decentralized Learning and Employment Records System


91. CREAM: Continual Retrieval on Dynamic Streaming Corpora with Adaptive Soft Memory


92. Adversarial Question Answering Robustness: A Multi-Level Error Analysis and Mitigation Study


93. Multi-channel multi-speaker transformer for speech recognition


94. Topology-Independent Robustness of the Weighted Mean under Label Poisoning Attacks in Heterogeneous Decentralized Learning


95. Extracting books from production language models


96. When Do Tools and Planning Help LLMs Think? A Cost- and Latency-Aware Benchmark



98. Prioritized Replay for RL Post-training


99. DreamLoop: Controllable Cinemagraph Generation from a Single Photograph


100. Credit Assignment via Neural Manifold Noise Correlation


101. TAAF: A Trace Abstraction and Analysis Framework Synergizing Knowledge Graphs and LLMs


102. Improved Evidence Extraction for Document Inconsistency Detection with LLMs


103. LAsset: An LLM-assisted Security Asset Identification Framework for System-on-Chip (SoC) Verification


104. Hierarchical temporal receptive windows and zero-shot timescale generalization in biologically constrained scale-invariant deep networks


105. Chronicals: A High-Performance Framework for LLM Fine-Tuning with 3.51x Speedup over Unsloth


106. LongDA: Benchmarking LLM Agents for Long-Document Data Analysis


107. Annealed Langevin Posterior Sampling (ALPS): A Rapid Algorithm for Image Restoration with Multiscale Energy Models


108. FlowPlan-G2P: A Structured Generation Framework for Transforming Scientific Papers into Patent Descriptions


109. Reconstructing Item Characteristic Curves using Fine-Tuned Large Language Models


110. Fact-Checking with Large Language Models via Probabilistic Certainty and Consistency


111. LendNova: Towards Automated Credit Risk Assessment with Language Models


112. AI-exposed jobs deteriorated before ChatGPT


113. Normalized Conditional Mutual Information Surrogate Loss for Deep Neural Classifiers


114. ModeX: Evaluator-Free Best-of-N Selection for Open-Ended Generation


115. Losses that Cook: Topological Optimal Transport for Structured Recipe Generation


116. Enhancing Debugging Skills with AI-Powered Assistance: A Real-Time Tool for Debugging Support


117. GEM-Style Constraints for PEFT with Dual Gradient Projection in LoRA


118. The Rise of Agentic Testing: Multi-Agent Systems for Robust Software Quality Assurance


119. mHC-GNN: Manifold-Constrained Hyper-Connections for Graph Neural Networks


120. VocalBridge: Latent Diffusion-Bridge Purification for Defeating Perturbation-Based Voiceprint Defenses


121. Evaluating the Diagnostic Classification Ability of Multimodal Large Language Models: Insights from the Osteoarthritis Initiative


122. Understanding Pure Textual Reasoning for Blind Image Quality Assessment


123. Mitigating Long-Tailed Anomaly Score Distributions with Importance-Weighted Loss


124. Focus on What Matters: Fisher-Guided Adaptive Multimodal Fusion for Vulnerability Detection


125. TAP-ViTs: Task-Adaptive Pruning for On-Device Deployment of Vision Transformers


126. WebCoderBench: Benchmarking Web Application Generation with Comprehensive and Interpretable Evaluation Metrics


127. A Dynamic Retrieval-Augmented Generation System with Selective Memory and Remembrance


128. NitroGen: An Open Foundation Model for Generalist Gaming Agents


129. A large-scale nanocrystal database with aligned synthesis and properties enabling generative inverse design


130. Watch Wider and Think Deeper: Collaborative Cross-modal Chain-of-Thought for Complex Visual Reasoning


131. Multimodal Sentiment Analysis based on Multi-channel and Symmetric Mutual Promotion Feature Fusion


132. MIAR: Modality Interaction and Alignment Representation Fuison for Multimodal Emotion


133. Socially-Aware Recommender Systems Mitigate Opinion Clusterization


134. SpikySpace: A Spiking State Space Model for Energy-Efficient Time Series Forecasting


135. The Vibe-Check Protocol: Quantifying Cognitive Offloading in AI Programming


136. Expert-Guided Explainable Few-Shot Learning with Active Sample Selection for Medical Image Analysis


137. PCEval: A Benchmark for Evaluating Physical Computing Capabilities of Large Language Models


138. ProSoftArena: Benchmarking Hierarchical Capabilities of Multimodal Agents in Professional Software Environments


139. AI-Native Integrated Sensing and Communications for Self-Organizing Wireless Networks: Architectures, Learning Paradigms, and System-Level Design


140. Self-Supervised Masked Autoencoders with Dense-Unet for Coronary Calcium Removal in limited CT Data


141. Tree of Preferences for Diversified Recommendation


142. Base Station Deployment under EMF constrain by Deep Reinforcement learning


143. How to Discover Knowledge for FutureG: Contextual RAG and LLM Prompting for O-RAN


144. The Refutability Gap: Challenges in Validating Reasoning by Large Language Models


145. Movement Primitives in Robotics: A Comprehensive Survey


146. LeafTutor: An AI Agent for Programming Assignment Tutoring


147. Permission Manifests for Web Agents


148. Distillation-based Scenario-Adaptive Mixture-of-Experts for the Matching Stage of Multi-scenario Recommendation


149. Cross-Platform Digital Discourse Analysis of the Israel-Hamas Conflict: Sentiment, Topics, and Event Dynamics


150. TextBridgeGNN: Pre-training Graph Neural Network for Cross-Domain Recommendation via Text-Guided Transfer


151. MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents


152. InfiAgent: An Infinite-Horizon Framework for General-Purpose Autonomous Agents


153. Automatic Prompt Engineering with No Task Cues and No Tuning


154. A framework for assuring the accuracy and fidelity of an AI-enabled Digital Twin of en route UK airspace


155. Explainable Fuzzy GNNs for Leak Detection in Water Distribution Networks


156. Rationale-Grounded In-Context Learning for Time Series Reasoning with Multimodal Large Language Models


157. Batch-of-Thought: Cross-Instance Learning for Enhanced LLM Reasoning


158. Logical Phase Transitions: Understanding Collapse in LLM Logical Reasoning


159. ReTreVal: Reasoning Tree with Validation – A Hybrid Framework for Enhanced LLM Multi-Step Reasoning


160. SimRPD: Optimizing Recruitment Proactive Dialogue Agents through Simulator-Based Data Evaluation and Selection


161. M3MAD-Bench: Are Multi-Agent Debates Really Effective Across Domains and Modalities?


162. Sample-Efficient Neurosymbolic Deep Reinforcement Learning


163. Quantum-enhanced long short-term memory with attention for spatial permeability prediction in oilfield reservoirs


164. Causal-Enhanced AI Agents for Medical Research Screening


165. HAL: Inducing Human-likeness in LLMs with Alignment


166. LLM Agent Framework for Intelligent Change Analysis in Urban Environment using Remote Sensing Imagery


167. The Path Ahead for Agentic AI: Challenges and Opportunities


168. Time-Scaling Is What Agents Need Now


169. Learning User Preferences Through Interaction for Long-Term Collaboration


170. Learning from Prompt itself: the Hierarchical Attribution Prompt Optimization


171. Inferring Causal Graph Temporal Logic Formulas to Expedite Reinforcement Learning in Temporally Extended Tasks


172. AWARE-US: Benchmark for Preference-Aware Resolution in Tool-Calling Agents


173. An Empirical Study of On-Device Translation for Real-Time Live-Stream Chat on Mobile Devices


174. Orchestral AI: A Framework for Agent Orchestration


175. SimpleMem: Efficient Lifelong Memory for LLM Agents


176. Textual Explanations and Their Evaluations for Reinforcement Learning Policy


177. Multi-RADS Synthetic Radiology Report Dataset and Head-to-Head Benchmarking of 41 Open-Weight and Proprietary Language Models


178. The Sonar Moment: Benchmarking Audio-Language Models in Audio Geo-Localization


179. The Fake Friend Dilemma: Trust and the Political Economy of Conversational AI


180. Fine-tuning Small Language Models as Efficient Enterprise Search Relevance Labelers


181. UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward


182. Counterfactual Fairness with Graph Uncertainty


183. Recursive querying of neural networks via weighted structures


184. DIP: Dynamic In-Context Planner For Diffusion Language Models


185. UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision


186. AnatomiX, an Anatomy-Aware Grounded Multimodal Large Language Model for Chest X-Ray Interpretation


187. Decentralized Autoregressive Generation


188. Multi-Modal Data-Enhanced Foundation Models for Prediction and Control in Wireless Networks: A Survey


189. Rapid Augmentations for Time Series (RATS): A High-Performance Library for Time Series Augmentation


190. Prompt-Counterfactual Explanations for Generative AI System Behavior


191. Self-Verification is All You Need To Pass The Japanese Bar Examination


192. Limited Linguistic Diversity in Embodied AI Datasets


193. Unified Thinker: A General Reasoning Modular Core for Image Generation


194. LeafLife: An Explainable Deep Learning Framework with Robustness for Grape Leaf Disease Recognition


195. ToxiGAN: Toxic Data Augmentation via LLM-Guided Directional Adversarial Generation


196. Transformers self-organize like newborn visual systems when trained in prenatal worlds


197. Who Laughs with Whom? Disentangling Influential Factors in Humor Preferences across User Clusters and LLMs


198. Text-Guided Layer Fusion Mitigates Hallucination in Multimodal LLMs


199. Grad-ELLM: Gradient-based Explanations for Decoder-only LLMs


200. Joint Encoding of KV-Cache Blocks for Scalable LLM Serving


201. Do LLMs Encode Functional Importance of Reasoning Tokens?


202. IBISAgent: Reinforcing Pixel-Level Visual Reasoning in MLLMs for Universal Biomedical Object Referring and Segmentation


203. On the Intrinsic Limits of Transformer Image Embeddings in Non-Solvable Spatial Reasoning


204. Motion Blur Robust Wheat Pest Damage Detection with Dynamic Fuzzy Feature Fusion


205. Lil: Less is Less When Applying Post-Training Sparse-Attention Algorithms in Long-Decode Stage


206. PiDR: Physics-Informed Inertial Dead Reckoning for Autonomous Platforms


207. Validating Generalist Robots with Situation Calculus and STL Falsification


208. Causal Manifold Fairness: Enforcing Geometric Invariance in Representation Learning


209. Dementia-R1: Reinforced Pretraining and Reasoning from Unstructured Clinical Notes for Real-World Dementia Prognosis


210. In-Context Reinforcement Learning through Bayesian Fusion of Context and Value Prior


211. SentGraph: Hierarchical Sentence Graph for Multi-hop Retrieval-Augmented Question Answering


212. JPU: Bridging Jailbreak Defense and Unlearning via On-Policy Path Rectification


213. Learning to Act Robustly with View-Invariant Latent Actions


214. Towards Faithful Reasoning in Comics for Small MLLMs


215. ULS+: Data-driven Model Adaptation Enhances Lesion Segmentation


216. LAMS-Edit: Latent and Attention Mixing with Schedulers for Improved Content Preservation in Diffusion-Based Image and Style Editing


217. Interpretable All-Type Audio Deepfake Detection with Audio LLMs via Frequency-Time Reinforcement Learning


218. Mechanistic Knobs in LLMs: Retrieving and Steering High-Order Semantic Features via Sparse Autoencoders


219. Correct, Concise and Complete: Multi-stage Training For Adaptive Reasoning


220. MoE Adapter for Large Audio Language Models: Sparsity, Disentanglement, and Gradient-Conflict-Free


221. The World is Not Mono: Enabling Spatial Understanding in Large Audio-Language Models


222. SastBench: A Benchmark for Testing Agentic SAST Triage


223. PrismVAU: Prompt-Refined Inference System for Multimodal Video Anomaly Understanding


224. DCG ReID: Disentangling Collaboration and Guidance Fusion Representations for Multi-modal Vehicle Re-Identification


225. RAL2M: Retrieval Augmented Learning-To-Match Against Hallucination in Compliance-Guaranteed Service Systems


226. TA-Prompting: Enhancing Video Large Language Models for Dense Video Captioning via Temporal Anchors


227. LOST-3DSG: Lightweight Open-Vocabulary 3D Scene Graphs with Semantic Tracking in Dynamic Environments


228. LongBench Pro: A More Realistic and Comprehensive Bilingual Long-Context Evaluation Benchmark


229. TiMem: Temporal-Hierarchical Memory Consolidation for Long-Horizon Conversational Agents


230. Breaking Self-Attention Failure: Rethinking Query Initialization for Infrared Small Target Detection


231. MiMo-V2-Flash Technical Report


232. Closing the Reality Gap: Zero-Shot Sim-to-Real Deployment for Dexterous Force-Based Grasping and Manipulation


233. UniSRCodec: Unified and Low-Bitrate Single Codebook Codec with Sub-Band Reconstruction


234. Netflix Artwork Personalization via LLM Post-training


235. Q-Regularized Generative Auto-Bidding: From Suboptimal Trajectories to Optimal Policies


236. Window-based Membership Inference Attacks Against Fine-tuned Large Language Models


237. Hypothesize-Then-Verify: Speculative Root Cause Analysis for Microservices with Pathwise Parallelism


238. Agentic Memory Enhanced Recursive Reasoning for Root Cause Localization in Microservices


239. Foreground-Aware Dataset Distillation via Dynamic Patch Selection


240. Privacy-Preserving AI-Enabled Decentralized Learning and Employment Records System


241. CREAM: Continual Retrieval on Dynamic Streaming Corpora with Adaptive Soft Memory


242. Adversarial Question Answering Robustness: A Multi-Level Error Analysis and Mitigation Study


243. Multi-channel multi-speaker transformer for speech recognition


244. Topology-Independent Robustness of the Weighted Mean under Label Poisoning Attacks in Heterogeneous Decentralized Learning


245. Extracting books from production language models


246. When Do Tools and Planning Help LLMs Think? A Cost- and Latency-Aware Benchmark



248. Prioritized Replay for RL Post-training


249. DreamLoop: Controllable Cinemagraph Generation from a Single Photograph


250. Credit Assignment via Neural Manifold Noise Correlation


251. TAAF: A Trace Abstraction and Analysis Framework Synergizing Knowledge Graphs and LLMs


252. Improved Evidence Extraction for Document Inconsistency Detection with LLMs


253. LAsset: An LLM-assisted Security Asset Identification Framework for System-on-Chip (SoC) Verification


254. Hierarchical temporal receptive windows and zero-shot timescale generalization in biologically constrained scale-invariant deep networks


255. Chronicals: A High-Performance Framework for LLM Fine-Tuning with 3.51x Speedup over Unsloth


256. LongDA: Benchmarking LLM Agents for Long-Document Data Analysis


257. Annealed Langevin Posterior Sampling (ALPS): A Rapid Algorithm for Image Restoration with Multiscale Energy Models


258. FlowPlan-G2P: A Structured Generation Framework for Transforming Scientific Papers into Patent Descriptions


259. Reconstructing Item Characteristic Curves using Fine-Tuned Large Language Models


260. Fact-Checking with Large Language Models via Probabilistic Certainty and Consistency


261. LendNova: Towards Automated Credit Risk Assessment with Language Models


262. AI-exposed jobs deteriorated before ChatGPT


263. Normalized Conditional Mutual Information Surrogate Loss for Deep Neural Classifiers


264. ModeX: Evaluator-Free Best-of-N Selection for Open-Ended Generation


265. Losses that Cook: Topological Optimal Transport for Structured Recipe Generation


266. Enhancing Debugging Skills with AI-Powered Assistance: A Real-Time Tool for Debugging Support


267. GEM-Style Constraints for PEFT with Dual Gradient Projection in LoRA


268. The Rise of Agentic Testing: Multi-Agent Systems for Robust Software Quality Assurance


269. mHC-GNN: Manifold-Constrained Hyper-Connections for Graph Neural Networks


270. VocalBridge: Latent Diffusion-Bridge Purification for Defeating Perturbation-Based Voiceprint Defenses


271. Evaluating the Diagnostic Classification Ability of Multimodal Large Language Models: Insights from the Osteoarthritis Initiative


272. Understanding Pure Textual Reasoning for Blind Image Quality Assessment


273. Mitigating Long-Tailed Anomaly Score Distributions with Importance-Weighted Loss


274. Focus on What Matters: Fisher-Guided Adaptive Multimodal Fusion for Vulnerability Detection


275. TAP-ViTs: Task-Adaptive Pruning for On-Device Deployment of Vision Transformers


276. WebCoderBench: Benchmarking Web Application Generation with Comprehensive and Interpretable Evaluation Metrics


277. A Dynamic Retrieval-Augmented Generation System with Selective Memory and Remembrance


278. NitroGen: An Open Foundation Model for Generalist Gaming Agents


279. A large-scale nanocrystal database with aligned synthesis and properties enabling generative inverse design


280. Watch Wider and Think Deeper: Collaborative Cross-modal Chain-of-Thought for Complex Visual Reasoning


281. Multimodal Sentiment Analysis based on Multi-channel and Symmetric Mutual Promotion Feature Fusion


282. MIAR: Modality Interaction and Alignment Representation Fuison for Multimodal Emotion


283. Socially-Aware Recommender Systems Mitigate Opinion Clusterization


284. SpikySpace: A Spiking State Space Model for Energy-Efficient Time Series Forecasting


285. The Vibe-Check Protocol: Quantifying Cognitive Offloading in AI Programming


286. Expert-Guided Explainable Few-Shot Learning with Active Sample Selection for Medical Image Analysis


287. PCEval: A Benchmark for Evaluating Physical Computing Capabilities of Large Language Models


288. ProSoftArena: Benchmarking Hierarchical Capabilities of Multimodal Agents in Professional Software Environments


289. AI-Native Integrated Sensing and Communications for Self-Organizing Wireless Networks: Architectures, Learning Paradigms, and System-Level Design


290. Self-Supervised Masked Autoencoders with Dense-Unet for Coronary Calcium Removal in limited CT Data


291. Tree of Preferences for Diversified Recommendation


292. Base Station Deployment under EMF constrain by Deep Reinforcement learning


293. How to Discover Knowledge for FutureG: Contextual RAG and LLM Prompting for O-RAN


294. The Refutability Gap: Challenges in Validating Reasoning by Large Language Models


295. Movement Primitives in Robotics: A Comprehensive Survey


296. LeafTutor: An AI Agent for Programming Assignment Tutoring


297. Permission Manifests for Web Agents


298. Distillation-based Scenario-Adaptive Mixture-of-Experts for the Matching Stage of Multi-scenario Recommendation


299. Cross-Platform Digital Discourse Analysis of the Israel-Hamas Conflict: Sentiment, Topics, and Event Dynamics


300. TextBridgeGNN: Pre-training Graph Neural Network for Cross-Domain Recommendation via Text-Guided Transfer


301. FUSE : Failure-aware Usage of Subagent Evidence for MultiModal Search and Recommendation


302. Towards Trustworthy LLM-Based Recommendation via Rationale Integration


303. The Impact of LLM-Generated Reviews on Recommender Systems: Textual Shifts, Performance Effects, and Strategic Platform Control


304. TWIST: Training-free and Label-free Short Text Clustering through Iterative Vector Updating with LLMs