전체 AI 논문 - 2025-09-09

1. LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation


2. Evaluation and Comparison Semantics for ODRL


3. ProToM: Promoting Prosocial Behaviour via Theory of Mind-Informed Feedback


4. Finding your MUSE: Mining Unexpected Solutions Engine


5. Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework


6. Internet 3.0: Architecture for a Web-of-Agents with it’s Algorithm for Ranking Agents


7. Towards Ontology-Based Descriptions of Conversations with Qualitatively-Defined Concepts


8. SparkUI-Parser: Enhancing GUI Perception with Robust Grounding and Parsing


9. OSC: Cognitive Orchestration through Dynamic Knowledge Alignment in Multi-Agent LLM Collaboration


10. Cloning a Conversational Voice AI Agent from Call\,Recording Datasets for Telesales


11. Collaboration and Conflict between Humans and Language Models through the Lens of Game Theory


12. TalkToAgent: A Human-centric Explanation of Reinforcement Learning Agents with Large Language Models


13. What-If Analysis of Large Language Models: Explore the Game World Using Proactive Thinking


14. Language-Driven Hierarchical Task Structures as Explicit World Models for Multi-Agent Learning


15. An Approach to Grounding AI Model Evaluations in Human-derived Criteria


16. Towards Personalized Explanations for Health Simulations: A Mixed-Methods Framework for Stakeholder-Centric Summarization


17. Maestro: Joint Graph & Config Optimization for Reliable AI Agents


18. The Ethical Compass of the Machine: Evaluating Large Language Models for Decision Support in Construction Project Management


19. WinT3R: Window-Based Streaming Reconstruction with Camera Token Pool


20. Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining


21. SpikingBrain Technical Report: Spiking Brain-inspired Large Models


22. Scaling Performance of Large Language Model Pretraining


23. Recomposer: Event-roll-guided generative audio editing


24. COGITAO: A Visual Reasoning Framework To Study Compositionality & Generalization


25. Uncertain but Useful: Leveraging CNN Variability into Data Augmentation


26. CURE: Controlled Unlearning for Robust Embeddings – Mitigating Conceptual Shortcuts in Pre-Trained Language Models


27. HoPE: Hyperbolic Rotary Positional Encoding for Stable Long-Range Dependency Modeling in Large Language Models


28. RapidGNN: Energy and Communication-Efficient Distributed Training on Large-Scale Graph Neural Networks


29. Enhancing 3D Point Cloud Classification with ModelNet-R and Point-SkipNet


30. AI Agents for Web Testing: A Case Study in the Wild


31. Accuracy-Constrained CNN Pruning for Efficient and Reliable EEG-Based Seizure Detection


32. Exploring Situated Stabilities of a Rhythm Generation System through Variational Cross-Examination


33. GenAI-based test case generation and execution in SDV platform



35. ToM-SSI: Evaluating Theory of Mind in Situated Social Interactions


36. Towards Efficient Pixel Labeling for Industrial Anomaly Detection and Localization


37. Pointing-Guided Target Estimation via Transformer-Based Attention


38. Adversarial Augmentation and Active Sampling for Robust Cyber Anomaly Detection


39. LLM Enabled Multi-Agent System for 6G Networks: Framework and Method of Dual-Loop Edge-Terminal Collaboration


40. High-Resolution Global Land Surface Temperature Retrieval via a Coupled Mechanism-Machine Learning Framework


41. Exploring an implementation of quantum learning pipeline for support vector machines


42. DeGuV: Depth-Guided Visual Reinforcement Learning for Generalization and Interpretability in Manipulation


43. Artificial intelligence for representing and characterizing quantum systems


44. PLaMo 2 Technical Report



46. The Paradox of Doom: Acknowledging Extinction Risk Reduces the Incentive to Prevent It


47. A Knowledge-Driven Diffusion Policy for End-to-End Autonomous Driving Based on Expert Routing


48. REMOTE: A Unified Multimodal Relation Extraction Framework with Multilevel Optimal Transport and Mixture-of-Experts


49. PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination


50. Exploring Non-Local Spatial-Angular Correlations with a Hybrid Mamba-Transformer Framework for Light Field Super-Resolution



52. Toward Accessible Dermatology: Skin Lesion Classification Using Deep Learning Models on Mobile-Acquired Images


53. Graph Unlearning: Efficient Node Removal in Graph Neural Networks


54. Enhancing Diversity in Large Language Models via Determinantal Point Processes


55. VARMA-Enhanced Transformer for Time Series Forecasting


56. The LLM Has Left The Chat: Evidence of Bail Preferences in Large Language Models


57. Decoders Laugh as Loud as Encoders


58. FloodVision: Urban Flood Depth Estimation Using Foundation Vision-Language Models and Domain Knowledge Graph


59. MCANet: A Multi-Scale Class-Specific Attention Network for Multi-Label Post-Hurricane Damage Assessment using UAV Imagery


60. A Study of Large Language Models for Patient Information Extraction: Model Architecture, Fine-Tuning Strategy, and Multi-task Instruction Tuning


61. SePA: A Search-enhanced Predictive Agent for Personalized Health Coaching


62. Enhancing Self-Driving Segmentation in Adverse Weather Conditions: A Dual Uncertainty-Aware Training Approach to SAM Optimization


63. Beyond I-Con: Exploring New Dimension of Distance Measures in Representation Learning


64. CoVeR: Conformal Calibration for Versatile and Reliable Autoregressive Next-Token Prediction


65. KERAG: Knowledge-Enhanced Retrieval-Augmented Generation for Advanced Question Answering


66. Bootstrapping Reinforcement Learning with Sub-optimal Policies for Autonomous Driving


67. ODKE+: Ontology-Guided Open-Domain Knowledge Extraction with LLMs


68. Ecologically Valid Benchmarking and Adaptive Attention: Scalable Marine Bioacoustic Monitoring


69. VCMamba: Bridging Convolutions with Multi-Directional Mamba for Efficient Visual Representation


70. Evaluating NL2SQL via SQL2NL


71. Polysemantic Dropout: Conformal OOD Detection for Specialized LLMs


72. Interpreting Transformer Architectures as Implicit Multinomial Regression


73. Comparative Analysis of Transformer Models in Disaster Tweet Classification for Public Safety


74. Scaling Environments for Organoid Intelligence with LLM-Automated Design and Plasticity-Based Evaluation


75. Schema Inference for Tabular Data Repositories Using Large Language Models


76. Action Chunking with Transformers for Image-Based Spacecraft Guidance and Control


77. Measuring the Measures: Discriminative Capacity of Representational Similarity Metrics Across Model Families


78. Sample-efficient Integration of New Modalities into Large Language Models


79. Quantum-Enhanced Multi-Task Learning with Learnable Weighting for Pharmacokinetic and Toxicity Prediction


80. Toward Faithfulness-guided Ensemble Interpretation of Neural Network


81. Manipulating Transformer-Based Models: Controllability, Steerability, and Robust Interventions


82. i-Mask: An Intelligent Mask for Breath-Driven Activity Recognition


83. Emergent Social Dynamics of LLM Agents in the El Farol Bar Problem


84. In-Context Policy Adaptation via Cross-Domain Skill Diffusion


85. Quantized Large Language Models in Biomedical Natural Language Processing: Evaluation and Recommendation


86. Mitigation of Gender and Ethnicity Bias in AI-Generated Stories through Model Explanations


87. From Silent Signals to Natural Language: A Dual-Stage Transformer-LLM Approach


88. Memristor-Based Neural Network Accelerators for Space Applications: Enhancing Performance with Temporal Averaging and SIRENs


89. Behavioral Fingerprinting of Large Language Models


90. VaccineRAG: Boosting Multimodal Large Language Models’ Immunity to Harmful RAG Samples


91. Understanding Reinforcement Learning for Model Training, and future directions with GRAPE


92. Context Engineering for Trustworthiness: Rescorla Wagner Steering Under Mixed and Inappropriate Contexts


93. DeepTRACE: Auditing Deep Research AI Systems for Tracking Reliability Across Citations and Evidence


94. Where Should I Study? Biased Language Models Decide! Evaluating Fairness in LMs for Academic Recommendations


95. A Narrative-Driven Computational Framework for Clinician Burnout Surveillance


96. Learned Hallucination Detection in Black-Box LLMs using Token-level Entropy Production Rate


97. Refining Transcripts With TV Subtitles by Prompt-Based Weakly Supervised Training of ASR


98. Serialized Output Prompting for Large Language Model-based Multi-Talker Speech Recognition


99. ASCENDgpt: A Phenotype-Aware Transformer Model for Cardiovascular Risk Prediction from Electronic Health Records


100. The Good, the Bad and the Constructive: Automatically Measuring Peer Review’s Utility for Authors


101. DecMetrics: Structured Claim Decomposition Scoring for Factually Consistent LLM Outputs


102. Energy Landscapes Enable Reliable Abstention in Retrieval-Augmented Large Language Models for Healthcare


103. Narrative-to-Scene Generation: An LLM-Driven Pipeline for 2D Game Environments


104. No Clustering, No Routing: How Transformers Actually Process Rare Tokens


105. Training Text-to-Molecule Models with Context-Aware Tokenization


106. ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute


107. Scaling Up, Speeding Up: A Benchmark of Speculative Decoding for Efficient LLM Test-Time Scaling


108. SpeechLLM: Unified Speech and Language Model for Enhanced Multi-Task Understanding in Low Resource Settings


109. RECAP: REwriting Conversations for Intent Understanding in Agentic Planning


110. MOSAIC: A Multilingual, Taxonomy-Agnostic, and Computationally Efficient Approach for Radiological Report Classification


111. COCORELI: Cooperative, Compositional Reconstitution \& Execution of Language Instructions


112. Multi-Modal Vision vs. Text-Based Parsing: Benchmarking LLM Strategies for Invoice Processing


113. Evaluating Large Language Models for Financial Reasoning: A CFA-Based Benchmark Study


114. Enhancing LLM Efficiency: Targeted Pruning for Prefill-Decode Disaggregation in Inference


115. Just-in-time and distributed task representations in language models


116. Emotionally-Aware Agents for Dispute Resolution


117. Can Multiple Responses from an LLM Reveal the Sources of Its Uncertainty?


118. Multiscale Graph Neural Network for Turbulent Flow-Thermal Prediction Around a Complex-Shaped Pin-Fin


119. Benchmarking GPT-5 for biomedical natural language processing


120. CoCoNUTS: Concentrating on Content while Neglecting Uninformative Textual Styles for AI-Generated Peer Review Detection


121. Teacher-Student Model for Detecting and Classifying Mitosis in the MIDOG 2025 Challenge


122. Efficient Training-Free Online Routing for High-Volume Multi-LLM Serving


123. MLP-SRGAN: A Single-Dimension Super Resolution GAN using MLP-Mixer