전체 AI 논문 - 2026-01-14

1. Uncovering Political Bias in Large Language Models using Parliamentary Voting Records


2. Pervasive Annotation Errors Break Text-to-SQL Benchmarks and Leaderboards


3. AI as Entertainment


4. Learning from Demonstrations via Capability-Aware Goal Sampling


5. Evaluating the Ability of Explanations to Disambiguate Models in a Rashomon Set


6. All Required, In Order: Phase-Level Evaluation for AI-Human Dialogue in Healthcare and Beyond


7. MEMEWEAVER: Inter-Meme Graph Reasoning for Sexism and Misogyny Detection


8. PersonaDual: Balancing Personalization and Objectivity via Adaptive Reasoning


9. Advancing ESG Intelligence: An Expert-level Agent and Comprehensive Benchmark for Sustainable Finance


10. Why AI Alignment Failure Is Structural: Learned Human Interaction Structures and AGI as an Endogenous Evolutionary Shock


11. Parallel Context-of-Experts Decoding for Retrieval Augmented Generation


12. From Classical to Quantum Reinforcement Learning and Its Applications in Quantum Control: A Beginner’s Tutorial


13. Prism: Towards Lowering User Cognitive Load in LLMs via Complex Intent Understanding


14. Resisting Manipulative Bots in Memecoin Copy Trading: A Multi-Agent Approach with Chain-of-Thought Reasoning


15. ViDoRe V3: A Comprehensive Evaluation of Retrieval Augmented Generation in Complex Real-World Scenarios


16. WaterCopilot: An AI-Driven Virtual Assistant for Water Management


17. Learner-Tailored Program Repair: A Solution Generator with Iterative Edit-Driven Retrieval Enhancement


18. Sketch-Based Facade Renovation With Generative AI: A Streamlined Framework for Bypassing As-Built Modelling in Industrial Adaptive Reuse


19. What If TSF: A Benchmark for Reframing Forecasting as Scenario-Guided Multimodal Forecasting


20. SUMMPILOT: Bridging Efficiency and Customization for Interactive Summarization System


21. M3-BENCH: Process-Aware Evaluation of LLM Agents Social Behaviors in Mixed-Motive Games


22. An Under-Explored Application for Explainable Multimodal Misogyny Detection in code-mixed Hindi-English


23. Beyond Linearization: Attributed Table Graphs for Table Reasoning


24. YaPO: Learnable Sparse Activation Steering Vectors for Domain Adaptation


25. RubricHub: A Comprehensive and Highly Discriminative Rubric Dataset via Automated Coarse-to-Fine Generation


26. Hybrid Distillation with CoT Guidance for Edge-Drone Control Code Generation


27. WebTrap Park: An Automated Platform for Systematic Security Evaluation of Web Agents


28. Owen-Shapley Policy Optimization (OSPO): A Principled RL Algorithm for Generative Search LLMs


29. Creativity in AI as Emergence from Domain-Limited Generative Models


30. Deconstructing Pre-training: Knowledge Attribution Analysis in MoE and Dense Models


31. A Qualitative Model to Reason about Object Rotations (QOR) applied to solve the Cube Comparison Test (CCT)


32. Thematic Working Group 5 – Artificial Intelligence (AI) literacy for teaching and learning: design and implementation


33. Semantic Laundering in AI Agent Architectures: Why Tool Boundaries Do Not Confer Epistemic Warrant


34. AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation


35. OpenMic: A Multi-Agent-Based Stand-Up Comedy Generation System


36. Greedy Is Enough: Sparse Action Discovery in Agentic LLMs


37. ToolACE-MCP: Generalizing History-Aware Routing from MCP Tools to the Agent Web


38. Sparsity Is Necessary: Polynomial-Time Stability for Agentic LLMs in Large Action Spaces


39. VGG Induced Deep Hand Sign Language Detection


40. T3: Benchmarking Sycophancy and Skepticism in Causal Judgment


41. Large Artificial Intelligence Model Guided Deep Reinforcement Learning for Resource Allocation in Non Terrestrial Networks


42. The End of Reward Engineering: How LLMs Are Redefining Multi-Agent Coordination


43. MPCI-Bench: A Benchmark for Multimodal Pairwise Contextual Integrity Evaluation of Language Model Agents


44. An Axiomatic Approach to General Intelligence: SANC(E3) – Self-organizing Active Network of Concepts with Energy E3


45. Adapting Rules of Official International Mahjong for Online Players


46. Improving LLM Reasoning with Homophily-aware Structural and Semantic Text-Attributed Graph Compression


47. The Agent’s First Day: Benchmarking Learning, Exploration, and Scheduling in the Workplace Scenarios


48. ZeroDVFS: Zero-Shot LLM-Guided Core and Frequency Allocation for Embedded Platforms


49. Project Synapse: A Hierarchical Multi-Agent Framework with Hybrid Memory for Autonomous Resolution of Last-Mile Delivery Disruptions


50. Embedded AI Companion System on Edge Devices


51. How vehicles change lanes after encountering crashes: Empirical analysis and modeling


52. MirrorBench: An Extensible Framework to Evaluate User-Proxy Agents for Human-Likeness


53. MemoBrain: Executive Memory as an Agentic Brain for Reasoning


54. Semantic Gravity Wells: Why Negative Constraints Backfire


55. A New Strategy for Verifying Reach-Avoid Specifications in Neural Feedback Systems


56. Forecast Aware Deep Reinforcement Learning for Efficient Electricity Load Scheduling in Dairy Farms


57. Integrating Attendance Tracking and Emotion Detection for Enhanced Student Engagement in Smart Classrooms


58. Internal Deployment Gaps in AI Regulation


59. Reasoning over Precedents Alongside Statutes: Case-Augmented Deliberative Alignment for LLM Safety


60. When Models Know When They Do Not Know: Calibration, Cascading, and Cleaning


61. Executable Ontologies in Game Development: From Algorithmic Control to Semantic World Modeling


62. Bridging the Trust Gap: Clinician-Validated Hybrid Explainable AI for Maternal Health Risk Assessment in Bangladesh


63. Modeling LLM Agent Reviewer Dynamics in Elo-Ranked Review System


64. Motion Attribution for Video Generation


65. MemRec: Collaborative Memory-Augmented Agentic Recommender System


66. Reasoning Matters for 3D Visual Grounding


67. Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge


68. S3-CLIP: Video Super Resolution for Person-ReID


69. APEX-SWE


70. Asymptotic Universal Alignment: A New Alignment Framework via Test-Time Scaling


71. Translating Light-Sheet Microscopy Images to Virtual H&E Using CycleGAN


72. Reliable Graph-RAG for Codebases: AST-Derived Graphs vs LLM-Extracted Knowledge Graphs


73. Grid-Aware Charging and Operational Optimization for Mixed-Fleet Public Transit


74. UR-Bench: A Benchmark for Multi-Hop Reasoning over Ultra-High-Resolution Images


75. To Retrieve or To Think? An Agentic Approach for Context Evolution


76. TableCache: Primary Foreign Key Guided KV Cache Precomputation for Low Latency Text-to-SQL


77. TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback


78. ISLA: A U-Net for MRI-based acute ischemic stroke lesion segmentation with deep supervision, attention, domain adaptation, and ensemble learning


79. Real-Time Localization Framework for Autonomous Basketball Robots


80. Auditing Student-AI Collaboration: A Case Study of Online Graduate CS Students


81. Region of interest detection for efficient aortic segmentation


82. Lessons from the Field: An Adaptable Lifecycle Approach to Applied Dialogue Summarization


83. TRACE: Reconstruction-Based Anomaly Detection in Ensemble and Time-Dependent Simulations


84. RULERS: Locked Rubrics and Evidence-Anchored Scoring for Robust LLM Evaluation


85. Moral Lenses, Political Coordinates: Towards Ideological Positioning of Morally Conditioned LLMs


86. M$^2$FMoE: Multi-Resolution Multi-View Frequency Mixture-of-Experts for Extreme-Adaptive Time Series Forecasting


87. SafeRedir: Prompt Embedding Redirection for Robust Unlearning in Image Generation Models


88. VeriTaS: The First Dynamic Benchmark for Multimodal Automated Fact-Checking


89. ExpSeek: Self-Triggered Experience Seeking for Web Agents


90. WaveFormer: Frequency-Time Decoupled Vision Modeling with Wave Equation


91. Rewriting Video: Text-Driven Reauthoring of Video Footage


92. VideoHEDGE: Entropy-Based Hallucination Detection for Video-VLMs via Semantic Clustering and Spatiotemporal Perturbations


93. Contrastive and Multi-Task Learning on Noisy Brain Signals with Nonlinear Dynamical Signatures


94. CD^2: Constrained Dataset Distillation for Few-Shot Class-Incremental Learning


95. STAGE: A Benchmark for Knowledge Graph Construction, Question Answering, and In-Script Role-Playing over Movie Screenplays


96. Temporal Fusion Nexus: A task-agnostic multi-modal embedding model for clinical narratives and irregular time series in post-kidney transplant care


97. EfficientFSL: Enhancing Few-Shot Classification via Query-Only Tuning in Vision Transformers


98. PKI: Prior Knowledge-Infused Neural Network for Few-Shot Class-Incremental Learning


99. BenchOverflow: Measuring Overflow in Large Language Models via Plain-Text Prompts


100. sui-1: Grounded and Verifiable Long-Form Summarization


101. JudgeRLVR: Judge First, Generate Second for Efficient Reasoning


102. CoMa: Contextual Massing Generation with Vision-Language Models


103. A Formal Proof of a Continued Fraction Conjecture for $π$ Originating from the Ramanujan Machine


104. Decoding Order Matters in Autoregressive Speech Synthesis


105. Divide and Conquer: Static-Dynamic Collaboration for Few-Shot Class-Incremental Learning


106. Large Multimodal Models for Embodied Intelligent Driving: The Next Frontier in Self-Driving?


107. Taxon: Hierarchical Tax Code Prediction with Semantically Aligned LLM Expert Guidance


108. Regulatory gray areas of LLM Terms


109. PATS: Personality-Aware Teaching Strategies with Large Language Model Tutors


110. An Explainable Two Stage Deep Learning Framework for Pericoronitis Assessment in Panoramic Radiographs Using YOLOv8 and ResNet-50


111. Controlled LLM Training on Spectral Sphere


112. Training-Free Distribution Adaptation for Diffusion Models via Maximum Mean Discrepancy Guidance


113. Geo-NVS-w: Geometry-Aware Novel View Synthesis In-the-Wild with an SDF Renderer


114. Scalable Sequential Recommendation under Latency and Memory Constraints


115. IGAN: A New Inception-based Model for Stable and High-Fidelity Image Synthesis Using Generative Adversarial Networks


116. Safe Heterogeneous Multi-Agent RL with Communication Regularization for Coordinated Target Acquisition


117. Enhancing Image Quality Assessment Ability of LMMs via Retrieval-Augmented Generation


118. ORBIT: On-policy Exploration-Exploitation for Controllable Multi-Budget Reasoning


119. Enhancing Sentiment Classification and Irony Detection in Large Language Models through Advanced Prompt Engineering Techniques


120. Demystifying the Slash Pattern in Attention: The Role of RoPE


121. HIPPO: Accelerating Video Large Language Models Inference via Holistic-aware Parallel Speculative Decoding


122. On Evaluation of Unsupervised Feature Selection for Pattern Classification


123. Hyperbolic Heterogeneous Graph Transformer


124. GADPN: Graph Adaptive Denoising and Perturbation Networks via Singular Value Decomposition


125. Knowledge-based learning in Text-RAG and Image-RAG


126. DNF: Dual-Layer Nested Fingerprinting for Large Language Model Intellectual Property Protection


127. Evaluating Implicit Regulatory Compliance in LLM Tool Invocation via Logic-Guided Synthesis


128. ForgetMark: Stealthy Fingerprint Embedding via Targeted Unlearning in Language Models


129. Autonomous Materials Exploration by Integrating Automated Phase Identification and AI-Assisted Human Reasoning


130. GI-Bench: A Panoramic Benchmark Revealing the Knowledge-Experience Dissociation of Multimodal Large Language Models in Gastrointestinal Endoscopy Against Clinical Standards


131. Instruction-Driven 3D Facial Expression Generation and Transition


132. Prompt-Based Clarity Evaluation and Topic Detection in Political Question Answering


133. SwiftMem: Fast Agentic Memory via Query-aware Indexing


134. Dynamic Graph Structure Learning via Resistance Curvature Flow


135. Enriching Semantic Profiles into Knowledge Graph for Recommender Systems Using Large Language Models


136. Mechanisms are Transferable: Data-Efficient Low-Resource Adaptation via Circuit-Targeted Supervised Fine-Tuning


137. Qalb: Largest State-of-the-Art Urdu Large Language Model for 230M Speakers with Systematic Continued Pre-training


138. Subspace Alignment for Vision-Language Model Test-time Adaptation


139. How Do Optical Flow and Textual Prompts Collaborate to Assist in Audio-Visual Semantic Segmentation?


140. PathoGen: Diffusion-Based Synthesis of Realistic Lesions in Histopathology Images


141. CSQL: Mapping Documents into Causal Databases


142. Debiasing Large Language Models via Adaptive Causal Prompting with Sketch-of-Thought


143. STO-RL: Offline RL under Sparse Rewards via LLM-Guided Subgoal Temporal Order


144. High-Fidelity Modeling of Stochastic Chemical Dynamics on Complex Manifolds: A Multi-Scale SIREN-PINN Framework for the Curvature-Perturbed Ginzburg-Landau Equation


145. Local-Global Feature Fusion for Subject-Independent EEG Emotion Recognition


146. Q-realign: Piggybacking Realignment on Quantization for Safe and Efficient LLM Deployment


147. Reasoning Beyond Chain-of-Thought: A Latent Computational Mode in Large Language Models


148. The Role of Noisy Data in Improving CNN Robustness for Image Classification


149. FigEx2: Visual-Conditioned Panel Detection and Captioning for Scientific Compound Figures


150. Representations of Text and Images Align From Layer One


151. TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models


152. LLM Review: Enhancing Creative Writing via Blind Peer Review Feedback


153. DYCP: Dynamic Context Pruning for Long-Form Dialogue with LLMs


154. From Word Sequences to Behavioral Sequences: Adapting Modeling and Evaluation Paradigms for Longitudinal NLP


155. Cultural Compass: A Framework for Organizing Societal Norms to Detect Violations in Human-AI Conversations


156. Tuberculosis Screening from Cough Audio: Baseline Models, Clinical Variables, and Uncertainty Quantification


157. LJ-Spoof: A Generatively Varied Corpus for Audio Anti-Spoofing and Synthesis Source Tracing


158. LWMSCNN-SE: A Lightweight Multi-Scale Network for Efficient Maize Disease Classification on Edge Devices


159. Quantum automated theorem proving


160. Hybrid SARIMA LSTM Model for Local Weather Forecasting: A Residual Learning Approach for Data Driven Meteorological Prediction



162. Coupled Diffusion-Encoder Models for Reconstruction of Flow Fields


163. Moonworks Lunara Aesthetic Dataset


164. SECite: Analyzing and Summarizing Citations in Software Engineering Literature


165. Towards Specialized Generalists: A Multi-Task MoE-LoRA Framework for Domain-Specific LLM Adaptation


166. Enhancing Large Language Models for Time-Series Forecasting via Vector-Injected In-Context Learning


167. Decentralized Online Convex Optimization with Unknown Feedback Delays


168. Large Language Models and Algorithm Execution: Application to an Arithmetic Function


169. Revealing the Attention Floating Mechanism in Masked Diffusion Models


170. Sherry: Hardware-Efficient 1.25-Bit Ternary Quantization via Fine-grained Sparsification


171. KVzap: Fast, Adaptive, and Faithful KV Cache Pruning


172. Small Symbols, Big Risks: Exploring Emoticon Semantic Confusion in Large Language Models


173. Ideological Isolation in Online Social Networks: A Survey of Computational Definitions, Metrics, and Mitigation Strategies


174. Tackling Heterogeneity in Quantum Federated Learning: An Integrated Sporadic-Personalized Approach


175. Sola-Visibility-ISPM: Benchmarking Agentic AI for Identity Security Posture Management Visibility


176. Sliced-Wasserstein Distribution Alignment Loss Improves the Ultra-Low-Bit Quantization of Large Language Models


177. E^2-LLM: Bridging Neural Signals and Interpretable Affective Analysis


178. NOVAK: Unified adaptive optimizer for deep neural networks


179. Multiplicative Orthogonal Sequential Editing for Language Models


180. Imaging-anchored Multiomics in Cardiovascular Disease: Integrating Cardiac Imaging, Bulk, Single-cell, and Spatial Transcriptomics


181. RewriteNets: End-to-End Trainable String-Rewriting for Generative Sequence Modeling


182. Affect and Effect: Limitations of regularisation-based continual learning in EEG-based emotion classification


183. Feature Entanglement-based Quantum Multimodal Fusion Neural Network


184. An Empirical Study on Knowledge Transfer under Domain and Label Shifts in 3D LiDAR Point Clouds


185. Immunological Density Shapes Recovery Trajectories in Long COVID


186. FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments


187. Hierarchical Sparse Plus Low Rank Compression of LLM


188. A survey: Information search time optimization based on RAG (Retrieval Augmentation Generation) chatbot


189. Photometric Redshift Estimation Using Scaled Ensemble Learning