전체 AI 논문 - 2025-10-01

1. Branching Out: Broadening AI Measurement and Evaluation with Measurement Trees


2. TimeRewarder: Learning Dense Reward from Passive Videos via Frame-wise Temporal Distance


3. Fine-tuning Behavioral Cloning Policies with Preference-Based Reinforcement Learning


4. Fairness Testing in Retrieval-Augmented Generation: How Small Perturbations Reveal Bias in Small Language Models


5. Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark


6. HilbertA: Hilbert Attention for Image Generation with Diffusion Models


7. Rearchitecting Datacenter Lifecycle for AI: A TCO-Driven Framework


8. SCUBA: Salesforce Computer Use Benchmark


9. OffTopicEval: When Large Language Models Enter the Wrong Chat, Almost Always!


10. Combining Knowledge Graphs and NLP to Analyze Instant Messaging Data in Criminal Investigations


11. TVS Sidekick: Challenges and Practical Insights from Deploying Large Language Models in the Enterprise


12. The Average Patient Fallacy


13. STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models


14. Extreme Self-Preference in Language Models


15. Zero-Shot Decentralized Federated Learning


16. Transformer Classification of Breast Lesions: The BreastDCEDL_AMBL Benchmark Dataset and 0.92 AUC Baseline


17. OntoAligner Meets Knowledge Graph Embedding Aligners


18. Commmunication-Efficient and Accurate Approach for Aggregation in Federated Low-Rank Adaptation


19. MC-GNNAS-Dock: Multi-criteria GNN-based Algorithm Selection for Molecular Docking


20. Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents


21. How Far Do Time Series Foundation Models Paint the Landscape of Real-World Benchmarks ?


22. SafeBehavior: Simulating Human-Like Multistage Reasoning to Mitigate Jailbreak Attacks in Large Language Models


23. AI Playing Business Games: Benchmarking Large Language Models on Managerial Decision-Making in Dynamic Simulations


24. Interactive Learning for LLM Reasoning


25. ExoPredicator: Learning Abstract Models of Dynamic Worlds for Robot Planning


26. SlimPack: Fine-Grained Asymmetric Packing for Balanced and Efficient Variable-Length LLM Training


27. Benchmarking Deep Learning Convolutions on Energy-constrained CPUs


28. Diversity-Incentivized Exploration for Versatile Reasoning


29. Human-Centered Evaluation of RAG outputs: a framework and questionnaire for human-AI collaboration


30. LLM Agents for Knowledge Discovery in Atomic Layer Processing


31. ‘Too much alignment; not enough culture’: Re-balancing cultural alignment practices in LLMs


32. 90% Faster, 100% Code-Free: MLLM-Driven Zero-Code 3D Game Development


33. Beyond the Algorithm: A Field Guide to Deploying AI Agents in Clinical Practice


34. LMILAtt: A Deep Learning Model for Depression Detection from Social Media Users Enhanced by Multi-Instance Learning Based on Attention Mechanism


35. MEDAKA: Construction of Biomedical Knowledge Graphs Using Large Language Models


36. SafeEvalAgent: Toward Agentic and Self-Evolving Safety Evaluation of LLMs


37. Evaluating the Use of Large Language Models as Synthetic Social Agents in Social Science Research



39. Towards Human Engagement with Realistic AI Combat Pilots


40. Towards Unified Multimodal Misinformation Detection in Social Media: A Benchmark Dataset and Baseline


41. Scalable and Robust LLM Unlearning by Correcting Responses with Retrieved Exclusions


42. RoRecomp: Enhancing Reasoning Efficiency via Rollout Response Recomposition in Reinforcement Learning


43. Automated Model Discovery via Multi-modal & Multi-step Pipeline


44. NuRisk: A Visual Question Answering Dataset for Agent-Level Risk Assessment in Autonomous Driving


45. Boosting Process-Correct CoT Reasoning by Modeling Solvability of Multiple-Choice QA


46. Quantitative Evaluation of KIRETT Wearable Demonstrator for Rescue Operations


47. KIRETT: Smart Integration of Vital Signs Data for Intelligent Decision Support in Rescue Scenarios


48. DeepJSONEval: Benchmarking Complex Nested JSON Data Mining for Large Language Models


49. SafeMind: Benchmarking and Mitigating Safety Risks in Embodied LLM Agents


50. Lita: Light Agent Uncovers the Agentic Coding Capabilities of LLMs



52. Aging Decline in Basketball Career Trend Prediction Based on Machine Learning and LSTM Model


53. ASGuard: Activation-Scaling Guard to Mitigate Targeted Jailbreaking Attack


54. HiStyle: Hierarchical Style Embedding Predictor for Text-Prompt-Guided Controllable Speech Synthesis



56. PUREVQ-GAN: Defending Data Poisoning Attacks through Vector-Quantized Bottlenecks


57. Deontic Argumentation


58. Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller LLMs


59. Galton’s Law of Mediocrity: Why Large Language Models Regress to the Mean and Fail at Creativity in Advertising


60. Thinking Sparks!: Emergent Attention Heads in Reasoning Models During Post Training


61. NePTune: A Neuro-Pythonic Framework for Tunable Compositional Reasoning on Vision-Language


62. Cooperative Autonomous Driving in Diverse Behavioral Traffic: A Heterogeneous Graph Reinforcement Learning Approach


63. ScheduleMe: Multi-Agent Calendar Assistant


64. Collaborative Compression for Large-Scale MoE Deployment on Edge


65. SING-SQL: A Synthetic Data Generation Framework for In-Domain Text-to-SQL Translation


66. GroundSight: Augmenting Vision-Language Models with Grounding Information and De-hallucination


67. On Explaining Proxy Discrimination and Unfairness in Individual Decisions Made by AI Systems


68. Landmark-Guided Knowledge for Vision-and-Language Navigation


69. Iterative Residual Cross-Attention Mechanism: An Integrated Approach for Audio-Visual Navigation Tasks


70. AutoLabs: Cognitive Multi-Agent Systems with Self-Correction for Autonomous Chemical Experimentation


71. SOCK: A Benchmark for Measuring Self-Replication in Large Language Models


72. SMS: Self-supervised Model Seeding for Verification of Machine Unlearning


73. A Framework for Studying AI Agent Behavior: Evidence from Consumer Choice Experiments


74. Echoes of Humanity: Exploring the Perceived Humanness of AI Music


75. Hybrid Reward Normalization for Process-supervised Non-verifiable Agentic Tasks


76. Causal Autoencoder-like Generation of Feedback Fuzzy Cognitive Maps with an LLM Agent


77. Building the EHR Foundation Model via Next Event Prediction


78. ATLAS: Constraints-Aware Multi-Agent Collaboration for Real-World Travel Planning


79. Skip-It? Theoretical Conditions for Layer Skipping in Vision-Language Models


80. IRIS: Intrinsic Reward Image Synthesis


81. Radiology’s Last Exam (RadLE): Benchmarking Frontier Multimodal AI Against Human Experts and a Taxonomy of Visual Reasoning Errors in Radiology


82. A(I)nimism: Re-enchanting the World Through AI-Mediated Object Interaction


83. Evaluating Foundation Models with Pathological Concept Learning for Kidney Cancer


84. Learning to Interact in World Latent for Team Coordination


85. RadOnc-GPT: An Autonomous LLM Agent for Real-Time Patient Outcomes Labeling at Scale


86. Beyond Static Retrieval: Opportunities and Pitfalls of Iterative Retrieval in GraphRAG


87. Understanding Generative Recommendation with Semantic IDs from a Model-scaling View


88. Message passing-based inference in an autoregressive active inference agent


89. TDHook: A Lightweight Framework for Interpretability


90. Plug-and-Play Emotion Graphs for Compositional Prompting in Zero-Shot Speech Emotion Recognition



92. GESA: Graph-Enhanced Semantic Allocation for Generalized, Fair, and Explainable Candidate-Role Matching


93. The Open Syndrome Definition


94. RADAR: Reasoning-Ability and Difficulty-Aware Routing for Reasoning LLMs



96. Boolean Satisfiability via Imitation Learning


97. Saliency Guided Longitudinal Medical Visual Question Answering


98. From Perception to Cognition: A Survey of Vision-Language Interactive Reasoning in Multimodal Large Language Models


99. Where LLM Agents Fail and How They can Learn From Failures


100. Structural Reward Model: Enhancing Interpretability, Efficiency, and Scalability in Reward Modeling


101. SynthPert: Enhancing LLM Biological Reasoning via Synthetic Reasoning Traces for Cellular Perturbation Prediction


102. Spontaneous High-Order Generalization in Neural Theory-of-Mind Networks


103. Dive into the Agent Matrix: A Realistic Evaluation of Self-Replication Risk in LLM Agents


104. Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution


105. ID-RAG: Identity Retrieval-Augmented Generation for Long-Horizon Persona Coherence in Generative Agents


106. Toward Causal-Visual Programming: Enhancing Agentic Reasoning in Low-Code Environments


107. RL in the Wild: Characterizing RLVR Training in LLM Deployment


108. RADAR: A Risk-Aware Dynamic Multi-Agent Framework for LLM Safety Evaluation via Role-Specialized Collaboration


109. Language Model Planning from an Information Theoretic Perspective


110. Fact Grounded Attention: Eliminating Hallucination in Large Language Models Through Attention Level Knowledge Integration


111. Memory Management and Contextual Consistency for Long-Running Low-Code Agents


112. Neo-Grounded Theory: A Methodological Innovation Integrating High-Dimensional Vector Clustering and Multi-Agent Collaboration for Qualitative Research


113. A Formal Comparison Between Chain-of-Thought and Latent Thought


114. The Causal Abstraction Network: Theory and Learning


115. Blueprint-Bench: Comparing spatial intelligence of LLMs, agents and image models


116. Stitch: Training-Free Position Control in Multimodal Diffusion Transformers


117. OmniRetarget: Interaction-Preserving Data Generation for Humanoid Whole-Body Loco-Manipulation and Scene Interaction


118. Learning Generalizable Shape Completion with SIM(3) Equivariance


119. Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training


120. Searching for Difficult-to-Translate Test Examples at Scale


121. MENLO: From Preferences to Proficiency - Evaluating and Modeling Native-like Quality Across 47 Languages


122. Deconstructing Self-Bias in LLM-generated Translation Benchmarks


123. Are Robust LLM Fingerprints Adversarially Robust?


124. AI-assisted Advanced Propellant Development for Electric Propulsion


125. Parametric Neural Amp Modeling with Active Learning


126. The Unheard Alternative: Contrastive Explanations for Speech-to-Text Models


127. OceanGym: A Benchmark Environment for Underwater Embodied Agents


128. TAP: Two-Stage Adaptive Personalization of Multi-task and Multi-Modal Foundation Models in Federated Learning


129. MUSE-Explainer: Counterfactual Explanations for Symbolic Music Graph Classification Models



131. Indoor/Outdoor Spectrum Sharing Enabled by GNSS-based Classifiers


132. VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications


133. Regression Language Models for Code


134. On Deepfake Voice Detection - It’s All in the Presentation


135. Attention over Scene Graphs: Indoor Scene Representations Toward CSAI Classification



137. ACT: Agentic Classification Tree


138. AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size


139. Ascent Fails to Forget


140. SeedPrints: Fingerprints Can Even Tell Which Seed Your Large Language Model Was Trained From


141. Game-Time: Evaluating Temporal Dynamics in Spoken Language Models


142. Efficient and Transferable Agentic Knowledge Graph RAG via Reinforcement Learning


143. SDA-PLANNER: State-Dependency Aware Adaptive Planner for Embodied Task Planning


144. Vector-Valued Reproducing Kernel Banach Spaces for Neural Networks and Operators


145. TimeScope: Towards Task-Oriented Temporal Grounding In Long Videos


146. SoK: Systematic analysis of adversarial threats against deep learning approaches for autonomous anomaly detection systems in SDN-IoT networks


147. EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing



149. Feedback Forensics: A Toolkit to Measure AI Personality


150. QUARTZ : QA-based Unsupervised Abstractive Refinement for Task-oriented Dialogue Summarization


151. Noise-Guided Transport for Imitation Learning


152. Representation-Based Data Quality Audits for Audio


153. Point2RBox-v3: Self-Bootstrapping from Point Annotations via Integrated Pseudo-Label Refinement and Utilization


154. Finetune Once: Decoupling General & Domain Learning with Dynamic Boosted Annealing


155. Sandbagging in a Simple Survival Bandit Problem


156. 3DiFACE: Synthesizing and Editing Holistic 3D Facial Animation


157. An Experimental Study on Generating Plausible Textual Explanations for Video Summarization



159. Beyond Pixels: Efficient Dataset Distillation via Sparse Gaussian Representation


160. Comparative Analysis of Ant Colony Optimization and Google OR-Tools for Solving the Open Capacitated Vehicle Routing Problem in Logistics


161. Toward an Unbiased Collective Memory for Efficient LLM-Based Agentic 6G Cross-Domain Management


162. Optimizing Indoor Environmental Quality in Smart Buildings Using Deep Learning


163. AttriGen: Automated Multi-Attribute Annotation for Blood Cell Datasets


164. Auto-ARGUE: LLM-Based Report Generation Evaluation


165. Towards Continual Expansion of Data Coverage: Automatic Text-guided Edge-case Synthesis


166. EntroPE: Entropy-Guided Dynamic Patch Encoder for Time Series Forecasting


167. Bubble, Bubble, AI’s Rumble: Why Global Financial Regulatory Incident Reporting is Our Shield Against Systemic Stumbles


168. OWL: Geometry-Aware Spatial Reasoning for Audio Large Language Models


169. Leveraging AI modelling for FDS with Simvue: monitor and optimise for more sustainable simulations


170. AGOCS – Accurate Google Cloud Simulator Framework


171. Enhancing PINN Performance Through Lie Symmetry Group


172. End-to-End Aspect-Guided Review Summarization at Scale


173. On Computing Top-$k$ Simple Shortest Paths from a Single Source


174. Real-time Noise Detection and Classification in Single-Channel EEG: A Lightweight Machine Learning Approach for EMG, White Noise, and EOG Artifacts


175. CEAID: Benchmark of Multilingual Machine-Generated Text Detection Methods for Central European Languages


176. SeMoBridge: Semantic Modality Bridge for Efficient Few-Shot Adaptation of CLIP


177. Muon Outperforms Adam in Tail-End Associative Memory Learning


178. Indirect Attention: Turning Context Misalignment into a Feature


179. PFDepth: Heterogeneous Pinhole-Fisheye Joint Depth Estimation via Distortion-aware Gaussian-Splatted Volumetric Fusion


180. Learning Egocentric In-Hand Object Segmentation through Weak Supervision from Human Narrations


181. VRWKV-Editor: Reducing quadratic complexity in transformer-based video editing


182. MHINDR - a DSM5 based mental health diagnosis and recommendation framework using LLM


183. R-Log: Incentivizing Log Analysis Capability in LLMs via Reasoning-based Reinforcement Learning


184. Reconcile Certified Robustness and Accuracy for DNN-based Smoothed Majority Vote Classifier


185. Data-Free Continual Learning of Server Models in Model-Heterogeneous Federated learning


186. AIM: Adaptive Intervention for Deep Multi-task Learning of Molecular Properties


187. From MNIST to ImageNet: Understanding the Scalability Boundaries of Differentiable Logic Gate Networks


188. The Impact of Scaling Training Data on Adversarial Robustness


189. Accelerating LLM Inference with Precomputed Query Storage


190. User-Centric Communication Service Provision for Edge-Assisted Mobile Augmented Reality


191. PerQ: Efficient Evaluation of Multilingual Text Personalization Quality


192. RoleConflictBench: A Benchmark of Role Conflict Scenarios for Evaluating LLMs’ Contextual Sensitivity


193. scUnified: An AI-Ready Standardized Resource for Single-Cell RNA Sequencing Analysis


194. Efficient On-Policy Reinforcement Learning via Exploration of Sparse Parameter Space


195. Vector sketch animation generation with differentialable motion trajectories


196. Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation


197. More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models


198. Training-Free Reward-Guided Image Editing via Trajectory Optimal Control


199. S$^2$FS: Spatially-Aware Separability-Driven Feature Selection in Fuzzy Decision Systems



201. Distillation of Large Language Models via Concrete Score Matching


202. Supporting Creative Ownership through Deep Learning-Based Music Variation


203. Overthinking Reduction with Decoupled Rewards and Curriculum Data Scheduling


204. VELA: An LLM-Hybrid-as-a-Judge Approach for Evaluating Long Image Captions


205. Learning to Reason as Action Abstractions with Scalable Mid-Training RL


206. CardioForest: An Explainable Ensemble Learning Model for Automatic Wide QRS Complex Tachycardia Diagnosis from ECG


207. Better with Less: Small Proprietary Models Surpass Large Language Models in Financial Transaction Understanding


208. Point-It-Out: Benchmarking Embodied Reasoning for Vision Language Models in Multi-Stage Visual Grounding


209. Editable Noise Map Inversion: Encoding Target-image into Noise For High-Fidelity Image Manipulation


210. Autonomy-Aware Clustering: When Local Decisions Supersede Global Prescriptions


211. V-HUB: A Visual-Centric Humor Understanding Benchmark for Video LLMs


212. Free Lunch Alignment of Text-to-Image Diffusion Models without Preference Image Pairs


213. TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning


214. Dolphin v1.0 Technical Report


215. Think Less, Label Better: Multi-Stage Domain-Grounded Synthetic Data Generation for Fine-Tuning Large Language Models in Telecommunications


216. Controlled Generation for Private Synthetic Text


217. Boundary-to-Region Supervision for Offline Safe Reinforcement Learning


218. Towards A Universally Transferable Acceleration Method for Density Functional Theory


219. The AI Productivity Index (APEX)


220. DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation


221. HNote: Extending YNote with Hexadecimal Encoding for Fine-Tuning LLMs in Music Modeling


222. Annotation-Efficient Active Test-Time Adaptation with Conformal Prediction


223. LD-MoLE: Learnable Dynamic Routing for Mixture of LoRA Experts


224. EEG-based AI-BCI Wheelchair Advancement: Hybrid Deep Learning with Motor Imagery for Brain Computer Interface



226. Capacity-Net-Based RIS Precoding Design without Channel Estimation for mmWave MIMO System


227. YOLO-Based Defect Detection for Metal Sheets


228. BaB-prob: Branch and Bound with Preactivation Splitting for Probabilistic Verification of Neural Networks


229. STAC: When Innocent Tools Form Dangerous Chains to Jailbreak LLM Agents


230. Quadratic Programming Approach for Nash Equilibrium Computation in Multiplayer Imperfect-Information Games


231. Unsupervised Detection of Spatiotemporal Anomalies in PMU Data Using Transformer-Based BiGAN


232. K-Prism: A Knowledge-Guided and Prompt Integrated Universal Medical Image Segmentation Model


233. AttentionViG: Cross-Attention-Based Dynamic Neighbor Aggregation in Vision GNNs


234. Probing the Limits of Stylistic Alignment in Vision-Language Models


235. Hybrid Approach for Enhancing Lesion Segmentation in Fundus Images


236. Aligning Multilingual Reasoning with Verifiable Semantics from a High-Resource Expert Model


237. Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play


238. Toxicity in Online Platforms and AI Systems: A Survey of Needs, Challenges, Mitigations, and Future Directions


239. Steering an Active Learning Workflow Towards Novel Materials Discovery via Queue Prioritization


240. Self-Rewarding Rubric-Based Reinforcement Learning for Open-Ended Reasoning


241. VISOR++: Universal Visual Inputs based Steering for Large Vision Language Models


242. Calibrating Verbalized Confidence with Self-Generated Distractors


243. MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources


244. LLM-RG: Referential Grounding in Outdoor Scenarios using Large Language Models


245. Economic Competition, EU Regulation, and Executive Orders: A Framework for Discussing AI Policy Implications in CS Courses


246. XR Blocks: Accelerating Human-centered AI + XR Innovation


247. DeepFake Detection in Dyadic Video Calls using Point of Gaze Tracking


248. Not Wrong, But Untrue: LLM Overconfidence in Document-Based Queries


249. EMO-TTA: Improving Test-Time Adaptation of Audio-Language Models for Speech Emotion Recognition


250. Translation from Wearable PPG to 12-Lead ECG


251. Discontinuous Epitope Fragments as Sufficient Target Templates for Efficient Binder Design


252. Data-Efficient Multitask DAgger


253. PIPer: On-Device Environment Setup via Online Reinforcement Learning


254. Multi-patch isogeometric neural solver for partial differential equations on computer-aided design domains


255. Joint Embeddings Go Temporal


256. Beyond Noisy-TVs: Noise-Robust Exploration Via Learning Progress Monitoring


257. Polychromic Objectives for Reinforcement Learning


258. Emotion-Aligned Generation in Diffusion Text to Speech Models via Preference-Guided Optimization


259. Rethinking Parameter Sharing for LLM Fine-Tuning with Multiple LoRAs


260. From Faithfulness to Correctness: Generative Reward Models that Think Critically


261. FlashOmni: A Unified Sparse Attention Engine for Diffusion Transformers


262. A Cartography of Open Collaboration in Open Source AI: Mapping Practices, Motivations, and Governance in 14 Open Large Language Model Projects


263. A Deep Learning Approach for Spatio-Temporal Forecasting of InSAR Ground Deformation in Eastern Ireland


264. SpinBench: Perspective and Rotation as a Lens on Spatial Reasoning in VLMs


265. Predicting Training Re-evaluation Curves Enables Effective Data Curriculums for LLMs


266. Let Physics Guide Your Protein Flows: Topology-aware Unfolding and Generation


267. Cold-Start Active Correlation Clustering


268. Generative Value Conflicts Reveal LLM Priorities


269. From Internal Representations to Text Quality: A Geometric Approach to LLM Evaluation


270. VisualOverload: Probing Visual Understanding of VLMs in Really Dense Scenes


271. Uncertainty-Aware Generative Oversampling Using an Entropy-Guided Conditional Variational Autoencoder


272. Scaling Behaviors of LLM Reinforcement Learning Post-Training: An Empirical Study in Mathematical Reasoning


273. Automatically Generating Web Applications from Requirements Via Multi-Agent Test-Driven Development


274. Learning Relationships Between Separate Audio Tracks for Creative Applications


275. AI in Pakistani Schools: Adoption, Usage, and Perceived Impact among Educators


276. A Measurement Study of Model Context Protocol


277. ClustRecNet: A Novel End-to-End Deep Learning Framework for Clustering Algorithm Recommendation


278. Artificial Authority: From Machine Minds to Political Alignments. An Experimental Analysis of Democratic and Autocratic Biases in Large-Language Models


279. Effectiveness of Large Language Models in Simulating Regional Psychological Structures: An Empirical Examination of Personality and Subjective Well-being


280. VoiceBridge: Designing Latent Bridge Models for General Speech Restoration at Scale


281. DNABERT-2: Fine-Tuning a Genomic Language Model for Colorectal Gene Enhancer Classification


282. InfMasking: Unleashing Synergistic Information by Contrastive Multimodal Interactions


283. A Weather Foundation Model for the Power Grid


284. Dynamic Policy Induction for Adaptive Prompt Optimization: Bridging the Efficiency-Accuracy Gap via Lightweight Reinforcement Learning


285. Cognifying Education: Mapping AI’s transformative role in emotional, creative, and collaborative learning


286. From NL2SQL to NL2GeoSQL: GeoSQL-Eval for automated evaluation of LLMs on PostGIS queries


287. How Effective Are Time-Series Models for Rainfall Nowcasting? A Comprehensive Benchmark for Rainfall Nowcasting Incorporating PWV Data


288. Artificial Intelligence-Powered Assessment Framework for Skill-Oriented Engineering Lab Education


289. The Sandbox Configurator: A Framework to Support Technical Assessment in AI Regulatory Sandboxes


290. Knowledge distillation through geometry-aware representational alignment


291. BEV-VLM: Trajectory Planning via Unified BEV Abstraction


292. BuildBench: Benchmarking LLM Agents on Compiling Real-World Open-Source Software


293. Protocode: Prototype-Driven Interpretability for Code Generation in LLMs


294. Comprehensive Analysis of VQC for Financial Fraud Detection: A Comparative Study of Quantum Encoding Techniques and Architectural Optimizations


295. Reinforcement Learning-Guided Chain-of-Draft for Token-Efficient Code Generation


296. A Benchmark for Localizing Code and Non-Code Issues in Software Projects


297. HAMMER: Hamiltonian Curiosity Augmented Large Language Model Reinforcement


298. PALADIN: Self-Correcting Language Model Agents to Cure Tool-Failure Cases


299. Quantum est in Libris: Navigating Archives with GenAI, Uncovering Tension Between Preservation and Innovation


300. Machine Learning for Pattern Detection in Printhead Nozzle Logging


301. FedCLF - Towards Efficient Participant Selection for Federated Learning in Heterogeneous IoV Networks


302. Energy Guided Geometric Flow Matching


303. Enhancing Linear Attention with Residual Learning


304. Learning to Condition: A Neural Heuristic for Scalable MPE Inference


305. On-the-Fly Adaptation to Quantization: Configuration-Aware LoRA for Efficient Fine-Tuning of Quantized LLMs


306. Six Sigma For Neural Networks: Taguchi-based optimization


307. STCast: Adaptive Boundary Alignment for Global and Regional Weather Forecasting


308. Multi-level Diagnosis and Evaluation for Robust Tabular Feature Engineering with Large Language Models


309. Spectral Logit Sculpting: Adaptive Low-Rank Logit Transformation for Controlled Text Generation


310. Generating High-Quality Datasets for Code Editing via Open-Source Language Models


311. Towards Repository-Level Program Verification with Large Language Models


312. APRIL: API Synthesis with Automatic Prompt Optimization and Reinforcement Learning


313. Devstral: Fine-tuning Language Models for Coding Agent Applications


314. Perceptual Influence: Improving the Perceptual Loss Design for Low-Dose CT Enhancement


315. UML-CoT: Structured Reasoning and Planning with Unified Modeling Language for Robotic Room Cleaning


316. AdaptCache: KV Cache Native Storage Hierarchy for Low-Delay and High-Quality Language Model Serving


317. FMIP: Joint Continuous-Integer Flow For Mixed-Integer Linear Programming


318. Diagnosing and Addressing Pitfalls in KG-RAG Datasets: Toward More Reliable Benchmarking