전체 AI 논문 - 2026-01-30

1. Exploring Reasoning Reward Model for Agents


2. Routing the Lottery: Adaptive Subnetworks for Heterogeneous Data


3. World of Workflows: a Benchmark for Bringing World Models to Enterprise Systems


4. The Patient is not a Moving Document: A World Model Training Paradigm for Longitudinal EHR


5. Defining Operational Conditions for Safety-Critical AI-Based Systems from Data


6. Optimizing Agentic Workflows using Meta-tools


7. CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty


8. Heterogeneous Computing: The Key to Powering the Future of AI Agent Inference


9. Liquid Interfaces: A Dynamic Ontology for the Interoperability of Autonomous Systems


10. VERSA: Verified Event Data Format for Reliable Soccer Analytics


11. Mind the Gap: How Elicitation Protocols Shape the Stated-Revealed Preference Gap in Language Models


12. Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic


13. The Energy Impact of Domain Model Design in Classical Planning


14. How do Visual Attributes Influence Web Agents? A Comprehensive Evaluation of User Interface Design Factors


15. ToolWeaver: Weaving Collaborative Semantics for Scalable Tool Use in Large Language Models


16. Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities


17. AgenticSimLaw: A Juvenile Courtroom Multi-Agent Debate Simulation for Explainable High-Stakes Tabular Decision Making


18. Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning


19. JADE: Bridging the Strategic-Operational Gap in Dynamic Agentic RAG


20. ProRAG: Process-Supervised Reinforcement Learning for Retrieval-Augmented Generation


21. From Meta-Thought to Execution: Cognitively Aligned Post-Training for Generalizable and Reliable LLM Reasoning


22. Making Models Unmergeable via Scaling-Sensitive Loss Landscape


23. astra-langchain4j: Experiences Combining LLMs and Agent Programming


24. WebArbiter: A Principle-Guided Reasoning Process Reward Model for Web Agents


25. KnowBias: Mitigating Social Bias in LLMs via Know-Bias Neuron Enhancement


26. Bridging Forecast Accuracy and Inventory KPIs: A Simulation-Based Software Framework


27. Looking Beyond Accuracy: A Holistic Benchmark of ECG Foundation Models


28. CORE:Toward Ubiquitous 6G Intelligence Through Collaborative Orchestration of Large Language Model Agents Over Hierarchical Edge


29. A Unified XAI-LLM Approach for EndotrachealSuctioning Activity Recognition


30. BioAgent Bench: An AI Agent Evaluation Suite for Bioinformatics


31. Abstract Concept Modelling in Conceptual Spaces: A Study on Chess Strategies


32. Zero-Shot Statistical Downscaling via Diffusion Posterior Sampling


33. Language-based Trial and Error Falls Behind in the Era of Experience


34. Epistemic Context Learning: Building Trust the Right Way in LLM-Based Multi-Agent Systems


35. DropoutTS: Sample-Adaptive Dropout for Robust Time Series Forecasting


36. E-mem: Multi-agent based Episodic Context Reconstruction for LLM Agent Memory


37. FBS: Modeling Native Parallel Reading inside a Transformer


38. TCAP: Tri-Component Attention Profiling for Unsupervised Backdoor Detection in MLLM Fine-Tuning


39. SONIC-O1: A Real-World Benchmark for Evaluating Multimodal Large Language Models on Audio-Video Understanding


40. ScholarGym: Benchmarking Deep Research Workflows on Academic Literature Retrieval


41. Semantic Content Determines Algorithmic Performance


42. RecNet: Self-Evolving Preference Propagation for Agentic Recommender Systems


43. Search-Based Risk Feature Discovery in Document Structure Spaces under a Constrained Budget


44. CORE: Collaborative Reasoning via Cross Teaching


45. Beyond Imitation: Reinforcement Learning for Active Latent Planning


46. Depth-Recurrent Attention Mixtures: Giving Latent Reasoning the Attention it Deserves


47. Chain Of Thought Compression: A Theoritical Analysis


48. EmboCoach-Bench: Benchmarking AI Agents on Developing Embodied Robots


49. Meta Context Engineering via Agentic Skill Evolution


50. ShardMemo: Masked MoE Routing for Sharded Agentic LLM Memory


51. ARGORA: Orchestrated Argumentation for Causally Grounded LLM Reasoning and Decision Making


52. KAPSO: A Knowledge-grounded framework for Autonomous Program Synthesis and Optimization


53. LLaMEA-SAGE: Guiding Automated Algorithm Design with Structural Feedback from Explainable AI


54. The Effectiveness of Style Vectors for Steering Large Language Models: A Human Evaluation


55. MAR: Efficient Large Language Models via Module-aware Architecture Refinement


56. The Path of Least Resistance: Guiding LLM Reasining Trajectories with Prefix Consensus


57. ScaleSim: Serving Large-Scale Multi-Agent Simulation with Invocation Distance-Based Memory Management


58. MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning


59. Topeax – An Improved Clustering Topic Model with Density Peak Detection and Lexical-Semantic Term Importance


60. LION: A Clifford Neural Paradigm for Multimodal-Attributed Graph Learning


61. ChipBench: A Next-Step Benchmark for Evaluating LLM Performance in AI-Aided Chip Design


62. The Paradox of Robustness: Decoupling Rule-Based Logic from Affective Noise in High-Stakes Decision-Making


63. When Prohibitions Become Permissions: Auditing Negation Sensitivity in Language Models


64. System 1&2 Synergy via Dynamic Model Interpolation


65. DataCross: A Unified Benchmark and Agent Framework for Cross-Modal Heterogeneous Data Analysis


66. TeachBench: A Syllabus-Grounded Framework for Evaluating Teaching Ability in Large Language Models


67. NEMO: Execution-Aware Optimization Modeling via Autonomous Coding Agents


68. Hebbian Learning with Global Direction


69. Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization


70. BEAP-Agent: Backtrackable Execution and Adaptive Planning for GUI Agents


71. Dynamic Framework for Collaborative Learning: Leveraging Advanced LLM with Adaptive Feedback Mechanisms


72. Ostrakon-VL: Towards Domain-Expert MLLM for Food-Service and Retail Stores


73. EHR-RAG: Bridging Long-Horizon Structured Electronic Health Records and Large Language Models via Enhanced Retrieval-Augmented Generation


74. Within-Model vs Between-Prompt Variability in Large Language Models for Creative Tasks


75. Modeling Endogenous Logic: Causal Neuro-Symbolic Reasoning Model for Explainable Multi-Behavior Recommendation


76. White-Box Op-Amp Design via Human-Mimicking Reasoning


77. Drive-KD: Multi-Teacher Distillation for VLMs in Autonomous Driving


78. Position: Certifiable State Integrity in Cyber-Physical Systems – Why Modular Sovereignty Solves the Plasticity-Stability Paradox


79. TIDE: Tuning-Integrated Dynamic Evolution for LLM-Based Automated Heuristic Design


80. Just Ask: Curious Code Agents Reveal System Prompts in Frontier LLMs


81. Delegation Without Living Governance


82. Causal Discovery for Explainable AI: A Dual-Encoding Approach


83. Intelli-Planner: Towards Customized Urban Planning via Large Language Model Empowered Reinforcement Learning


84. Uncovering Hidden Correctness in LLM Causal Reasoning via Symbolic Verification


85. When should I search more: Adaptive Complex Query Optimization with Reinforcement Learning


86. Do Reasoning Models Enhance Embedding Models?


87. Sycophantic Anchors: Localizing and Quantifying User Agreement in Reasoning Models


88. MAD: Modality-Adaptive Decoding for Mitigating Cross-Modal Hallucinations in Multimodal Large Language Models


89. FrontierScience: Evaluating AI’s Ability to Perform Expert-Level Scientific Tasks


90. Concise Geometric Description as a Bridge: Unleashing the Potential of LLM for Plane Geometry Problem Solving


91. Bridging the Arithmetic Gap: The Cognitive Complexity Benchmark and Financial-PoT for Robust Financial Reasoning


92. BrainStack: Neuro-MoE with Functionally Guided Expert Routing for EEG-Based Language Decoding


93. What You Feel Is Not What They See: On Predicting Self-Reported Emotion from Third-Party Observer Labels


94. Beyond a Single Reference: Training and Evaluation with Paraphrases in Sign Language Translation


95. CUA-Skill: Develop Skills for Computer Using Agent


96. Planner-Auditor Twin: Agentic Discharge Planning with FHIR-Based LLM Planning, Guideline Recall, Optional Caching and Self-Improvement


97. How does information access affect LLM monitors’ ability to detect sabotage?


98. Magellan: Autonomous Discovery of Novel Compiler Optimization Heuristics with AlphaEvolve


99. Responsible AI: The Good, The Bad, The AI


100. OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence


101. Multi-modal Imputation for Alzheimer’s Disease Classification


102. Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report


103. QUARK: Robust Retrieval under Non-Faithful Queries via Query-Anchored Aggregation


104. Unplugging a Seemingly Sentient Machine Is the Rational Choice – A Metaphysical Perspective


105. Bayesian-LoRA: Probabilistic Low-Rank Adaptation of Large Language Models


106. The Epistemic Planning Domain Definition Language: Official Guideline


107. Do LLMs Favor LLMs? Quantifying Interaction Effects in Peer Review


108. RedSage: A Cybersecurity Generalist LLM


109. Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts


110. DynaWeb: Model-Based Reinforcement Learning of Web Agents


111. Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers


112. PRISM: Distribution-free Adaptive Computation of Matrix Functions for Accelerating Neural Network Training


113. StepShield: When, Not Whether to Intervene on Rogue Agents


114. SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents



116. SINA: A Circuit Schematic Image-to-Netlist Generator Using Artificial Intelligence


117. Value-Based Pre-Training with Downstream Feedback


118. ECO: Quantized Training without Full-Precision Master Weights


119. Investigating Associational Biases in Inter-Model Communication of Large Generative Models


120. Latent Adversarial Regularization for Offline Preference Optimization


121. Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models


122. Unsupervised Decomposition and Recombination with Discriminator-Driven Diffusion Models


123. MetricAnything: Scaling Metric Depth Pretraining with Noisy Heterogeneous Sources


124. SIA: Symbolic Interpretability for Anticipatory Deep Reinforcement Learning in Network Control


125. Learning to Communicate Across Modalities: Perceptual Heterogeneity in Multi-Agent Systems


126. A Separable Architecture for Continuous Token Representation in Language Models


127. Thinking Out of Order: When Output Order Stops Reflecting Reasoning Order in Diffusion Language Models


128. When “Better” Prompts Hurt: Evaluation-Driven Iteration for LLM Applications


129. SymbXRL: Symbolic Explainable Deep Reinforcement Learning for Mobile Networks


130. Vidmento: Creating Video Stories Through Context-Aware Expansion With Generative Video


131. MEIDNet: Multimodal generative AI framework for inverse materials design


132. Mechanistic Data Attribution: Tracing the Training Origins of Interpretable LLM Units


133. Geometry of Drifting MDPs with Path-Integral Stability Certificates


134. Generalized Information Gathering Under Dynamics Uncertainty


135. From Particles to Agents: Hallucination as a Metric for Cognitive Friction in Spatial Simulation


136. MoE-ACT: Improving Surgical Imitation Learning Policies through Supervised Mixture-of-Experts


137. Token-Guard: Towards Token-Level Hallucination Control via Self-Checking Decoding


138. Industrialized Deception: The Collateral Effects of LLM-Generated Misinformation on Digital Ecosystems


139. Robust Multimodal Representation Learning in Healthcare


140. From Future of Work to Future of Workers: Addressing Asymptomatic AI Harms for Dignified Human-AI Interaction


141. TraceRouter: Robust Safety for Large Foundation Models via Path-Level Intervention


142. Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text


143. Improving Classifier-Free Guidance of Flow Matching via Manifold Projection


144. MoHETS: Long-term Time Series Forecasting with Mixture-of-Heterogeneous-Experts


145. Test-Time Compute Games


146. Trustworthy Intelligent Education: A Systematic Perspective on Progress, Challenges, and Future Directions


147. Moral Outrage Shapes Commitments Beyond Attention: Multimodal Moral Emotions on YouTube in Korea and the US


148. A Decomposable Forward Process in Diffusion Models for Time-Series Forecasting


149. KID: Knowledge-Injected Dual-Head Learning for Knowledge-Grounded Harmful Meme Detection


150. Effective LoRA Adapter Routing using Task Representations


151. ECSEL: Explainable Classification via Signomial Equation Learning


152. Assessing the Business Process Modeling Competences of Large Language Models


153. Synthetic-to-Real Domain Bridging for Single-View 3D Reconstruction of Ships for Maritime Monitoring


154. CoFrGeNet: Continued Fraction Architectures for Language Generation


155. EWSJF: An Adaptive Scheduler with Hybrid Partitioning for Mixed-Workload LLM Inference


156. Temporal Sepsis Modeling: a Fully Interpretable Relational Way


157. Why Adam Works Better with $β_1 = β_2$: The Missing Gradient Scale Invariance Principle


158. From Global to Granular: Revealing IQA Model Performance via Correlation Surface


159. Enhancing Language Models for Robust Greenwashing Detection


160. When does predictive inverse dynamics outperform behavior cloning?


161. DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning


162. Disentangling perception and reasoning for improving data efficiency in learning cloth manipulation without demonstrations


163. TACLer: Tailored Curriculum Reinforcement Learning for Efficient Reasoning


164. Toward Culturally Aligned LLMs through Ontology-Guided Multi-Agent Reasoning


165. Curriculum Learning for LLM Pretraining: An Analysis of Learning Dynamics


166. XFACTORS: Disentangled Information Bottleneck via Contrastive Supervision


167. FIT: Defying Catastrophic Forgetting in Continual LLM Unlearning


168. Expected Return Causes Outcome-Level Mode Collapse in Reinforcement Learning and How to Fix It with Inverse Probability Scaling


169. SENDAI: A Hierarchical Sparse-measurement, EfficieNt Data AssImilation Framework


170. Gauge-invariant representation holonomy


171. When Life Gives You AI, Will You Turn It Into A Market for Lemons? Understanding How Information Asymmetries About AI System Capabilities Affect Market Outcomes and Adoption


172. SWE-Spot: Building Small Repo-Experts with Repository-Centric Learning


173. ILRR: Inference-Time Steering Method for Masked Diffusion Language Models


174. Seg-MoE: Multi-Resolution Segment-wise Mixture-of-Experts for Time Series Forecasting Transformers


175. HeRo-Q: A General Framework for Stable Low Bit Quantization via Hessian Conditioning


176. Breaking the Overscaling Curse: Thinking Parallelism Before Parallel Thinking


177. Beyond Parameter Finetuning: Test-Time Representation Refinement for Node Classification


178. Representation-Regularized Convolutional Audio Transformer for Audio Understanding


179. Thinking Broad, Acting Fast: Latent Reasoning Distillation from Multi-Perspective Chain-of-Thought for E-Commerce Relevance


180. Dynamics Reveals Structure: Challenging the Linear Propagation Assumption


181. Scalable Power Sampling: Unlocking Efficient, Training-Free Reasoning for LLMs via Distribution Sharpening


182. Signal-Adaptive Trust Regions for Gradient-Free Optimization of Recurrent Spiking Neural Networks


183. Shaping capabilities with token-level data filtering


184. SAL: Selective Adaptive Learning for Backpropagation-Free Training with Sparsification


185. Training slow silicon neurons to control extremely fast robots with spiking reinforcement learning


186. Multi-Modal Time Series Prediction via Mixture of Modulated Experts


187. Bi-Anchor Interpolation Solver for Accelerating Generative Modeling


188. On the Adversarial Robustness of Large Vision-Language Models under Visual Token Compression


189. Sustainable Materials Discovery in the Era of Artificial Intelligence


190. More Bang for the Buck: Improving the Inference of Large Language Models at a Fixed Budget using Reset and Discard (ReD)


191. SimGraph: A Unified Framework for Scene Graph-Based Image Generation and Editing


192. Mean-Field Control on Sparse Graphs: From Local Limits to GNNs via Neighborhood Distributions


193. Task-free Adaptive Meta Black-box Optimization


194. Adaptive Confidence Gating in Multi-Agent Collaboration for Efficient and Optimized Code Generation


195. Conversation for Non-verifiable Learning: Self-Evolving LLMs through Meta-Evaluation


196. Unifying Speech Editing Detection and Content Localization via Prior-Enhanced Audio LLMs


197. L$^3$: Large Lookup Layers


198. HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing


199. SAGE: Sequence-level Adaptive Gradient Evolution for Generative Recommendation


200. Spava: Accelerating Long-Video Understanding via Sequence-Parallelism-aware Approximate Attention


201. From Consistency to Complementarity: Aligned and Disentangled Multi-modal Learning for Time Series Understanding and Reasoning


202. Mitigating Overthinking in Large Reasoning Models via Difficulty-aware Reinforcement Learning


203. Intrinsic Reward Policy Optimization for Sparse-Reward Environments


204. Understanding Frechet Speech Distance for Synthetic Speech Quality Evaluation


205. Sim-MSTNet: sim2real based Multi-task SpatioTemporal Network Traffic Forecasting


206. The Compliance Paradox: Semantic-Instruction Decoupling in Automated Academic Code Evaluation


207. Theoretically Optimal Attention/FFN Ratios in Disaggregated LLM Serving


208. L2R: Low-Rank and Lipschitz-Controlled Routing for Mixture-of-Experts


209. Memorization Control in Diffusion Models from Denoising-centric Perspective


210. Self-Improving Pretraining: using post-trained models to pretrain better models


211. Adversarial Vulnerability Transcends Computational Paradigms: Feature Engineering Provides No Defense Against Neural Adversarial Transfer


212. Heterogeneous Vertiport Selection Optimization for On-Demand Air Taxi Services: A Deep Reinforcement Learning Approach


213. Distributionally Robust Classification for Multi-source Unsupervised Domain Adaptation


214. The Surprising Difficulty of Search in Model-Based Reinforcement Learning


215. Grounding and Enhancing Informativeness and Utility in Dataset Distillation


216. Physics-Guided Tiny-Mamba Transformer for Reliability-Aware Early Fault Warning


217. Zenith: Scaling up Ranking Models for Billion-scale Livestreaming Recommendation


218. PILD: Physics-Informed Learning via Diffusion


219. DUET: Distilled LLM Unlearning from an Efficiently Contextualized Teacher


220. NEXUS: Bit-Exact ANN-to-SNN Equivalence via Neuromorphic Gate Circuits with Surrogate-Free Training


221. GeoRC: A Benchmark for Geolocation Reasoning Chains


222. More Code, Less Reuse: Investigating Code Quality and Reviewer Sentiment towards AI-generated Pull Requests


223. Lightweight High-Fidelity Low-Bitrate Talking Face Compression for 3D Video Conference


224. Music Plagiarism Detection: Problem Formulation and a Segment-based Solution


225. Hypersolid: Emergent Vision Representations via Short-Range Repulsion


226. Conditional Generative Framework with Peak-Aware Attention for Robust Chemical Detection under Interferences


227. Less Noise, More Voice: Reinforcement Learning for Reasoning via Instruction Purification


228. Understanding Diffusion Models via Ratio-Based Function Approximation with SignReLU Networks


229. PTQ4ARVG: Post-Training Quantization for AutoRegressive Visual Generation Models


230. SHARP: Social Harm Analysis via Risk Profiles for Measuring Inequities in Large Language Models


231. MGSM-Pro: A Simple Strategy for Robust Multilingual Mathematical Reasoning Evaluation


232. Temporal Context and Architecture: A Benchmark for Naturalistic EEG Decoding


233. A Sheaf-Theoretic and Topological Perspective on Complex Network Modeling and Attention Mechanisms in Graph Neural Models


234. Scaling Embeddings Outperforms Scaling Experts in Language Models


235. Thinker: A vision-language foundation model for embodied intelligence


236. ZipMoE: Efficient On-Device MoE Serving via Lossless Compression and Cache-Affinity Scheduling


237. From Linear Input to Hierarchical Structure: Function Words as Statistical Cues for Language Learning


238. Adaptive and Robust Cost-Aware Proof of Quality for Decentralized LLM Inference Networks


239. Rethinking Refinement: Correcting Generative Bias without Noise Injection


240. AC2L-GAD: Active Counterfactual Contrastive Learning for Graph Anomaly Detection


241. Output-Space Search: Targeting LLM Generations in a Frozen Encoder-Defined Output Space


242. A2RAG: Adaptive Agentic Graph Retrieval for Cost-Aware and Reliable Reasoning


243. Can Neural Networks Learn Small Algebraic Worlds? An Investigation Into the Group-theoretic Structures Learned By Narrow Models Trained To Predict Group Operations


244. Mobility-Embedded POIs: Learning What A Place Is and How It Is Used from Human Movement


245. Optimization and Mobile Deployment for Anthropocene Neural Style Transfer


246. PhaseCoder: Microphone Geometry-Agnostic Spatial Audio Understanding for Multimodal LLMs


247. AI-Assisted Engineering Should Track the Epistemic Status and Temporal Validity of Architectural Decisions


248. Multi-task Code LLMs: Data Mix or Model Merge?


249. SteerEval: A Framework for Evaluating Steerability with Natural Language Profiles for Recommendation


250. Safety Generalization Under Distribution Shift in Safe Reinforcement Learning: A Diabetes Testbed


251. Deep Reinforcement Learning for Fault-Adaptive Routing in Eisenstein-Jacobi Interconnection Topologies


252. LOCUS: Low-Dimensional Model Embeddings for Efficient Model Exploration, Comparison, and Selection


253. Towards Comprehensive Benchmarking Infrastructure for LLMs In Software Engineering


254. Textual Equilibrium Propagation for Deep Compound AI Systems


255. SMKC: Sketch Based Kernel Correlation Images for Variable Cardinality Time Series Anomaly Detection


256. Log2Motion: Biomechanical Motion Synthesis from Touch Logs


257. Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning


258. SIGMA-PPG: Statistical-prior Informed Generative Masking Architecture for PPG Foundation Model


259. “Unlimited Realm of Exploration and Experimentation”: Methods and Motivations of AI-Generated Sexual Content Creators


260. Conditional Denoising Model as a Physical Surrogate Model


261. Solver-in-the-Loop: MDP-Based Benchmarks for Self-Correction and Behavioral Rationality in Operations Research


262. UrduBench: An Urdu Reasoning Benchmark using Contextually Ensembled Translations with Human-in-the-Loop


263. The Depth Delusion: Why Transformers Should Be Wider, Not Deeper


264. Noisy but Valid: Robust Statistical Evaluation of LLMs with Imperfect Judges


265. Non-Markov Multi-Round Conversational Image Generation with History-Conditioned MLLMs


266. Denoising and Baseline Correction of Low-Scan FTIR Spectra: A Benchmark of Deep Learning Models Against Traditional Signal Processing


267. ICON: Intent-Context Coupling for Efficient Multi-Turn Jailbreak Attack


268. Finetune-Informed Pretraining Boosts Downstream Performance


269. DevOps-Gym: Benchmarking AI Agents in Software DevOps Cycle


270. STAER: Temporal Aligned Rehearsal for Continual Spiking Neural Network


271. Integrating Color Histogram Analysis and Convolutional Neural Network for Skin Lesion Classification


272. Rethinking LLM-Driven Heuristic Design: Generating Efficient and Specialized Solvers via Dynamics-Aware Optimization


273. Generalizable Prompt Tuning for Audio-Language Models via Semantic Expansion


274. LSR-Net: A Lightweight and Strong Robustness Network for Bearing Fault Diagnosis in Noise Environment


275. Implementing AI Bill of Materials (AI BOM) with SPDX 3.0: A Comprehensive Guide to Creating AI and Dataset Bill of Materials