[arXiv Digest] 2025-07-24

1. Online Submission and Evaluation System Design for Competition Operations

Authors: Zhe Chen, Daniel Harabor, Ryan Hechnenberger, Nathan R. Sturtevant
URL: https://arxiv.org/abs/2507.17730
요약 (영문): research communities have developed benchmark datasets across domains to compare the performance of algorithms and techniques . many of them claim to represent the state-of-the-art . to address this, research communities often organise periodic competitions .
요약 (한글): 연구 커뮤니티는 알고리즘과 기술의 성능을 비교하기 위해 여러 도메인에 걸쳐 벤치마크 데이터 세트를 개발했습니다. 이들 중 다수는 최신 기술을 대표한다고 주장합니다. 이를 해결하기 위해 연구 커뮤니티는 종종 주기적으로 대회를 개최합니다.

2. Thinking Isn’t an Illusion: Overcoming the Limitations of Reasoning Models via Tool Augmentations

Authors: Zhao Song, Song Yue, Jiahao Zhang
URL: https://arxiv.org/abs/2507.17699
요약 (영문): large language models are a central focus in today’s large language model (LLM) research . models are designed to output a step-by-step thinking process before arriving at a final answer to handle complex reasoning tasks . this thinking process may not actually enhance reasoning ability .
요약 (한글): 대규모 언어 모델은 오늘날 대규모 언어 모델(LLM) 연구의 중심입니다. 모델은 복잡한 추론 작업을 처리하기 위해 최종 답변에 도달하기 전에 단계별 사고 과정을 출력하도록 설계되었습니다. 이 사고 과정은 실제로 추론 능력을 향상시키지 않을 수 있습니다.

3. Symbiotic Agents: A Novel Paradigm for Trustworthy AGI-driven Networks

Authors: Ilias Chatzistefanidis, Navid Nikaein
URL: https://arxiv.org/abs/2507.17695
요약 (영문): autonomous agents are expected to play a vital role in the evolution of 6G networks . this shift facilitates the transition from a specialized intelligence approach where artificial intelligence algorithms handle isolated tasks . agents possess broader reasoning capabilities and can manage diverse network fun .
요약 (한글): 자율 에이전트는 6G 네트워크의 진화에 중요한 역할을 할 것으로 예상됩니다. 이러한 변화는 인공지능 알고리즘이 고립된 작업을 처리하는 전문화된 지능 접근 방식에서 전환을 촉진합니다. 에이전트는 더 광범위한 추론 능력을 보유하고 다양한 네트워크 재미를 관리할 수 있습니다.

4. Simulating multiple human perspectives in socio-ecological systems using large language models

Authors: Yongchao Zeng, Calum Brown, Ioannis Kyriakou, Ronja Hotz, Mark Rounsevell
URL: https://arxiv.org/abs/2507.17680
요약 (영문): to enable alternative, simulation-based exploration of different stakeholder perspectives, we develop the HoPeS (Human-Oriented Perspective Shifting) modelling framework . users can step into the agent roles to experience perspectival differences .
요약 (한글): 다양한 이해관계자의 관점에 대한 시뮬레이션 기반의 대안적 탐색을 지원하기 위해 유니티는 인간 중심의 관점 전환(HoPeS) 모델링 프레임워크를 개발했습니다. 사용자는 에이전트 역할에 들어가 관점의 차이를 경험할 수 있습니다.

5. Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning

Authors: Xinyao Liu, Diping Song
URL: https://arxiv.org/abs/2507.17539
요약 (영문): multimodal large language models (MLLMs) demonstrate significant potential in the field of medical diagnosis . however, they face critical challenges in specialized domains such as ophthalmology, particularly the fragmentation of annotation granularity .
요약 (한글): 다중 모드 대규모 언어 모델(MLLM)은 의료 진단 분야에서 상당한 잠재력을 보여주지만 안과와 같은 전문 영역, 특히 주석 세분화의 파편화라는 중요한 과제에 직면해 있습니다.

6. TAI Scan Tool: A RAG-Based Tool With Minimalistic Input for Trustworthy AI Self-Assessment

Authors: Athanasios Davvetas, Xenia Ziouvelou, Ypatia Dami, Alexis Kaponis, Konstantina Giouvanopoulou, Michael Papademas
URL: https://arxiv.org/abs/2507.17514
요약 (영문): the current version of the tool supports the legal TAI assessment . it involves a two-step approach with a pre-screening and an assessment phase . the tool includes insight regarding the risk-level of the AI system according to the AI Act .
요약 (한글): 현재 버전의 도구는 법적 TAI 평가를 지원합니다. 사전 심사 및 평가 단계로 구성된 2단계 접근 방식이 포함됩니다. 이 도구에는 AI 법에 따른 AI 시스템의 위험 수준에 대한 통찰력이 포함되어 있습니다.

7. Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning

Authors: Yu Li, Zhuoshi Pan, Honglin Lin, Mengyuan Sun, Conghui He, Lijun Wu
URL: https://arxiv.org/abs/2507.17512
요약 (영문): reinforcement learning with verifiable rewards (RLVR) has emerged as a powerful paradigm for enhancing the reasoning capabilities of LLMs . existing research has concentrated on isolated reasoning domains such as mathematical problem-solving, coding tasks, or logical reasoning .
요약 (한글): 검증 가능한 보상을 통한 강화 학습(RLVR)은 LLM의 추론 능력을 향상시키는 강력한 패러다임으로 부상했습니다. 기존 연구는 수학적 문제 해결, 코딩 작업 또는 논리적 추론과 같은 고립된 추론 영역에 집중되어 있습니다.

8. Automated Hybrid Grounding Using Structural and Data-Driven Heuristics

Authors: Alexander Beiser, Markus Hecher, Stefan Woltran
URL: https://arxiv.org/abs/2507.17493
요약 (영문): hybrid grounding is a step in alleviating the bottleneck by combining the strength of standard bottom-up grounding with recently proposed techniques where rule bodies are decoupled during grounding . however, it has remained unclear when to use body-decoupled grounding and when . to use standard bottom up grounding.
요약 (한글): 하이브리드 접지는 표준 상향식 접지의 강점과 접지 중에 룰 바디를 분리하는 최근 제안된 기술을 결합하여 병목 현상을 완화하는 단계이지만, 언제 바디 분리 접지를 사용하고 언제 표준 상향식 접지를 사용해야 하는지는 아직 명확하지 않은 상태입니다.

9. CQE under Epistemic Dependencies: Algorithms and Experiments (extended version)

Authors: Lorenzo Marconi, Flavia Ricci, Riccardo Rosati
URL: https://arxiv.org/abs/2507.17487
요약 (영문): controlled Query Evaluation (CQE) over ontologies is regulated by epistemic dependencies . we combine EDs with the notion of optimal GA censors .
요약 (한글): 온톨로지에 대한 제어 쿼리 평가(CQE)는 인식 의존성에 의해 규제됩니다. 우리는 ED를 최적의 GA 검열이라는 개념과 결합합니다.

10. LTLZinc: a Benchmarking Framework for Continual Learning and Neuro-Symbolic Temporal Reasoning

Authors: Luca Salvatore Lorello, Nikolaos Manginas, Marco Lippi, Stefano Melacci
URL: https://arxiv.org/abs/2507.17482
요약 (영문): neuro-symbolic artificial intelligence aims to combine neural architectures with symbolic approaches that can represent knowledge in a human-interpretable formalism . most of the existing approaches are applied to static scenarios only, and the challenging setting where reasoning along the temporal dimension is nas .
요약 (한글): 신경 기호 인공지능은 신경 아키텍처와 인간이 해석 가능한 형식주의로 지식을 표현할 수 있는 기호적 접근 방식을 결합하는 것을 목표로 합니다. 기존의 접근 방식은 대부분 정적 시나리오에만 적용되며, 시간적 차원을 따라 추론하는 어려운 환경은 nas .

11. An Uncertainty-Driven Adaptive Self-Alignment Framework for Large Language Models

Authors: Haoran Sun, Zekun Zhang, Shaoning Zeng
URL: https://arxiv.org/abs/2507.17477
요약 (영문): large language models have demonstrated remarkable progress in instruction following and general-purpose reasoning . but achieving high-quality alignment with human intent and safety norms without human annotations remains a fundamental challenge .
요약 (한글): 대규모 언어 모델은 인스트럭션 추종 및 범용 추론에서 괄목할 만한 진전을 보였지만, 사람의 주석 없이 사람의 의도 및 안전 규범과 고품질로 일치시키는 것은 여전히 근본적인 과제로 남아 있습니다.

12. Ctx2TrajGen: Traffic Context-Aware Microscale Vehicle Trajectories using Generative Adversarial Imitation Learning

Authors: Joobin Jin, Seokjun Hong, Gyeongseon Baek, Yeeun Kim, Byeongjoon Noh
URL: https://arxiv.org/abs/2507.17418
요약 (영문): we propose a context-aware trajectory generation framework that synthesizes realistic urban driving behaviors using GAIL . the model addresses nonlinear interdependencies and training instability inherent in microscopic settings .
요약 (한글): 이 모델은 미시적 환경에 내재된 비선형 상호 의존성과 훈련 불안정성을 해결하기 위해 GAIL을 사용하여 현실적인 도시 주행 행동을 합성하는 상황 인식 궤적 생성 프레임워크를 제안합니다.

13. Compliance Brain Assistant: Conversational Agentic AI for Assisting Compliance Tasks in Enterprise Environments

Authors: Shitong Zhu, Chenhao Fang, Derek Larson, Neel Reddy Pochareddy, Rajeev Rao, Sophie Zeng, Yanqing Peng, Wendy Summer, Alex Goncalves, Arya Pudota, Herve Robert
URL: https://arxiv.org/abs/2507.17289
요약 (영문): compliance brain assistant (CBA) is a conversational, agentic AI assistant designed to boost the efficiency of compliance tasks for personnel in enterprise environments . we design a user query router that can choose between (i) FastTrack mode: to handle simple requests that only need additional relevant context retrieved from knowledge corpora .
요약 (한글): 컴플라이언스 브레인 어시스턴트(CBA)는 기업 환경의 직원을 위한 컴플라이언스 업무의 효율성을 높이기 위해 설계된 대화형 AI 어시스턴트로, (i) 패스트트랙 모드: 지식 코퍼라에서 검색된 추가 관련 맥락만 필요한 간단한 요청을 처리하는 사용자 쿼리 라우터를 설계합니다.

14. Students’ Feedback Requests and Interactions with the SCRIPT Chatbot: Do They Get What They Ask For?

Authors: Andreas Scholl, Natalie Kiesler
URL: https://arxiv.org/abs/2507.17258
요약 (영문): a chatbot based on ChatGPT-4o-mini supports novice learners . the tool allows for open-ended interactions and structured guidance through predefined prompts .
요약 (한글): ChatGPT-4o-mini 기반의 챗봇이 초보 학습자를 지원합니다. 이 도구는 사전 정의된 프롬프트를 통해 개방형 상호 작용과 구조화된 안내를 허용합니다.

15. Agent Identity Evals: Measuring Agentic Identity

Authors: Elija Perrier, Michael Timothy Bennett
URL: https://arxiv.org/abs/2507.17257
요약 (영문): central to agentic capability and trustworthiness of language model agents is the extent they maintain stable, reliable identity over time . however, LMAs inherit pathologies from large language models (LLMs) which can undermine their identifiability, continuity, persistence and consistency by interfering with their agentic capab .
요약 (한글): 언어 모델 에이전트의 에이전트 역량과 신뢰성의 핵심은 시간이 지나도 안정적이고 신뢰할 수 있는 정체성을 유지하는 정도입니다. 그러나 LMA는 에이전트 역량을 방해하여 식별성, 연속성, 지속성 및 일관성을 약화시킬 수 있는 대규모 언어 모델(LLM)의 병리 현상을 상속받습니다.

16. Our Cars Can Talk: How IoT Brings AI to Vehicles

Authors: Amod Kant Agrawal
URL: https://arxiv.org/abs/2507.17214
요약 (영문): bringing AI to vehicles and enabling them as sensing platforms is key to transforming maintenance from reactive to proactive . this article offers a conceptual and technical perspective intended to spark interdisciplinary dialogue .
요약 (한글): 차량에 AI를 도입하고 이를 감지 플랫폼으로 활용하는 것은 유지보수를 사후 대응에서 사전 예방으로 전환하는 데 있어 핵심입니다. 이 글에서는 학제 간 대화를 촉발하기 위한 개념적, 기술적 관점을 제공합니다.

17. Improving LLMs’ Generalized Reasoning Abilities by Graph Problems

Authors: Qifan Zhang, Nuo Chen, Zehua Li, Miao Peng, Jing Tang, Jia Li
URL: https://arxiv.org/abs/2507.17168
요약 (영문): large language models have made remarkable strides in reasoning tasks . but their performance often falters on novel and complex problems . we pioneer the use of Graph Problem Reasoning (GPR) to enhance the general reasoning capabilities of LLMs.
요약 (한글): 대규모 언어 모델은 추론 작업에서 괄목할 만한 발전을 이루었지만, 새롭고 복잡한 문제에서는 종종 그 성능이 흔들리는 경우가 많습니다. 저희는 LLM의 일반적인 추론 능력을 향상시키기 위해 그래프 문제 추론(GPR)을 선도적으로 사용하고 있습니다.

18. HySafe-AI: Hybrid Safety Architectural Analysis Framework for AI Systems: A Case Study

Authors: Mandar Pitale, Jelena Frtunikj, Abhinaw Priyadershi, Vasu Singh, Maria Spence
URL: https://arxiv.org/abs/2507.17118
요약 (영문): the architecture of recent autonomous systems is trending toward end-to-end (E2E) monolithic architectures such as large language models (LLMs) and vision language models .
요약 (한글): 최근 자율 시스템의 아키텍처는 대규모 언어 모델(LLM) 및 비전 언어 모델과 같은 엔드투엔드(E2E) 모놀리식 아키텍처를 지향하는 추세입니다.

19. Large Learning Rates Simultaneously Achieve Robustness to Spurious Correlations and Compressibility

Authors: Melih Barsbey, Lucas Prieto, Stefanos Zafeiriou, Tolga Birdal
URL: https://arxiv.org/abs/2507.17748
요약 (영문): we position high learning rates as a facilitator for achieving robustness to spurious correlations and network compressibility . large learning rates also produce desirable representation properties such as invariant feature utilization, class separation, and activation sparsity .
요약 (한글): 높은 학습 속도는 가짜 상관관계에 대한 견고성과 네트워크 압축성을 달성하기 위한 촉진제로서, 큰 학습 속도는 불변 특징 활용, 클래스 분리, 활성화 희소성과 같은 바람직한 표현 속성을 생성합니다.

20. Pretraining on the Test Set Is No Longer All You Need: A Debate-Driven Approach to QA Benchmarks

Authors: Linbo Cao, Jinman Zhao
URL: https://arxiv.org/abs/2507.17747
요약 (영문): we propose a debate-driven evaluation paradigm that transforms any existing QA dataset into structured adversarial debates . one model is given the official answer to defend, and another constructs and defends an alternative answer .
요약 (한글): 기존의 모든 QA 데이터 세트를 구조화된 적대적 토론으로 변환하는 토론 중심 평가 패러다임을 제안합니다. 한 모델에는 방어할 공식 답변이 주어지고 다른 모델은 대안 답변을 구성하고 방어합니다.

21. Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains

Authors: Anisha Gunjal, Anthony Wang, Elaine Lau, Vaskar Nath, Bing Liu, Sean Hendryx
URL: https://arxiv.org/abs/2507.17746
요약 (영문): many tasks lack a single, unambiguous ground truth-making it difficult to define reliable reward signals . traditional preference-based methods offer a workaround, but they rely on opaque reward functions that are difficult to interpret and prone to spurious correlations .
요약 (한글): 많은 작업에는 명확하고 단일한 근거가 없어 신뢰할 수 있는 보상 신호를 정의하기 어렵습니다. 기존의 선호도 기반 방법은 해결책을 제시하지만, 해석하기 어렵고 잘못된 상관관계가 발생하기 쉬운 불투명한 보상 함수에 의존합니다.

22. Ultra3D: Efficient and High-Fidelity 3D Generation with Part Attention

Authors: Yiwen Chen, Zhihao Li, Yikai Wang, Hu Zhang, Qin Li, Chi Zhang, Guosheng Lin
URL: https://arxiv.org/abs/2507.17745
요약 (영문): new advances in sparse voxel representations have significantly improved the quality of 3D content generation . existing frameworks suffer from severe computational inefficiencies due to the quadratic complexity of attention mechanisms in their two-stage diffusion pipelines .
요약 (한글): 스파스 복셀 표현의 새로운 발전으로 3D 콘텐츠 생성 품질이 크게 향상되었습니다. 기존 프레임워크는 2단계 확산 파이프라인에서 주의 메커니즘의 4차적 복잡성으로 인해 심각한 계산 비효율성을 겪고 있습니다.

23. Yume: An Interactive World Generation Model

Authors: Xiaofeng Mao, Shaoheng Lin, Zhen Li, Chuanhao Li, Wenshuo Peng, Tong He, Jiangmiao Pang, Mingmin Chi, Yu Qiao, Kaipeng Zhang
URL: https://arxiv.org/abs/2507.17744
요약 (영문): method aims to create an interactive, realistic, and dynamic world . it allows exploration and control using peripheral devices or neural signals . the framework consists of four main components .
요약 (한글): 방법은 상호작용적이고 사실적이며 역동적인 세계를 만드는 것을 목표로 합니다. 주변 장치 또는 신경 신호를 사용하여 탐색하고 제어할 수 있습니다. 프레임워크는 네 가지 주요 구성 요소로 구성됩니다.

24. Flow Matching Meets Biology and Life Science: A Survey

Authors: Zihao Li, Zhichen Zeng, Xiao Lin, Feihao Fang, Yanru Qu, Zhe Xu, Zhining Liu, Xuying Ning, Tianxin Wei, Ge Liu, Hanghang Tong, Jingrui He
URL: https://arxiv.org/abs/2507.17731
요약 (영문): advances in generative modeling, such as generative adversarial networks, have transformed biological research and discovery . flow matching has emerged as a powerful and efficient alternative to diffusio .
요약 (한글): 생성적 적대적 네트워크와 같은 생성적 모델링의 발전은 생물학적 연구와 발견을 변화시켰습니다. 흐름 매칭은 확산에 대한 강력하고 효율적인 대안으로 떠올랐습니다.

25. On the Interaction of Compressibility and Adversarial Robustness

Authors: Melih Barsbey, Antônio H. Ribeiro, Umut Şimşekli, Tolga Birdal
URL: https://arxiv.org/abs/2507.17725
요약 (영문): neural networks are expected to simultaneously satisfy a host of desirable properties: accurate fitting to training data, generalization to unseen inputs, parameter and computational efficiency, and robustness to adversarial perturbations . a unified understanding of their interaction remains elusive .
요약 (한글): 신경망은 훈련 데이터에 대한 정확한 피팅, 보이지 않는 입력에 대한 일반화, 파라미터 및 계산 효율성, 적대적 섭동에 대한 견고성 등 여러 가지 바람직한 특성을 동시에 만족해야 하지만, 이들의 상호작용에 대한 통합적인 이해는 여전히 어려운 과제입니다.

26. AI Telephone Surveying: Automating Quantitative Data Collection with an AI Interviewer

Authors: Danny D. Leybzon, Shreyas Tirumala, Nishant Jain, Summer Gillen, Michael Jackson, Cameron McPhee, Jennifer Schmidt
URL: https://arxiv.org/abs/2507.17718
요약 (영문): quantitative survey researchers can scale quantitative studies by using AI to conduct phone interviews . voice AI enables a more natural and adaptive respondent experience as iVR .
요약 (한글): 정량적 설문조사 연구자는 AI를 사용하여 전화 인터뷰를 수행함으로써 정량적 연구를 확장할 수 있습니다. 음성 AI는 iVR처럼 보다 자연스럽고 적응력 있는 응답자 경험을 가능하게 합니다.

27. From Feedback to Checklists: Grounded Evaluation of AI-Generated Clinical Notes

Authors: Karen Zhou, John Giorgi, Pranav Mani, Peng Xu, Davis Liang, Chenhao Tan
URL: https://arxiv.org/abs/2507.17717
요약 (영문): existing automated metrics often fail to align with real-world physician preferences . to address this, we propose a pipeline that distills user feedback into structured checklists for note evaluation .
요약 (한글): 기존의 자동화된 지표는 실제 의사의 선호도와 일치하지 않는 경우가 많습니다. 이를 해결하기 위해 사용자 피드백을 구조화된 체크리스트로 추출하여 노트 평가를 위한 파이프라인을 제안합니다.

28. CASCADE: LLM-Powered JavaScript Deobfuscator at Google

Authors: Shan Jiang, Pranoy Kovuri, David Tao, Zhixun Tan
URL: https://arxiv.org/abs/2507.17691
요약 (영문): this paper introduces CASCADE, a novel hybrid approach that integrates the advanced coding capabilities of Gemini with the deterministic transformation capabilities of a compiler . by employing Gemini to identify critical prelude functions, the foundatia IR (JSIR) .
요약 (한글): 이 백서에서는 Gemini의 고급 코딩 기능과 컴파일러의 결정론적 변환 기능을 통합하는 새로운 하이브리드 접근 방식인 CASCADE를 소개합니다. 중요한 전주곡 기능인 파운데이션 IR(JSIR)을 식별하는 데 Gemini를 사용함으로써.

29. How Should We Meta-Learn Reinforcement Learning Algorithms?

Authors: Alexander David Goldie, Zilin Wang, Jakob Nicolaus Foerster, Shimon Whiteson
URL: https://arxiv.org/abs/2507.17668
요약 (영문): meta-learning algorithms are often adapted from supervised or unsupervised learning despite their suboptimality for RL . however, until now there has been a severe lack of comparison between different algorithms, such as evolution to optimise performance .
요약 (한글): 메타러닝 알고리즘은 RL에 최적이 아님에도 불구하고 감독 또는 비지도 학습을 적용하는 경우가 많지만, 지금까지는 성능 최적화를 위한 진화와 같은 서로 다른 알고리즘 간의 비교가 심각하게 부족했습니다.

30. Vision Transformer attention alignment with human visual perception in aesthetic object evaluation

Authors: Miguel Carrasco, César González-Martín, José Aranda, Luis Oliveros
URL: https://arxiv.org/abs/2507.17616
요약 (영문): visual attention mechanisms play a crucial role in human perception and aesthetic evaluation . recent advances in vision Transformers (ViTs) have demonstrated remarkable capabilities in computer vision tasks .
요약 (한글): 시각적 주의 메커니즘은 인간의 지각과 미적 평가에 중요한 역할을 합니다. 최근 시각 트랜스포머(ViT)의 발전으로 컴퓨터 비전 작업에서 놀라운 능력을 입증했습니다.

31. PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving

Authors: Maciej K. Wozniak, Lianhang Liu, Yixi Cai, Patric Jensfelt
URL: https://arxiv.org/abs/2507.17596
요약 (영문): our novel and efficient end-to-end driving architecture operates using only camera data, without explicit BEV represe . to address these challenges, we propose PRIX (Plan from Raw Pixels)
요약 (한글): 당사의 새롭고 효율적인 엔드투엔드 주행 아키텍처는 명시적인 BEV 표현 없이 카메라 데이터만을 사용하여 작동합니다. 이러한 문제를 해결하기 위해 PRIX(Plan from Raw Pixels)를 제안합니다.

32. Enhancing Quantum Federated Learning with Fisher Information-Based Optimization

Authors: Amandeep Singh Bhatia, Sabre Kais
URL: https://arxiv.org/abs/2507.17580
요약 (영문): it involves multiple rounds of communication between the global model and participating clients . it introduces several challenges like high communication costs, heterogeneous client data and increased vulnerability to privacy threats .
요약 (한글): 글로벌 모델과 참여 클라이언트 간에 여러 차례의 커뮤니케이션이 필요하며, 높은 통신 비용, 이질적인 클라이언트 데이터, 개인 정보 위협에 대한 취약성 증가 등 여러 가지 문제가 발생합니다.

33. Federated Majorize-Minimization: Beyond Parameter Aggregation

Authors: Aymeric Dieuleveut, Gersende Fort, Mahmoud Hegazy, Hoi-To Wai
URL: https://arxiv.org/abs/2507.17534
요약 (영문): a class of majorize-minimization problems possesses a linearly parameterized family of majorizing surrogate functions . this framework encompasses (proximal) gradient-based algorithms for (regularized) smooth objectives, the Expectation Maximization algorithm, and many variational surrogates .
요약 (한글): 최대화-최소화 문제의 클래스는 선형적으로 매개변수화된 최대화 대리 함수군을 가지고 있습니다. 이 프레임워크는 (정규화된) 평활 목적에 대한 (근위) 기울기 기반 알고리즘, 기대 최대화 알고리즘 및 많은 가변 대리 함수를 포함합니다.

34. Integrating Physics-Based and Data-Driven Approaches for Probabilistic Building Energy Modeling

Authors: Leandro Von Krannichfeldt, Kristina Orehounig, Olga Fink
URL: https://arxiv.org/abs/2507.17526
요약 (영문): building energy modeling is a key tool for optimizing the performance of building energy systems . hybrid approaches combine the strengths of both paradigms . a wide spectrum of methods has been explored .
요약 (한글): 건물 에너지 모델링은 건물 에너지 시스템의 성능을 최적화하는 핵심 도구이며, 하이브리드 접근 방식은 두 패러다임의 강점을 결합하여 다양한 방법을 모색해 왔습니다.

35. Enabling Cyber Security Education through Digital Twins and Generative AI

Authors: Vita Santa Barletta, Vito Bavaro, Miriana Calvano, Antonio Curci, Antonio Piccinno, Davide Pio Posa
URL: https://arxiv.org/abs/2507.17518
요약 (영문): digital twins (DTs) are gaining prominence in cybersecurity for their ability to replicate complex IT (Information Technology), OT (Operational Technology) and IoT (Internet of Things) infrastructures . integrating DTs with penetration testing tools and Large Language Models can enhance cybersecurity education and operational readiness .
요약 (한글): 디지털 트윈(DT)은 복잡한 IT(정보 기술), OT(운영 기술), IoT(사물 인터넷) 인프라를 복제할 수 있는 능력으로 사이버 보안 분야에서 각광받고 있습니다. DT를 모의 침투 테스트 도구 및 대규모 언어 모델과 통합하면 사이버 보안 교육과 운영 준비성을 강화할 수 있습니다.

36. HOTA: Hamiltonian framework for Optimal Transport Advection

Authors: Nazar Buzun, Daniil Shlenskii, Maxim Bobrin, Dmitry V. Dylov
URL: https://arxiv.org/abs/2507.17513
요약 (영문): the majority of recent models assume trivial geometry (e.g., Euclidean) and rely on strong density-estimation assumptions . we present Hamiltonian Optimal Transport Advection (HOTA), a Hamilton-Jacobi-Bellman based method that tackles the dual dynamical OT problem explicitly through Kantorovich potentia .
요약 (한글): 최근 대부분의 모델은 사소한 기하학(예: 유클리드)을 가정하고 강력한 밀도 추정 가정에 의존합니다. 본 논문에서는 칸토로비치 포텐시아를 통해 이중 역학적 OT 문제를 명시적으로 다루는 해밀턴-자코비-벨만 기반 방법인 해밀턴 최적 수송 전진(HOTA)을 소개합니다.

37. To Trust or Not to Trust: On Calibration in ML-based Resource Allocation for Wireless Networks

Authors: Rashika Raina, Nidhi Simmons, David E. Simmons, Michel Daoud Yacoub, Trung Q. Duong
URL: https://arxiv.org/abs/2507.17494
요약 (영문): this paper studies the calibration performance of an ML-based outage predictor . we first establish key theoretical properties of this system’s outage probability under perfect calibration .
요약 (한글): 이 백서에서는 ML 기반 정전 예측기의 보정 성능을 연구합니다. 먼저 완벽한 보정 하에서 이 시스템의 정전 확률에 대한 주요 이론적 특성을 확립합니다.

38. Unsupervised anomaly detection using Bayesian flow networks: application to brain FDG PET in the context of Alzheimer’s disease

Authors: Hugues Roy, Reuben Dorent, Ninon Burgos
URL: https://arxiv.org/abs/2507.17486
요약 (영문): unsupervised anomaly detection (UAD) plays a crucial role in neuroimaging . we introduce anoBFN, an extension of BFNs for UAD .
요약 (한글): 신경 영상에서 비지도 이상 탐지(UAD)는 중요한 역할을 합니다. UAD를 위한 BFN의 확장인 anoBFN을 소개합니다.

39. MultiNRC: A Challenging and Native Multilingual Reasoning Evaluation Benchmark for LLMs

Authors: Alexander R. Fabbri, Diego Mares, Jorge Flores, Meher Mankikar, Ernesto Hernandez, Dean Lee, Bing Liu, Chen Xing
URL: https://arxiv.org/abs/2507.17476
요약 (영문): the evaluation of such LLMs’ multilingual reasoning capability across diverse languages and cultural contexts remains limited . existing multilingual benchmarks are typically constructed by translating existing English reasoning benchmarks . in this work, we introduce the Multilingual Native Reason .
요약 (한글): 다양한 언어와 문화적 맥락에서 이러한 LLM의 다국어 추론 능력에 대한 평가는 여전히 제한적입니다 . 기존의 다국어 벤치마크는 일반적으로 기존의 영어 추론 벤치마크를 번역하여 구성됩니다 . 이 작업에서는 다국어 네이티브 추론 을 소개합니다 .

40. BGM-HAN: A Hierarchical Attention Network for Accurate and Fair Decision Assessment on Semi-Structured Profiles

Authors: Junhua Liu, Roy Ka-Wei Lee, Kwan Hui Lim
URL: https://arxiv.org/abs/2507.17472
요약 (영문): this work presents a novel approach to enhancing complex decision-making workflows through the integration of hierarchical learning alongside various enhancements . we propose an enhanced Byte-Pair Encoded, Gated Multi-head Hierarchical Attent .
요약 (한글): 이 작업은 다양한 개선 사항과 함께 계층적 학습의 통합을 통해 복잡한 의사 결정 워크플로우를 개선하는 새로운 접근 방식을 제시합니다. 우리는 향상된 바이트 쌍 인코딩, 게이트 멀티 헤드 계층적 주의력을 제안합니다.

41. Demonstration of Efficient Predictive Surrogates for Large-scale Quantum Processors

Authors: Wei-You Liao, Yuxuan Du, Xinbiao Wang, Tian-Ci Tian, Yong Luo, Bo Du, Dacheng Tao, He-Liang Huang
URL: https://arxiv.org/abs/2507.17470
요약 (영문): quantum processors will remain rare for the foreseeable future, limiting their widespread application . the concept of predictive surrogates is designed to emulate the mean-value behavior of a given quantum processor .
요약 (한글): 예측 대리자 개념은 주어진 양자 프로세서의 평균값 동작을 에뮬레이트하도록 설계되었으며, 당분간 양자 프로세서는 희소성이 유지되어 광범위한 적용이 제한될 것 입니다.

42. Probing Vision-Language Understanding through the Visual Entailment Task: promises and pitfalls

Authors: Elena Pitta, Tom Kouwenhoven, Tessa Verhoef
URL: https://arxiv.org/abs/2507.17467
요약 (영문): this study investigates the extent to which the Visual Entailment task serves as a reliable probe of vision-language understanding in multimodal language models . we conduct a series of experiments across zero-shot, few-shot and fine-tuning settings .
요약 (한글): 이 연구는 시각적 수반 과제가 다중 모드 언어 모델에서 시각-언어 이해의 신뢰할 수 있는 프로브 역할을 하는 정도를 조사합니다. 우리는 제로 샷, 소수 샷 및 미세 조정 설정에서 일련의 실험을 수행합니다.

43. Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning

Authors: Situo Zhang, Hanqi Li, Lu Chen, Zihan Zhao, Xuanze Lin, Zichen Zhu, Bo Chen, Xin Chen, Kai Yu
URL: https://arxiv.org/abs/2507.17448
요약 (영문): traditional graph-based and sequence-to-sequence models often lack generalized chemical knowledge, leading to predictions that are neither consistently accurate nor easily explainable . to address these challenges, we introduce retroDFM-R, a reasoning-based large language model .
요약 (한글): 기존의 그래프 기반 및 시퀀스 간 모델에는 일반화된 화학 지식이 부족하여 일관되게 정확하지 않거나 쉽게 설명할 수 없는 예측을 초래하는 경우가 많습니다. 이러한 문제를 해결하기 위해 추론 기반 대규모 언어 모델인 retroDFM-R을 소개합니다.

44. IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird’s-Eye View Perception

Authors: Haichuan Li, Changda Tian, Panos Trahanias, Tomi Westerlund
URL: https://arxiv.org/abs/2507.17445
요약 (영문): indoorBEV is a novel mask-based Bird’s-Eye View (BEV) method for indoor mobile robots . a 3D scene is projected into a 2D BEV grid which handles naturally occlusions and provides a consisted scene .
요약 (한글): 실내BEV는 실내 이동 로봇을 위한 새로운 마스크 기반 조감도(BEV) 방식으로, 3D 장면을 2D BEV 그리드에 투사하여 자연스럽게 오클루전을 처리하고 일관된 장면을 제공합니다.

45. Each to Their Own: Exploring the Optimal Embedding in RAG

Authors: Shiting Chen, Zijian Zhao, Jinsong Chen
URL: https://arxiv.org/abs/2507.17442
요약 (영문): the methods for incorporating up-to-date information into LLMs or adding external knowledge to construct domain-specific models have garnered wide attention . the variant embedding models used in RAG exhibit heterogeneous training data and model architecture .
요약 (한글): 최신 정보를 LLM에 통합하거나 외부 지식을 추가하여 도메인별 모델을 구축하는 방법이 많은 관심을 받고 있으며, RAG에 사용되는 변형 임베딩 모델은 이질적인 학습 데이터와 모델 아키텍처를 보여줍니다.

46. Fair Compromises in Participatory Budgeting: a Multi-Agent Deep Reinforcement Learning Approach

Authors: Hugh Adams, Srijoni Majumdar, Evangelos Pournaras
URL: https://arxiv.org/abs/2507.17433
요약 (영문): participation budgeting is a method of collectively understanding and addressing spending priorities where citizens vote on how a budget is spent . a multi-agent reinforcement learning approach can make decision making easier for voters by identifying voting strategies that increase the winning proporti .
요약 (한글): 참여 예산은 시민들이 예산의 지출 방법에 대해 투표하는 지출 우선 순위를 집단적으로 이해하고 해결하는 방법입니다. 다중 에이전트 강화 학습 접근 방식은 승리 비율을 높이는 투표 전략을 식별하여 유권자의 의사 결정을 더 쉽게 만들 수 있습니다.

47. Content-based 3D Image Retrieval and a ColBERT-inspired Re-ranking for Tumor Flagging and Staging

Authors: Farnaz Khun Jush, Steffen Vogler, Matthias Lenga
URL: https://arxiv.org/abs/2507.17412
요약 (영문): the increasing volume of medical images poses challenges for radiologists in retrieving relevant cases . content-based image retrieval systems offer potential for efficient access to similar cases, yet lack standardized evaluation and comprehensive studies .
요약 (한글): 의료 이미지의 양이 증가함에 따라 방사선 전문의는 관련 사례를 검색하는 데 어려움을 겪고 있습니다. 콘텐츠 기반 이미지 검색 시스템은 유사한 사례에 효율적으로 액세스할 수 있는 잠재력을 제공하지만 표준화된 평가 및 종합적인 연구가 부족합니다.

48. Millions of $\text{GeAR}$-s: Extending GraphRAG to Millions of Documents

Authors: Zhili Shen, Chenxin Diao, Pascual Merita, Pavlos Vougiouklis, Jeff Z. Pan
URL: https://arxiv.org/abs/2507.17399
요약 (영문): recent studies have explored graph-based approaches to retrieval-augmented generation . this paper aims to adapt a state-of-the-art approach across broader datasets .
요약 (한글): 최근 연구에서는 검색 증강 생성에 대한 그래프 기반 접근 방식을 탐구했습니다. 이 백서에서는 더 광범위한 데이터 세트에 최신 접근 방식을 적용하는 것을 목표로 합니다.

49. HiProbe-VAD: Video Anomaly Detection via Hidden States Probing in Tuning-Free Multimodal LLMs

Authors: Zhaolin Cai, Fan Li, Ziwei Zheng, Yanjun Qin
URL: https://arxiv.org/abs/2507.17394
요약 (영문): video Anomaly Detection (VAD) aims to identify and locate deviations from normal patterns in video sequences . traditional methods often struggle with substantial computational demands and a reliance on extensive labeled datasets, thereby restricting their practical applicability .
요약 (한글): 비디오 이상 탐지(VAD)는 비디오 시퀀스에서 정상 패턴에서 벗어난 부분을 식별하고 위치를 찾는 것을 목표로 합니다. 기존 방법은 상당한 계산 요구와 광범위한 레이블이 지정된 데이터 세트에 의존하는 경우가 많아 실제 적용이 제한되는 경우가 많습니다.

50. Investigating Training Data Detection in AI Coders

Authors: Tianlin Li, Yunxiang Wei, Zhiming Li, Aishan Liu, Qing Guo, Xianglong Liu, Dongning Sun, Yang Liu
URL: https://arxiv.org/abs/2507.17389
요약 (영문): recent advances in code large language models (CodeLLMs) have made them indispensable tools in modern software engineering . however, these models occasionally produce outputs that contain proprietary or sensitive code snippets . training data detection (TDD) has become a critical task .
요약 (한글): 최근 코드 대용량 언어 모델(CodeLLM)의 발전으로 현대 소프트웨어 엔지니어링에서 없어서는 안 될 도구가 되었지만, 이러한 모델은 때때로 독점적이거나 민감한 코드 스니펫이 포함된 출력을 생성하며, 학습 데이터 탐지(TDD)는 중요한 작업이 되었습니다.

51. SFUOD: Source-Free Unknown Object Detection

Authors: Keon-Hee Park, Seun-An Choe, Gyeong-Moon Park
URL: https://arxiv.org/abs/2507.17373
요약 (영문): source-free object detection adapts a detector pre-trained on a source domain to an unlabeled target domain without requiring access to labeled source data . this setting prevents the detector from detecting undefined objects from the source domain .
요약 (한글): 소스 없는 개체 감지는 레이블이 지정된 소스 데이터에 액세스할 필요 없이 소스 도메인에서 사전 학습된 탐지기를 레이블이 지정되지 않은 대상 도메인에 맞게 조정합니다. 이 설정은 탐지기가 소스 도메인에서 정의되지 않은 개체를 탐지하지 못하도록 합니다.

52. DynaSearcher: Dynamic Knowledge Graph Augmented Search Agent via Multi-Reward Reinforcement Learning

Authors: Chuzhan Hao, Wenfeng Feng, Yuewei Zhang, Hao Wang
URL: https://arxiv.org/abs/2507.17365
요약 (영문): multi-step agentic retrieval systems based on large language models (LLMs) have demonstrated remarkable performance in complex information search tasks . however, these systems still face significant challenges in practical applications, particularly in generating factually inconsistent intermediate queries .
요약 (한글): 대규모 언어 모델(LLM)을 기반으로 하는 다단계 에이전트 검색 시스템은 복잡한 정보 검색 작업에서 놀라운 성능을 보여 왔지만, 이러한 시스템은 실제 적용에서 특히 사실과 일치하지 않는 중간 쿼리를 생성하는 데 있어 여전히 상당한 어려움에 직면해 있습니다.

53. Swin-TUNA : A Novel PEFT Approach for Accurate Food Image Segmentation

Authors: Haotian Chen, Zhiyong Xiao
URL: https://arxiv.org/abs/2507.17347
요약 (영문): existing large-scale Transformer-based models face challenges due to their massive parameter counts and high computational resource demands . this paper introduces TUNable Adapter module (Swin-TUNA) that integrates trainable adapters into the Swin Transformer .
요약 (한글): 기존의 대규모 트랜스포머 기반 모델은 방대한 파라미터 수와 높은 연산 자원 요구로 인해 어려움을 겪고 있습니다. 본 백서에서는 트레이너블 어댑터를 스윈 트랜스포머에 통합한 튜너블 어댑터 모듈(Swin-TUNA)을 소개합니다.

54. Temporal Point-Supervised Signal Reconstruction: A Human-Annotation-Free Framework for Weak Moving Target Detection

Authors: Weihua Gao, Chunxu Ren, Wenlong Niu, Xiaodong Peng
URL: https://arxiv.org/abs/2507.17334
요약 (영문): detecting weak moving targets remains a challenge due to low signal energy, small spatial extent, and complex background clutter . existing methods struggle with extracting robust features and suffer from the lack of reliable annotations .
요약 (한글): 낮은 신호 에너지, 작은 공간 범위, 복잡한 배경 혼잡으로 인해 움직이는 약한 표적을 탐지하는 것은 여전히 어려운 과제입니다. 기존 방법들은 강력한 특징을 추출하는 데 어려움을 겪고 있으며 신뢰할 수 있는 주석이 부족합니다.

55. EarthLink: Interpreting Climate Signals with Self-Evolving AI Agents

Authors: Zijie Guo, Jiong Wang, Xiaoyu Yue, Wangxu Wei, Zhe Jiang, Wanghan Xu, Ben Fei, Wenlong Zhang, Xinyu Gu, Lijing Cheng, Jing-Jia Luo, Chao Li, Yaqiang Wang, Tao Chen, Wanli Ouyang, Fenghua Ling, Lei Bai
URL: https://arxiv.org/abs/2507.17311
요약 (영문): EarthLink is the first AI agent designed as an interactive copilot for Earth scientists . it automates the end-to-end research workflow, from planning and code generation to multi-scenario analysis .
요약 (한글): EarthLink는 지구 과학자를 위한 대화형 부조종사로 설계된 최초의 AI 에이전트로, 계획 및 코드 생성부터 다중 시나리오 분석에 이르기까지 엔드투엔드 연구 워크플로우를 자동화합니다.

56. Confounded Causal Imitation Learning with Instrumental Variables

Authors: Yan Zeng, Shenglan Nie, Feng Xie, Libo Huang, Peng Wu, Zhi Geng
URL: https://arxiv.org/abs/2507.17309
요약 (영문): we propose a Confounded Causal Imitation Learning model . this model accommodates confounders that influence actions across multiple timesteps . a biased estimation of the policy would be entailed .
요약 (한글): 이 모델은 여러 시간대에 걸쳐 행동에 영향을 미치는 교란 요인을 수용하며, 정책에 대한 편향된 추정이 수반될 수 있습니다.

57. A Versatile Pathology Co-pilot via Reasoning Enhanced Multimodal Large Language Model

Authors: Zhe Xu, Ziyi Liu, Junlin Hou, Jiabo Ma, Cheng Jin, Yihui Wang, Zhixuan Chen, Zhengyu Zhang, Zhengrui Guo, Fengtao Zhou, Yingxue Xu, Xi Wang, Ronald Cheong Kin Chan, Li Liang, Hao Chen
URL: https://arxiv.org/abs/2507.17303
요약 (영문): multimodal large language models have emerged as powerful tools for computational pathology . they offer unprecedented opportunities to integrate pathological images with language context for comprehensive diagnostic analysis . current MLLM approaches in pathology demonstrate significantly constrained reasoning capabilities .
요약 (한글): 다중 모드 대규모 언어 모델은 전산 병리학을 위한 강력한 도구로 부상했습니다. 병리학 이미지를 언어 컨텍스트와 통합하여 포괄적인 진단 분석을 할 수 있는 전례 없는 기회를 제공합니다. 병리학의 현재 MLLM 접근 방식은 상당히 제한된 추론 능력을 보여줍니다.

Authors: Tobias Morocutti, Jonathan Greif, Paul Primus, Florian Schmid, Gerhard Widmer
URL: https://arxiv.org/abs/2507.17297
요약 (영문): spatial semantic segmentation of sound scenes (S5) involves the precise identification of active sound classes and the precise separation of their sources from complex acoustic mixtures . conventional systems rely on a two-stage pipeline . but are often constrained by the absence of fine-grained temporal information critical for effective separation .
요약 (한글): 사운드 장면의 공간 시맨틱 분할(S5)에는 활성 사운드 클래스를 정확하게 식별하고 복잡한 음향 혼합물에서 해당 소스를 정확하게 분리하는 작업이 포함되는데, 기존 시스템은 2단계 파이프라인에 의존하지만 효과적인 분리에 중요한 세분화된 시간 정보가 없다는 제약이 있습니다.

59. Integrating Belief Domains into Probabilistic Logic Programs

Authors: Damiano Azzolini, Fabrizio Riguzzi, Theresa Swift
URL: https://arxiv.org/abs/2507.17291
요약 (영문): distribution semantics is a leading approach to practical reasoning under uncertainty . current formulations use point-probabilities, making it difficult to express epistemic uncertainty, such as arises from hiera .
요약 (한글): 분포 의미론은 불확실성 하에서 실용적인 추론에 대한 선도적인 접근 방식입니다. 현재 공식은 점 확률을 사용하므로 히에라에서 발생하는 것과 같은 인식론적 불확실성을 표현하기 어렵습니다.

60. Leveraging Knowledge Graphs and LLM Reasoning to Identify Operational Bottlenecks for Warehouse Planning Assistance

Authors: Rishi Parekh, Saisubramaniam Gopalakrishnan, Zishan Ahmad, Anirudh Deodhar
URL: https://arxiv.org/abs/2507.17273
요약 (영문): our framework integrates Knowledge Graphs and Large Language Model (LLM)-based agents to analyze complex DES output data from warehouse operations . it transforms raw DES data into a semantically rich KG, capturing relatio .
요약 (한글): 우리의 프레임워크는 지식 그래프와 LLM(대규모 언어 모델) 기반 에이전트를 통합하여 웨어하우스 작업의 복잡한 DES 출력 데이터를 분석하고, 원시 DES 데이터를 의미적으로 풍부한 KG로 변환하여 관계성을 포착합니다.

61. Understanding Prompt Programming Tasks and Questions

Authors: Jenny T. Liang, Chenyang Yang, Agnia Sergeyuk, Travis D. Breaux, Brad A. Myers
URL: https://arxiv.org/abs/2507.17264
요약 (영문): developers are embedding prompts in software known as prompt programs . prompt programming requires the developer to make many changes to their prompt . the questions developers ask to update their prompt are unknown .
요약 (한글): 개발자는 프롬프트 프로그램으로 알려진 소프트웨어에 프롬프트를 내장하고 있습니다. 프롬프트 프로그래밍은 개발자가 프롬프트를 많이 변경해야 합니다. 개발자가 프롬프트를 업데이트하기 위해 묻는 질문은 알 수 없습니다.

62. Reality Proxy: Fluid Interactions with Real-World Objects in MR via Abstract Representations

Authors: Xiaoan Liu, Difan Jia, Xianhao Carton Liu, Mar Gonzalez-Franco, Chen Zhu-Tian
URL: https://arxiv.org/abs/2507.17248
요약 (영문): interacting with real-world objects often proves difficult when they are crowded, distant, or partially occluded . we observe that these difficulties stem from performing interaction directly on physical objects, where input is tightly coupled to their physical constraints .
요약 (한글): 실제 오브젝트가 붐비거나 멀리 있거나 부분적으로 가려져 있을 때 상호작용이 어려운 경우가 많습니다. 이러한 어려움은 입력이 물리적 제약과 밀접하게 연결된 물리적 오브젝트에 직접 상호작용을 수행하기 때문에 발생하는 것으로 관찰됩니다.

63. DistrAttention: An Efficient and Flexible Self-Attention Mechanism on Modern GPUs

Authors: Haolin Jin, Mengbai Xiao, Yuan Yuan, Xiao Zhang, Dongxiao Yu, Guanghui Zhang, Haoliang Wang
URL: https://arxiv.org/abs/2507.17245
요약 (영문): the Transformer architecture has revolutionized deep learning, delivering the state-of-the-art performance in areas such as natural language processing, computer vision, and time series prediction . but its core component, self-attention, has the quadratic time complexity relative to input sequence length, which hinders the scalability of Transformers .
요약 (한글): 트랜스포머 아키텍처는 자연어 처리, 컴퓨터 비전, 시계열 예측과 같은 분야에서 최첨단 성능을 제공하며 딥 러닝에 혁신을 가져왔지만, 핵심 구성 요소인 자기 주의는 입력 시퀀스 길이에 비해 이차적 시간 복잡성을 가지고 있어 트랜스포머의 확장성을 저해합니다.

64. Eco-Friendly AI: Unleashing Data Power for Green Federated Learning

Authors: Mattia Sabella, Monica Vitali
URL: https://arxiv.org/abs/2507.17241
요약 (영문): the widespread adoption of artificial intelligence and machine learning comes with a significant environmental impact . this pressing issue highlights the need for innovative solutions to mitigate AI’s ecological footprint .
요약 (한글): 인공지능과 머신러닝의 광범위한 채택은 환경에 미치는 영향이 매우 크며, 이 시급한 문제는 인공지능의 생태 발자국을 완화하기 위한 혁신적인 솔루션의 필요성을 강조합니다.

65. A Highly Clean Recipe Dataset with Ingredient States Annotation for State Probing Task

Authors: Mashiro Toyooka, Kiyoharu Aizawa, Yoko Yamakata
URL: https://arxiv.org/abs/2507.17232
요약 (영문): large language models (LLMs) are trained on a vast amount of procedural texts . but they do not directly observe real-world phenomena . this poses a challenge, as intermediate states of ingredients are often omitted .
요약 (한글): 대규모 언어 모델(LLM)은 방대한 양의 절차적 텍스트로 학습되지만 실제 현상을 직접 관찰하지는 못합니다. 이는 재료의 중간 상태가 생략되는 경우가 많기 때문에 문제가 됩니다.

66. P3SL: Personalized Privacy-Preserving Split Learning on Heterogeneous Edge Devices

Authors: Wei Fan, JinYi Yoon, Xiaochang Li, Huajie Shao, Bo Ji
URL: https://arxiv.org/abs/2507.17228
요약 (영문): SL enables resource constrained edge devices to participate in model training by partitioning a model into client-side and server-side sub-models . SL reduces computational overhead on edge devices, but encounters significant challenges in heterogeneous environments .
요약 (한글): SL은 모델을 클라이언트 측 및 서버 측 하위 모델로 분할하여 리소스가 제한된 엣지 디바이스가 모델 학습에 참여할 수 있도록 합니다. SL은 엣지 디바이스의 컴퓨팅 오버헤드를 줄여주지만 이기종 환경에서는 상당한 문제가 발생합니다.

67. HuiduRep: A Robust Self-Supervised Framework for Learning Neural Representations from Extracellular Spikes

Authors: Feng Cao, Zishuo Feng
URL: https://arxiv.org/abs/2507.17224
요약 (영문): extracellular recordings are brief voltage fluctuations recorded near neurons . this is widely used in neuroscience as the basis for decoding brain activity at single-neuron resolution . but it remains challenging under low signal-to-noise ratio (SNR), electrode drift and cross-session variability .
요약 (한글): 세포 외 기록은 뉴런 근처에서 기록된 짧은 전압 변동으로, 신경과학에서 단일 뉴런 해상도로 뇌 활동을 해독하는 기초로 널리 사용되지만 낮은 신호 대 잡음비(SNR), 전극 드리프트 및 세션 간 변동성에서는 여전히 어려운 작업입니다.

68. The Pluralistic Moral Gap: Understanding Judgment and Value Differences between Humans and Large Language Models

Authors: Giuseppe Russo, Debora Nozza, Paul Röttger, Dirk Hovy
URL: https://arxiv.org/abs/2507.17216
요약 (영문): a benchmark of 1,618 real-world moral dilemmas paired with a distribution of human moral judgments consisting of a binary evaluation and a free-text rationale . we treat this problem as a pluralistic distributional alignment task .
요약 (한글): 1,618개의 실제 도덕적 딜레마에 대한 벤치마크와 이분법적 평가와 자유 텍스트 근거로 구성된 인간의 도덕적 판단 분포를 결합하여 이 문제를 다원적 분포 정렬 과제로 처리합니다.

69. DesignLab: Designing Slides Through Iterative Detection and Correction

Authors: Jooyeol Yun, Heng Wang, Yotaro Shimose, Jaegul Choo, Shingo Takamatsu
URL: https://arxiv.org/abs/2507.17202
요약 (영문): design-related issues can be challenging for non-experts due to the complexity involved in navigating various design choices . design designers often lack the ability to refine their output, which is key aspect in real-world workflows .
요약 (한글): 디자인 관련 문제는 다양한 디자인 선택과 관련된 복잡성으로 인해 비전문가에게는 어려울 수 있습니다. 디자인 디자이너는 종종 실제 워크플로우의 핵심 요소인 결과물을 다듬을 수 있는 능력이 부족합니다.

70. Dispatch-Aware Deep Neural Network for Optimal Transmission Switching: Toward Real-Time and Feasibility Guaranteed Operation

Authors: Minsoo Kim, Jip Kim
URL: https://arxiv.org/abs/2507.17194
요약 (영문): Optimal transmission switching (OTS) improves optimal power flow (OPF) by selectively opening transmission lines . but its mixed-integer formulation increases computational complexity . we propose a dispatch-aware deep neural network that accelerates DC-OTS .
요약 (한글): 최적 전송 스위칭(OTS)은 송전선을 선택적으로 개방하여 최적 전력 흐름(OPF)을 개선하지만 혼합 정수 공식으로 인해 계산 복잡성이 증가합니다. 우리는 DC-OTS를 가속화하는 파견 인식 심층 신경망을 제안합니다.

71. LLM Meets the Sky: Heuristic Multi-Agent Reinforcement Learning for Secure Heterogeneous UAV Networks

Authors: Lijie Zheng, Ji He, Shih Yu Chang, Yulong Shen, Dusit Niyato
URL: https://arxiv.org/abs/2507.17188
요약 (영문): this work tackles the physical layer security problem of maximizing secrecy rate in heterogeneous UAV networks . we consider a realistic scenario where UAVs with diverse payloads and computation resources collaborate to serve ground terminals in presence of eavesdroppers .
요약 (한글): 본 연구는 이기종 무인항공기 네트워크에서 기밀성을 극대화하는 물리 계층 보안 문제를 다루며, 다양한 페이로드와 연산 자원을 가진 무인항공기가 도청자가 있는 상황에서 지상 단말기에 서비스를 제공하기 위해 협업하는 현실적인 시나리오를 고려합니다.

72. Asymmetric Lesion Detection with Geometric Patterns and CNN-SVM Classification

Authors: M. A. Rasel, Sameem Abdul Kareem, Zhenli Kwan, Nik Aimee Azizah Faheem, Winn Hui Han, Rebecca Kai Jan Choong, Shin Shen Yong, Unaizah Obaidellah
URL: https://arxiv.org/abs/2507.17185
요약 (영문): dermoscopic images allow visualization of surface skin structures not visible to the naked eye . asymmetric lesion shape is one of the criteria for diagnosing melanoma .
요약 (한글): 피부확대경 영상은 육안으로 볼 수 없는 표면 피부 구조를 시각화할 수 있으며 비대칭 병변 모양은 흑색종 진단의 기준 중 하나입니다 .

73. Regret Minimization in Population Network Games: Vanishing Heterogeneity and Convergence to Equilibria

Authors: Die Hu, Shuyue Hu, Chunjiang Mu, Shiqi Fan, Chen Chu, Jinzhuo Liu, Zhen Wang
URL: https://arxiv.org/abs/2507.17183
요약 (영문): this paper examines the role of heterogeneity in equilibrium formation . it examines how smooth regret-matching drives a large number of agents with diverse initial policies toward unified behavior.
요약 (한글): 이 논문에서는 균형 형성에서 이질성의 역할을 살펴보고, 원활한 후회 매칭이 다양한 초기 정책을 가진 수많은 에이전트를 어떻게 통일된 행동으로 유도하는지 살펴봅니다.

74. SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs

Authors: Zhiqiang Liu, Enpei Niu, Yin Hua, Mengshu Sun, Lei Liang, Huajun Chen, Wen Zhang
URL: https://arxiv.org/abs/2507.17178
요약 (영문): large language models have made significant progress in understanding Structured Knowledge (SK) like KG and Table . existing evaluations for SK understanding are non-rigorous and focus on a single type of SK .
요약 (한글): 대규모 언어 모델은 KG 및 Table과 같은 구조화된 지식(SK)을 이해하는 데 상당한 진전을 이루었습니다. 기존 SK 이해도 평가는 엄격하지 않고 단일 유형의 SK에 초점을 맞추고 있습니다.

75. Tabular Diffusion based Actionable Counterfactual Explanations for Network Intrusion Detection

Authors: Vinura Galwaduge, Jagath Samarabandu
URL: https://arxiv.org/abs/2507.17161
요약 (영문): the “black-box” nature of such deep learning methods adds a layer of opaqueness . the majority of the existing NIDS met with XAI .
요약 (한글): 이러한 딥러닝 방법의 ‘블랙박스’ 특성은 불투명성을 더합니다. 기존 NIDS의 대부분은 XAI 를 만났습니다.

76. JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction

Authors: Fangze Lin, Ying He, Fei Yu, Hong Zhang
URL: https://arxiv.org/abs/2507.17152
요약 (영문): we propose a two-stage multi-agent joint prediction framework . the first stage is modeled as a marginal prediction process .
요약 (한글): 우리는 2단계 다중 에이전트 공동 예측 프레임워크를 제안합니다. 첫 번째 단계는 한계 예측 프로세스로 모델링됩니다.

77. ScSAM: Debiasing Morphology and Distributional Variability in Subcellular Semantic Segmentation

Authors: Bo Fang, Jianan Fan, Dongnan Liu, Hang Chang, Gerald J.Shami, Filip Braet, Weidong Cai
URL: https://arxiv.org/abs/2507.17149
요약 (영문): morphological and distributional variability among subcellular components poses a long-standing challenge for learning-based organelle segmentation models . existing methods often rely on single mapping relationships, overlooking feature diversity and inducing biased training .
요약 (한글): 하위 세포 구성 요소 간의 형태학적 및 분포적 가변성은 학습 기반 소기관 세분화 모델에 오랜 과제를 제기합니다. 기존 방법은 종종 단일 매핑 관계에 의존하여 특징 다양성을 간과하고 편향된 학습을 유도합니다.

78. Towards Human-level Intelligence via Human-like Whole-Body Manipulation

Authors: Guang Gao, Jianan Wang, Jinbo Zuo, Junnan Jiang, Jingfan Zhang, Xianwen Zeng, Yuejiang Zhu, Lianyang Ma, Ke Chen, Minhua Sheng, Ruirui Zhang, Zhaohui An
URL: https://arxiv.org/abs/2507.17141
요약 (영문): a promising approach is to mirror the evolutionary trajectory of humans . designing safe robotic hardware with human-level physical capabilities . developing an intuitive and scalable whole-body teleoperation interface for data collection .
요약 (한글): 인간 수준의 물리적 기능을 갖춘 안전한 로봇 하드웨어 설계, 데이터 수집을 위한 직관적이고 확장 가능한 전신 원격 조작 인터페이스 개발, 인간의 진화 궤적을 반영하는 것이 유망한 접근 방식입니다.

79. SADA: Stability-guided Adaptive Diffusion Acceleration

Authors: Ting Jiang, Yixiao Wang, Hancheng Ye, Zishan Shao, Jingwei Sun, Jingyang Zhang, Zekai Chen, Jianyi Zhang, Yiran Chen, Hai Li
URL: https://arxiv.org/abs/2507.17135
요약 (영문): fidelity gap arises because different prompts correspond to varying denoising trajectory . fidelity gaps arise due to iterative sampling process and quadratic attention costs . models have achieved remarkable success in generative tasks .
요약 (한글): 다양한 프롬프트가 다양한 노이즈 제거 궤적에 대응하기 때문에 충실도 격차가 발생합니다. 반복 샘플링 프로세스와 이차 주의 비용으로 인해 충실도 격차가 발생합니다. 모델이 생성 작업에서 괄목할 만한 성공을 거두었습니다.

80. Resilient Multi-Agent Negotiation for Medical Supply Chains:Integrating LLMs and Blockchain for Transparent Coordination

Authors: Mariam ALMutairi, Hyungmin Kim
URL: https://arxiv.org/abs/2507.17134
요약 (영문): this paper presents a novel hybrid framework that integrates blockchain technology with a decentralized, large language model (LLM) powered multi-agent negotiation system to enhance the resilience and accountability of medical supply chains during crises .
요약 (한글): 이 백서에서는 블록체인 기술을 탈중앙화된 대규모 언어 모델(LLM) 기반 다중 에이전트 협상 시스템과 통합하여 위기 시 의료 공급망의 복원력과 책임성을 강화하는 새로운 하이브리드 프레임워크를 소개합니다.

81. Enabling Self-Improving Agents to Learn at Test Time With Human-In-The-Loop Guidance

Authors: Yufei He, Ruoyu Li, Alex Chen, Yue Liu, Yulin Chen, Yuan Sui, Cheng Chen, Yi Zhu, Luca Luo, Frank Yang, Bryan Hooi
URL: https://arxiv.org/abs/2507.17131
요약 (영문): agents often struggle in environments where rules and required domain knowledge frequently change . current approaches, like offline fine-tuning and standard prompting, are insufficient because they cannot adapt to new knowledge during actual operation .
요약 (한글): 상담원은 규칙과 필요한 도메인 지식이 자주 바뀌는 환경에서 종종 어려움을 겪습니다. 오프라인 미세 조정 및 표준 프롬프트와 같은 현재의 접근 방식은 실제 운영 중에 새로운 지식에 적응할 수 없기 때문에 불충분합니다.

82. BucketServe: Bucket-Based Dynamic Batching for Smart and Efficient LLM Inference Serving

Authors: Wanyi Zheng, Minxian Xu, Shengye Song, Kejiang Ye
URL: https://arxiv.org/abs/2507.17120
요약 (영문): large language models (LLMs) have become increasingly popular in various areas . traditional business gradually shifting from rule-based systems to LLM-based solutions . existing LLM serving systems often use static or continuous batching strategies .
요약 (한글): 다양한 분야에서 대규모 언어 모델(LLM)이 점점 더 대중화되고 있습니다. 전통적인 비즈니스는 점차 규칙 기반 시스템에서 LLM 기반 솔루션으로 전환하고 있습니다. 기존 LLM 서빙 시스템은 종종 정적 또는 연속 배치 전략을 사용합니다.

83. Reinforcement Learning Fine-Tunes a Sparse Subnetwork in Large Language Models

Authors: Andrii Balashov
URL: https://arxiv.org/abs/2507.17107
요약 (영문): RL fine-tuning consistently modifies only a small subnetwork (typically 5-30% of weights), leaving most parameters unchanged . we call this phenomenon RL-induced parameter update sparsity .
요약 (한글): RL 미세 조정은 지속적으로 작은 하위 네트워크(일반적으로 가중치의 5-30%)만 수정하고 대부분의 파라미터는 변경하지 않습니다. 이 현상을 RL에 의한 파라미터 업데이트 희소성이라고 부릅니다.

84. Weather-Aware AI Systems versus Route-Optimization AI: A Comprehensive Analysis of AI Applications in Transportation Productivity

Authors: Tatsuru Kikuchi
URL: https://arxiv.org/abs/2507.17099
요약 (영문): the study reveals that route-optimization systems improve taxi driver productivity by 14% . we compare their performance against traditional operations and route-only AI approaches .
요약 (한글): 연구에 따르면 경로 최적화 시스템은 택시 기사의 생산성을 14% 향상시키는 것으로 나타났습니다. 기존 운영 방식과 경로 전용 AI 접근 방식과 성능을 비교했습니다.

[arXiv Digest] 2025-07-24

1. Online Submission and Evaluation System Design for Competition Operations

2. Thinking Isn’t an Illusion: Overcoming the Limitations of Reasoning Models via Tool Augmentations

3. Symbiotic Agents: A Novel Paradigm for Trustworthy AGI-driven Networks

4. Simulating multiple human perspectives in socio-ecological systems using large language models

5. Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning

6. TAI Scan Tool: A RAG-Based Tool With Minimalistic Input for Trustworthy AI Self-Assessment

7. Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning

8. Automated Hybrid Grounding Using Structural and Data-Driven Heuristics

9. CQE under Epistemic Dependencies: Algorithms and Experiments (extended version)

10. LTLZinc: a Benchmarking Framework for Continual Learning and Neuro-Symbolic Temporal Reasoning

11. An Uncertainty-Driven Adaptive Self-Alignment Framework for Large Language Models

12. Ctx2TrajGen: Traffic Context-Aware Microscale Vehicle Trajectories using Generative Adversarial Imitation Learning

13. Compliance Brain Assistant: Conversational Agentic AI for Assisting Compliance Tasks in Enterprise Environments

14. Students’ Feedback Requests and Interactions with the SCRIPT Chatbot: Do They Get What They Ask For?

15. Agent Identity Evals: Measuring Agentic Identity

16. Our Cars Can Talk: How IoT Brings AI to Vehicles

17. Improving LLMs’ Generalized Reasoning Abilities by Graph Problems

18. HySafe-AI: Hybrid Safety Architectural Analysis Framework for AI Systems: A Case Study

19. Large Learning Rates Simultaneously Achieve Robustness to Spurious Correlations and Compressibility

20. Pretraining on the Test Set Is No Longer All You Need: A Debate-Driven Approach to QA Benchmarks

21. Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains

22. Ultra3D: Efficient and High-Fidelity 3D Generation with Part Attention

23. Yume: An Interactive World Generation Model

24. Flow Matching Meets Biology and Life Science: A Survey

25. On the Interaction of Compressibility and Adversarial Robustness

26. AI Telephone Surveying: Automating Quantitative Data Collection with an AI Interviewer

27. From Feedback to Checklists: Grounded Evaluation of AI-Generated Clinical Notes

28. CASCADE: LLM-Powered JavaScript Deobfuscator at Google

29. How Should We Meta-Learn Reinforcement Learning Algorithms?

30. Vision Transformer attention alignment with human visual perception in aesthetic object evaluation

31. PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving

32. Enhancing Quantum Federated Learning with Fisher Information-Based Optimization

33. Federated Majorize-Minimization: Beyond Parameter Aggregation

34. Integrating Physics-Based and Data-Driven Approaches for Probabilistic Building Energy Modeling

35. Enabling Cyber Security Education through Digital Twins and Generative AI

36. HOTA: Hamiltonian framework for Optimal Transport Advection

37. To Trust or Not to Trust: On Calibration in ML-based Resource Allocation for Wireless Networks

38. Unsupervised anomaly detection using Bayesian flow networks: application to brain FDG PET in the context of Alzheimer’s disease

39. MultiNRC: A Challenging and Native Multilingual Reasoning Evaluation Benchmark for LLMs

40. BGM-HAN: A Hierarchical Attention Network for Accurate and Fair Decision Assessment on Semi-Structured Profiles

41. Demonstration of Efficient Predictive Surrogates for Large-scale Quantum Processors

42. Probing Vision-Language Understanding through the Visual Entailment Task: promises and pitfalls

43. Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning

44. IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird’s-Eye View Perception

45. Each to Their Own: Exploring the Optimal Embedding in RAG

46. Fair Compromises in Participatory Budgeting: a Multi-Agent Deep Reinforcement Learning Approach

47. Content-based 3D Image Retrieval and a ColBERT-inspired Re-ranking for Tumor Flagging and Staging

48. Millions of $\text{GeAR}$-s: Extending GraphRAG to Millions of Documents

49. HiProbe-VAD: Video Anomaly Detection via Hidden States Probing in Tuning-Free Multimodal LLMs

50. Investigating Training Data Detection in AI Coders

51. SFUOD: Source-Free Unknown Object Detection

52. DynaSearcher: Dynamic Knowledge Graph Augmented Search Agent via Multi-Reward Reinforcement Learning

53. Swin-TUNA : A Novel PEFT Approach for Accurate Food Image Segmentation

54. Temporal Point-Supervised Signal Reconstruction: A Human-Annotation-Free Framework for Weak Moving Target Detection

55. EarthLink: Interpreting Climate Signals with Self-Evolving AI Agents

56. Confounded Causal Imitation Learning with Instrumental Variables

57. A Versatile Pathology Co-pilot via Reasoning Enhanced Multimodal Large Language Model

58. On Temporal Guidance and Iterative Refinement in Audio Source Separation

59. Integrating Belief Domains into Probabilistic Logic Programs

60. Leveraging Knowledge Graphs and LLM Reasoning to Identify Operational Bottlenecks for Warehouse Planning Assistance

61. Understanding Prompt Programming Tasks and Questions

62. Reality Proxy: Fluid Interactions with Real-World Objects in MR via Abstract Representations

63. DistrAttention: An Efficient and Flexible Self-Attention Mechanism on Modern GPUs

64. Eco-Friendly AI: Unleashing Data Power for Green Federated Learning

65. A Highly Clean Recipe Dataset with Ingredient States Annotation for State Probing Task

66. P3SL: Personalized Privacy-Preserving Split Learning on Heterogeneous Edge Devices

67. HuiduRep: A Robust Self-Supervised Framework for Learning Neural Representations from Extracellular Spikes

68. The Pluralistic Moral Gap: Understanding Judgment and Value Differences between Humans and Large Language Models

69. DesignLab: Designing Slides Through Iterative Detection and Correction

70. Dispatch-Aware Deep Neural Network for Optimal Transmission Switching: Toward Real-Time and Feasibility Guaranteed Operation

71. LLM Meets the Sky: Heuristic Multi-Agent Reinforcement Learning for Secure Heterogeneous UAV Networks

72. Asymmetric Lesion Detection with Geometric Patterns and CNN-SVM Classification

73. Regret Minimization in Population Network Games: Vanishing Heterogeneity and Convergence to Equilibria

74. SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs

75. Tabular Diffusion based Actionable Counterfactual Explanations for Network Intrusion Detection

76. JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction

77. ScSAM: Debiasing Morphology and Distributional Variability in Subcellular Semantic Segmentation

78. Towards Human-level Intelligence via Human-like Whole-Body Manipulation

79. SADA: Stability-guided Adaptive Diffusion Acceleration