Personalized Daily ArXiv Papers 2025-08-28

[gpt-4o]	Prompt	Completion	Total
Token	27480	2842	30322
Cost	$0.07	$0.03	$0.1

Total arXiv papers: 432

Total scanned papers: 259

Total relevant papers: 19

Table of contents with paper titles:

Model Science: getting serious about verification, explanation and control of AI systems Authors: Przemyslaw Biecek, Wojciech Samek
Data-Efficient Symbolic Regression via Foundation Model Distillation Authors: Wangyang Ying, Jinghan Zhang, Haoyue Bai, Nanxu Gong, Xinyuan Wang, Kunpeng Liu, Chandan K. Reddy, Yanjie Fu
Safety Alignment Should Be Made More Than Just A Few Attention Heads Authors: Chao Huang, Zefeng Zhang, Juewei Yue, Quangang Li, Chuang Zhang, Tingwen Liu
Parameter-Free Structural-Diversity Message Passing for Graph Neural Networks Authors: Mingyue Kong, Yinglong Zhang, Chengda Xu, Xuewen Xia, Xing Xu
UNIFORM: Unifying Knowledge from Large-scale and Diverse Pre-trained Models Authors: Yimu Wang, Weiming Zhuang, Chen Chen, Jiabo Huang, Jingtao Li, Lingjuan Lyu
On Surjectivity of Neural Networks: Can you elicit any behavior from your model? Authors: Haozhe Jiang, Nika Haghtalab
Memorization in Graph Neural Networks Authors: Adarsh Jamadandi, Jing Xu, Adam Dziedzic, Franziska Boenisch
DeepAtlas: a tool for effective manifold learning Authors: Serena Hughes, Timothy Hamilton, Tom Kolokotrones, Eric J. Deeds
MultiPL-MoE: Multi-Programming-Lingual Extension of Large Language Models through Hybrid Mixture-of-Experts Authors: Qing Wang, Xue Han, Jiahui Wang, Lehao Xing, Qian Hu, Lianlian Zhang, Chao Deng, Junlan Feng
CORE: Lossless Compression for Retrieval-Augmented LLMs via Reinforcement Learning Authors: Ziqiang Cui, Yunpeng Weng, Xing Tang, Peiyang Liu, Shiwei Li, Bowei He, Jiamin Chen, Xiuqiang He, Chen Ma
Symplectic convolutional neural networks Authors: S\"uleyman Y{\i}ld{\i}z, Konrad Janik, Peter Benner
PSO-Merging: Merging Models Based on Particle Swarm Optimization Authors: Kehao Zhang, Shaolei Zhang, Yang Feng
Quantum-Classical Hybrid Molecular Autoencoder for Advancing Classical Decoding Authors: Afrar Jahin, Yi Pan, Yingfeng Wang, Tianming Liu, Wei Zhang
Tracking World States with Language Models: State-Based Evaluation Using Chess Authors: Romain Harang, Jason Naradowsky, Yaswitha Gujju, Yusuke Miyao
Diffusion Language Models Know the Answer Before Decoding Authors: Pengxiang Li, Yefan Zhou, Dilxat Muhtar, Lu Yin, Shilin Yan, Li Shen, Yi Liang, Soroush Vosoughi, Shiwei Liu
Kolmogorov-Arnold Representation for Symplectic Learning: Advancing Hamiltonian Neural Networks Authors: Zongyu Wu, Ruichen Xu, Luoyao Chen, Georgios Kementzidis, Siyao Wang, Yuefan Deng
Just Because You Can, Doesn't Mean You Should: LLMs for Data Fitting Authors: Hejia Liu, Mochen Yang, Gediminas Adomavicius
Efficiently Generating Multidimensional Calorimeter Data with Tensor Decomposition Parameterization Authors: Paimon Goulart, Shaan Pakala, Evangelos Papalexakis
LFD: Layer Fused Decoding to Exploit External Knowledge in Retrieval-Augmented Generation Authors: Yang Sun, Lixin Zou, Dan Luo, Zhiyong Xie, Long Zhang, Liming Dong, Yunwei Zhao, Xixun Lin, Yanxiong Lu, Chenliang Li

1. Model Science: getting serious about verification, explanation and control of AI systems

ArXiv ID: 2508.20040

Authors: Przemyslaw Biecek, Wojciech Samek

Abstract: The growing adoption of foundation models calls for a paradigm shift from Data Science to Model Science. Unlike data-centric approaches, Model Science places the trained model at the core of analysis, aiming to interact, verify, explain, and control its behavior across diverse operational contexts. This paper introduces a conceptual framework for a new discipline called Model Science, along with the proposal for its four key pillars: Verification, which requires strict, context-aware evaluation protocols; Explanation, which is understood as various approaches to explore of internal model operations; Control, which integrates alignment techniques to steer model behavior; and Interface, which develops interactive and visual explanation tools to improve human calibration and decision-making. The proposed framework aims to guide the development of credible, safe, and human-aligned AI systems.

Comment: The paper proposes a new discipline called Model Science, which focuses on verification, explanation, and control of AI systems, aligning with the emerging trends criterion by introducing a broad new paradigm.

Relevance: 9 Novelty: 9

2. Data-Efficient Symbolic Regression via Foundation Model Distillation

ArXiv ID: 2508.19487

Authors: Wangyang Ying, Jinghan Zhang, Haoyue Bai, Nanxu Gong, Xinyuan Wang, Kunpeng Liu, Chandan K. Reddy, Yanjie Fu

Abstract: Discovering interpretable mathematical equations from observed data (a.k.a. equation discovery or symbolic regression) is a cornerstone of scientific discovery, enabling transparent modeling of physical, biological, and economic systems. While foundation models pre-trained on large-scale equation datasets offer a promising starting point, they often suffer from negative transfer and poor generalization when applied to small, domain-specific datasets. In this paper, we introduce EQUATE (Equation Generation via QUality-Aligned Transfer Embeddings), a data-efficient fine-tuning framework that adapts foundation models for symbolic equation discovery in low-data regimes via distillation. EQUATE combines symbolic-numeric alignment with evaluator-guided embedding optimization, enabling a principled embedding-search-generation paradigm. Our approach reformulates discrete equation search as a continuous optimization task in a shared embedding space, guided by data-equation fitness and simplicity. Experiments across three standard public benchmarks (Feynman, Strogatz, and black-box datasets) demonstrate that EQUATE consistently outperforms state-of-the-art baselines in both accuracy and robustness, while preserving low complexity and fast inference. These results highlight EQUATE as a practical and generalizable solution for data-efficient symbolic regression in foundation model distillation settings.

Comment: The paper introduces a novel framework EQUATE for symbolic regression using foundation model distillation, aligning with foundational research in representation learning and model compression.