Personalized Daily Arxiv Papers 03/19/2025

[gpt-4o]	Prompt	Completion	Total
Token	39134	5372	44506
Cost	$0.09	$0.05	$0.15

Total arXiv papers: 559

Total scanned papers: 325

Total relevant papers: 29

Table of contents with paper titles:

Improved Scalable Lipschitz Bounds for Deep Neural Networks Authors: Usman Syed, Bin Hu
Higher-Order Graphon Neural Networks: Approximation and Cut Distance Authors: Daniel Herbst, Stefanie Jegelka
ROCK: A variational formulation for occupation kernel methods in Reproducing Kernel Hilbert Spaces Authors: Victor Rielly, Kamel Lahouel, Chau Nguyen, Bruno Jedynak
RWKV-7 "Goose" with Expressive Dynamic State Evolution Authors: Bo Peng, Ruichong Zhang, Daniel Goldstein, Eric Alcaide, Haowen Hou, Janna Lu, William Merrill, Guangyu Song, Kaifeng Tan, Saiteja Utpala, Nathan Wilce, Johan S. Wind, Tianyi Wu, Daniel Wuttke, Christian Zhou-Zheng
Cognitive Activation and Chaotic Dynamics in Large Language Models: A Quasi-Lyapunov Analysis of Reasoning Mechanisms Authors: Xiaojian Li, Yongkang Leng, Ruiqing Ding, Hangjie Mo, Shanlin Yang
Frac-Connections: Fractional Extension of Hyper-Connections Authors: Defa Zhu, Hongzhi Huang, Jundong Zhou, Zihao Huang, Yutao Zeng, Banggu Wu, Qiyang Min, Xun Zhou
DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers Authors: Minglei Shi, Ziyang Yuan, Haotian Yang, Xintao Wang, Mingwu Zheng, Xin Tao, Wenliang Zhao, Wenzhao Zheng, Jie Zhou, Jiwen Lu, Pengfei Wan, Di Zhang, Kun Gai
Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM Kernels Authors: Maximilian Beck, Korbinian P\"oppel, Phillip Lippe, Sepp Hochreiter
Learning on LLM Output Signatures for gray-box LLM Behavior Analysis Authors: Guy Bar-Shalom, Fabrizio Frasca, Derek Lim, Yoav Gelberg, Yftah Ziser, Ran El-Yaniv, Gal Chechik, Haggai Maron
Landscape Complexity for the Empirical Risk of Generalized Linear Models: Discrimination between Structured Data Authors: Theodoros G. Tsironis, Aris L. Moustakas
Fundamental Limits of Matrix Sensing: Exact Asymptotics, Universality, and Applications Authors: Yizhou Xu, Antoine Maillard, Lenka Zdeborov\'a, Florent Krzakala
Revealing higher-order neural representations with generative artificial intelligence Authors: Hojjat Azimi Asrari, Megan A. K. Peters
Fuzzy Rule-based Differentiable Representation Learning Authors: Wei Zhang, Zhaohong Deng, Guanjin Wang, Kup-Sze Choi
Ensemble Knowledge Distillation for Machine Learning Interatomic Potentials Authors: Sakib Matin, Emily Shinkle, Yulia Pimonova, Galen T. Craven, Ying Wai Li, Kipton Barros, Nicholas Lubbers
Analytic Subspace Routing: How Recursive Least Squares Works in Continual Learning of Large Language Model Authors: Kai Tong, Kang Pan, Xiao Zhang, Erli Meng, Run He, Yawen Cui, Nuoyan Guo, Huiping Zhuang
From Demonstrations to Rewards: Alignment Without Explicit Human Preferences Authors: Siliang Zeng, Yao Liu, Huzefa Rangwala, George Karypis, Mingyi Hong, Rasool Fakoor
PENCIL: Long Thoughts with Short Memory Authors: Chenxiao Yang, Nathan Srebro, David McAllester, Zhiyuan Li
Learning local neighborhoods of non-Gaussian graphical models: A measure transport approach Authors: Sarah Liaw, Rebecca Morrison, Youssef Marzouk, Ricardo Baptista
FeNeC: Enhancing Continual Learning via Feature Clustering with Neighbor- or Logit-Based Classification Authors: Kamil Ksi\k{a}.zek, Hubert Jastrz\k{e}bski, Bartosz Trojan, Krzysztof Pniaczek, Micha{\l} Karp, Jacek Tabor
Layer-wise Adaptive Gradient Norm Penalizing Method for Efficient and Accurate Deep Learning Authors: Sunwoo Lee
ML-SpecQD: Multi-Level Speculative Decoding with Quantized Drafts Authors: Evangelos Georganas, Dhiraj Kalamkar, Alexander Kozlov, Alexander Heinecke
Unifying Text Semantics and Graph Structures for Temporal Text-attributed Graphs with Large Language Models Authors: Siwei Zhang, Yun Xiong, Yateng Tang, Xi Chen, Zian Jia, Zehao Gu, Jiarong Xu, Jiawei Zhang
Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels Authors: Yujia Tong, Yuze Wang, Jingling Yuan, Chuang Hu
Unified Analysis of Decentralized Gradient Descent: a Contraction Mapping Framework Authors: Erik G. Larsson, Nicolo Michelusi
Positivity sets of hinge functions Authors: Josef Schicho, Ayush Kumar Tewari, Audie Warren
End-to-End Optimal Detector Design with Mutual Information Surrogates Authors: Kinga Anna Wozniak, Stephen Mulligan, Jan Kieseler, Markus Klute, Francois Fleuret, Tobias Golling
Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models Authors: Teng Wang, Zhangyi Jiang, Zhenqi He, Wenhan Yang, Yanan Zheng, Zeyu Li, Zifan He, Shenyang Tong, Hailei Gong
Quantification of Uncertainties in Probabilistic Deep Neural Network by Implementing Boosting of Variational Inference Authors: Pavia Bera, Sanjukta Bhanja
On the clustering behavior of sliding windows Authors: Boris Alexeev, Wenyan Luo, Dustin G. Mixon, Yan X Zhang

1. Improved Scalable Lipschitz Bounds for Deep Neural Networks

ArXiv ID: 2503.14297

Authors: Usman Syed, Bin Hu

Abstract: Computing tight Lipschitz bounds for deep neural networks is crucial for analyzing their robustness and stability, but existing approaches either produce relatively conservative estimates or rely on semidefinite programming (SDP) formulations (namely the LipSDP condition) that face scalability issues. Building upon ECLipsE-Fast, the state-of-the-art Lipschitz bound method that avoids SDP formulations, we derive a new family of improved scalable Lipschitz bounds that can be combined to outperform ECLipsE-Fast. Specifically, we leverage more general parameterizations of feasible points of LipSDP to derive various closed-form Lipschitz bounds, avoiding the use of SDP solvers. In addition, we show that our technique encompasses ECLipsE-Fast as a special case and leads to a much larger class of scalable Lipschitz bounds for deep neural networks. Our empirical study shows that our bounds improve ECLipsE-Fast, further advancing the scalability and precision of Lipschitz estimation for large neural networks.

Comment: The paper introduces improved scalable Lipschitz bounds for deep neural networks, which directly contributes to understanding training dynamics and robustness, aligning with representation learning.