Personalized Daily ArXiv Papers 2025-04-14

[gpt-4o]	Prompt	Completion	Total
Token	27814	3407	31221
Cost	$0.07	$0.03	$0.1

Total arXiv papers: 362

Total scanned papers: 203

Total relevant papers: 16

Table of contents with paper titles:

Statistically guided deep learning Authors: Michael Kohler, Adam Krzyzak
Scaling Up On-Device LLMs via Active-Weight Swapping Between DRAM and Flash Authors: Fucheng Jia, Zewen Wu, Shiqi Jiang, Huiqiang Jiang, Qianxi Zhang, Yuqing Yang, Yunxin Liu, Ju Ren, Deyu Zhang, Ting Cao
Dimension reduction for derivative-informed operator learning: An analysis of approximation errors Authors: Dingcheng Luo, Thomas O'Leary-Roseberry, Peng Chen, Omar Ghattas
Large language models could be rote learners Authors: Yuyang Xu, Renjun Hu, Haochao Ying, Jian Wu, Xing Shi, Wei Lin
SAEs $\textit{Can}$ Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs Authors: Aashiq Muhamed, Jacopo Bonato, Mona Diab, Virginia Smith
Gradient Descent Robustly Learns the Intrinsic Dimension of Data in Training Convolutional Neural Networks Authors: Chenyang Zhang, Peifeng Gao, Difan Zou, Yuan Cao
Steering CLIP's vision transformer with sparse autoencoders Authors: Sonia Joseph, Praneet Suresh, Ethan Goldfarb, Lorenz Hufe, Yossi Gandelsman, Robert Graham, Danilo Bzdok, Wojciech Samek, Blake Aaron Richards
Cellular Development Follows the Path of Minimum Action Authors: Rohola Zandie, Farhan Khodaee, Yufan Xia, Elazer R. Edelman
Entropic bounds for conditionally Gaussian vectors and applications to neural networks Authors: Lucia Celli, Giovanni Peccati
A Piecewise Lyapunov Analysis of sub--quadratic SGD: Applications to Robust and Quantile Regression Authors: Yixuan Zhang (Lucy), Dongyan (Lucy), Huo, Yudong Chen, Qiaomin Xie
Enabling Automatic Differentiation with Mollified Graph Neural Operators Authors: Ryan Y. Lin, Julius Berner, Valentin Duruisseaux, David Pitt, Daniel Leibovici, Jean Kossaifi, Kamyar Azizzadenesheli, Anima Anandkumar
Compositional Flows for 3D Molecule and Synthesis Pathway Co-design Authors: Tony Shen, Seonghwan Seo, Ross Irwin, Kieran Didi, Simon Olsson, Woo Youn Kim, Martin Ester
Constrained Machine Learning Through Hyperspherical Representation Authors: Gaetano Signorelli, Michele Lombardi
Scaling Laws of Graph Neural Networks for Atomistic Materials Modeling Authors: Chaojian Li, Zhifan Ye, Massimiliano Lupo Pasini, Jong Youl Choi, Cheng Wan, Yingyan Celine Lin, Prasanna Balaprakash
Between Linear and Sinusoidal: Rethinking the Time Encoder in Dynamic Graph Learning Authors: Hsing-Huan Chung, Shravan Chaudhari, Xing Han, Yoav Wald, Suchi Saria, Joydeep Ghosh
Proofs as Explanations: Short Certificates for Reliable Predictions Authors: Avrim Blum, Steve Hanneke, Chirag Pabbaraju, Donya Saless

1. Statistically guided deep learning

ArXiv ID: 2504.08489

Authors: Michael Kohler, Adam Krzyzak

Abstract: We present a theoretically well-founded deep learning algorithm for nonparametric regression. It uses over-parametrized deep neural networks with logistic activation function, which are fitted to the given data via gradient descent. We propose a special topology of these networks, a special random initialization of the weights, and a data-dependent choice of the learning rate and the number of gradient descent steps. We prove a theoretical bound on the expected $L_2$ error of this estimate, and illustrate its finite sample size performance by applying it to simulated data. Our results show that a theoretical analysis of deep learning which takes into account simultaneously optimization, generalization and approximation can result in a new deep learning estimate which has an improved finite sample performance.

Comment: The paper proposes a theoretically grounded deep learning algorithm with a focus on optimization, generalization, and approximation, aligning with foundational research in representation learning.