Personalized Daily ArXiv Papers 2025-05-14

[gpt-4o]	Prompt	Completion	Total
Token	38723	5195	43918
Cost	$0.1	$0.05	$0.15

Total arXiv papers: 475

Total scanned papers: 312

Total relevant papers: 17

Table of contents with paper titles:

Iteratively reweighted kernel machines efficiently learn sparse functions Authors: Libin Zhu, Damek Davis, Dmitriy Drusvyatskiy, Maryam Fazel
Super-fast rates of convergence for Neural Networks Classifiers under the Hard Margin Condition Authors: Nathanael Tepakbong, Ding-Xuan Zhou, Xiang Zhou
Learning Advanced Self-Attention for Linear Transformers in the Singular Value Domain Authors: Hyowon Wi, Jeongwhan Choi, Noseong Park
Lost in Transmission: When and Why LLMs Fail to Reason Globally Authors: Tobias Schnabel, Kiran Tomlinson, Adith Swaminathan, Jennifer Neville
Recovering Event Probabilities from Large Language Model Embeddings via Axiomatic Constraints Authors: Jian-Qiao Zhu, Haijiang Yan, Thomas L. Griffiths
InfoPO: On Mutual Information Maximization for Large Language Model Alignment Authors: Teng Xiao, Zhen Ge, Sujay Sanghavi, Tian Wang, Julian Katz-Samuels, Marc Versage, Qingjun Cui, Trishul Chilimbi
Blockbuster, Part 1: Block-level AI Operator Fusion Authors: Ofer Dekel
PWC-MoE: Privacy-Aware Wireless Collaborative Mixture of Experts Authors: Yang Su, Na Yan, Yansha Deng, Robert Schober
Efficient Unstructured Pruning of Mamba State-Space Models for Resource-Constrained Environments Authors: Ibne Farabi Shihab, Sanjeda Akter, Anuj Sharma
Beyond Input Activations: Identifying Influential Latents by Gradient Sparse Autoencoders Authors: Dong Shu, Xuansheng Wu, Haiyan Zhao, Mengnan Du, Ninghao Liu
Rapid Overfitting of Multi-Pass Stochastic Gradient Descent in Stochastic Convex Optimization Authors: Shira Vansover-Hager, Tomer Koren, Roi Livni
Mirror Mirror on the Wall, Have I Forgotten it All? A New Framework for Evaluating Machine Unlearning Authors: Brennon Brimhall, Philip Mathew, Neil Fendley, Yinzhi Cao, Matthew Green
SPAT: Sensitivity-based Multihead-attention Pruning on Time Series Forecasting Models Authors: Suhan Guo, Jiahong Deng, Mengjun Yi, Furao Shen, Jian Zhao
Manifold Learning with Normalizing Flows: Towards Regularity, Expressivity and Iso-Riemannian Geometry Authors: Willem Diepeveen, Deanna Needell
Behind the Noise: Conformal Quantile Regression Reveals Emergent Representations Authors: Petrus H. Zwart, Tamas Varga, Odeta Qafoku, James A. Sethian
Scaling Laws for Speculative Decoding Authors: Siyuan Yan, Mo Zhu, Guo-qing Jiang, Jianfei Wang, Jiaxing Chen, Wentai Zhang, Xiang Liao, Xiao Cui, Chen Zhang, Zhuoran Song, Ran Zhu
The Correspondence Between Bounded Graph Neural Networks and Fragments of First-Order Logic Authors: Bernardo Cuenca Grau, Przemys{\l}aw A. Wa{\l}\k{e}ga

1. Iteratively reweighted kernel machines efficiently learn sparse functions

ArXiv ID: 2505.08277

Authors: Libin Zhu, Damek Davis, Dmitriy Drusvyatskiy, Maryam Fazel

Abstract: The impressive practical performance of neural networks is often attributed to their ability to learn low-dimensional data representations and hierarchical structure directly from data. In this work, we argue that these two phenomena are not unique to neural networks, and can be elicited from classical kernel methods. Namely, we show that the derivative of the kernel predictor can detect the influential coordinates with low sample complexity. Moreover, by iteratively using the derivatives to reweight the data and retrain kernel machines, one is able to efficiently learn hierarchical polynomials with finite leap complexity. Numerical experiments illustrate the developed theory.

Comment: The paper explores sparse function learning using kernel machines, which aligns with representation learning through sparse methods. It provides theoretical insights into kernel methods, making it relevant to foundational research.