Personalized Daily ArXiv Papers 2025-05-02

[gpt-4o]	Prompt	Completion	Total
Token	31381	3815	35196
Cost	$0.08	$0.04	$0.12

Total arXiv papers: 348

Total scanned papers: 223

Total relevant papers: 12

Table of contents with paper titles:

Optimal Vector Compressed Sensing Using James Stein Shrinkage Authors: Apratim Dey, David Donoho
Pushing the Limits of Low-Bit Optimizers: A Focus on EMA Dynamics Authors: Cong Xu, Wenbin Liang, Mo Yu, Anan Liu, Ke-Yue Zhang, Lizhuang Ma, Jianyong Wang, Jun Wang, Wei Zhang
On the generalization of language models from in-context learning and finetuning: a controlled study Authors: Andrew K. Lampinen, Arslan Chaudhry, Stephanie C. Y. Chan, Cody Wild, Diane Wan, Alex Ku, J\"org Bornschein, Razvan Pascanu, Murray Shanahan, James L. McClelland
FreqKV: Frequency Domain Key-Value Compression for Efficient Context Window Extension Authors: Jushi Kai, Boyi Zeng, Yixuan Wang, Haoli Bai, Bo Jiang, Zhouhan Lin
Empirical Evaluation of Progressive Coding for Sparse Autoencoders Authors: Hans Peter, Anders S{\o}gaard
FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation Authors: Chaitali Bhattacharyya, Yeseong Kim
LangVAE and LangSpace: Building and Probing for Language Model VAEs Authors: Danilo S. Carvalho, Yingji Zhang, Harriet Unsworth, Andr\'e Freitas
Parameter-Efficient Fine-Tuning with Circulant and Diagonal Vectors Authors: Xinyu Ding, Lexuan Chen, Siyu Liao, Zhongfeng Wang
Block Circulant Adapter for Large Language Models Authors: Xinyu Ding, Meiqi Wang, Siyu Liao, Zhongfeng Wang
On the expressivity of deep Heaviside networks Authors: Insung Kong, Juntong Chen, Sophie Langer, Johannes Schmidt-Hieber
Optimizing Deep Neural Networks using Safety-Guided Self Compression Authors: Mohammad Zbeeb, Mariam Salman, Mohammad Bazzi, Ammar Mohanna
Scaling On-Device GPU Inference for Large Generative Models Authors: Jiuqiang Tang, Raman Sarokin, Ekaterina Ignasheva, Grant Jensen, Lin Chen, Juhyun Lee, Andrei Kulik, Matthias Grundmann

1. Optimal Vector Compressed Sensing Using James Stein Shrinkage

ArXiv ID: 2505.00326

Authors: Apratim Dey, David Donoho

Abstract: The trend in modern science and technology is to take vector measurements rather than scalars, ruthlessly scaling to ever higher dimensional vectors. For about two decades now, traditional scalar Compressed Sensing has been synonymous with a Convex Optimization based procedure called Basis Pursuit. In the vector recovery case, the natural tendency is to return to a straightforward vector extension of Basis Pursuit, also based on Convex Optimization. However, Convex Optimization is provably suboptimal, particularly when $B$ is large. In this paper, we propose SteinSense, a lightweight iterative algorithm, which is provably optimal when $B$ is large. It does not have any tuning parameter, does not need any training data, requires zero knowledge of sparsity, is embarrassingly simple to implement, and all of this makes it easily scalable to high vector dimensions. We conduct a massive volume of both real and synthetic experiments that confirm the efficacy of SteinSense, and also provide theoretical justification based on ideas from Approximate Message Passing. Fascinatingly, we discover that SteinSense is quite robust, delivering the same quality of performance on real data, and even under substantial departures from conditions under which existing theory holds.

Comment: The paper introduces SteinSense, a novel algorithm for vector compressed sensing that is provably optimal and highly scalable. This aligns with foundational research in model compression and efficiency.