Personalized Daily ArXiv Papers 2025-06-16

[gpt-4o]	Prompt	Completion	Total
Token	41131	5352	46483
Cost	$0.1	$0.05	$0.16

Total arXiv papers: 610

Total scanned papers: 396

Total relevant papers: 26

Table of contents with paper titles:

Forward Target Propagation: A Forward-Only Approach to Global Error Credit Assignment via Local Losses Authors: Nazmus Saadat As-Saquib, A N M Nafiz Abeer, Hung-Ta Chien, Byung-Jun Yoon, Suhas Kumar, Su-in Yi
HEIST: A Graph Foundation Model for Spatial Transcriptomics and Proteomics Data Authors: Hiren Madhu, Jo\~ao Felipe Rocha, Tinglin Huang, Siddharth Viswanath, Smita Krishnaswamy, Rex Ying
Large Language Models and Emergence: A Complex Systems Perspective Authors: David C. Krakauer, John W. Krakauer, Melanie Mitchell
Interpretable representation learning of quantum data enabled by probabilistic variational autoencoders Authors: Paulin de Schoulepnikoff, Gorka Mu\~noz-Gil, Hendrik Poulsen Nautrup, Hans J. Briegel
Boost Post-Training Quantization via Null Space Optimization for Large Language Models Authors: Jiaqi Zhao, Miao Zhang, Weili Guan, Liqiang Nie
FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation Authors: Zhuguanyu Wu, Shihe Wang, Jiayi Zhang, Jiaxin Chen, Yunhong Wang
Dynamic Sparse Training of Diagonally Sparse Networks Authors: Abhishek Tyagi, Arjun Iyer, William H Renninger, Christopher Kanan, Yuhao Zhu
A Framework for Non-Linear Attention via Modern Hopfield Networks Authors: Ahmed Farooq
How Visual Representations Map to Language Feature Space in Multimodal LLMs Authors: Constantin Venhoff, Ashkan Khakzar, Sonia Joseph, Philip Torr, Neel Nanda
Long-Short Alignment for Effective Long-Context Modeling in LLMs Authors: Tianqi Du, Haotian Huang, Yifei Wang, Yisen Wang
MoTE: Mixture of Task-specific Experts for Pre-Trained ModelBased Class-incremental Learning Authors: Linjie Li, Zhenyu Wu, Yang Ji
DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference Acceleration Authors: Hanzhi Zhang, Heng Fan, Kewei Sha, Yan Huang, Yunhe Feng
Solving Inverse Problems in Stochastic Self-Organising Systems through Invariant Representations Authors: Elias Najarro, Nicolas Bessone, Sebastian Risi
Spectral Estimation with Free Decompression Authors: Siavash Ameli, Chris van der Heide, Liam Hodgkinson, Michael W. Mahoney
Delayformer: spatiotemporal transformation for predicting high-dimensional dynamics Authors: Zijian Wang, Peng Tao, Luonan Chen
Lifting Data-Tracing Machine Unlearning to Knowledge-Tracing for Foundation Models Authors: Yuwen Tan, Boqing Gong
Brewing Knowledge in Context: Distillation Perspectives on In-Context Learning Authors: Chengye Li, Haiyun Liu, Yuanxi Li
Tversky Neural Networks: Psychologically Plausible Deep Learning with Differentiable Tversky Similarity Authors: Moussa Koulako Bala Doumbouya, Dan Jurafsky, Christopher D. Manning
STRCMP: Integrating Graph Structural Priors with Language Models for Combinatorial Optimization Authors: Xijun Li, Jiexiang Yang, Jinghao Wang, Bo Peng, Jianguo Yao, Haibing Guan
TruncQuant: Truncation-Ready Quantization for DNNs with Flexible Weight Bit Precision Authors: Jinhee Kim, Seoyeon Yoon, Taeho Lee, Joo Chan Lee, Kang Eun Jeon, Jong Hwan Ko
PolyMicros: Bootstrapping a Foundation Model for Polycrystalline Material Structure Authors: Michael Buzzy, Andreas Robertson, Peng Chen, Surya Kalidindi
Tracing LLM Reasoning Processes with Strategic Games: A Framework for Planning, Revision, and Resource-Constrained Decision Making Authors: Xiaopeng Yuan, Xingjian Zhang, Ke Xu, Yifan Xu, Lijun Yu, Jindong Wang, Yushun Dong, Haohan Wang
GenFT: A Generative Parameter-Efficient Fine-Tuning Method for Pretrained Foundation Models Authors: Baoquan Zhang, Guangning Xu, Michael. K. Ng
Understanding Input Selectivity in Mamba: Impact on Approximation Power, Memorization, and Associative Recall Capacity Authors: Ningyuan Huang, Miguel Sarabia, Abhinav Moudgil, Pau Rodriguez, Luca Zappella, Federico Danieli
An Attention-based Spatio-Temporal Neural Operator for Evolving Physics Authors: Vispi Karkaria, Doksoo Lee, Yi-Ping Chen, Yue Yu, Wei Chen
Improving Large Language Model Safety with Contrastive Representation Learning Authors: Samuel Simko, Mrinmaya Sachan, Bernhard Sch\"olkopf, Zhijing Jin

1. Forward Target Propagation: A Forward-Only Approach to Global Error Credit Assignment via Local Losses

ArXiv ID: 2506.11030

Authors: Nazmus Saadat As-Saquib, A N M Nafiz Abeer, Hung-Ta Chien, Byung-Jun Yoon, Suhas Kumar, Su-in Yi

Abstract: Training neural networks has traditionally relied on backpropagation (BP), a gradient-based algorithm that, despite its widespread success, suffers from key limitations in both biological and hardware perspectives. These include backward error propagation by symmetric weights, non-local credit assignment, and frozen activity during backward passes. We propose Forward Target Propagation (FTP), a biologically plausible and computationally efficient alternative that replaces the backward pass with a second forward pass. FTP estimates layerwise targets using only feedforward computations, eliminating the need for symmetric feedback weights or learnable inverse functions, hence enabling modular and local learning. We evaluate FTP on fully connected networks, CNNs, and RNNs, demonstrating accuracies competitive with BP on MNIST, CIFAR10, and CIFAR100, as well as effective modeling of long-term dependencies in sequential tasks. Moreover, FTP outperforms BP under quantized low-precision and emerging hardware constraints while also demonstrating substantial efficiency gains over other biologically inspired methods such as target propagation variants and forward-only learning algorithms. With its minimal computational overhead, forward-only nature, and hardware compatibility, FTP provides a promising direction for energy-efficient on-device learning and neuromorphic computing.

Comment: The paper introduces Forward Target Propagation, a novel approach to error credit assignment, relevant to model architecture and training dynamics.

Relevance: 9 Novelty: 9

2. HEIST: A Graph Foundation Model for Spatial Transcriptomics and Proteomics Data

ArXiv ID: 2506.11152

Authors: Hiren Madhu, Jo\~ao Felipe Rocha, Tinglin Huang, Siddharth Viswanath, Smita Krishnaswamy, Rex Ying

Abstract: Single-cell transcriptomics has become a great source for data-driven insights into biology, enabling the use of advanced deep learning methods to understand cellular heterogeneity and transcriptional regulation at the single-cell level. With the advent of spatial transcriptomics data we have the promise of learning about cells within a tissue context as it provides both spatial coordinates and transcriptomic readouts. However, existing models either ignore spatial resolution or the gene regulatory information. Gene regulation in cells can change depending on microenvironmental cues from neighboring cells, but existing models neglect gene regulatory patterns with hierarchical dependencies across levels of abstraction. In order to create contextualized representations of cells and genes from spatial transcriptomics data, we introduce HEIST, a hierarchical graph transformer-based foundation model for spatial transcriptomics and proteomics data. HEIST models tissue as spatial cellular neighborhood graphs, and each cell is, in turn, modeled as a gene regulatory network graph. The framework includes a hierarchical graph transformer that performs cross-level message passing and message passing within levels. HEIST is pre-trained on 22.3M cells from 124 tissues across 15 organs using spatially-aware contrastive learning and masked auto-encoding objectives. Unsupervised analysis of HEIST representations of cells, shows that it effectively encodes the microenvironmental influences in cell embeddings, enabling the discovery of spatially-informed subpopulations that prior models fail to differentiate. Further, HEIST achieves state-of-the-art results on four downstream task such as clinical outcome prediction, cell type annotation, gene imputation, and spatially-informed cell clustering across multiple technologies, highlighting the importance of hierarchical modeling and GRN-based representations.

Comment: The paper introduces HEIST, a hierarchical graph transformer-based foundation model for spatial transcriptomics and proteomics data, which aligns with the AI for Science criterion focusing on foundational research in molecular/protein modeling.