Personalized Daily Arxiv Papers 3/31/2025

[gpt-4o]	Prompt	Completion	Total
Token	25192	3444	28636
Cost	$0.06	$0.03	$0.1

Total arXiv papers: 353

Total scanned papers: 223

Total relevant papers: 15

Table of contents with paper titles:

Meta-Representational Predictive Coding: Biomimetic Self-Supervised Learning Authors: Alexander Ororbia, Karl Friston, Rajesh P. N. Rao
STADE: Standard Deviation as a Pruning Metric Authors: Diego Coello de Portugal Mecke, Haya Alyoussef, Ilia Koloiarov, Maximilian Stubbemann, Lars Schmidt-Thieme
Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities Authors: Raman Dutt, Harleen Hanspal, Guoxuan Xia, Petru-Daniel Tudosiu, Alexander Black, Yongxin Yang, Steven McDonagh, Sarah Parisot
An Efficient Training Algorithm for Models with Block-wise Sparsity Authors: Ding Zhu, Zhiqun Zuo, Mohammad Mahdi Khalili
Bridging the Dimensional Chasm: Uncover Layer-wise Dimensional Reduction in Transformers through Token Correlation Authors: Zhuo-Yang Song, Zeyu Li, Qing-Hong Cao, Ming-xing Luo, Hua Xing Zhu
Local Normalization Distortion and the Thermodynamic Formalism of Decoding Strategies for Large Language Models Authors: Tom Kempton, Stuart Burrell
Concise One-Layer Transformers Can Do Function Evaluation (Sometimes) Authors: Lena Strobl, Dana Angluin, Robert Frank
AdaRank: Adaptive Rank Pruning for Enhanced Model Merging Authors: Chanhyuk Lee, Jiho Choi, Chanryeol Lee, Donggyun Kim, Seunghoon Hong
Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment Authors: Audrey Huang, Adam Block, Qinghua Liu, Nan Jiang, Dylan J. Foster, Akshay Krishnamurthy
Arch-LLM: Taming LLMs for Neural Architecture Generation via Unsupervised Discrete Representation Learning Authors: Deshani Geethika Poddenige, Sachith Seneviratne, Damith Senanayake, Mahesan Niranjan, PN Suganthan, Saman Halgamuge
MixFunn: A Neural Network for Differential Equations with Improved Generalization and Interpretability Authors: Tiago de Souza Farias, Gubio Gomes de Lima, Jonas Maziero, Celso Jorge Villas-Boas
ThinkEdit: Interpretable Weight Editing to Mitigate Overly Short Thinking in Reasoning Models Authors: Chung-En Sun, Ge Yan, Tsui-Wei Weng
MSPLoRA: A Multi-Scale Pyramid Low-Rank Adaptation for Efficient Model Fine-Tuning Authors: Jiancheng Zhao, Xingda Yu, Zhen Yang
A Proposal for Networks Capable of Continual Learning Authors: Zeki Doruk Erden, Boi Faltings
Efficient Joint Prediction of Multiple Future Tokens Authors: Kwangjun Ahn, Alex Lamb, John Langford

1. Meta-Representational Predictive Coding: Biomimetic Self-Supervised Learning

ArXiv ID: 2503.21796

Authors: Alexander Ororbia, Karl Friston, Rajesh P. N. Rao

Abstract: Self-supervised learning has become an increasingly important paradigm in the domain of machine intelligence. Furthermore, evidence for self-supervised adaptation, such as contrastive formulations, has emerged in recent computational neuroscience and brain-inspired research. Nevertheless, current work on self-supervised learning relies on biologically implausible credit assignment -- in the form of backpropagation of errors -- and feedforward inference, typically a forward-locked pass. Predictive coding, in its mechanistic form, offers a biologically plausible means to sidestep these backprop-specific limitations. However, unsupervised predictive coding rests on learning a generative model of raw pixel input (akin to ``generative AI'' approaches), which entails predicting a potentially high dimensional input; on the other hand, supervised predictive coding, which learns a mapping between inputs to target labels, requires human annotation, and thus incurs the drawbacks of supervised learning. In this work, we present a scheme for self-supervised learning within a neurobiologically plausible framework that appeals to the free energy principle, constructing a new form of predictive coding that we call meta-representational predictive coding (MPC). MPC sidesteps the need for learning a generative model of sensory input (e.g., pixel-level features) by learning to predict representations of sensory input across parallel streams, resulting in an encoder-only learning and inference scheme. This formulation rests on active inference (in the form of sensory glimpsing) to drive the learning of representations, i.e., the representational dynamics are driven by sequences of decisions made by the model to sample informative portions of its sensorium.

Comment: The paper introduces a novel self-supervised learning framework, Meta-Representational Predictive Coding (MPC), which aligns with representation learning by focusing on biologically plausible mechanisms and encoder-only learning. It provides theoretical insights into predictive coding and active inference.