Personalized Daily Arxiv Papers 4/03/2025

[gpt-4o]	Prompt	Completion	Total
Token	23735	2804	26539
Cost	$0.06	$0.03	$0.09

Total arXiv papers: 379

Total scanned papers: 225

Total relevant papers: 13

Table of contents with paper titles:

AI-Newton: A Concept-Driven Physical Law Discovery System without Prior Physical Knowledge Authors: You-Le Fang, Dong-Shan Jian, Xiang Li, Yan-Qing Ma
Estimating Unbounded Density Ratios: Applications in Error Control under Covariate Shift Authors: Shuntuo Xu, Zhou Yu, Jian Huang
Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert Parallelism Design Authors: Mohan Zhang, Pingzhi Li, Jie Peng, Mufan Qiu, Tianlong Chen
InfiniteICL: Breaking the Limit of Context Window Size via Long Short-term Memory Transformation Authors: Bowen Cao, Deng Cai, Wai Lam
Sparse Gaussian Neural Processes Authors: Tommy Rochussen, Vincent Fortuin
Is the Reversal Curse a Binding Problem? Uncovering Limitations of Transformers from a Basic Generalization Failure Authors: Boshi Wang, Huan Sun
Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length? Authors: Celine Lee, Alexander M. Rush, Keyon Vafa
A Unified Approach to Analysis and Design of Denoising Markov Models Authors: Yinuo Ren, Grant M. Rotskoff, Lexing Ying
Denoising guarantees for optimized sampling schemes in compressed sensing Authors: Yaniv Plan, Matthew S. Scott, Xia Sheng, Ozgur Yilmaz
FLAMES: A Hybrid Spiking-State Space Model for Adaptive Memory Retention in Event-Based Learning Authors: Biswadeep Chakraborty, Saibal Mukhopadhyay
R2DN: Scalable Parameterization of Contracting and Lipschitz Recurrent Deep Networks Authors: Nicholas H. Barbara, Ruigang Wang, Ian R. Manchester
Analysis of an Idealized Stochastic Polyak Method and its Application to Black-Box Model Distillation Authors: Robert M. Gower, Guillaume Garrigos, Nicolas Loizou, Dimitris Oikonomou, Konstantin Mishchenko, Fabian Schaipp
Hessian-aware Training for Enhancing DNNs Resilience to Parameter Corruptions Authors: Tahmid Hasan Prato, Seijoon Kim, Lizhong Chen, Sanghyun Hong

1. AI-Newton: A Concept-Driven Physical Law Discovery System without Prior Physical Knowledge

ArXiv ID: 2504.01538

Authors: You-Le Fang, Dong-Shan Jian, Xiang Li, Yan-Qing Ma

Abstract: Current limitations in human scientific discovery necessitate a new research paradigm. While advances in artificial intelligence (AI) offer a highly promising solution, enabling AI to emulate human-like scientific discovery remains an open challenge. To address this, we propose AI-Newton, a concept-driven discovery system capable of autonomously deriving physical laws from raw data -- without supervision or prior physical knowledge. The system integrates a knowledge base and knowledge representation centered on physical concepts, along with an autonomous discovery workflow. As a proof of concept, we apply AI-Newton to a large set of Newtonian mechanics problems. Given experimental data with noise, the system successfully rediscovers fundamental laws, including Newton's second law, energy conservation and law of gravitation, using autonomously defined concepts. This achievement marks a significant step toward AI-driven autonomous scientific discovery.

Comment: AI-Newton represents a novel paradigm for autonomous scientific discovery, which aligns with the 'AI for Science' criterion and introduces a concept-driven approach to deriving physical laws.

Relevance: 9 Novelty: 9

2. Estimating Unbounded Density Ratios: Applications in Error Control under Covariate Shift

ArXiv ID: 2504.01031

Authors: Shuntuo Xu, Zhou Yu, Jian Huang

Abstract: The density ratio is an important metric for evaluating the relative likelihood of two probability distributions, with extensive applications in statistics and machine learning. However, existing estimation theories for density ratios often depend on stringent regularity conditions, mainly focusing on density ratio functions with bounded domains and ranges. In this paper, we study density ratio estimators using loss functions based on least squares and logistic regression. We establish upper bounds on estimation errors with standard minimax optimal rates, up to logarithmic factors. Our results accommodate density ratio functions with unbounded domains and ranges. We apply our results to nonparametric regression and conditional flow models under covariate shift and identify the tail properties of the density ratio as crucial for error control across domains affected by covariate shift. We provide sufficient conditions under which loss correction is unnecessary and demonstrate effective generalization capabilities of a source estimator to any suitable target domain. Our simulation experiments support these theoretical findings, indicating that the source estimator can outperform those derived from loss correction methods, even when the true density ratio is known.

Comment: The paper addresses density ratio estimation under relaxed conditions, which is foundational for representation learning and generalization under covariate shift. The theoretical contributions are significant and align with the criteria for foundational research.