Á. Cía Mina, J. López Fidalgo, L. Deldossi, C. Tommasi

In large-scale regression problems, subsampling is often used to enhance computational efficiency. Traditional subsampling techniques primarily focus on accurate parameter estimation, yet in many practical applications, the ultimate objective is to improve predictive performance. This study presents a new subsampling strategy for linear models that explicitly accounts for model misspecification. The approach leverages the distribution of covariates and is particularly useful in scenarios where acquiring response variable labels is expensive. By targeting the reduction of bias in the random-X prediction error, the proposed method enhances predictive accuracy. Theoretical results establish its advantage in lowering prediction mean squared error, and simulation studies further validate its effectiveness compared to existing approaches.

Keywords: Model misspecification, Random-X regression, Optimal Design of Experiments

Scheduled

Design of Experiments I
June 11, 2025  3:30 PM
MR 1


Other papers in the same session

Estimators based on AUC for classification and optimal subsampling for them

C. de la Calle-Arroyo, J. López Fidalgo, P. Urruchi Mohino

Optimal discrimination between mixed models through nature-inspired algorithms

C. Tommasi, V. M. Casero Alonso, J. López Fidalgo, S. Pozuelo Campos, W. K. Wong

Robust subsampling to minimise MSPE

C. de la Calle-Arroyo, L. Deldossi, C. Tommasi


Cookie policy

We use cookies in order to be able to identify and authenticate you on the website. They are necessary for the correct functioning of it, and therefore they can not be disabled. If you continue browsing the website, you are agreeing with their acceptance, as well as our Privacy Policy.

Additionally, we use Google Analytics in order to analyze the website traffic. They also use cookies and you can accept or refuse them with the buttons below.

You can read more details about our Cookie Policy and our Privacy Policy.