M. D. Jiménez Gamero, M. R. Sillero Denamiel

In some practical settings, the population is divided into a large number k of subpopulations (by countries, cities, age groups, etc). In such a case, one may be interested in testing the equality of the k subpopulations, which is itself of interest and as a previous step in some Machine Learning approaches such as classification problems. With this aim, an unbiased estimator of the Gini covariance is taken as a test statistic. The asymptotic distribution of the test statistic is stated under the null hypothesis as well as under alternatives, assuming k large and small to moderate sample sizes. Specifically, it is shown that the test statistic is asymptotically free distributed under the null hypothesis, avoiding the use of complicated resampling procedures. The finite sample performance of the test based on the asymptotic null distribution is studied via simulation and compared with existing methods. An application to a real dataset about the quality of the air is shown.

Keywords: k-sample problem, energy distance, Gini correlation, asymptotic power, consistency

Scheduled

AMC4 Prediction and Classification
June 11, 2025  10:30 AM
MR 1


Other papers in the same session


Cookie policy

We use cookies in order to be able to identify and authenticate you on the website. They are necessary for the correct functioning of it, and therefore they can not be disabled. If you continue browsing the website, you are agreeing with their acceptance, as well as our Privacy Policy.

Additionally, we use Google Analytics in order to analyze the website traffic. They also use cookies and you can accept or refuse them with the buttons below.

You can read more details about our Cookie Policy and our Privacy Policy.