Séminaire Probabilités et Statistiques
Finite-sample performance of the maximum likelihood estimator in logistic regression
10
oct. 2024
logo_team
Intervenant : Jaouad Mourtada
Institution : ENSAE
Heure : 15h30 - 16h30
Lieu : 3L15

The logistic model is a classical linear model to describe the dependence of binary responses to multivariate covariates. We consider the predictive performance of the maximum likelihood estimator (MLE) for logistic regression, assessed in terms of logistic risk. We consider two questions: first, that of the existence of the MLE (which occurs when the data is not linearly separated), and second that of its accuracy when it exists. These properties depend on both the dimension of covariates and on the signal strength.
In the case of Gaussian covariates and a well-specified logistic model, we describe sharp (up to universal constants) non-asymptotic guarantees for the existence and excess prediction risk of the MLE. This complements asymptotic results of Sur and Candès, and tightens existing non-asymptotic upper bounds. We then generalize these results in two ways: first, to non-Gaussian covariates satisfying a certain regularity condition, and second to the case of a misspecified logistic model.

Voir tous les événements