[rede.APPIA] DaSSWeb – Data Science and Statistics Webinar – 24 September – Peter J. Rousseeuw – Robust Principal Components by Casewise and Cellwise Weighting

DaSSWeb – Data Science and Statistics Webinar

 

Tuesday, 24 September, 14:30 (WEST)

 

Speaker

Peter J. Rousseeuw

Department of Mathematics, KU Leuven

Leuven, Belgium

 

Title

Robust Principal Components by Casewise and Cellwise Weighting

 

Zoom link

Abstract

Principal component analysis (PCA) is a fundamental tool for analyzing multivariate data. Here the focus is on dimension reduction to the principal subspace, characterized by its projection matrix. The  classical principal subspace can be strongly affected by the presence of outliers. Traditional robust approaches consider casewise outliers, that is, cases generated by an unspecified outlier distribution that differs from that of the clean cases. But there may also be cellwise outliers, which are suspicious entries that can occur anywhere in the data matrix. Another common issue is that some cells may be missing. This paper proposes a new robust PCA method, called cellPCA, that can simultaneously deal with casewise outliers, cellwise outliers, and missing cells. Its single objective function combines two robust loss functions that together mitigate the effect of casewise and cellwise outliers. The objective function is minimized by an iteratively reweighted least squares (IRLS) algorithm. Residual cellmaps and enhanced outlier maps are proposed for outlier detection. Simulations and real data examples illustrate the performance of cellPCA.

 

More information at