Each training point is predicted by a GP model trained on all other points. Observed values are plotted against these LOO predictions — points on the diagonal indicate good model fit.
Each point has a vertical error bar showing the GP's 95% confidence interval. Points far from the diagonal, or where the diagonal falls outside the CI, indicate regions where the model struggles. The R² value summarizes overall fit quality.
Interactivity: