A series of interactive tutorials introducing PCA, clustering, linear modelling and cross-validation for large datasets.