Statistical Learning

About this Course

This is an introductory-level course in supervised learning, with a focus on regression and classification methods. The syllabus includes: linear and polynomial regression, logistic regression and linear discriminant analysis; cross-validation and the bootstrap, model selection and regularization methods (ridge and lasso); nonlinear models, splines and generalized additive models; tree-based methods, random forests and boosting; support-vector machines; neural networks and deep learning; survival models; multiple testing. Some unsupervised learning methods are discussed: principal components and clustering (k-means and hierarchical). This is not a math-heavy class, so we try and describe the methods without heavy reliance on formulas and complex mathematics. We focus on what we consider to be the important elements of modern data science. Computing is done in R. There are lectures devoted to R, giving tutorials from the ground up, and progressing with more detailed sessions that implement the techniques in each chapter. The lectures cover all the material in An Introduction to Statistical Learning, with Applications in R (second addition) by James, Witten, Hastie and Tibshirani (Springer, 2021). The pdf for this book is available for free on the book website.

Created by: Stanford University

Level: Introductory


Related Online Courses

Develop the skills necessary to create structured database environments using a relational database management system (RDBMS), such as MySQL, that incorporates basic processing functionality and... more
This proctored examination assesses all concepts, methods and techniques introduced across the following four courses within the LSE MicroBachelors program in Statistics Fundamentals: Statistics 1:... more
Sustainable development is the most important global movement of our time. In 2015, the 193 member states of the United Nations unanimously adopted the 2030 Agenda for Sustainable Development and... more
Los contenidos de este curso se han pensado para permitir que los usuarios de Tableau mejoren a un nivel intermedio las propias capacidades en el empleo de la herramienta. En los precedentes... more
Statistical inference and modeling are indispensable for analyzing data affected by chance, and thus essential for data scientists. In this course, you will learn these key concepts through a... more

CONTINUE SEARCH

FOLLOW COLLEGE PARENT CENTRAL