From Basic Machine Learning models to Advanced Kernel Learning (2025

Course description

Statistical learning is about the construction and study of systems that can automatically learn from data. With the emergence of massive datasets commonly encountered today, the need for powerful machine learning is of acute importance. Examples of successful applications include effective web search, anti-spam software, computer vision, robotics, practical speech recognition, and a deeper understanding of the human genome.

This course gives an introduction to this exciting field. In the first part, we will introduce basic techniques such as linear regression, logistic regression, multilayer perceptrons, nearest neighbor approaches, both from a theoretical and methodological point of views. In the second part, we will focus on more advanced techniques such as kernel methods, which is a versatile tool to represent data, in combination with (un)supervised learning techniques that are agnostic to the type of data that is learned from. The learning techniques that will be covered include regression, classification, clustering and dimension reduction. We will cover both the theoretical underpinnings of kernels, as well as a series of kernels that are important in practical applications. Finally we will touch upon topics of active research, such as large-scale kernel methods and the use of kernel methods to develop theoretical foundations of deep learning models.

Evaluation

The grading of the class will be done with (i) one final exam, (ii) two homeworks.

Reading material

Francis Bach, Learning Theory from First Principles, 2023.
Vapnik, The nature of statistical learning theory. Springer.
Hastie, Tibshirani, Friedman, The elements of statistical learning. (free online).
Shawe-Taylor, Cristianini. Kernel methods for pattern analysis. 2004.

Optional exercises (ungraded)

Addtional exercises (on kernel learning) can be found in this booklet.

Final exam

The final exam will take place some day between the 26th of January 2026 and 4th of February 2026, it will last 2 hours (the location and precise date is to be confirmed). You are allowed to bring one single A4 sheet of handwritten notes.

Schedule

Lectures are scheduled from 2pm to 5pm on Tuesdays. Please bring your laptop as their will sometimes be coding exercises. The first part of the class will recap basic supervised learning techniques, theory, and algorithms. Lecture notes will be continuously updated here, and I'll be looking for motivated volunteers in each session to help write them. A cheat sheet with key formulas and theorems for the course is also available at the end of the lecture notes—feel free to contribute by adding anything you think might be useful for you and your classmates!

You can download the slides for all lectures of the second part of the class (advanced kernel methods) here! Each lecture corresponds to a range of slides. Slides are frequently updated. Please let us know if you spot typos!

Homework 1. The first homework will be due on Monday 8th of December 2025 and is available here. The report can be written in English or in French. You can work alone or in pairs. Submission details: Email before Monday 8 December, 22:00 to scott.pesme@inria.fr with subject “Homework 1 – [Name 1] and [Name 2]”. Attach a single .zip containing the PDF report, you code ReLU.ipynb and the given data folder.

Homework 2. The second homework will be due by 13/01/2026 at 1pm and is available here. This is an individual homework. Students must submit a single PDF to the GradeScope plateform. Please follow the instructions to create an account on GradeScope and to submit the homework. To access the course plateform you need to provide the following Course entry code: WNBGBN. For questions about the Homework please send an email to , the object of the email must start with the course entry code.

#	Date	Room	Teacher	Topic
1	23/09/2025	A022	SP	Introduction
1bis	23/09/2025	A022	SP	Supervised learning basics
2	30/09/2025	H206	SP	Linear regression (quizzes)
2bis	30/09/2025	H206	SP	Linear regression (notebook)
3	07/10/2025	Amphi H	SP	Bias / variance tradeoff, logistic regression (quizzes)
3bis	07/10/2025	Amphi H	SP	Notebooks
4	14/10/2025	H206	SP	Unsupervised learning
4bis	14/10/2025	H206	SP	Notebook
5	21/10/2025	Amphi E
5bis	21/10/2025	Amphi E
6	04/11/2025	Amphi H
6bis	04/11/2025	Amphi H
7	18/11/2025	H105
7bis	18/11/2025	H105
8	25/11/2025	H206
8bis	25/11/2025	H206
9	02/12/2025	A022
9bis	02/12/2025	A022
10	09/12/2025	Amphi E
10bis	09/12/2025	Amphi E
11	06/01/2026	Amphi E
11bis	06/01/2026	Amphi E
12	13/01/2026	Amphi E
12bis	13/01/2026	Amphi E
EXAM	TBD	TBD		Final Exam

From Basic Machine Learning models to Advanced Kernel Learning

Course description

Evaluation

Reading material

Optional exercises (ungraded)

Final exam

Schedule

Instructors

Julien Mairal

Scott Pesme

Michael Arbel