Quentin Duchemin

Documents

Here I share some works done during my studies.

Some works done during my research Master (MVA)

Project for the course on Kernel Methods with the questions.

Project on a unifying framework for representer theorems.

Homeworks (1, 2 and 3) related to the course “Probabilistic Graphical Models”

Poster done for a Machine-Learning project on classification by diffusion.

A list of good references that I read during my master and my PhD with some comments

  • General references on the theoretical aspects of Machine Learning and data-science
    – A large list of nice Theoretical Computer Science references are available here.
    – I recommand also the book recently written by Francis Bach “Learning from first Principles”
  • Markov chains
    – Meyn & Tweedie’s book: A good reference for theoretical aspects on (discrete time) Markov chains on general state space
    – Levin’s book on mixing time for Markov chains. This book deals mainly with Markov chain with finite state space and introduces important notions related to mixing like the spectral gap of a transition kernel or the Cheeger constant.
    – For a friendly version of the previous book, I recommend the lectures of Justin Salez (from a course at Sorbonne University).
    – For hidden Markov chains, one can read the book wirtten by Eric Moulines on the subject.
  • Concentration of measure
    – P.Massart’s book is the main reference on the subject
    – I also recommand the lecture notes from Ana Ben Amou to get a more synthetic introduction to this topic.
  • Convex Optimization
    – S. Bubeck’s book is a nice reference to understand the most important algorithms in convex optimization.
    – Boyd’s book is one of the most famous reference in the field of convex optimization.
  • General books on Optimization
    – Numerical Optimization from Nocedal and Wright
    – Convex Analysis and Monotone Operator Theory in Hilbert Spaces from Combettes and al. is a great reference to understand ADMM, proximal algorithms or Douglas Rachford algorithm.
  • High dimensional statistics
    – The book from R. Vershynin introduces important concepts of high dimensional statistics with a specific focus on geometry and concentration of measure and their interactions. This book presents in particular sub-gaussian and exponential random variables, concentration inequalities for quadratic forms (Hanson Wright’s inequality), Slepian, Gordon, Sudakov and Dudley inequalities (and much more).
    – To focus specifically on Random Matrices, there exists the Saint Flour lecture notes of Alice Guionnet.
    – Wainwright, High-Dimensional Statistics: A Non-Asymptotic Viewpoint This book covers a larger number of topics compared to the previous one with chapters on RKHS or PCA. I also spends several chapters presenting theoretical results for sparse linear models in high dimensions. The author takes the framework the more general possible which makes some sections difficult to grasp.
    – Giraud, Introduction to High-Dimensional Statistics
    – Giné, Mathematical Foundations of Infinite-Dimensional Statistical Models
  • Non parametric statistics
    – Non parametric inference, Tsybakov
    – Lecture Notes written by John Duchi were also really helpful for me.
  • Sparsity and compressed sensing
    – Statistical Learning with Sparsity, Hastie: one of the few books that present some “recent” and important topics such as Post-selection inference and the polyhedral lemma.
    – Books from Sara Van de Geer are widely used in the community (but are technically difficult).
    – Claire Boyer’s lecture notes on compressed sensing
  • Multiple testing procedures
    – Lecture notes from Emmanuel Candès offer a really nice historical overview of the subject. A specific attention is given to the work personally done by Pr. Candès (such as the Knockoff filter).
    – The recent handbook “Handbook of Multiple Comparisons” edited by Cui, Dickhaus and al. presents advanced research topics.
  • Probability theory
    – Books from Billingsley are often used as references for master courses on probability theory and limit theorems.
    – Giné, Decoupling – From dependence to independence : Book introducing decoupling methods.
  • Stochastic Calculus
    – Marc Yor’s book on Continuous Martingales and brownian motion describes in detail a variety of techniques used by probabilists in the field of random processes. This is not a book for beginners on the subject and is more intended for doctoral students or reseachers in the field.
    – Jean-François Le Gall’s lecture notes are a great reference for people that do not work precisely on Stochastic Calculus.
  • Optimal Transport
    – A nice reference that was the purpose of a reading group during my master MVA with Professor Vianney Perchet is the book written by Santambrogio: Optimal transport.
    – For a more computational view point (and maybe a more easy to grasp presentation of concepts from a theoretical point of view), I recommend the book from Gabriel Peyré and Marco Cuturi: “Computational Optimal Transport”.
    – Lectures Notes from Lenaic Chizat give an impressive well-structured summary of the two previous references.