GitHub - leiwu0/course.math_theory_nn: Summer course on mathematical theory of deep learning

Mathematical Theory of Neural Network Models

Announcements

7/26: Lecture 7 and 9 are out.
7/25: The report of paper review is due on 8/2, 12 pm.
7/19: The schedule of presentations is out.
7/18: A draft of Lecture 4 is out.
7/17: Drafts of Lecture 3, 5 and 6 are out.
7/12: A draft of Lecture 2 is out.
7/12: Some references for random feature models, Barron spaces and regularization theory of two-layer nets are added.
7/9: A draft of Lecture 1 is out.
7/9: Homework 2 is out. It is due on Tuesday, 7/16, 12pm.
7/6: Homework 1 is out. It is due on Friday, 7/12, 12pm.

Administrative information

Instructor:
- Weinan E
- Lei Wu, leiwu@princeton.edu
- Chao Ma, chaom@princeton.edu
Time: Tue: 2:00-5:00 pm; Thu: 2:00-5:00 pm; Fri: 3:00-5:00 pm.
Location: Room 515, Teaching Building 2

Course Content

Description:

This course introduces the basic models for supervised learning, including kernel method, two-layer neural network and residual network. We then provide a unified approach to analyze these models.

Topic:

Supverised learning, generalization/approximation/estimation error, a priroi/posteriori estimates
Kernel method, two-layer nerual network, residual network
Reproducing kernel Hilbert space, Barron space, compositional function space
Rademacher complexity, margin, gradient descent, implicit regularization

Prerequisite:

A solid background in linear algebra, real analysis and probability/measure theory
Basic knowledge in (convex) optimization and statistics

Grading

Coursework:

Homework (45%)
Paper review (45%): You are asked to choose a paper from this paper list and write a review. The review should not only summarize the paper, but also identify the novelty and limitation of the result. A good paper review at least attempts to answer the following four questions:
- What is the main result of the paper?
- Why is the result important and significant compared with other papers?
- What is the limitation of the result?
- What is the potential research direction inspired by the paper?

You are required to give a presentation (15%) and submit a report of 3 pages (30%).

Scribe notes (10%): You are asked to scribe a note in LaTeX. The scribe notes can be done in pairs. Please use this template:

Collaboration policy: We encourage you to form study groups and discuss courseworks. However, you must write up all the coureworks from scrach independently without refering to any notes from others.

Texts and References

Schedule (subject to change)

Week 1

Tue 7/2: Introduction to supervised learning methods
- Lecture 1
- Random Features for Large-Scale Kernel Machines
Thu 7/4: Overview of mathematical theory for neural network models
Fri 7/5: Rademacher complexity, covering number, metric entropy and uniform bound
- Lecture 3
- Concentration inequalities

Week 2

Reproducing kernel Hilbert space and random feature model
Error estimates for random feature model with explict and implicit regularizations
- Lecture 5
- The analysis of implicit regularization for the random feature model can be found in this paper
- Learning with SGD and Random Features
- Optimal Rates for the Regularized Least-Squares Algorithm
Barron space and regularization theory of two-layer neural networks
- Lecture 6
- Properties of Barron space can found in Section 2 of this paper
- The a priori estimates of regularized two-layer neural networks can be found in this paper
- The must-read classic paper of Andrew Barron (This is the first paper that provides an approximation rate without the course of dimensionality.)

Week 3

Implicit regularization for two-layer neural networks
- Lecture 7, 7.1, 7.2
- The main materials can be found in this paper
A priori estimates for regularized deep residual networks
- A Priori Estimates of the Population Risk for Residual Networks
F-principle and it application in deep learning (Guest speakers: Zhiqin Xu, Yaoyu Zhang, Tao Luo)
- An introduction to F-principle Lecture 9.1
- Application of F-principle in learning two-layer neural networks Lecture 9.2
- General theory of F-principle Lecture 9.3

Week 4

Compositonal function space for deep residual networks
- The mathematical theory of compositonal function spaces can found in Section 3 of this paper
Overview of recent progresses in theoretical deep learning
- Slide

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
homework		homework
note		note
template		template
README.md		README.md
paper_list.md		paper_list.md
pre_schedule.txt		pre_schedule.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mathematical Theory of Neural Network Models

Announcements

Administrative information

Course Content

Grading

Texts and References

Schedule (subject to change)

Week 1

Week 2

Week 3

Week 4

About

Releases

Packages

Languages

leiwu0/course.math_theory_nn

Folders and files

Latest commit

History

Repository files navigation

Mathematical Theory of Neural Network Models

Announcements

Administrative information

Course Content

Grading

Texts and References

Schedule (subject to change)

Week 1

Week 2

Week 3

Week 4

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages