New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add schedule&runtime tutorial doc #499

Merged

mzr1996 merged 33 commits into open-mmlab:master from Ezra-Yu:schedule

Nov 17, 2021

Collaborator

Ezra-Yu commented Oct 22, 2021 •

edited

Loading

Motivation

Add schedule and runtime tutorial documentation.

Modification

Add schedule and runtime tutorial documentation.
Add api.models.heads link in CN_doc.

Checklist

Before PR:

Pre-commit or other linting tools are used to fix the potential lint issues.
Bug fixes are fully covered by unit tests, the case that causes the bug should be added in the unit tests.
The modification is covered by complete unit tests. If not, please add more unit test to ensure the correctness.
The documentation has been modified accordingly, like docstring or example tutorials.

After PR:

If the modification has potential influence on downstream or other related projects, this PR should be tested with those projects, like MMDet or MMSeg.
CLA has been signed and all committers have signed the CLA in this PR.

Ezra-Yu added 10 commits

October 14, 2021 18:05


          add cn tutorials/config.md

f1f485f


          add heads api and doc title link

a6b0bdb


          Merge remote-tracking branch 'mmcls/master' into config

076ad61


          Update tutorials index

6e0de56


          Update tutorials index

e66e8e7


          Update config.md


          add english version

eb39620


          Update config.md

15e80ad


          Merge branch 'open-mmlab:master' into schedule

eb8bd5d


          add custom_runtime

3852c47

codecov bot commented Oct 22, 2021 •

edited

Loading

Codecov Report

Merging #499 (18550a2) into master (dc35eb6) will increase coverage by 0.39%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master     #499      +/-   ##
==========================================
+ Coverage   79.48%   79.87%   +0.39%     
==========================================
  Files         106      107       +1     
  Lines        5975     6093     +118     
  Branches      968      987      +19     
==========================================
+ Hits         4749     4867     +118     
+ Misses       1095     1094       -1     
- Partials      131      132       +1

Flag	Coverage Δ
unittests	`79.87% <ø> (+0.39%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
mmcls/models/backbones/timm_backbone.py	`78.94% <0.00%> (-2.01%)`	⬇️
mmcls/models/heads/cls_head.py	`83.33% <0.00%> (ø)`
mmcls/models/backbones/__init__.py	`100.00% <0.00%> (ø)`
mmcls/models/backbones/mlp_mixer.py	`95.45% <0.00%> (ø)`
mmcls/models/losses/cross_entropy_loss.py	`98.33% <0.00%> (+0.15%)`	⬆️
mmcls/apis/inference.py	`20.00% <0.00%> (+0.35%)`	⬆️
mmcls/datasets/pipelines/transforms.py	`88.17% <0.00%> (+1.36%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update dc35eb6...18550a2. Read the comment docs.

Ezra-Yu added 6 commits

October 25, 2021 12:01


          Update docs

7062add


          merge config.doc

d1ff825


          modify title

aaf297e


          modify en to zh_CN in chinses docs

0f06041


          Update Readme

080443e


          fix punctuations

Ezra-Yu requested a review from mzr1996

October 27, 2021 10:02


          Merge branch 'open-mmlab:master' into schedule

8c52c76

mzr1996 requested changes

View reviewed changes

docs/tutorials/customize_runtime.md Outdated

Comment on lines 79 to 81

+                  Create the `mmcls/core/optimizer` folder and the `mmcls/core/optimizer/__init__.py` file.
+                  The newly defined module should be imported in `mmcls/core/optimizer/__init__.py` so that the registry will
+                  find the new module and add it:

Member

mzr1996 Nov 3, 2021

Users should also need to add from .optimizer import * into mmcls/core/__init__.py to register the optimzier.

docs/tutorials/customize_runtime.md Outdated Show resolved Hide resolved

docs/tutorials/customize_runtime.md Outdated

+              ```
+              The default optimizer constructor is implemented [here](https://github.com/open-mmlab/mmcv/blob/9ecd6b0d5ff9d2172c49a182eaa669e9f27bb8e7/mmcv/runner/optimizer/default_constructor.py#L11),
+              which could also serve as a template for new optimizer constructor.

Member

mzr1996 Nov 3, 2021

The DefaultOptimizerConstructor supports paramwise_cfg, please add an example about how to use it in config file.

docs/tutorials/customize_runtime.md Outdated


		## Customize Training Schedules

		we use step learning rate with default value in config files, this calls [`StepLRHook`](https://github.com/open-mmlab/mmcv/blob/f48241a65aebfe07db122e9db320c31b685dc674/mmcv/runner/hooks/lr_updater.py#L153) in MMCV.

Member

mzr1996 Nov 3, 2021

we -> We

docs/tutorials/customize_runtime.md Outdated


		so that 1 epoch for training and 1 epoch for validation will be run iteratively.

		:::{note}

Member

mzr1996 Nov 3, 2021

use

```{note}

docs/tutorials/customize_runtime.md Outdated

+              ]
+              ```
+              You can also set the priority of the hook by adding key `priority` to `'NORMAL'` or `'HIGHEST'` as below

Member

mzr1996 Nov 3, 2021

'NORMAL' or 'HIGHEST'?
Here is the priority level table, and users can also use a specific value to modify priority finely.

Level	Value
HIGHEST	0
VERY_HIGH	10
HIGH	30
ABOVE_NORMAL	40
NORMAL	50
BELOW_NORMAL	60
LOW	70
VERY_LOW	90
LOWEST	100

docs/tutorials/customize_runtime.md Outdated Show resolved Hide resolved

docs/tutorials/customize_runtime.md Outdated

+              The above-mentioned tutorials already cover how to modify `optimizer_config`, `momentum_config`, and `lr_config`.
+              Here we reveals how what we can do with `log_config`, `checkpoint_config`, and `evaluation`.
+              #### Checkpoint config

Member

mzr1996 Nov 3, 2021

Please rearrange these sections and rename titles according to the above list.

docs/tutorials/customize_runtime.md Outdated

Comment on lines 140 to 144

+                  Some models need gradient clip to clip the gradients to stabilize the training process. An example is as below:
+                  ```python
+                  optimizer_config = dict(grad_clip=dict(max_norm=35, norm_type=2))
+                  ```

Member

mzr1996 Nov 3, 2021

The optimizer_config will be passed to OptimizerHook, users are easy to confuse it and the optimizer_cfg in the optimizer constructor.
And users can specify different kinds of optimizer hooks, like GradientCumulativeOptimizerHook here. Consider adding some introduction here.

docs/tutorials/customize_runtime.md Outdated

+              #### Evaluation config
+              The config of `evaluation` will be used to initialize the [`EvalHook`](https://github.com/open-mmlab/mmclassification/blob/master/mmcls/core/evaluation/eval_hooks.py).
+              Except the key `interval`, other arguments such as `metrics` will be passed to the `dataset.evaluate()`

Member

mzr1996 Nov 3, 2021

The EvaluationHook supports save_best now, many users may want this function, consider adding an introduction about it.

Ezra-Yu and others added 3 commits

November 3, 2021 14:55


          Update docs/tutorials/customize_runtime.md

0787d65

Co-authored-by: Ma Zerun <mzr1996@163.com>


          Update docs/tutorials/customize_runtime.md

51918ca

Co-authored-by: Ma Zerun <mzr1996@163.com>


          split to schedule and runtime

8b8703e

Ezra-Yu changed the title ~~Add custom runtime tutorial doc~~ Add schedule&runtime tutorial doc

Ezra-Yu added 3 commits

November 5, 2021 20:32


          Merge branch 'open-mmlab:master' into schedule

ff1ee43


          fix lint

0bf570d


          Merge remote-tracking branch 'origin/schedule' into schedule

89a5f18

Ezra-Yu mentioned this pull request

Class names in checkpoint's meta data #520

Closed

mzr1996 requested changes

View reviewed changes

docs/tutorials/schedule.md Outdated


		### Warmup strategy

		在配置文件中预热（warmup）的逐步学习率调整，主要的参数有以下几个：

Member

mzr1996 Nov 8, 2021

Missing translation

docs/tutorials/schedule.md Outdated


		In academic research and industrial practice, it may be necessary to use optimization methods not implemented by MMClassification, and users can add them through the following methods.

		```(note)

Member

mzr1996 Nov 8, 2021

```{note} instead of ```(note)

docs/tutorials/runtime.md Outdated

Comment on lines 10 to 12

+                  - [CheckpointSaverHook](#checkpointsaverhook)
+                  - [LoggerHooks](#loggerhooks)
+                  - [EvaluationHook](#evaluationhook)

Member

mzr1996 Nov 8, 2021

The TOC's links don't match titles.

docs/tutorials/schedule.md Outdated

		@@ -0,0 +1,304 @@
		# Tutorial 6: Customize Schedule

		In this tutorial, we will introduce some methods about how to construct optimizers, customize learning rate and momentum schedules, use multiple learning rates and weight_decay, gradient clipping, gradient accumulation, and customize self-implemented methods for the project.

Member

mzr1996 Nov 8, 2021

The underline is only used in variable names.

docs/tutorials/runtime.md Outdated

+              checkpoint_config = dict(interval=1)
+              ```
+              The users could set `max_keep_ckpts` to only save only small number of checkpoints or decide whether to store state dict of optimizer by `save_optimizer`.

Member

mzr1996 Nov 8, 2021

The users can be used in comments, which is for developers. In the documentation, just use we or you.

docs/tutorials/schedule.md Outdated


		After completing your configuration file，you could use [learning rate visualization tool](https://mmclassification.readthedocs.io/zh_CN/latest/tools/visualization.html#id3) to draw the corresponding learning rate adjustment curve.

		## Use multiple learning rates and weight_decays

Member

mzr1996 Nov 8, 2021

Suggested change

      
            ## Use multiple learning rates and weight_decays
          
            ## Parameter-wise finely configuration

docs/tutorials/schedule.md

+              Examples are as follows:
+              ```python
+              optimizer_config = dict(grad_clip=dict(max_norm=35, norm_type=2))

Member

mzr1996 Nov 8, 2021

Add a comment about norm_type, since it's not self-explanatory.

docs/tutorials/schedule.md Outdated

Comment on lines 186 to 189

+              When the optimizer hook type is not specified, `OptimizerHook` is used by default, and the above is equivalent to:
+              ```python
+              optimizer_config = dict(type="OptimizerHook", grad_clip=dict(max_norm=35, norm_type=2))

Member

mzr1996 Nov 8, 2021 •

edited

Loading

This section is not relevant to gradient clipping, move it to where users need to change it.

docs/tutorials/schedule.md Outdated


		### Gradient accumulation

		When computing resources are lacking, BatchSize can only be set to a small value, which affects the effect of the resulting model. Gradient accumulation can be used to circumvent this problem.

Member

mzr1996 Nov 8, 2021

BatchSize? Strange uppercase.

docs/tutorials/schedule.md Outdated

Comment on lines 197 to 207

+              - ConsineAnnealing schedule:
+                  ```python
+                  lr_config = dict(
+                      policy='CosineAnnealing',
+                      warmup='linear',
+                      warmup_iters=1000,
+                      warmup_ratio=1.0 / 10,
+                      min_lr_ratio=1e-5)
+                  ```

Member

mzr1996 Nov 8, 2021

Forget to replace the example?

mzr1996 requested changes

View reviewed changes

docs/tutorials/runtime.md Outdated

		@@ -0,0 +1,265 @@
		# Tutorial 7: Customize Runtime Settings

		In this tutorial, we will introduce some methods about how to customize optimization methods, training schedules, workflow and hooks when running your own settings for the project.

Member

mzr1996 Nov 8, 2021

How to customize optimization methods has been moved to tutorial 6

docs/tutorials/runtime.md Outdated

+              ```{note}
+. The parameters of model will not be updated during val epoch.
+. Keyword `total_epochs` in the config only controls the number of training epochs and will not affect the validation workflow.

Member

mzr1996 Nov 8, 2021

total_epochs or max_epochs?

docs/tutorials/runtime.md Outdated


		## Customize Workflow

		By default, we recommend users to use `EvaluationHook` to do evaluation after training epoch, but they can still use `val` workflow as an alternative.

Member

mzr1996 Nov 8, 2021

This line should be moved after the introduction.
And consider adding a note to remind users that modifying workflow is unnecessary in most situations.

docs/tutorials/runtime.md Outdated

+              workflow = [('train', 1)]
+              ```
+              which means running 1 epoch for training.

Member

mzr1996 Nov 8, 2021

Watch out for the uppercase.

docs/tutorials/runtime.md Outdated


		The hook mechanism is widely used in the OpenMMLab open source algorithm library. Combined with the `Runner`, the entire life cycle of the training process can be managed easily. You can learn more about the hook through [related article](https://www.calltutors.com/blog/what-is-hook/).

		Hooks only work when they are registered in the constructor. At present, hooks are mainly divided into two categories:

Member

mzr1996 Nov 8, 2021

Suggested change

      
            Hooks only work when they are registered in the constructor. At present, hooks are mainly divided into two categories:
          
            Hooks only work after being registered into the runner. At present, hooks are mainly divided into two categories:

docs/tutorials/runtime.md Outdated

+              ```{note}
+. In the default configuration files of MMClassification, the evaluation field is generally placed in the datasets configs.
+. 'EvalHook' in 'MMClassification/mmcls/core/evaluation/eval_hooks.py' will be deprecated, recommend to use 'EvaluationHook' in MMCV as above.

Member

mzr1996 Nov 8, 2021

This note can be removed because config file modification is not relevant to use which implementation.

docs/tutorials/runtime.md Outdated


		### Use implemented hooks

		Some hooks have been already implemented in MMCV 和 MMClassification:

Member

mzr1996 Nov 8, 2021

and

docs/tutorials/runtime.md Outdated


		- load_from : only imports model weights, which is mainly used to load pre-trained or trained models;

		- resume_from : not only import model weights, but also optimizer information, current epoch information, mainly used to continue training from the breakpoint.

Member

mzr1996 Nov 8, 2021

breakpoint -> checkpoint

docs/tutorials/runtime.md Outdated


		- resume_from : not only import model weights, but also optimizer information, current epoch information, mainly used to continue training from the breakpoint.

		- init_cfg.Pretrained : load the model weight, and you can specify a specific ‘key’ layer to load.

Member

mzr1996 Nov 8, 2021

Suggested change

      
            - init_cfg.Pretrained : load the model weight, and you can specify a specific ‘key’ layer to load.
          
            - init_cfg.Pretrained : Load weights during weight initialization, and you can specify which module to load. This is usually used when fine-tuning a model.

docs/tutorials/runtime.md Outdated

+              - init_cfg.Pretrained : load the model weight, and you can specify a specific ‘key’ layer to load.
+              ```{note}
+              It is recommended to specify pre-training weights using init_cfg.Pretrained when fine-tuning the model.

Member

mzr1996 Nov 8, 2021

Add a link to the fine-tuning tutorial.

Ezra-Yu added 3 commits

November 11, 2021 18:34


          improve docs after review

05095d0


          fix TOC


          imporve expersion

7d18d50

Ezra-Yu requested a review from mzr1996

November 12, 2021 05:13

Ezra-Yu and others added 6 commits

November 12, 2021 13:15


          fix an error

f0d361f


          Imporve schedule.md

c43aa3f


          Improve runtime.md

9e49435


          Improve chinese docs.

ff3e469


          Fix toc-tree

6dac541


          fix en link and add a case of gradient clipping

mzr1996 requested changes

View reviewed changes

docs_zh-CN/tutorials/schedule.md Outdated

@@ @@ -200,6 +200,15 @@ momentum_config = dict( @@
               optimizer_config = dict(grad_clip=dict(max_norm=35, norm_type=2))
               ```
+              当使用继承并修改基础配置方式时，如果基础配置中 `grad_clip=None`，需要添加 `_delete_=True`。有关 `_delete_` 可以参靠[教程 1：如何编写配置文件](https://mmclassification.readthedocs.io/zh_CN/latest/tutorials/config.html#id16)。案列如下：

Member

mzr1996 Nov 17, 2021

“参靠”，“案列”


          fix wrong word

18550a2

mzr1996 approved these changes

View reviewed changes

Member

mzr1996 left a comment

LGTM

mzr1996 merged commit 771d105 into open-mmlab:master

Ezra-Yu deleted the schedule branch

July 18, 2022 08:45

mzr1996 added a commit to mzr1996/mmpretrain that referenced this pull request


          [Docs] Add schedule and runtime tutorial docs (open-mmlab#499)

c03a78b

* add cn tutorials/config.md

* add heads api and doc title link

* Update tutorials index

* Update tutorials index

* Update config.md

* add english version

* Update config.md

* add custom_runtime

* Update docs

* modify title

* modify en to zh_CN in chinses docs

* Update Readme

* fix punctuations

* Update docs/tutorials/customize_runtime.md

Co-authored-by: Ma Zerun <mzr1996@163.com>

* Update docs/tutorials/customize_runtime.md

Co-authored-by: Ma Zerun <mzr1996@163.com>

* split to schedule and runtime

* fix lint

* improve docs after review

* fix TOC

* imporve expersion

* fix an error

* Imporve schedule.md

* Improve runtime.md

* Improve chinese docs.

* Fix toc-tree

* fix en link and add a case of gradient clipping

* fix wrong word

Co-authored-by: Ma Zerun <mzr1996@163.com>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet