model.train(False) affects gradient tracking? #2230

MaverickMeerkat · 2023-03-02T12:09:13Z

In this tutorial here it says in the comment that "# We don't need gradients on to do reporting". From what I understand the train flag only affects layers such as dropout and batch-normalization. Does it also affect gradient calculations, or is this comment wrong?

tutorials/beginner_source/introyt/trainingyt.py

Line 293 in 6bd30cf

# We don't need gradients on to do reporting

cc @suraj813

chedatomasz · 2023-04-10T23:16:35Z

This comment, and the accompanying code, is wrong. A more correct pattern is shown at https://pytorch.org/tutorials/beginner/basics/optimization_tutorial.html#full-implementation, although this one misses the train/eval switches and thus does not generalize to models with batchnorm and dropout.

chedatomasz · 2023-04-10T23:26:36Z

Related issues: #2083 #1756 #208

JoseLuisC99 · 2023-05-31T18:05:47Z

/assigntome

* refactored train loop in trainingyt.py, resolves issue #2230 * Simplified numpy function call, resolves issue #1038

svekars added intro question labels Mar 2, 2023

svekars added easy docathon-h1-2023 A label for the docathon in H1 2023 labels May 31, 2023

github-actions bot assigned JoseLuisC99 May 31, 2023

JoseLuisC99 mentioned this issue May 31, 2023

Fix train loop in trainingyt.py #2372

Merged

carljparker closed this as completed in #2372 Jun 1, 2023

carljparker pushed a commit that referenced this issue Jun 1, 2023

Fix train loop in trainingyt.py (#2372)

d686b66

* refactored train loop in trainingyt.py, resolves issue #2230 * Simplified numpy function call, resolves issue #1038

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model.train(False) affects gradient tracking? #2230

model.train(False) affects gradient tracking? #2230

MaverickMeerkat commented Mar 2, 2023 •

edited by pytorch-bot bot

Loading

chedatomasz commented Apr 10, 2023

chedatomasz commented Apr 10, 2023

JoseLuisC99 commented May 31, 2023

model.train(False) affects gradient tracking? #2230

model.train(False) affects gradient tracking? #2230

Comments

MaverickMeerkat commented Mar 2, 2023 • edited by pytorch-bot bot Loading

chedatomasz commented Apr 10, 2023

chedatomasz commented Apr 10, 2023

JoseLuisC99 commented May 31, 2023

MaverickMeerkat commented Mar 2, 2023 •

edited by pytorch-bot bot

Loading