-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Switch Apex with Pytorch #336
Conversation
Signed-off-by: Jason <jasoli@nvidia.com>
…o_torch Signed-off-by: Jason <jasoli@nvidia.com>
This pull request fixes 1 alert when merging 675b0fe into 18b528e - view on LGTM.com fixed alerts:
|
Signed-off-by: Jason <jasoli@nvidia.com>
This pull request fixes 1 alert when merging d535720 into 18b528e - view on LGTM.com fixed alerts:
|
Signed-off-by: Jason <jasoli@nvidia.com>
This pull request fixes 1 alert when merging a9be01a into 18b528e - view on LGTM.com fixed alerts:
|
Signed-off-by: Jason <jasoli@nvidia.com>
|
This pull request introduces 1 alert and fixes 1 when merging 71d4bff into 4f299f4 - view on LGTM.com new alerts:
fixed alerts:
|
This pull request introduces 1 alert and fixes 1 when merging 64ccf26 into c6a3cdd - view on LGTM.com new alerts:
fixed alerts:
|
64ccf26
to
71d4bff
Compare
Signed-off-by: Jason <jasoli@nvidia.com>
This pull request introduces 1 alert and fixes 1 when merging 7235317 into c6a3cdd - view on LGTM.com new alerts:
fixed alerts:
|
Signed-off-by: Jason <jasoli@nvidia.com>
This pull request fixes 1 alert when merging f627f44 into 54a8e9e - view on LGTM.com fixed alerts:
|
This pull request fixes 1 alert when merging b85cb40 into 54a8e9e - view on LGTM.com fixed alerts:
|
…o_torch Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
This pull request fixes 1 alert when merging f1a57bb into 403238f - view on LGTM.com fixed alerts:
|
Signed-off-by: Jason <jasoli@nvidia.com>
This pull request fixes 1 alert when merging dabaea2 into 403238f - view on LGTM.com fixed alerts:
|
Signed-off-by: Jason <jasoli@nvidia.com>
…o_torch Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
This pull request fixes 2 alerts when merging a71da02 into f072029 - view on LGTM.com fixed alerts:
|
Signed-off-by: Jason <jasoli@nvidia.com>
This pull request fixes 2 alerts when merging 390c410 into f072029 - view on LGTM.com fixed alerts:
|
Signed-off-by: Jason <jasoli@nvidia.com>
This pull request fixes 2 alerts when merging 8c26247 into f072029 - view on LGTM.com fixed alerts:
|
* cli: use non-zero exit status for error scenarios Invoking sys.exit() with no args results in an exit status of zero, which traditionally indicates success. This is not appropriate for error scenarios. Signed-off-by: Daniel P. Berrangé <berrange@redhat.com> * lab: exit program after catching and reporting exceptions Creating the app failed with an exception raised due to a missing model file. Execution carried on, however, leading to use of an undefined 'app' variable. Fix this, and a couple of other places which also caught (unrecoverable) exceptions and forgot to exit. Signed-off-by: Daniel P. Berrangé <berrange@redhat.com> --------- Signed-off-by: Daniel P. Berrangé <berrange@redhat.com>
This PR switches from apex's DistributedDataParallel to torch's DistributedDataParallel.
Warning: that gradient_predivide_factor is no longer working after this switch
Warning: in multi-gpu runs, neural modules with no weights MUST inherit from NonTrainableNM.