Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implementation of Early Stopping for DDP training #8345

Merged
merged 10 commits into from
Jun 29, 2022

Commits on Jun 26, 2022

  1. Implementation of Early Stopping for DDP training

    This edit correctly uses the broadcast_object_list() function to send slave processes a boolean so to end the training phase if the variable is True, thus allowing the master process to destroy the process group and terminate.
    giacomoguiduzzi committed Jun 26, 2022
    Configuration menu
    Copy the full SHA
    6dcc47b View commit details
    Browse the repository at this point in the history

Commits on Jun 27, 2022

  1. Configuration menu
    Copy the full SHA
    17c56a8 View commit details
    Browse the repository at this point in the history

Commits on Jun 28, 2022

  1. Configuration menu
    Copy the full SHA
    9f6d318 View commit details
    Browse the repository at this point in the history
  2. Update train.py

    glenn-jocher committed Jun 28, 2022
    Configuration menu
    Copy the full SHA
    4aa4305 View commit details
    Browse the repository at this point in the history
  3. Update train.py

    glenn-jocher committed Jun 28, 2022
    Configuration menu
    Copy the full SHA
    953aaa3 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    0e87ad6 View commit details
    Browse the repository at this point in the history
  5. Update train.py

    glenn-jocher committed Jun 28, 2022
    Configuration menu
    Copy the full SHA
    d6ad680 View commit details
    Browse the repository at this point in the history
  6. Update train.py

    glenn-jocher committed Jun 28, 2022
    Configuration menu
    Copy the full SHA
    39c1f11 View commit details
    Browse the repository at this point in the history
  7. Update train.py

    glenn-jocher committed Jun 28, 2022
    Configuration menu
    Copy the full SHA
    227a77a View commit details
    Browse the repository at this point in the history
  8. Further cleanup

    This cleans up the definition of broadcast_list and removes the requirement for clear() afterward.
    glenn-jocher committed Jun 28, 2022
    Configuration menu
    Copy the full SHA
    58bc763 View commit details
    Browse the repository at this point in the history