Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: backend support for inference metric tracking part 1 #7375

Merged
merged 48 commits into from
Jul 26, 2023
Merged

Conversation

tayritenour
Copy link
Contributor

@tayritenour tayritenour commented Jul 12, 2023

Description

MLG-637

Design Doc: https://hpe.sharepoint.com/:w:/t/detai/EVAlau0Y2exMgXyX63inrmEB7rUFHEfB8bDUmdLTxJ4Bjw?e=ZjAZJ7&clickparams=eyJBcHBOYW1lIjoiVGVhbXMtRGVza3RvcCIsIkFwcFZlcnNpb24iOiIyOC8yMzA1MDEwMDQyMiIsIkhhc0ZlZGVyYXRlZFVzZXIiOmZhbHNlfQ%3D%3D

Adds the ability to keep track of what checkpoints were used in a given trial. When leveraged with the new generic metrics APIs, this will allow us to track and find the metrics for all inference runs that utilize a given checkpoint.

Related PRs:

Test Plan

The integration tests pass. This isn't integrated fully for the user to see in a release party.

Commentary (optional)

Checklist

  • Changes have been manually QA'd
  • User-facing API changes need the "User-facing API Change" label.
  • Release notes should be added as a separate file under docs/release-notes/.
    See Release Note for details.
  • Licenses should be included for new code which was copied and/or modified from any external code.

Ticket

@tayritenour tayritenour requested a review from a team as a code owner July 12, 2023 22:59
@tayritenour tayritenour requested review from carolinaecalderon and removed request for a team July 12, 2023 22:59
@cla-bot cla-bot bot added the cla-signed label Jul 12, 2023
@netlify
Copy link

netlify bot commented Jul 12, 2023

Deploy Preview for determined-ui canceled.

Name Link
🔨 Latest commit f88ad56
🔍 Latest deploy log https://app.netlify.com/sites/determined-ui/deploys/64c17e9f83ba900008c63a67

@tayritenour tayritenour requested a review from ioga July 12, 2023 23:58
harness/determined/determined.code-workspace Outdated Show resolved Hide resolved
Comment on lines +61 to +62
for _, val := range trialIds {
if err := CanGetTrialsExperimentAndCheckCanDoAction(ctx, val.TrialID,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

if there's a ton of trial ids, it may be more optimal to join trial_source_info to trials to experiments, get workspace ids, and check permissions once per workspace id.
I won't insist on adding it right now, but perhaps as a TODO or something to keep an eye on from perf perspective.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added a comment about this

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we also usually want a ticket associated with these inline todos?

proto/src/determined/api/v1/trial.proto Outdated Show resolved Hide resolved
proto/src/determined/trial/v1/trial.proto Outdated Show resolved Hide resolved
@tayritenour tayritenour requested a review from a team as a code owner July 20, 2023 23:55
@determined-ci determined-ci added the documentation Improvements or additions to documentation label Jul 20, 2023
@determined-ci
Copy link
Collaborator

determined-ci commented Jul 20, 2023

Hello! DesignKit diffs for commit b6163b2 are available for you to view here

@determined-ci determined-ci removed the documentation Improvements or additions to documentation label Jul 21, 2023
Copy link
Contributor

@hamidzr hamidzr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@tayritenour asked I take a quick look. I commented on some bits that jumped out w/o fully understanding everything.

proto/src/determined/trial/v1/trial.proto Outdated Show resolved Hide resolved
master/internal/trials/utils.go Show resolved Hide resolved
Comment on lines +61 to +62
for _, val := range trialIds {
if err := CanGetTrialsExperimentAndCheckCanDoAction(ctx, val.TrialID,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we also usually want a ticket associated with these inline todos?

master/internal/api_model.go Show resolved Hide resolved
@tayritenour tayritenour enabled auto-merge (squash) July 26, 2023 20:56
@tayritenour tayritenour merged commit d29d49d into main Jul 26, 2023
17 of 19 checks passed
@tayritenour tayritenour deleted the MLG-637 branch July 26, 2023 23:47
@dannysauer dannysauer added this to the 0.24.0 milestone Feb 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants