-
Notifications
You must be signed in to change notification settings - Fork 348
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: backend support for inference metric tracking part 1 #7375
Conversation
✅ Deploy Preview for determined-ui canceled.
|
for _, val := range trialIds { | ||
if err := CanGetTrialsExperimentAndCheckCanDoAction(ctx, val.TrialID, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
if there's a ton of trial ids, it may be more optimal to join trial_source_info to trials to experiments, get workspace ids, and check permissions once per workspace id.
I won't insist on adding it right now, but perhaps as a TODO or something to keep an eye on from perf perspective.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added a comment about this
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do we also usually want a ticket associated with these inline todos?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@tayritenour asked I take a quick look. I commented on some bits that jumped out w/o fully understanding everything.
for _, val := range trialIds { | ||
if err := CanGetTrialsExperimentAndCheckCanDoAction(ctx, val.TrialID, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do we also usually want a ticket associated with these inline todos?
Description
MLG-637
Design Doc: https://hpe.sharepoint.com/:w:/t/detai/EVAlau0Y2exMgXyX63inrmEB7rUFHEfB8bDUmdLTxJ4Bjw?e=ZjAZJ7&clickparams=eyJBcHBOYW1lIjoiVGVhbXMtRGVza3RvcCIsIkFwcFZlcnNpb24iOiIyOC8yMzA1MDEwMDQyMiIsIkhhc0ZlZGVyYXRlZFVzZXIiOmZhbHNlfQ%3D%3D
Adds the ability to keep track of what checkpoints were used in a given trial. When leveraged with the new generic metrics APIs, this will allow us to track and find the metrics for all inference runs that utilize a given checkpoint.
Related PRs:
Test Plan
The integration tests pass. This isn't integrated fully for the user to see in a release party.
Commentary (optional)
Checklist
docs/release-notes/
.See Release Note for details.
Ticket