feat: backend support for inference metric tracking part 1 #7375

tayritenour · 2023-07-12T22:59:04Z

Description

Design Doc: https://hpe.sharepoint.com/:w:/t/detai/EVAlau0Y2exMgXyX63inrmEB7rUFHEfB8bDUmdLTxJ4Bjw?e=ZjAZJ7&clickparams=eyJBcHBOYW1lIjoiVGVhbXMtRGVza3RvcCIsIkFwcFZlcnNpb24iOiIyOC8yMzA1MDEwMDQyMiIsIkhhc0ZlZGVyYXRlZFVzZXIiOmZhbHNlfQ%3D%3D

Adds the ability to keep track of what checkpoints were used in a given trial. When leveraged with the new generic metrics APIs, this will allow us to track and find the metrics for all inference runs that utilize a given checkpoint.

Related PRs:

Test Plan

The integration tests pass. This isn't integrated fully for the user to see in a release party.

Commentary (optional)

Checklist

Changes have been manually QA'd
User-facing API changes need the "User-facing API Change" label.
Release notes should be added as a separate file under docs/release-notes/.
See Release Note for details.
Licenses should be included for new code which was copied and/or modified from any external code.

Ticket

…e metrics

…ueries with bun

netlify · 2023-07-12T22:59:13Z

✅ Deploy Preview for determined-ui canceled.

Name	Link
🔨 Latest commit	`f88ad56`
🔍 Latest deploy log	https://app.netlify.com/sites/determined-ui/deploys/64c17e9f83ba900008c63a67

harness/determined/determined.code-workspace

ioga · 2023-07-20T22:22:41Z

master/internal/trials/api_trial_source_info.go

+	for _, val := range trialIds {
+		if err := CanGetTrialsExperimentAndCheckCanDoAction(ctx, val.TrialID,


if there's a ton of trial ids, it may be more optimal to join trial_source_info to trials to experiments, get workspace ids, and check permissions once per workspace id.
I won't insist on adding it right now, but perhaps as a TODO or something to keep an eye on from perf perspective.

Added a comment about this

do we also usually want a ticket associated with these inline todos?

proto/src/determined/api/v1/trial.proto

proto/src/determined/trial/v1/trial.proto

determined-ci · 2023-07-20T23:58:30Z

Hello! DesignKit diffs for commit b6163b2 are available for you to view here

hamidzr

@tayritenour asked I take a quick look. I commented on some bits that jumped out w/o fully understanding everything.

proto/src/determined/trial/v1/trial.proto

master/internal/trials/utils.go

hamidzr · 2023-07-21T21:46:15Z

master/internal/trials/api_trial_source_info.go

+	for _, val := range trialIds {
+		if err := CanGetTrialsExperimentAndCheckCanDoAction(ctx, val.TrialID,


do we also usually want a ticket associated with these inline todos?

master/internal/trials/api_trial_source_info.go

master/internal/api_model.go

tayritenour added 19 commits June 26, 2023 10:42

basic db migrations necessary for saving trial_source_info

63bf1cb

wip protobuf

ec77c19

wip, adding stuff to protobuf, supporting model_versions

6906e51

wip setting up the protobuf

37df6ce

wip getting the endpoints lined up

841dd82

wip, trying to add the create trial info endpoint

ab41f26

backend for the CreateTrialSourceInfo

1371b13

migrating query to bun

0c5b367

wip, adding the ability to query by the trial source info and populat…

deb4156

…e metrics

more attempts to use the metrics endpoint correct and fully run the q…

ae7f3d3

…ueries with bun

work in progress

4b04626

better testing, more stable endpoints

b9be74d

start on SDK integration

f7bbf29

Merge branch 'main' into MLG-637

bb2241c

make sure db migrations are in order again

8bed4bf

touch ups

f6213ac

stabilizing

1a8651b

Merge remote-tracking branch 'upstream/main' into MLG-637

aef59b9

remove comment

54f62f6

tayritenour requested a review from a team as a code owner July 12, 2023 22:59

tayritenour requested review from carolinaecalderon and removed request for a team July 12, 2023 22:59

cla-bot bot added the cla-signed label Jul 12, 2023

tayritenour added 5 commits July 12, 2023 16:02

fix proto linting

9edef0d

comments in proto

e1fac69

make the proto checks happy

ddcd5ff

go back to using the db.GetMetrics the original way

e60c096

one more try

7d780a1

tayritenour requested a review from ioga July 12, 2023 23:58

ioga reviewed Jul 20, 2023

View reviewed changes

merge main, remove vscode stuff

7f4ea25

tayritenour force-pushed the MLG-637 branch from 42c6b72 to 7f4ea25 Compare July 20, 2023 23:55

tayritenour requested a review from a team as a code owner July 20, 2023 23:55

determined-ci added the documentation Improvements or additions to documentation label Jul 20, 2023

tayritenour added 5 commits July 20, 2023 17:05

update names for the model version ids

b6163b2

adding comment warning about RBAC checks

0b720ac

format the proto

557385e

rebuild proto

74c1cfd

Merge branch 'main' into MLG-637

14e3ebe

determined-ci removed the documentation Improvements or additions to documentation label Jul 21, 2023

ioga approved these changes Jul 21, 2023

View reviewed changes

hamidzr reviewed Jul 21, 2023

View reviewed changes

tayritenour added 8 commits July 24, 2023 17:36

testing for authZ stuff

986167f

don't commit this thing

3a206d9

for some reason, the linter is okay with this

53d7bc2

make linter happy

87f503f

reverting changes to testing structure

0bb72eb

fixing very bizarre mock issues

27e1da6

Merge branch 'main' into MLG-637

e01b7b0

removing some invalid comments

d545ea7

hamidzr approved these changes Jul 26, 2023

View reviewed changes

tayritenour added 2 commits July 26, 2023 13:07

Merge branch 'main' into MLG-637

3efc5ad

update the migration

f88ad56

tayritenour enabled auto-merge (squash) July 26, 2023 20:56

tayritenour disabled auto-merge July 26, 2023 23:36

tayritenour merged commit d29d49d into main Jul 26, 2023
17 of 19 checks passed

tayritenour deleted the MLG-637 branch July 26, 2023 23:47

dannysauer added this to the 0.24.0 milestone Feb 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: backend support for inference metric tracking part 1 #7375

feat: backend support for inference metric tracking part 1 #7375

tayritenour commented Jul 12, 2023 •

edited by jira bot

Loading

netlify bot commented Jul 12, 2023 •

edited

Loading

ioga Jul 20, 2023

tayritenour Jul 21, 2023

hamidzr Jul 21, 2023

determined-ci commented Jul 20, 2023 •

edited

Loading

hamidzr left a comment

hamidzr Jul 21, 2023

		for _, val := range trialIds {
		if err := CanGetTrialsExperimentAndCheckCanDoAction(ctx, val.TrialID,

feat: backend support for inference metric tracking part 1 #7375

feat: backend support for inference metric tracking part 1 #7375

Conversation

tayritenour commented Jul 12, 2023 • edited by jira bot Loading

Description

Test Plan

Commentary (optional)

Checklist

Ticket

netlify bot commented Jul 12, 2023 • edited Loading

✅ Deploy Preview for determined-ui canceled.

ioga Jul 20, 2023

Choose a reason for hiding this comment

tayritenour Jul 21, 2023

Choose a reason for hiding this comment

hamidzr Jul 21, 2023

Choose a reason for hiding this comment

determined-ci commented Jul 20, 2023 • edited Loading

hamidzr left a comment

Choose a reason for hiding this comment

hamidzr Jul 21, 2023

Choose a reason for hiding this comment

tayritenour commented Jul 12, 2023 •

edited by jira bot

Loading

netlify bot commented Jul 12, 2023 •

edited

Loading

determined-ci commented Jul 20, 2023 •

edited

Loading