Skip to content

Releases: determined-ai/determined

0.29.0

05 Mar 18:48
Compare
Choose a tag to compare

Release Notes

0.29.0

Changelog

  • 5079570 chore: bump version: 0.29.0-rc4 -> 0.29.0
  • 8fa5b5a docs: add release notes for 0.29.0 (#8955)
  • fffde7f chore: bump version: 0.29.0-rc3 -> 0.29.0-rc4
  • f939a0f fix: no data plot in chart with data (#8935)
  • 5a74e37 build: bump ci cpu image to latest ubuntu 2004 (#8940)
  • ad84759 build: bump up ci setup_remote_docker version (#8942)
  • f0d9768 fix: malformed config with gcp up with --initial-user-password (#8936)
  • 2a61ab3 chore: bump version: 0.29.0-rc2 -> 0.29.0-rc3
  • 435e90a chore: fix mp.pool test_streaming_metrics_api (#8917)
  • 18e2ea4 chore: bump version: 0.29.0-rc1 -> 0.29.0-rc2
  • 799373f fix: slurm launcher authenticates preemption notification (#8928)
  • 641174c tc: Add release note 8851 (#8864)
  • f275252 chore: bump up ebs size to 400gb for genai deployments
  • 8c855b7 fix: SSO button link target (#8925)
  • 8d4acd5 tc: Remove broken link (#8924)
  • 06875df fix: canonicalize master urls shim code (#8919)
  • e5ae865 chore: bump version: 0.29.0-rc0 -> 0.29.0-rc1
  • b847ede chore: bump version: 0.29.0-dev0 -> 0.29.0-rc0
  • 28c385c chore: lock published urls to preserve redirects
  • cbfd3c2 chore: lock api state for backward compatibility check
  • b30f609 chore: bump version: 0.28.2-dev0 -> 0.29.0-dev0
  • ad94c17 fix: return error from websocket handler if socket id is taken (#8877)
  • 4618389 style: update genai logo on sidebar (#8907)
  • 8f82087 test: fix tensorboard reattach k8s flake [RM-39] (#8906)
  • d24b19a test: unquarantine deploy-local tests (#8896)
  • 7c6bec9 chore: refactor proto, schema, and jobservice for multiRM (#8875)
  • ca96da1 fix: Genai helm service fix (#8885)
  • a89e51e fix: trial comparison text overflow bug fix (#8869)
  • 9817a4d chore: add trigger to abort checkpoint deletion (#8878)
  • 2689b0b chore: delete unused functions [RM-41] (#8888)
  • 9a6afd2 docs: Organize docs (#8898)
  • a8ac657 chore: small build system fixes (#8900)
  • fa98bf3 fix: add missing ci context to preview cluster
  • b15d508 fix: add deploy last main missing ci context (#8892)
  • b47b477 chore: cleanup stray comments (#8889)
  • ae08265 feat: force default user passwords for all det deploy and CI clusters [RM-28] (#8851)
  • be1ab85 fix: unnecessary group related api calls during the initial group page loading (#8882)
  • f37bc3e fix: move e2e_tests changes for slurm test from EE to OSS (#8887)
  • 93ced86 fix: add missing check for external sessions on exp launch (#8859)
  • 944732a ci: more e2e test fixes (#8881)
  • ab9505c ci: fix e2e tests in ee (#8880)
  • 0bc3106 docs: Add llm blog link to home page (#8874)
  • c029327 docs: add link checker utility (#8738)
  • e1da471 chore: api's default retry now session's default retry (#8872)
  • 7bb9dbc chore: master config updates for multirm [RM-3, RM-4, RM-5, RM-7, RM-29] (#8831)
  • f101f3d chore: add allocation info for cluster ui [DET-10018] (#8616) (#8876)
  • 72d54be chore: canonicalize master urls everywhere [MLG-878] (#8670)
  • e3709bd chore: document internal api errors (#8865)
  • 27a279e fix: e2e CPU tests have wrong maxSlotsPerPod number (#8870)
  • 03b9b30 chore: bunify postgres_jobs.go (#8858)
  • e9ac112 build(deps): bump peter-evans/create-or-update-comment from 3 to 4 (#8760)
  • dc3e41e Fix broken links (#8825)
  • bccdf0c fix: stop allowing multi-container allocations to launch in single agent config (#8833)
  • a1214d7 chore: add allocation info for cluster ui [DET-10018] (#8616)
  • 76ec233 chore: refactor a bunch of auth-related python (#8347)
  • 66b1e6c chore: bump version: 0.28.1-dev -> 0.28.2-dev0
  • f250ad9 chore: add docs dropdown link for new version
  • 9d44ca1 docs: add release notes for 0.28.1 (#8861)
  • ac8c440 fix: allow experiments to configure k8s sidecars (#8854)
  • d07ec40 ci: fix broken ci due to queue version change (#8853)
  • c656aac chore: use npm build for hew (#8845)
  • 6b63750 feat: add a master API to fetch a trial by external id. (#8730)
  • e78a4c0 fix: correctly source bucket region when using minio (#8850)
  • dba5f0f fix: replace react-window with react-virtuoso in transfer component (#8800)
  • 2a183da ci: fix performance feature branch using wrong db (#8835)
  • 47061fa fix: revert config work from #8765 and #8789 due to feature regressions (#8849)
  • a5f38cb chore: remove GetAllocationSummary from RM interface (#8846)
  • de28a57 chore: cover postgres_jobs.go (#8841)
  • ba8250a chore: update backend coverage target (#8798)
  • 556639d fix: show error message from backend API for workspace deletion (#8848)
  • 08dfa43 fix: job queue test failures (#8843)
  • 876f9c3 chore: configure agent log level through config file (#8819)
  • ba03375 chore: move project id onto runs (#8794)

0.28.1

20 Feb 22:58
Compare
Choose a tag to compare

Release Notes

0.28.1

Changelog

  • f6cb624 chore: bump version: 0.28.1-rc3 -> 0.28.1
  • baaa3bd docs: add release notes for 0.28.1 (#8861)
  • fbf9df4 chore: bump version: 0.28.1-rc2 -> 0.28.1-rc3
  • a965f15 ci: fix broken ci due to queue version change (#8853)
  • d91e8b0 chore: bump version: 0.28.1-rc1 -> 0.28.1-rc2
  • 3129d33 fix: revert config work from #8765 and #8789 due to feature regressions (#8849)
  • 1888b90 chore: bump version: 0.28.1-rc0 -> 0.28.1-rc1
  • c443073 fix: show error message from backend API for workspace deletion (#8848)
  • 1fc1496 fix: job queue test failures (#8843)
  • a74685f chore: bump version: 0.28.1-dev -> 0.28.1-rc0
  • 5b2e32d chore: cleanup the last traces of experiment git fields. [MD-258] (#8830)
  • 92a380f feat: Generic task restore (#8802)
  • b0fa7dc feat: generic tasks: support startup hooks (#8840)
  • ca80022 chore: bunify postgres_checkpoints and add tests (#8783)
  • a4dbc03 chore: fix error on terminating experiments on restart (#8837)
  • aa98d82 chore: agent state wasn't getting deleted and logged error (#8838)
  • bb469fa fix: update hew with bugfixes (#8839)
  • 393cfde Fix broken ref (#8836)
  • 7a13863 perf: improve GetExperiments + SearchExperiments counting (#8801)
  • d8d9965 chore: remove unused SetAllocationName (#8829)
  • 1946d9a docs: Update slurm install (#8832)
  • 1fd21e7 fix: Fix small typo in Webhook documentation (#8820)
  • e341e27 feat: Generic Tasks (#8724)
  • fff85e3 fix: handle helm templating in older go template versions (#8828)
  • f300d97 chore: hide genai helm values config and fix var name (#8821)
  • 6206bde feat: add streaming updates core functionality and project streaming (#8669)
  • ed61121 fix: stop truncating log timestamps to avoid missing logs [WEB-1791] (#8815)
  • 43d3f21 fix: check for models before deleting workspace (#8804)
  • bb59fa2 ci: wait longer for performance test db to startup (#8796)
  • cfffe96 docs: Remove legacy pages (#8818)
  • 1c3f3c4 fix: mitigate many unnecessary api calls in user management table (#8816)
  • 4612c41 fix: agent config precedence (#8656)
  • 762fcef feat: Deploy GenAI in Helm (#8727)
  • 8e067d9 fix: remove possible hang from ship_logs.py [MLG-1565] (#8803)
  • 1daf9d3 docs: remove duplicated note (#8813)
  • 56e7000 fix: remove extra quotes around IdentifyTask (#8792)
  • 3805ebd chore: add testing for k8s informer panic (#8810)
  • a35696d refactor: condense trial update functions (#8808)
  • 45c578b chore: bump version: 0.28.0-dev0 -> 0.28.1-dev
  • 6520629 chore: add docs dropdown link for new version
  • ed2136d docs: add release notes for 0.28.0 (#8807)
  • 8258565 chore: bump version: 0.27.2-dev0 -> 0.28.0-dev
  • c5afb6c fix: fetch experiment in case config data is not contained (#8789)
  • 4e17ef7 chore: differentiate between programmatic and web page requests (#8795)
  • a1a6e20 chore: add ee helm chart changes to oss (#8799)
  • 65c811c docs: Add mention of RPMs to on-prem _index.rst (#8773)
  • ad765d4 docs: adds/corrects EE changes, merges to OSS (#8788)
  • f1a45ae perf: update proto_checkpoint_view to use index (#8793)
  • abd590d Revert "docs: Update oidc and saml docs (#8777)" (#8791)
  • bb88b01 fix: improve trial log request cancelling (#8787)
  • 17f305f ci: make perf tests only alert on failure (#8790)
  • 422f5aa perf: avoid loading model def in experiment model (#8742)
  • 7698452 perf: improve GetExperiments showTrialData performance (#8753)
  • e801cfe perf: add index to checkpoints_v2 id (#8758)
  • 71db4e1 perf: add indexes to tasks and allocations (#8757)
  • e0e6cf0 perf: improve get_workspaces query (#8751)
  • ef656bc perf: improve resource agg performance (#8735)
  • e873381 fix: retry watcher failure causes infinite loop (#8786)
  • ba2f190 fix: replace experiment config (#8765)
  • 85d1053 chore: rename postgres_command_intg_test.go (#8785)
  • 40a70cf test: performance test CI work (#8761)
  • 36a2e29 chore: bunify db/postgres_tasks.go (#8764)
  • 07494cf fix: update hew to a version without broken documentcard prompts (#8782)
  • 9502059 feat: GCS client should retry on TooManyRequests. (#8780)
  • ec850ae test: add intg tests for db/postgres_tasks.go (#8750)
  • 9ec2f7d chore: update gke version to comply with latest release for e2e tests (#8781)
  • cefa242 chore: persist checkpoint storage backend ID (#8690)
  • 905e449 chore: migrate db schema trials to runs (#8723)
  • dfbb926 chore: clean up leftover debug print statements (#8755)

0.28.0

06 Feb 20:56
Compare
Choose a tag to compare

Release Notes

0.28.0

Changelog

  • 7f9b082 chore: bump version: 0.28.0-rc4 -> 0.28.0
  • ed1b7f0 docs: add release notes for 0.28.0 (#8807)
  • c4b6f57 chore: bump version: 0.28.0-rc3 -> 0.28.0-rc4
  • 27ce0a2 chore: add ee helm chart changes to oss (#8799)
  • 959a096 chore: add ee helm chart changes to oss (#8799)
  • f513174 chore: bump version: 0.28.0-rc2 -> 0.28.0-rc3
  • 083c314 chore: bump version: 0.28.0-rc1 -> 0.28.0-rc2
  • e272bf0 chore: bump version: 0.28.0-rc0 -> 0.28.0-rc1
  • 6080d39 chore: bump version: 0.27.2-rc4 -> 0.28.0-rc0
  • 3cbce1d chore: bump version: 0.27.2-rc3 -> 0.27.2-rc4
  • 89df98b docs: adds/corrects EE changes, merges to OSS (#8788)
  • 2e27c71 chore: bump version: 0.27.2-rc2 -> 0.27.2-rc3
  • 1abe34f chore: bump version: 0.27.2-rc1 -> 0.27.2-rc2
  • e23e162 fix: improve trial log request cancelling (#8787)
  • 55b5bd4 chore: bump version: 0.27.2-rc0 -> 0.27.2-rc1
  • 5edfd81 fix: retry watcher failure causes infinite loop (#8786)
  • 6a21d44 fix: update hew to a version without broken documentcard prompts (#8782)
  • 74e341d chore: bump version: 0.27.2-dev0 -> 0.27.2-rc0
  • ea9e903 chore: lock published urls to preserve redirects
  • 0321e1f chore: lock api state for backward compatibility check
  • 3783f2b docs: Update oidc and saml docs (#8777)
  • 141afa4 docs: update dependency version in contributing readme (#8776)
  • 994527f fix: Text filter on ProjectMoveModal (#8775)
  • aa65c07 chore: use vite-plugin-svg-to-jsx package (#8772)
  • 98c61f3 test: do not import model_hub test requirements (#8771)
  • 1e2da10 ci: retry git fetch for early stopping checks (#6318)
  • c73712b docs: Replace basic quickstart (#8770)
  • fda515d fix: python requirements for pytest and moto (#8769)
  • 78929c0 fix: Filter value resets when switching column types [WEB-1949] (#8731)
  • 7ddf965 docs: Fix minor issues (#8768)
  • 31f6f99 fix: add default transport to proxy connection (#8767)
  • 149b7fa build(deps): bump slackapi/slack-github-action from 1.24.0 to 1.25.0 (#8766)
  • 719169a docs: Fix dropdown url (#8763)
  • 56406a2 Update helm chart config ref (#8762)
  • 5973f8e chore: bump version: 0.27.1-dev0 -> 0.27.2-dev0
  • 260c2bc chore: add docs dropdown link for new version
  • 4b4d14a docs: add release notes for 0.27.1 (#8746)
  • 7841d9e feat: the new quick start guide link (#8759)
  • 995311a feat: expconf flag to force scheduling on a single node/container/pod (#8743)
  • 64d588f refactor: use hew Tree and Divider components [WEB-1920] (#8736)
  • f771acb fix: cease many model fetch api calls in checkpoint tab (#8749)
  • 96b9064 docs: Add qs for webui users (#8754)
  • d68ffaa docs: API deprecate returning config for bulk endpoints (#8732)
  • f21a516 tests: cover queries inside internal/users/postgres_users.go (#8729)
  • 90a57cb fix: Experiment table, right-click context menu [WEB-1942] (#8756)
  • 2ffc18f chore: import missing EE helm chart change [ci skip] (#8747)
  • 87b6cf3 fix: use the new genai docker repo (#8745)
  • 7c3650f chore: make devcluster to rebuild bindings before harness and webui. (#8748)
  • 7f3ddfb feat: Add a modal to enable/disable Agents [WEB-1718] (#8721)
  • bd0a9ea fix: pagination fix in model detail page (#8744)
  • 43c074e feat: helm option to mount shared_fs checkpoints to master (#8741)
  • 9f06d35 fix: use selected checkpoints when registering (#8739)
  • 6db8c06 test: cover agent_state.go SQL queries (#8740)
  • eb48302 test: cover db.GroupCheckpointUUIDsByExperimentID (#8508)
  • d661404 fix: compress data from API for the page load performance improvement (#8720)
  • b71da7a fix: batch metric writes to TensorBoard [MLG-990] (#8688)
  • bd78ec1 feat: Preserve 'redirect' query during logout [GAS-489] (#8728)
  • 62941a2 refactor: remove antd App component [WEB-1922] (#8713)
  • bf5b1d1 chore: fix unused-imports warning in protos build. (#8726)
  • 0bda0d9 fix: use hew Alert [WEB-1918] (#8711)
  • 1c21f6a chore: Move from internal glide-table-grid to v6.0.0 [WEB-1945] (#8725)
  • f32e015 fix: local checkpoint download path fix (#8722)
  • 190af1d docs: [FE-270] add PBS known issue - Cluster tab does not display GPU information (#8719)
  • 6d744f7 feat: content-length for tar checkpoint downloads (#8684)
  • 11e3ba9 chore: upgrade vitest@1.2.1 (#8718)
  • 92fe3a6 docs: [FE-269] Add documentation detailing configuration steps to set the values for ngpus. (#8714)
  • b69a49c chore: update github path in docker docs (#8687)
  • faea553 chore: codecov reports to match go coverage reports (#8696)
  • 0782c35 chore: standardize oidc/saml group & display attribute names in helm config (#8689)
  • acca434 chore: update oss/ee oidc & saml helm config (#8680)
  • 7188b69 fix: use Hew dropdown on FilterGroup [WEB-1938] (#8715)
  • a410c45 chore: Upgrade to vite 5 (#8676)
  • dbeb458 fix: support CommandState for experiment icon (#8709)
  • 83fe474 docs: fix references on children of "training reference" root (#8708)
  • 71eaa5a chore: Replace antd reset.css with modern-normalize (#8706)
  • e0e08b6 fix: Update hew for chart fix, avoid error from Typography.Label (#8712)
  • fef93a4 build(deps): bump actions/cache from 3 to 4 (#8710)
  • 4aedded docs: Update docs to pass linter (#8705)
  • 00c2746 fix: restore original user store poll on leaving workspace details (#8702)
  • 73760cd Revert "docs: Update docs to pass linter" (#8704)
  • 4b7b705 [docs] Update docs to pass linter (#8703)
  • 2402133 docs: Update Docker Installation Instructions (#8659)
  • f8a2434 docs: Update Linux distros, add WSL, and archs to Quickstart (#8662)
  • 132919f docs: Overhaul WSL deployment instructions (#8658)
  • e7dc7aa chore: Replace custom archived note with Hew badge (#8695)
  • f2899cc fix: fix CreateExperiment for Remote Users (#8700)
  • f00768f chore: remove unused files (#8698)
  • 2e60167 chore: TrialsComparisonModal style fixes [WEB-1919] [WEB-1909] (#8674)
  • e8d6448 Revert "fix: restore original user store poll on leaving workspace details"
  • 6d3f9ff fix: playwright fix (#8699)
  • c869ce7 fix: restore original user store poll on leaving workspace details

0.27.1

24 Jan 23:32
e05d57d
Compare
Choose a tag to compare

Release Notes

0.27.1

Changelog

0.27.0

10 Jan 15:22
Compare
Choose a tag to compare

Release Notes

0.27.0

Changelog

0.26.7

18 Dec 20:21
Compare
Choose a tag to compare

Release Notes

0.26.7

Changelog

  • a125dd4 chore: bump version: 0.26.7-rc1 -> 0.26.7
  • cc161d5 docs: add release notes for 0.26.7 (#8601)
  • ed4d11a chore: bump version: 0.26.7-rc0 -> 0.26.7-rc1
  • a138b0a fix: Change slot number back to 2 in Keras example (#8595)
  • 47c885b chore: fix api state (#8589)
  • dc175a1 chore: bump version: 0.26.7-dev0 -> 0.26.7-rc0
  • 0584703 chore: lock published urls to preserve redirects
  • 40d6c32 chore: lock api state for backward compatibility check
  • 6e224ce chore: add docs dropdown link for new version
  • e1c1749 fix(tasks): persist rendezvous readiness (#8545)
  • 150eae4 chore: bump version: 0.26.6-dev0 -> 0.26.7-dev0
  • 660de26 feat: Update the "Continue Single trial experiment" workflow (#8526)
  • 940d1d7 fix: master cannot download s3 from us-east-1 (#8558)
  • dac3fe8 build: avoid installing playwright by make build (#8569)
  • 005634b ci: unit-test login scenarios, and others (#8471)
  • 4a808c3 chore: remove --dry-run option from det deploy aws [MLG-983] (#8542)
  • aa71929 chore: implement GetSlot, GetSlots, and GetAgent for K8s rm (#8464)
  • 791e532 chore: release notes for 0.26.6 (#8566)
  • 51f9f71 chore: fix bad arg validation for det deploy aws up (#8576)
  • e5566ee docs: Rename submit experiment (#8570)
  • c05eab1 chore: include cluster name in stack deletion confirmation (#8575)
  • 2a55fc1 docs: Update oidc group claim name notes (#8573)
  • c9ba474 chore: add lore help and experimental disclaimer (#8563)
  • 8527011 feat: filter out inactive users from list and add option to see all users (#8421)
  • 431dec0 chore: Model metadata sections move to Surface [WEB-1813] (#8557)
  • 6263695 Revert "docs: Add a link to mldes trial (#8561)" (#8572)
  • 3371746 chore: Port smaller Antd.Modals to standard Hew Modal (#8567)
  • 799d6d4 ci: make backend own codeowner of go.mod / go.sum (#8568)
  • 6232ce2 chore: bump version: 0.26.5-dev0 -> 0.26.6-dev0
  • faa9dee docs: add release notes for 0.26.5 (#8564)
  • 05c5320 test: store test results for test-e2e-gke. (#8560)
  • 921826a docs: Fix workspaces projects left nav (#8565)
  • 4bccbce fix: prevent workspace list race condition (#8524)
  • ef9a686 test: add more Go db tests to internal/db (#8553)
  • 17666a4 docs: Add a link to mldes trial (#8561)
  • f461c74 Mention det python sdk demo (#8550)
  • 8d3f6fb chore: Rename show_ssh_command in CLI and add message for VSCode WSL Users (#8387)
  • f988566 chore: replace readme logo with svg [WEB-314] (#8559)
  • f543c41 chore: remove OIDC Config from OSS (#8534)
  • 12e31af build(deps): bump golang.org/x/net from 0.7.0 to 0.17.0 in /master (#8124)
  • f9fc1d1 docs: Describe remote user management webui (#8479)
  • bb37b53 test: skip all failing mmdetection tests. (#8540)
  • 3fff399 test: fix failing nightly test splits (#8549)
  • ff97304 test: add more tests to db/postgres_experiments.go (#8537)
  • c08613a fix: correct docs link in empty project [WEB-1879] (#8548)
  • b9caae5 docs: Improve webhooks tasklog (#8544)
  • 816d203 chore: Use Hew SplitPane [WEB-1682] (#8482)
  • 76bdfe9 test: make DET_MASTER configurable in the perf test Makefile. (#8533)
  • a6ae0c7 feat: add the slots property to the props (#8498)
  • ef4b787 fix(api): delete experiment error handling corrections (#8510)
  • 203477d chore: remove CI go unit tests and rename CI integration target (#8530)
  • 65227fa ci: add Go coverage regex match so we can require some functions to be tested (#8514)
  • 22af6f7 fix: allow slots per trial to be 0 [WEB-1871] (#8521)
  • 0771185 test: add more Go db tests (#8519)
  • 1c9575f fix: callback webhook action modals to re-fetch webhooks after successful call [WEB-1869] (#8522)
  • 3d18c13 fix: trial spinner (#8528)
  • ad55716 feat: Enable Retry for multi-trial exp with errored trials (#8518)
  • 88784cc fix: Model delete redirect only needed if inside the model [WEB-1867] (#8512)
  • 9a09d15 feat: add lore redirect (#8492)
  • 6b4f52e test: remove cifar10_tf_keras tests and examples. (#8444)
  • 0590e80 fix: ensure filter columns are valid when selecting special columns (#8517)
  • 9f14c6e fix: typo (#8516)
  • 0d59196 chore: add ElementsMatch to usergroup test (#8501)
  • 5fff579 feat: add experiment id to SDK Trial objects (#8499)
  • 8e6252f refactor: replace resource pool looping with direct call (#8503)
  • b37403f fix: fetch projects after archive/unarchive (#8504)
  • 7da2777 chore(deps): bump actions/setup-java from 3 to 4 (#8507)
  • 7f5de0f fix(agentrm): resource pools must filter agents.list() by name (#8509)
  • e33ebb7 chore: remove defunct yogadl dependency (#8450)
  • 72ef508 test: fix failing test_delete_experiment_removes_tensorboard_files (#8511)
  • 9abdbf1 fix: Overwrite omnibar antd modal with Hew standard modal [WEB-1830] (#8476)
  • 17f1efb chore: re-enable CI metrics upload (#8506)
  • c8804e5 chore: take out submodule update cmd (#8502)
  • 42b38fb feat: Enable multi-trial "retry" for errored/cancelled exp (#8495)
  • 0b855c7 feat: hide hyperparameter search for unmanaged multi tiral experiments (#8497)
  • 7077ab4 fix: Update Trial Download Error Message (#8398)
  • 05e5974 ci: add date to python cache, fix moto linting issue (#8493)
  • bf79a43 ci: fix Go build cache by not making it run get-deps (#8483)
  • ec90590 feat: allow creation of tasklog webhooks in webui + docs (#8434)
  • 324f148 chore: add UpdateUsergroupMembership (#8489)
  • 66b7225 chore: run migration moving trials to view and rename [DET-9989] (#8440)

0.26.6

08 Dec 03:19
Compare
Choose a tag to compare

Release Notes

0.26.6

Changelog

  • 3946169 chore: bump version: 0.26.5 -> 0.26.6

0.26.5

08 Dec 02:12
Compare
Choose a tag to compare

Release Notes

0.26.5

Changelog

  • cfa7730 chore: bump version: 0.26.5-rc3 -> 0.26.5
  • 2755e5e docs: add release notes for 0.26.5 (#8564)
  • 617fb0c chore: bump version: 0.26.5-rc2 -> 0.26.5-rc3
  • 5eff3b4 chore: bump version: 0.26.5-rc1 -> 0.26.5-rc2
  • 54aa7a8 feat: add the slots property to the props (#8498)
  • 52adafb fix: allow slots per trial to be 0 [WEB-1871] (#8521)
  • 6eb6bf0 fix(api): delete experiment error handling corrections (#8510)
  • 54f2cec fix: trial spinner (#8528)
  • 5cfc4cc chore: bump version: 0.26.5-rc0 -> 0.26.5-rc1
  • 5f4a7f4 fix: ensure filter columns are valid when selecting special columns (#8517)
  • 189b4c1 fix(agentrm): resource pools must filter agents.list() by name (#8509)
  • f47bde5 ci: add date to python cache, fix moto linting issue (#8493)
  • 22d6d25 chore: bump version: 0.26.5-dev0 -> 0.26.5-rc0
  • 275ea84 chore: lock api state for backward compatibility check
  • cea9ff4 fix: Experiment state now is an ExperimentState (#8457)
  • 06b7b79 fix: tqdm logs within wrap_rank [MLG-1236] (#8488)
  • f50f7db fix: use explicit e.state in bulk experiment delete query (#8491)
  • 0f65698 fix(rm): tasks shouldn't hang on restore failures (#8486)
  • f847b26 chore: Revert "test: quarantine GPU execution of test_task_logs (#8261)" (#8484)
  • f085e10 fix: failing custom searcher due to ExtraEnvVars being overwritten (#8490)
  • 88f64f6 chore: update libraries (#8463)
  • 2e48a6f chore: migrate detaileduser and experimentitem types to io-ts (#8477)
  • d8ed945 chore: add aliases to det dev commads (#8156)
  • a2730cb feat: adding PACHD_ADDRESS and DEX_TOKEN to task env (#8473)
  • 837bc29 chore: Update Hew Version to 0.6.12 (#8481)
  • 1ee9b81 chore: clean up version dropdown update script (#8415)
  • 398f879 docs: Add requirement and known issue for singularity-suid (#8478)
  • 64e299e fix: wrong skip experiment config regex for log policies (#8475)
  • 0b4e1d2 chore: cleanup some spurious cluster logs (#8468)
  • 8c9dfbf fix: add delete cascade to generic metrics (#8469)
  • 0b28148 ci: register unit pytest marks (#8470)
  • 815f5ae fix: Kill task permission on interactive page (#8358)
  • e67807d chore: preserve CI logs when bringing an AWS cluster down (#8461)
  • 50d40bd chore: update trial complete or early exit to always notify searcher (#8466)
  • 6bdf061 Update k8s install info (#8465)
  • e06b472 chore: export AddUserTx (#8458)
  • 16ea5a0 chore: introduce and use observables with improved update checking (#8405)
  • 5805df2 chore: set up ownership for .circleci [skip ci] (#8402)
  • c2a211d fix(api): handle delete experiment failures correctly (#8459)
  • dfb4dc5 chore(actors): remove pkg/actor (#8452)
  • 167e237 chore: add error check to KillNTSC (#8441)
  • 4edbc7f chore: log RestoreAllCommands error (#8454)
  • 9d71abd Fix minor issues including hard coded reference (#8427)
  • 756a79c chore: bump CI node version to 20.9.0 (#8455)
  • 3090e42 chore: use ResourcePool info for consistent capacity calculation [WEB-1796] (#8447)
  • 192a2b3 chore(actors): remove pkg/actor usage from agentrm (#8395)
  • 5ded38d chore: bump version: 0.26.4-dev0 -> 0.26.5-dev0
  • 4339f67 docs: add release notes for 0.26.4 (#8451)
  • a25e4f5 fix: add back pin icon in experiment list header (#8429)
  • adb5191 ci: store npm log artifacts (#8449)
  • e29006b fix: det slot task name for no-permissions RBAC users (#8416)
  • 695a648 fix: SDK list_checkpoints not defaulting to searcher metric sort (#8448)
  • 1ddad7d feat: add Topology into the RP details page (#8276)
  • 2f7dda6 ci: cache install Python (#8426)
  • cde18df fix: Calculate allocation bar stats same as overview [WEB-1822] (#8431)
  • 9a48ff1 docs: Update upgrade instructions (#8346)
  • e9a199a fix: k8s autoscaling nodes not counted towards RP (#8439)
  • bd19e7a chore: command actor refactor & add intg test [DET-9660] (#8136)
  • 99c4cea test: create a test for delete-tensorboards via det e delete (#8336)
  • 7b7d1eb feat: Add remote user settings to Users table [WEB-1798] (#8397)
  • 8051039 ci: fix linting with responses==0.24.1 (#8436)
  • b087c10 chore: add version dropdown url for previous release (#8437)
  • 8e9c505 test: fix model registry rbac wrong user regression (#8420)
  • 292c75d fix: new experiment list tooltip styling (#8433)
  • 1b47bf0 ci: delete broken fixture (#8428)
  • bf07e61 fix: Wrap older modals in theme class [WEB-1824] (#8432)
  • 0c8fad9 chore: filterformstore comment re: change tracking (#8386)
  • 1041e56 fix: replace antd select with hew select (#8424)
  • aa34aa7 feat: add workspace/project creation/deletion (#8430)
  • 2e0a5a2 feat: client gets list_models, too. (#8425)
  • 69df80e Revert "feat: Client gets list_models, too."
  • d1343ca feat: Client gets list_models, too.
  • a1e660d chore: converting SearchGroupsWithoutPersonalGroups into tx (#8419)
  • 6ba688d test: fix TestAddAndRemoveBindings flake (#8423)
  • d4c4195 chore: update Column and Row from Hew (#8412)
  • d1d09e7 docs: Update non root container instructions (#8273)

0.26.4

17 Nov 21:45
Compare
Choose a tag to compare

Release Notes

0.26.4

Changelog

  • bf665ae chore: bump version: 0.26.4-rc4 -> 0.26.4
  • 2f86950 docs: add release notes for 0.26.4 (#8451)
  • f2ef0fe chore: bump version: 0.26.4-rc3 -> 0.26.4-rc4
  • f0a37a9 fix: Calculate allocation bar stats same as overview [WEB-1822] (#8431)
  • 9acfbf2 chore: bump version: 0.26.4-rc2 -> 0.26.4-rc3
  • 9dd0211 fix: k8s autoscaling nodes not counted towards RP (#8439)
  • 3bf6647 chore: bump version: 0.26.4-rc1 -> 0.26.4-rc2
  • 47397a4 fix: new experiment list tooltip styling (#8433)
  • 680ac02 ci: fix linting with responses==0.24.1 (#8436)
  • d4200d2 chore: add version dropdown url for previous release (#8437)
  • 0a4b6bc test: fix model registry rbac wrong user regression (#8420)
  • 242ff97 fix: Wrap older modals in theme class [WEB-1824] (#8432)
  • e3c109a chore: bump version: 0.26.4-rc0 -> 0.26.4-rc1
  • 22e18ae fix: replace antd select with hew select (#8424)
  • 00d349a feat: add workspace/project creation/deletion (#8430)
  • 9f727fe feat: client gets list_models, too. (#8425)
  • b8c1be7 chore: update Column and Row from Hew (#8412)
  • 4e6fd52 chore: bump version: 0.26.4-dev0 -> 0.26.4-rc0
  • e9a457d chore: lock published urls to preserve redirects
  • 2fae9ba chore: add docs dropdown link for new version
  • 6c3bf84 chore: make insert-dropdown-url.sh executable (#8418)
  • b5ca7f4 chore: fail deployment if launching part of the service fails (#8409)
  • 8498674 fix: allow --json in det master config CLI command (#8413)
  • d123932 fix: Place modal inside of ResourcePoolCard (#8414)
  • ff19924 chore: Add eslint rule for ?? operator (#8410)
  • d56b3ae chore: convert DOS line endings to Unix (#8411)
  • c1219eb fix: Hide stats card when 0 on cluster page (#8359)
  • da77efb fix: added permission check on GetAllocation (#8281)
  • 3b0550c chore: Bumpenvs 0.26.4 (#8407)
  • e48d03d fix: user flag to prompt for password during user requests (#8158)
  • 513e6d7 fix: Project and Workspace cards wrap modal divs (#8378)
  • 2497d84 chore: export AddUserTx (#8403)
  • ad764f0 refactor: implement Glossary component from Hew (#8385)
  • 52326d1 feat: change cli command for patch master log config DET[9720] (#8054)
  • 1e9155d chore(type): stricter tsconfig (#8349)
  • 16f18cc chore: revert task obfuscation lint failures (#8406)
  • dde3156 chore: Implement Theming updates in Determined [WEB-1726] (#8388)
  • 4edfc3c ci: move packaging test to test-e2e-longrunning (#8381)
  • d3c208a ci: cache go modules deps and build cache (#8383)
  • 8924996 chore: temporarily disable CI upload job (#8399)
  • 356f651 Revert "chore: temporarily disable upload_test_results job step"
  • 6dd9701 chore: temporarily disable upload_test_results job step
  • ba49dbd ci: up parallelism for slowest test_e2e premerge tests (#8374)
  • 5f3e556 ci: finish removing growforest (#8389)
  • 62084e2 fix: NTSC task and slot viewing obscured for RBAC users with no Viewer Permissions (#8311)
  • 0254f7d chore: fix nil ptr on allocation.Proto() (#8372)
  • 119e759 chore: fix profiler test in CI (#8382)
  • b428d5e feat: add hide column header menu item to explist (#8342)
  • 7ae0501 chore: update the lore service port (#8375)
  • 052cf8d feat: Cluster historical usage charts move to UI Kit LineChart [WEB-1786] [WEB-1764] (#8327)
  • 819948d feat: clear filter from experiment table header (#8376)
  • a590999 test: fix slow delete_checkpoint test (#8377)
  • b0505db chore: Job/task displays Running instead of Scheduled (#8335)
  • 1d64941 chore: short dsat e2e tests (#8288)
  • 6afa836 chore: fix CI mnist_pytorch (#8364)
  • 4d3eaab chore: Update Horovod Cycle Time (#8362)
  • d3b01cb docs: Add det pach tutorial (#8082)
  • 7cebc30 fix: adjust card size on workspaces page (#8370)
  • 5c93cb0 chore: enable more Go linters (#8333)
  • a279967 fix: aws deployment can deploy priority scheduler (#8345)
  • 3d9293c fix: fixed bug in error handling in experiment.go (#8339)
  • 194bfd5 fix: Cell can be undefined in experiment list table (#8360)
  • 1da92aa chore: bump environment images to ubuntu 18.04 [MLG-1194] (#8356)
  • 990c56f chore: add list_experiments to experimental.client (#8361)
  • 3a7d9ea fix(tests): lower e2e_gpu_quarantine parallelism (#8363)
  • 4c48458 fix: patched remote users were able to login with password (#8337)
  • baf5c96 chore: port over PyTorch example to use Trainer API [MLG-1181] (#8292)
  • 235bd8f feat: delete TB files from the SDK (#8329)
  • 2fe3d99 chore: update Typography from UI kit (#8323)
  • 2b23674 fix: prevent carriage return in env from crashing deepspeed launcher (#8321)
  • 461c307 chore: Remove DesignKit since it's now maintained in Hew [WEB-1790] (#8338)
  • 5ee87ec fix: Set group name and number columns to handle Safari [DET-9948] [DET-9949] (#8355)
  • 10deef9 fix(experiments): transient errors shouldn't leave trial hung (#8352)
  • 512b9f3 chore: remove accidental mock commit (#8354)
  • 9d17dbf feat: Show "-" for null values in data cells for experiment list (#8343)
  • ea50987 fix: properly interpret flag values (#8326)
  • 8b6fc68 fix: Allow SAML and OIDC logins to work differently [WEB-1797] (#8308)
  • 274288e docs: fix linting failure (#8351)
  • 73bf0e8 docs: log policies (#8302)
  • 8418029 chore: ft slot capacity check for each trial [DET-9897] (#8213)
  • 494ca57 fix: replace TODO with ctx for deleteTensorboard (#8332)
  • cfde2f6 docs: Docs Version Dropdown Automation (#8340)
  • 8e69941 chore: Remove examples/legacy (#8153)
  • af995ba fix: cli is not a library! (#7891)
  • bf0a03d test: fix ray.air.session import. (#8344)
  • 9bb10cc ci: mypy fix for responses>=0.24.0 (#8341)
  • b924b25 fix: add pin icon in dropdown (#8324)
  • 62b7f3b chore: remove fit-content from TimeAgoc (#8328)
  • 86d6962 chore: update determined-ui to hew (#8334)
  • f580385 fix: metric group charts have more than one color (#8304)
  • 1966373 feat: Add tensorboard delete command to CLI (#8227)
  • 656c8b2 chore: bump version: 0.26.3-dev0 -> 0.26.4-dev0
  • af43248 docs: add release notes for 0.26.3 (#8322)
  • b262a3d chore: Update lore.yaml to use the new version
  • d64a0ac chore: use a single .golangci.yml file (#8320)
  • ad94d20 chore: Add progress bar from UI Kit [WEB-1675] (#8181)
  • b3b5be0 feat: implement CodeSample from UI Kit [WEB-1677] (#8270)
  • d723b7f docs: fix typo in user edit release note (#8319)
  • 6e5d840 chore: initial experiment actor refactor (#8229)
  • 8a1ff58 chore: use a single root level go mod (#8285)
  • 3511abf chore: delete dead code (#8313)
  • d0e6375 chore: add a new deployment type for aws (#8279)
  • 3929e8c chore(actors): remove ctx usage in agent_state.go (#8267)
  • 5bf1b87 ci: delete broken wait_for helper (#8312)
  • 50535f1 test: quarantine GPU execution of test_task_logs (#8261)
  • d5b8e80 chore: deployment's --dry-run option doesn't print template (#8303)
  • ac89d44 fix: allow experiments with directory checkpoint storage to parse (#8310)
  • 306c0c3 fix: Project info not presists when forking (#8307)
  • dc1b131 chore: sort out issues after bringing EE e2e_tests into OSS (#8084)
  • d182abe chore: slurm support for blocklist (#1111)
  • efdf62b fix: return correct location URL for /Users SCIM API endpoint (#1115)
  • 37a84d1 fix: ruamel.yaml fixes for EE
  • 0ce925a chore: Update nightly tests that use legacy cifar10_pytorch (#1102)
  • 6cad296 fix: update for error message change in product (#1098)
  • 1e302b5 chore: update e2e tests affected by examples_pruning (#1100)
  • ad3dcda chore: cleanup model registry rbac test
  • a3ffb5d test: enable command run tests for PBS (#1073)
  • 9dd0e42 test: enable command and deepspeed tests run on slurm/pbs (#1044)
  • ea4f4c4 chore(templates): ee fixes for template rbac
  • c48e48d fix: Test test_slurm_verify_home fails with podman and it shouldn't [FE-136] (#1028)
  • 760a738 test: Add pytorch2 distributed e2e tests on slurm [FE-168] (#1007)
  • b5aee79 chore: use longer running no op experiment when seeding workspace (#994)
  • facbda9 test: run test_hpc_job_pending_reason only on gcp vm (#996)
  • 393d0b5 ci: FE-133 Configure non agent slurm/pbs tests to skip without explicitly listing test names in circleci. (#977)
  • fd15535 ci: add ee-only files to the import-restrictions linter exclusions.
  • 9cf2a26 test: slurm/pbs test for pending reason (FE-90) (#960)
  • b3c2ca3 chore(actors): allocation.go, ee side
  • eb7d1a1 test: [ALLGCP] Add e2e test for HPC that verifies that user HOME is preserved (#972)
  • f3a8b0e test: fix test_slurm.py lint error (#949)
  • 71896f3 chore: FE-91: Update base images (slurm/pbs) to include a populated singularity_image_cache (#943)
  • 5891567 feat: add rbac to api/v1/master/config [DET-9633] (#931)
  • 0a5c32e ci: FE-72: Add test-e2e-pbs-*-gcp tests (#941)
  • 4c233c0 feat: add rbac for strict job queue control (#927)
  • 6e23aa2 chore: removed admin dependency from delete model/version (#912)
  • 7c6c59e feat: rbac for templates (#909)
  • e89cc08 ci: DET 9622: (ee) test_slurm.py::test_cifar10_pytorch_distributed failures (#919)
  • c6ee094 fix: test_rbac goes to wrong url (#918)
  • 6b34e0d fix: DET-9483 successfully run e2e_slurm_preemption tests as part of nightly workflow (#903)
  • 4f6277d ci: FE-14 Migrate test-e2e-slurm to GCP slurmcluster (#879)
  • f4507f7 tests: fix a miss indentation leading to missing project err (#878)
  • 5cad9bb chore: fix a missing check for global permissions in jq (#874)
  • bca3848 feat: add rbac support for reading job queue (#871)
  • 5c79474 chore: update how we wait for tasks to be ready (#863)
  • b292862 test: fix test_master_host [DET-9482]. (#851)
  • 5375a08 ci: quarantine flaky slurm tests (#850)
  • 8e44c6c fix: Patch groups test [DET-9473] (#845)
  • 49d2e08 fix: fix bug with launching tensorboards on trials (#842)
  • d4dcbe5 test: Fix and add e2e_slurm_preemption tests to nightly workflow [D...
Read more

0.26.3

03 Nov 19:50
Compare
Choose a tag to compare

Release Notes

0.26.3

Changelog