[ML] Update trained model inference endpoint #556

valeriy42 · 2023-07-07T11:25:10Z

Infer trained model deployment API has been deprecated, so I changed the code to use the new one.

droberts195 · 2023-07-07T11:43:39Z

README.md

-[API](https://www.elastic.co/guide/en/elasticsearch/reference/current/start-trained-model-deployment.html)
-you will be able to set the threading options to make best use of your
+[API](https://www.elastic.co/guide/en/elasticsearch/reference/current/infer-trained-model.html)
+you will be able to set the threading options to make the best use of your


This change isn't necessary. The start trained model deployment API still exists, and is the one where threading options are set.

droberts195 · 2023-07-07T11:55:32Z

tests/ml/pytorch/test_pytorch_model_upload_pytest.py

+
+        self.quantize = (
+            True
+            if not (platform.system() == "Darwin" and platform.machine() == "arm64")


The changes in this file should go in a separate PR. It will be hard to search for them in the future if they're in a PR that's about updating the model inference endpoint.

davidkyle

Thanks for fixing the test

davidkyle · 2023-07-07T11:59:48Z

README.md

@@ -244,8 +244,8 @@ command line and instead start the model using the ML UI in Kibana.
 The `--start` argument will deploy the model with one allocation and one
 thread per allocation, which will not offer good performance. When starting
 the model deployment using the ML UI in Kibana or the Elasticsearch
-[API](https://www.elastic.co/guide/en/elasticsearch/reference/current/start-trained-model-deployment.html)
-you will be able to set the threading options to make best use of your
+[API](https://www.elastic.co/guide/en/elasticsearch/reference/current/infer-trained-model.html)


Suggested change

[API](https://www.elastic.co/guide/en/elasticsearch/reference/current/infer-trained-model.html)

[API](https://www.elastic.co/guide/en/elasticsearch/reference/current/start-trained-model-deployment.html)

This is referring to the start API

davidkyle · 2023-07-07T12:08:42Z

eland/ml/exporters/_sklearn_deserializers.py

-                ("n_node_samples", "<i8"),
-                ("weighted_n_node_samples", "<f8"),
-            ],
+            dtype={


We should update the scikit learn version in requirements-dev.txt and setup.py otherwise these tests will fail for anyone on an older version

Currently it is:

scikit-learn>=0.22.1,<2

Suggest:

scikit-learn>=1.3,<2

valeriy42 · 2023-07-10T08:26:11Z

Thank you for your comments. I moved the fix of unit tests into #558. Once merged, the changes here should be related to the deprecated API only.

droberts195

LGTM

initial commit

361a43b

valeriy42 requested a review from davidkyle July 7, 2023 11:25

valeriy42 added bug Something isn't working topic:NLP Issue or PR about NLP model support and eland_import_hub_model refactor A refactoring task labels Jul 7, 2023

fix formatting

36b2d7c

droberts195 reviewed Jul 7, 2023

View reviewed changes

davidkyle reviewed Jul 7, 2023

View reviewed changes

valeriy42 removed bug Something isn't working refactor A refactoring task labels Jul 10, 2023

valeriy42 added 2 commits July 10, 2023 10:29

change back the link in README

e6b5025

Merge branch 'main' into inference-endpoint-update

bd9d36c

valeriy42 requested a review from droberts195 July 10, 2023 13:17

droberts195 approved these changes Jul 10, 2023

View reviewed changes

valeriy42 merged commit 77781b9 into elastic:main Jul 11, 2023
2 checks passed

valeriy42 deleted the inference-endpoint-update branch July 11, 2023 08:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Update trained model inference endpoint #556

[ML] Update trained model inference endpoint #556

valeriy42 commented Jul 7, 2023 •

edited

Loading

droberts195 Jul 7, 2023

droberts195 Jul 7, 2023

davidkyle left a comment

davidkyle Jul 7, 2023

davidkyle Jul 7, 2023

valeriy42 commented Jul 10, 2023

droberts195 left a comment

	[API](https://www.elastic.co/guide/en/elasticsearch/reference/current/infer-trained-model.html)
	[API](https://www.elastic.co/guide/en/elasticsearch/reference/current/start-trained-model-deployment.html)

[ML] Update trained model inference endpoint #556

[ML] Update trained model inference endpoint #556

Conversation

valeriy42 commented Jul 7, 2023 • edited Loading

droberts195 Jul 7, 2023

Choose a reason for hiding this comment

droberts195 Jul 7, 2023

Choose a reason for hiding this comment

davidkyle left a comment

Choose a reason for hiding this comment

davidkyle Jul 7, 2023

Choose a reason for hiding this comment

davidkyle Jul 7, 2023

Choose a reason for hiding this comment

valeriy42 commented Jul 10, 2023

droberts195 left a comment

Choose a reason for hiding this comment

valeriy42 commented Jul 7, 2023 •

edited

Loading