FEAT add method to call inference backend #105

adrinjalali · 2022-08-18T11:22:28Z

This PR introduces a wrapper to make it convenient for users to get outputs for array like inputs.

I'm not sure about the name of the function.

I'll be adding tests.

@BenjaminBossan would you mind having a look and see if this looks okay as a start to you?

adrinjalali · 2022-08-18T11:30:42Z

Note: this cannot be tested or released until huggingface/huggingface_hub#998 is released (unless we patch that variable in this library, but rather not do)

BenjaminBossan · 2022-08-18T11:43:15Z

LGTM so far.

Regarding the naming, I have no strong opinion. Is there a reason to prefer "output" over "prediction"? At least when it comes to the planned features, we would always call model.predict, right? Not sure if we ever would want to have, say, transform or decision_function. Alternatively, we could use the word "inference", since we're using the inference API.

adrinjalali · 2022-08-18T11:46:58Z

yeah that's why I didn't call it predict. I'm thinking a feature we can add is to let users add a "predict_method" to the configuration file, and get predict_proba or transform instead of predict for instance.

BenjaminBossan · 2022-08-18T12:04:27Z

In that case, it makes sense to not call it predict (except if that feature would be provided through separate functions).

adrinjalali · 2022-08-18T12:08:27Z

I'm still not sure on that design though. But we'll get there when we start working on it.

adrinjalali · 2022-08-23T12:44:57Z

The classifier test passes, the regression one fails, need to figure out why.

adrinjalali · 2022-08-23T13:03:33Z

Seems like huggingface/api-inference-community#83 is not deployed yet. Once that's done, the tests here should pass.

adrinjalali · 2022-08-23T14:33:24Z

@BenjaminBossan why do you think the tests work for a classifier but not a regressor?

>       assert all(output == data.target[:5])
E       assert False
E        +  where False = all(array([206.11...128.45984241]) == 0    151.0\n1 ...dtype: float64
E           Full diff:
E           + array([206.11706979,  68.07234761, 176.88406035, 166.91796559,
E           +        128.45984241],
E           + )
E           - 0    151.0
E           - 1     75.0
E           - 2    141.0
E           - 3    206.0
E           - 4    135.0
E           - Name: target, dtype: float64)

BenjaminBossan · 2022-08-23T14:41:26Z

Please correct me if I misunderstand, but doesn't that check test if the prediction is 100% accurate? This may occur with classification but is almost impossible with regression.

adrinjalali · 2022-08-23T15:56:39Z

so the issue was that I was comparing with the training data instead of the model output lol 🤦🏼

Question: should we add this to the user guide? If we do, the user guide will take like 30 seconds at least more to run.

BenjaminBossan · 2022-08-23T16:21:30Z

Question: should we add this to the user guide? If we do, the user guide will take like 30 seconds at least more to run.

Hmm, maybe it's sufficient to just document it well, without actually running it?

adrinjalali · 2022-08-24T09:15:32Z

ready for review

BenjaminBossan

Basically perfect for me. I have two minor comments, please take a look.

BenjaminBossan · 2022-08-24T12:22:26Z

skops/hub_utils/_hf_hub.py

+        inputs=inputs
+    )
+
+    if isinstance(res, list):


Can we rely on the assumption that if the output is a list, everything went well, else it didn't?

Yes, on the inference side, we always raise when there's at least one warning or error: https://github.com/huggingface/api-inference-community/blob/5b9970bae8928019dd725f7f16832d9be753df3b/docker_images/sklearn/app/pipelines/tabular_classification.py#L55

Okay, and we can be sure that if an error is raised, the API will never return a list. Ideally, I would wish for a response object with a status code but api inference doesn't seem to support that.

On the other hand, when the response is ok, we know it's a list because of how the Pipelines are implemented. I guess this introduces some coupling, as future pipelines must always return a list as output too. For now it works, so we can proceed, but it doesn't feel very robust to me.

The response comes as a json object, so if there are only values in it, it's a list. But I agree a more complex response or having warnings in the header would be better maybe. For now I'd say we can merge and we can change this if the logic on the backend changes. I don't think we can easily future proof this bit here.

BenjaminBossan · 2022-08-24T12:24:00Z

skops/hub_utils/tests/test_hf_hub.py

+    X_test = data.data.head(5)
+    y_pred = model.predict(X_test)
+    output = get_model_output(repo_id, data=X_test, token=HF_HUB_TOKEN)
+    assert np.allclose(output, y_pred)


nit: How about moving the assert to the end of the test, so that clean up is performed even if the assert fails? Even better would be a fixture or context manager that does the setup and cleanup but moving the assert would already help.

Didn't end up using fixtures for this cause I couldn't figure out how to do them easily, and the pytest docs are really sparse when it comes to using request.

BenjaminBossan

LGTM.

Regarding the fixture, I agree that it would be overkill for this test.

If you want, I can merge, or do you want to wait for more reviews? (Edit: I see now that the tests failed, not sure why)

BenjaminBossan · 2022-08-24T12:43:55Z

skops/hub_utils/_hf_hub.py

+        inputs=inputs
+    )
+
+    if isinstance(res, list):


Okay, and we can be sure that if an error is raised, the API will never return a list. Ideally, I would wish for a response object with a status code but api inference doesn't seem to support that.

On the other hand, when the response is ok, we know it's a list because of how the Pipelines are implemented. I guess this introduces some coupling, as future pipelines must always return a list as output too. For now it works, so we can proceed, but it doesn't feel very robust to me.

FEAT add method to call inference backend

6f8e81c

adrinjalali added this to the 0.2 milestone Aug 18, 2022

adrinjalali mentioned this pull request Aug 18, 2022

MNT add tabular task types to InferenceApi huggingface/huggingface_hub#998

Merged

add tests

e09ee1e

set the right task

44e7073

compare model's output

e7afb65

adrinjalali added 2 commits August 24, 2022 10:45

add docs

abb6ab1

fix func rename

1d9b6cc

BenjaminBossan reviewed Aug 24, 2022

View reviewed changes

move assert

c4a2d35

BenjaminBossan approved these changes Aug 24, 2022

View reviewed changes

increase CI time

258ec1f

BenjaminBossan merged commit 527d05c into skops-dev:main Aug 24, 2022

adrinjalali deleted the inference branch August 24, 2022 15:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT add method to call inference backend #105

FEAT add method to call inference backend #105

adrinjalali commented Aug 18, 2022

adrinjalali commented Aug 18, 2022 •

edited

Loading

BenjaminBossan commented Aug 18, 2022

adrinjalali commented Aug 18, 2022

BenjaminBossan commented Aug 18, 2022

adrinjalali commented Aug 18, 2022

adrinjalali commented Aug 23, 2022

adrinjalali commented Aug 23, 2022

adrinjalali commented Aug 23, 2022

BenjaminBossan commented Aug 23, 2022

adrinjalali commented Aug 23, 2022

BenjaminBossan commented Aug 23, 2022

adrinjalali commented Aug 24, 2022

BenjaminBossan left a comment

BenjaminBossan Aug 24, 2022

adrinjalali Aug 24, 2022

BenjaminBossan Aug 24, 2022

adrinjalali Aug 24, 2022

BenjaminBossan Aug 24, 2022

adrinjalali Aug 24, 2022

BenjaminBossan left a comment •

edited

Loading

BenjaminBossan Aug 24, 2022

FEAT add method to call inference backend #105

FEAT add method to call inference backend #105

Conversation

adrinjalali commented Aug 18, 2022

adrinjalali commented Aug 18, 2022 • edited Loading

BenjaminBossan commented Aug 18, 2022

adrinjalali commented Aug 18, 2022

BenjaminBossan commented Aug 18, 2022

adrinjalali commented Aug 18, 2022

adrinjalali commented Aug 23, 2022

adrinjalali commented Aug 23, 2022

adrinjalali commented Aug 23, 2022

BenjaminBossan commented Aug 23, 2022

adrinjalali commented Aug 23, 2022

BenjaminBossan commented Aug 23, 2022

adrinjalali commented Aug 24, 2022

BenjaminBossan left a comment

Choose a reason for hiding this comment

BenjaminBossan Aug 24, 2022

Choose a reason for hiding this comment

adrinjalali Aug 24, 2022

Choose a reason for hiding this comment

BenjaminBossan Aug 24, 2022

Choose a reason for hiding this comment

adrinjalali Aug 24, 2022

Choose a reason for hiding this comment

BenjaminBossan Aug 24, 2022

Choose a reason for hiding this comment

adrinjalali Aug 24, 2022

Choose a reason for hiding this comment

BenjaminBossan left a comment • edited Loading

Choose a reason for hiding this comment

BenjaminBossan Aug 24, 2022

Choose a reason for hiding this comment

adrinjalali commented Aug 18, 2022 •

edited

Loading

BenjaminBossan left a comment •

edited

Loading