Getting "ValueError: not enough values to unpack" when using text_embedding models #533

devnarekm · 2023-04-28T21:12:42Z

Hello!

I'm getting the following error when trying to deploy a model with the task type "text_embedding"

Here is the command I used:

docker run -it --rm --network host
elastic/eland
eland_import_hub_model
--url "my_url"
-u "my_usrnm" -p "my_pswd"
--hub-model-id sentence-transformers/msmarco-MiniLM-L-12-v3
--task-type text_embedding
--start

I tried all-mpnet-base-v2 as well as some other models and got the same error. Strangely task type text_classification works just fine.

I tried changing line 653 in eland/ml/pytorch/transformers.py so that it unpacks one value only but got a further api error.

Any suggestions are appreciated!

devnarekm · 2023-04-29T00:29:03Z

Update:

I managed to run this by applying two alterations:

in \usr\local\lib\python3.9\dist-packages\eland\ml\pytorch\transformers.py change line 653 to

sample_embedding = self._traceable_model.sample_output()

Reason: Only 1 value is being unpacked

After this change I started getting another error (now rest api related) stating that the request body couldn't be parsed due to an extra field "embedding_size", which leads to the second change.

in \usr\local\lib\python3.9\dist-packages\elasticsearch_sync\client_base.py I added the following snippet to line 288

try:
    body['inference_config']['text_embedding'].pop('embedding_size', None)
except:
    pass

Reason: as per the documentation here https://www.elastic.co/guide/en/elasticsearch/reference/current/put-trained-models.html, the request body's inference_config/text_embedding entry does not have a field 'embedding_size'.

This temporarily solves my problem, hope it gets resolved soon on your end!

davidkyle · 2023-05-02T14:39:31Z

Thanks for reporting this @NarekMargaryan I opened #535 to check the output of the model

The embedding_size field was added in elastic/elasticsearch#95176 (version 8.8). It is helpful to know the number of dimensions the embedding has when creating the dense_vector field mapping

It is documented in the 8.8 docs: https://www.elastic.co/guide/en/elasticsearch/reference/8.8/put-trained-models.html

melfebulu · 2023-05-24T04:00:34Z

so, how to use the eland in 8.7.1 now.. why I can not find the \usr\local\lib\python3.9\dist-packages\elasticsearch_sync\client_base.py ? the error info : ValueError: not enough values to unpack (expected 2, got 1)

melfebulu · 2023-05-24T09:19:32Z

I try this, comment the embedding_size....

elif self._task_type == "text_embedding":
sample_embedding = self._traceable_model.sample_output()
#sample_embedding, _ = self._traceable_model.sample_output()
#embedding_size = sample_embedding.size(-1)
inference_config = TASK_TYPE_TO_INFERENCE_CONFIG[self._task_type](
tokenization=tokenization_config,
# embedding_size=embedding_size,
)

davidkyle · 2023-05-24T12:50:59Z

Hi @melfebulu the problem will be fixed in the next release.

If commenting out the code works for you then great, alternatively you can checkout the 8.7.0 release which does not have the problematic code:

git checkout v8.7.0

One question if you don't mind, are you building Eland from source or installing the latest release via pip or a similar mechanism? Thanks

melfebulu · 2023-05-25T02:27:56Z

thx, I build from source to use, the commenting is working now :)

davidkyle mentioned this issue May 2, 2023

Check model output when measuring embedding size #535

Merged

davismcphee added a commit to davismcphee/eland that referenced this issue May 12, 2023

Implement tuple fix from elastic#533

9981ef0

davidkyle closed this as completed in #535 May 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Getting "ValueError: not enough values to unpack" when using text_embedding models #533

Getting "ValueError: not enough values to unpack" when using text_embedding models #533

devnarekm commented Apr 28, 2023 •

edited

Loading

devnarekm commented Apr 29, 2023

davidkyle commented May 2, 2023

melfebulu commented May 24, 2023

melfebulu commented May 24, 2023

davidkyle commented May 24, 2023

melfebulu commented May 25, 2023

Getting "ValueError: not enough values to unpack" when using text_embedding models #533

Getting "ValueError: not enough values to unpack" when using text_embedding models #533

Comments

devnarekm commented Apr 28, 2023 • edited Loading

devnarekm commented Apr 29, 2023

davidkyle commented May 2, 2023

melfebulu commented May 24, 2023

melfebulu commented May 24, 2023

davidkyle commented May 24, 2023

melfebulu commented May 25, 2023

devnarekm commented Apr 28, 2023 •

edited

Loading