Make embeddings computing async and add support for batching. #267

drazvan · 2024-01-22T21:52:09Z

Enable batching when computing the embeddings. Initial tests show a 3x throughput increase.

Updated the BasicEmbeddingsIndex to use a more flexible cache configuration setup. We replaced the bool flag and the CacheEmbeddings instance with a single unified `cache_config` parameter that can accept either a dictionary or an EmbeddingsCacheConfig instance for enhanced configurability. Additionally, the `_get_embeddings` method is now an asynchronous function to allow for non-blocking I/O operations per NVIDIA#267.

Enhanced the EmbeddingsIndex class by adding a 'cache_config' attribute for customizable cache management. Also, updated the '_get_embeddings' method to be asynchronous per NVIDIA#267.

Make embeddings computing async and add support for batching.

2f3108e

drazvan added this to the v0.8.0 milestone Jan 22, 2024

drazvan self-assigned this Jan 22, 2024

drazvan mentioned this pull request Feb 6, 2024

Implement cache embeddings (resolves #200) #208

Merged

drazvan marked this pull request as ready for review February 12, 2024 21:13

drazvan merged commit fb09f4a into develop Feb 12, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make embeddings computing async and add support for batching. #267

Make embeddings computing async and add support for batching. #267

drazvan commented Jan 22, 2024 •

edited

Loading

Make embeddings computing async and add support for batching. #267

Make embeddings computing async and add support for batching. #267

Conversation

drazvan commented Jan 22, 2024 • edited Loading

drazvan commented Jan 22, 2024 •

edited

Loading