Skip to content

Latest commit

 

History

History
9 lines (5 loc) · 932 Bytes

README.md

File metadata and controls

9 lines (5 loc) · 932 Bytes

t2v-transformers-models-rs

This repository contains the Proof of Concept (PoC) for a Rust-native sentence embedding API for use within Weaviate's module vectorization framework.

In its current form, it uses huggingface/candle to load HuggingFace models from the Hub into Rust. The models are then used to generate embeddings for input text.

The API layer is provided by the tokio-rs/axum framework, which allows for event loop based concurrency.

The inference workloads themselves are scheduled into a rayon thread-pool using andybarron/tokio-rayon, which is a lightweight wrapper allowing for awaiting of rayon jobs within a tokio async function. Ideally, this would be re-implemented by us internally to avoid the dangerous external dependency. For now, it is used as a proof of concept.