Skip to content

Packages

Model2Vec

Model2Vec

State-of-the-art static embeddings distilled from any sentence transformer.

EmbeddingsDistillationPython
SemHash

SemHash

Multimodal semantic deduplication, outlier filtering, and representative sampling.

Data deduplicationMultimodalDataset curationPython
Vicinity

Vicinity

Flexible nearest neighbor search with multiple backends and evaluation.

SearchRetrievalPython
Semble

Semble

Fast and accurate code search for agents.

Code SearchMCP ServerAgentsPython
Tokenlearn

Tokenlearn

Pre-train Model2Vec models on large corpora with efficient embedding distillation.

PretrainingDistillationPython
Model2Vec-rs

Model2Vec-rs

High-performance Rust inference for Model2Vec models with low-overhead deployment.

RustInference