BGEM3EmbeddingFunction

BGEM3EmbeddingFunction is a class in pymilvus that handles encoding text into embeddings using the BGE M3 model to support embedding retrieval in Milvus.

pymilvus.model.hybrid.BGEM3EmbeddingFunction

Constructor

Constructs a BGEM3EmbeddingFunction for common use cases.

BGEM3EmbeddingFunction(
    model_name: str = "BAAI/bge-m3",
    batch_size: int = 16,
    device: str = "",
    normalize_embeddings: bool = True,
    use_fp16: bool = True,
    return_dense: bool = True,
    return_sparse: bool = True,
    return_colbert_vecs: bool = False,
    **kwargs,
)

PARAMETERS:

model_name (string) -

The name of the model to use for encoding. The value defaults to BAAI/bge-m3.
batch_size (int) -

The batch size used for the computation.
device (string) -

The device to use, with cpu for the CPU and cuda:n for the nth GPU device.
normalize_embeddings (bool) -

Whether to normalize embedding vectors to unit length.
use_fp16 (bool) -

Whether to utilize 16-bit floating-point precision (fp16). Specify False when device is cpu.
return_dense (bool) -

Whether to return dense embedding vectors.
return_sparse (bool) -

Whether to return sparse embedding vectors.
return_colbert_vecs (bool) -

Whether to return ColBERT-style contextualized embedding vectors.
****kwargs**

Allows additional keyword arguments to be passed to the model initialization. For more information, refer to bge_m3.

Examples

from pymilvus import model

bge_m3_ef = model.hybrid.BGEM3EmbeddingFunction(
    model_name='BAAI/bge-m3', # Specify t`he model name
    device='cpu', # Specify the device to use, e.g., 'cpu' or 'cuda:0'
    use_fp16=False # Whether to use fp16. `False` for `device='cpu'`.
)

Try Managed Milvus for Free

Zilliz Cloud is hassle-free, powered by Milvus and 10x faster.

Get Started

Feedback

Was this page helpful?