Add exported openvino model 'openvino_model_qint8_quantized.xml'

by tomaarsen HF Staff - opened about 18 hours ago

base: refs/heads/main

←

from: refs/pr/4

Discussion Files changed

+11046

-0

tomaarsen

Owner about 18 hours ago

⚙️

sentence-transformers/backend-export

Hello!

This pull request adds an exported openvino model (openvino_model_qint8_quantized.xml).

Config

OVQuantizationConfig(
    quant_method=<OVQuantizationMethod.DEFAULT: 'default'>
)

Testing this pull request

You can test this pull request before merging by loading the model from this PR with the revision argument:

from sentence_transformers import SentenceTransformer

# NOTE: Update this to the number of your pull request
pr_number = 2
model = SentenceTransformer(
    "tomaarsen/distilroberta-base-nli-v2-bf16-bf16",
    revision=f"refs/pr/{pr_number}",
    backend="openvino",
    model_kwargs={"file_name": "openvino_model_qint8_quantized.xml"},
)

# Verify that everything works as expected
embeddings = model.encode(["The weather is lovely today.", "It's so sunny outside!", "He drove to the stadium."])
print(embeddings.shape)

similarities = model.similarity(embeddings, embeddings)
print(similarities)

This PR was auto-generated with export_static_quantized_openvino_model.

Add exported openvino model 'openvino_model_qint8_quantized.xml'ecce7e41

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment