Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Added
new_with_tokenizer
constructor forSentenceEmbeddingsModel
allowing passing custom tokenizers for sentence embeddings pipelines.tokenizer.json
andspecial_token_map.json
tokenizer files.kind
parameter to specify the model weight precision. If not provided, will default to full precision on CPU, or the serialized weights precision otherwise.Fixed
tokenizer_forbidden_ngram_chars
to specify characters that should be excluded from n-grams (allows filtering m-grams spanning multiple sentences).sparse_grad
flag to false forgather
operationsChanged
torch
2.1 (viatch
0.14.0).Result
for improved error handling